Use transposed buffers
Extend use of transposed buffers to enable SIMD for SAD, weightedAverage and linearTransform. This is already done in some parts of the ECM but this MR extend the usage It enables SIMD usage as some blocks are considered as a row instead of a column (please note that it changes only how the block is considered not the memory)