Only the upper triangle of the block matrix is stored, thus when executing operations on the lower triangle, each block must be transposed. This transposition was not ocurring in the matrix-vector multiplication function, which is fixed by this commit. Reviewed By: brecht Differential Revision: http://developer.blender.org/D3619