Speed up float 5x5 matrix * vector multiplication with SSE

Question

I need to run a matrix-vector multiplication 240000 times per second. The matrix is 5x5 and is always the same, whereas the vector changes at each iteration. The data type is float. I was thinking of using some SSE (or similar) instructions.

I am concerned that the number of arithmetic operations is too small compared to the number of memory operations involved. Do you think I can get some tangible (e.g. > 20%) improvement?
Do I need the Intel compiler to do it?
Can you point out some references?

Ulterior · Accepted Answer

I would suggest using Intel IPP and abstract yourself of dependency on techniques

Speed up float 5x5 matrix * vector multiplication with SSE

Tags:

c++

vectorization

simd

matrix-multiplication

sse

Enzo

1 Answers

Ulterior

Recent Activity

Donate For Us

Speed up float 5x5 matrix * vector multiplication with SSE

Tags:

c++

vectorization

simd

matrix-multiplication

sse

Enzo

1 Answers

Ulterior

Related questions

Recent Activity

Donate For Us