vector-processing Questions
4
Solved
I have a __m256d vector packed with four 64-bit floating-point values.
I need to find the horizontal maximum of the vector's elements and store the result in a double-precision scalar value;
My at...
Gage asked 20/3, 2012 at 21:48
3
Solved
I have a packed vector of four 64-bit floating-point values.
I would like to get the sum of the vector's elements.
With SSE (and using 32-bit floats) I could just do the following:
v_sum = ...
Dukedom asked 19/3, 2012 at 18:11
2
Solved
The v4 series of the gcc compiler can automatically vectorize loops using the SIMD processor on some modern CPUs, such as the AMD Athlon or Intel Pentium/Core chips. How is this done?
Contented asked 3/1, 2009 at 16:22
2
Solved
Is it better in some sense to vectorize code by hand, using explicit pragmas or to rely on or use auto-vectorization? For optimum performance using auto-vectorization, one would have to monit...
Dais asked 3/1, 2009 at 18:40
1
© 2022 - 2024 — McMap. All rights reserved.