mmx Questions
1
Solved
The intel intrinsic functions have the subtype of the vector built into their names. For example, _mm_set1_ps is a ps, which is a packed single-precision aka. a float. Although the meaning of most ...
Infix asked 30/1, 2022 at 4:35
1
Solved
I use gdb info registers <tab> to see all the registers, but I don't see MMX registers.
My CPU is Xeon Platinum 8163, a modern Xeon cpu that supports SSE and MMX. So i think its a gdb proble...
Hammock asked 24/6, 2021 at 3:46
1
I'm currently working on updating a large codebase from VS2013 to VS2019. One of the compiler errors I've run into is as follows:
intrinsics.h(348): error C3861: '_mm_cvtpd_pi32': identifier not...
Epochmaking asked 30/3, 2020 at 15:4
3
When I look at diagrams and overviews of recent processors[1], I never see mention of the MMX registers MM0 - MM7. But from the specs, it seems like they still exist. Can one depend on them being p...
Journalism asked 7/6, 2013 at 9:43
1
I've been looking at MMX/SSE and I am wondering. There are instructions for packed, saturated subtraction of unsigned bytes and words, but not doublewords.
Is there a way of doing what I want, or ...
Antonioantonius asked 10/6, 2019 at 12:6
2
Solved
In AT&T syntax instructions often have to be suffixed with the appropriate operand size, with q for operations on 64-bit operands. However in MMX and SSE there is also movq instruction, with th...
1
I'm trying to revive an old Win32 game that uses 3DNow! instruction set to make 3D rendering.
On modern OSs like Win7 - Win10 instructions like FPADD or FPMUL are not allowed and the program throw...
1
Solved
I wrote this code in NASM:
PADDD mm0, mm1
And it was assembled with no errors, but this instruction, though it exists in NASM, I couldn't find it in Intel Instructions Manuals, all I found is th...
1
Solved
1
Solved
Is converting then shifting then bitwise-or'ing the only way to convert from two __m128d to a single __m128i?
This is perfectly acceptable to Xcode in an x64 build
m128d v2dHi = ....
m128d v2dLo ...
Wilbertwilborn asked 15/9, 2016 at 4:24
2
Solved
Using SSE intrinsics, I've gotten a vector of four 32-bit floats clamped to the range 0-255 and rounded to nearest integer. I'd now like to write those four out as bytes.
There is an intrinsic _mm...
1
Solved
I understand converting MMX 32bit mmx intrinsics no longer allows the __m64. So I was having great trouble upgrading this piece of code to SSE. I was told on another stack-Overflow post to post my ...
Paracasein asked 9/9, 2015 at 12:2
1
Solved
So when I started the conversion and set the target to 'x64', I get 7 unresolved externals. Two examples:
error LNK2001: unresolved external symbol _m_empty ...CONVOLUTION_2D_USHORT.obj CONVOLUTIO...
Motorboat asked 7/9, 2015 at 21:12
2
Solved
I am programming the following:
__asm__ volatile ("movq %%rax, %%mm1\n"
"movq %%rcx, %%mm2\n"
: : "a" (0xDEADBEEFDEADBEEF), "c" (0xBADFACE5BADFACE5));
In this case, I am moving the value from ...
2
Solved
I'm thinking I should familiarize myself with x86 SIMD extensions. But before I even began I ran into trouble. I can't find a good overview on which of them are still relevant.
The x86 architectur...
1
Solved
What is the proper way to convert an __int64 value to an __m64 value for use with SSE?
Schellens asked 30/1, 2012 at 8:46
3
Solved
I am trying to optimize some arithmetic by using the MMX and SSE instruction sets with inline assembly. However, I have been unable to find good references for the timings and usages of these enhan...
3
Solved
I am trying to optimize a small piece of code with SSE intrinsics (I am a complete beginner on the topic), but I am a little stuck on the use of conditionals.
My original code is:
unsigned long c...
Sigmund asked 9/6, 2011 at 9:7
2
Solved
9
Solved
I am curious, do new compilers use some extra features built into new CPUs such as MMX SSE,3DNow! and so?
I mean, in original 8086 there was even no FPU, so compiler that old cannot even use it, ...
1
© 2022 - 2024 — McMap. All rights reserved.