cpu-architecture Questions

5

Solved

Given some Random.exe on Windows, how can I determine its CPU architecture eg Intel/ARM, and its bitness eg 32 or 64. Is there a property in File Explorer, some other tool, or programatic meth...

1

Solved

given this code lp: vpaddq ymm0, ymm1 vpaddq ymm3, ymm4 add rbx, rax add rcx, rax vpaddq ymm1, ymm2 vpaddq ymm4, ymm5 sub rax, 0xA jge lp according to the https://uica.uops.info this is ...
Friesland asked 17/6, 2023 at 9:57

3

Need to create a script to check to see if the kernel is in PAE mode or not. Surely, it is not enough to just check if the /proc/cpuinfo flags have this 'pae' setting. We must know if the PAE mech...
Bohannan asked 12/12, 2012 at 0:6

1

Solved

In the document titled Data Operand Independent Timing Instruction Set Architecture (ISA) Guidance Intel is introducing a new IA32_UARCH_MISC_CTL MSR where toggling bit 0 enables the "Data Ope...
Sizzler asked 22/5, 2023 at 19:16

4

When disassembling an executable I encountered the cmove instruction. I've already searched on the Internet but I've only found that it's a conditional move, and if the source and destination are e...

0

I am currently looking for answers to why gcc generates strange instructions like "rep ret" in the generated assembly code. I came across a question on Stack Overflow where someone raised...

2

Solved

I am taking a course of system tools and architecture in my University and the first class is about how CPU and RAM talks and how CPU process the data. As explained by the Prof, CPU has an ALU (Ari...
Mannose asked 7/1, 2014 at 19:40

1

Solved

Since amd zen 4 has only 256bit wide operations on vector data, the following diagram from chipsandcheese's Zen 4 article shows 6 FP pipelines (4 ALU and 2 memory): Each FMA does 1 multiplication ...
Prieto asked 7/5, 2023 at 15:45

1

Solved

In the technical overview published by Intel, "Sub-NUMA Clustering" and "Hemisphere and Quadrant Modes" are described separately. But the main difference between them is not cle...
Sciamachy asked 28/4, 2023 at 8:51

1

Solved

Changing add to adc in the highlighted line below significantly improves performance. I find it very counter-intuitive, since add has more ports to execute and it does not depend on flags. CPU: Int...
Vickyvico asked 24/1, 2021 at 20:20

2

I stumbled upon a peculiar performance issue when running the following c++ code on some Intel Xeon processors: // array_a contains permutation of [0, n - 1] // array_b and inverse are initialized ...
Projective asked 7/9, 2020 at 15:23

6

Solved

Some CPUs (notably x86 CPUs) feature a parity flag on their status register. This flag indicates whether the number of bits of the result of an operation is odd or even. What actual practical purp...
Mining asked 7/9, 2014 at 4:41

1

I have an Intel Sapphire Rapids CPU with 56 cores. By default, SNC is not enabled. When core 0 accesses a certain memory address A, I think the following will happen: One of the cache agent is acc...
Sinuate asked 20/11, 2022 at 20:45

3

I am trying to find out what an x86 processor does when it encounters a store conditional instruction. For instance does it stall the front end of the pipeline and wait for the ROB buffer to become...

3

In xcode which is the developement environment for iOS. When creating a new project. Build settings states that it supports armv7, armv7s and arm64 architectures. The following chart shows apple ...
Harriott asked 11/12, 2016 at 22:38

1

Solved

I came across this code emission for x64 were "Atomic Load" is using a simple movq whereas "Atomic Store" is using xchgq. This link explains that Atomic Load/Stores on aligned a...
Countryside asked 30/1, 2023 at 19:22

2

Solved

Cache lines are often 64 bytes, other sizes also exist. My very simple question is: is there any theory behind this number, or is it just the result of the vast amount of tests and measurements th...
Unbearable asked 30/3, 2016 at 15:11

4

I remember in an assembly class, we learned the m68k processor, and there were 3 kinds of shifts you could do. Linear shift, circular shift, and circular shift with extend. The last one, circular ...

3

Solved

In languages like C, unsynchronized reads and writes to the same memory location from different threads is undefined behavior. But in the CPU, cache coherence says that if one core writes to a memo...
Sceptic asked 11/10, 2021 at 12:5

3

Solved

Suppose an external interrupt request is made to 8086. Processor will handle the interrupt after completing the current instruction being executed (if any). Before handling of the interrupt, the st...

4

Solved

I've been told and have read from Intel's manuals that it is possible to write instructions to memory, but the instruction prefetch queue has already fetched the stale instructions and will execute...
Gentianaceous asked 30/6, 2013 at 22:52

1

In Paul McKenny's famous paper "Memory Barriers: A Hardware View for Software Hackers" 3.3 Store Buffers and Memory Barriers To see the second complication, a violation of global memory ...

6

Solved

I am using Intel(R) Core(TM) i5 CPU M 560 @ 2.67GHz as I found out from cat /proc/cpuinfo. But I want to know exact hierarchy like how many sockets are there, and how many cores are there per socke...
Dara asked 23/8, 2013 at 11:43

1

Solved

NOT on a register holding binary 11111111 will produce 00000000, but ZF will still have its old value, so it might not be 1 even though the output value is all zero. XOR reg, -1 would do the same t...
Operant asked 7/11, 2022 at 3:47

1

I read that the multithreading is the feature that a single core can manage many threads (not in parallel) and the benefit is that the CPU is always working. For the Hyperthreading that a single co...

© 2022 - 2024 — McMap. All rights reserved.