loop-unrolling Questions

1

Solved

My 9600GT hates me. Fragment shader: #version 130 uint aa[33] = uint[33]( 0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0, 0,0,0,0,0,0,0,0,0,0, 0,0,0 ); void main() { int i=0; int a=26; for (i...
Gird asked 1/9, 2013 at 10:54

8

Solved

I have a lot of if, else if statements and I know there has to be a better way to do this but even after searching stackoverflow I'm unsure of how to do so in my particular case. I am parsing text...
Corticate asked 11/9, 2013 at 22:59

2

I have a question about a Project Euler question and optimization using loop unrolling. Problem description: 2520 is the smallest number that can be divided by each of the numbers from 1 to 10 wit...
Nullify asked 8/11, 2013 at 19:18

3

Solved

#include <stdio.h> int main() { int i; for(i=0;i<10000;i++){ printf("%d",i); } } I want to do loop unrolling on this code using gcc but even using the flag. gcc -O2 -funroll-all-loo...
Sprang asked 30/9, 2013 at 21:31

2

Solved

I have the following loop that I am running on an ARM processor. // pin here is pointer to some part of an array for (i = 0; i < v->numelements; i++) { pe = pptr[i]; peParent = pe-&g...
Ploy asked 15/4, 2013 at 18:35

2

In CUDA it is possible to unroll loops using the #pragma unroll directive to improve performance by increasing instruction level parallelism. The #pragma can optionally be followed by a number that...
Hinterland asked 4/11, 2012 at 19:43

2

Solved

I am writing a Linux Kernel driver (for ARM) and in an irq handler I need to check the interrupt bits. bit 0/16 End point 0 In/Out interrupt (very likely, while In is more likely) 1/17 End poin...
Burmaburman asked 13/9, 2012 at 7:23

1

Solved

Do you recommend reading your kernel's PTX code to find out to optimize your kernels further? One example: I read, that one can find out from the PTX code if the automatic loop unrolling worked. ...
Reeder asked 10/11, 2011 at 14:22

4

Solved

When a compiler performs a loop-unroll optimization, how does it determined by which factor to unroll the loop or whether to unroll the whole loop? Since this is a space-performance trade-off, on a...
Anthracnose asked 7/10, 2011 at 18:12

1

Solved

When I increase the unrolling from 8 to 9 loops in my kernel, it breaks with an out of resources error. I read in How do I diagnose a CUDA launch failure due to being out of resources? that a mis...
Velites asked 28/9, 2011 at 14:37

3

Solved

wikipedia (here) gives a compile time unrolling of for loop....... i was wondering can we use a similar for loop with template statements inside... for example... is the following loop valid temp...
Arresting asked 19/7, 2011 at 4:44

6

I'm trying to use templates to unroll a loop in C++ as follows. #include <iostream> template< class T, T i > struct printDown { static void run(void) { std::cout << i <<...

9

Solved

I've been trying to optimize some extremely performance-critical code (a quick sort algorithm that's being called millions and millions of times inside a monte carlo simulation) by loop unrolling. ...

© 2022 - 2024 — McMap. All rights reserved.