gpgpu - McMap

2

Solved

Do GPU architectures have Persistent Last-Level Cache Across Kernel Launches?

Background I'm trying to understand whether a GPU's Last-Level Cache is invalidated or preserved across multiple kernel launches, so that the effective memory bandwidth can be increased. I'm aware ...

caching gpu gpgpu

Spellbound asked 2/9, 2023 at 8:26

3

Solved

Running more than one CUDA applications on one GPU

CUDA document does not specific how many CUDA process can share one GPU. For example, if I launch more than one CUDA programs by the same user with only one GPU card installed in the system, what i...

cuda gpu gpgpu nvidia

Columbarium asked 27/7, 2015 at 0:55

5

Solved

How does CUDA assign device IDs to GPUs?

When a computer has multiple CUDA-capable GPUs, each GPU is assigned a device ID. By default, CUDA kernels execute on device ID 0. You can use cudaSetDevice(int device) to select a different device...

cuda gpu gpgpu nvidia

Biddle asked 8/12, 2012 at 20:42

5

Solved

What is the current status of C++ AMP

I am working on high performance code in C++ and have been using both CUDA and OpenCL and more recently C++AMP, which I like very much. I am however a little worried that it is not being developed ...

c++c++11 gpgpu c++-amp

Putter asked 23/1, 2016 at 21:48

2

Solved

CUDA compiler is unable to compile a simple test program

I am trying to get NVIDIA's CUDA setup and installed on my PC which has an NVIDIA GEFORCE RTX 2080 SUPER graphics card. After hours of trying different things and lots of research I have gotten CUD...

c++compiler-errors cuda gpgpu clion

Cnidoblast asked 23/7, 2020 at 18:45

13

Solved

How can I flush GPU memory using CUDA (physical reset is unavailable)

My CUDA program crashed during execution, before memory was flushed. As a result, device memory remained occupied. I'm running on a GTX 580, for which nvidia-smi --gpu-reset is not supported. Pla...

cuda gpgpu remote-access

Sandbox asked 4/3, 2013 at 8:22

1

Solved

4000% Performance Decrease in SYCL when using Unified Shared Memory instead of Device Memory

In SYCL, there are three types of memory: host memory, device memory, and Unified Shared Memory (USM). For host and device memory, data exchange requires explicit copying. Meanwhile, data movement ...

c++gpgpu hpc sycl dpc++

Frequent asked 16/7, 2023 at 20:36

3

Solved

GPGPU vs. Multicore?

What are the key practical differences between GPGPU and regular multicore/multithreaded CPU programming, from the programmer's perspective? Specifically: What types of problems are better suited...

multithreading performance multicore gpgpu parallel-processing

Ploce asked 7/5, 2011 at 4:45

6

Solved

Disassemble an OpenCL kernel?

I'm not sure if it's possible. I want to study OpenCL in-depth, so I was wondering if there is a tool to disassemble an compiled OpenCL kernel. For normal x86 executable, I can use objdump to get ...

opencl gpu gpgpu disassembly

Prolusion asked 14/7, 2011 at 6:25

3

Solved

Retaining dot product on GPGPU using CUBLAS routine

I am writing a code to compute dot product of two vectors using CUBLAS routine of dot product but it returns the value in host memory. I want to use the dot product for further computation on GPGPU...

cuda gpgpu cublas dot-product

Ronnyronsard asked 13/9, 2012 at 6:18

9

Solved

Is it possible to run CUDA on AMD GPUs?

I'd like to extend my skill set into GPU computing. I am familiar with raytracing and realtime graphics(OpenGL), but the next generation of graphics and high performance computing seems to be in GP...

cuda gpu nvidia gpgpu amd-gpu

Mischiefmaker asked 10/10, 2012 at 21:2

1

How can I clear/flush the L2 cache (and the TLB) of a GPU?

I have a discrete NVIDIA GPU (say, Kepler or Maxwell). I want to clear my L2 cache before some kernel is scheduled, so as not to taint my test results. I could do something like allocate a large s...

cuda gpgpu cpu-cache tlb

Instill asked 15/7, 2015 at 11:39

1

What does storageBarrier in WebGPU actually do?

So I'm exploring WebGPU and figured it would be an interesting exercise to implement a basic neural network in it. Having little understanding of both GPU shader programming and neural networks and...

shader gpgpu webgpu wgsl

Bowing asked 27/4, 2022 at 21:35

4

Solved

printing from cuda kernels

I am writing a cuda program and trying to print something inside the cuda kernels using the printf function. But when I am compiling the program then I am getting an error error : calling a host f...

c visual-studio-2010 cuda gpgpu

Tyrannicide asked 31/12, 2012 at 22:45

3

Solved

Generating random number within Cuda kernel in a varying range

I am trying to generate random number random numbers within the cuda kernel. I wish to generate the random numbers from uniform distribution and in the integer form, starting from 1 up to 8. The ra...

c cuda gpgpu

Insulation asked 29/8, 2013 at 1:49

4

Solved

Import PGP public key by string

I want to import a PGP public key into my keychain in a script, but I don't want it to write the contents to a file. Right now my script does this: curl http://example.com/pgp-public-key -o /tmp/p...

shell gpgpu pgp

Classroom asked 9/9, 2016 at 12:35

4

When to call cudaDeviceSynchronize?

when is calling to the cudaDeviceSynchronize function really needed?. As far as I understand from the CUDA documentation, CUDA kernels are asynchronous, so it seems that we should call cudaDevice...

cuda gpu gpgpu

Psilomelane asked 9/8, 2012 at 17:25

7

Solved

OpenCL - How to I query for a device's SIMD width?

In CUDA, there is a concept of a warp, which is defined as the maximum number of threads that can execute the same instruction simultaneously within a single processing element. For NVIDIA, this wa...

opencl gpu gpgpu

Coefficient asked 17/8, 2011 at 13:15

2

Solved

When to use volatile with shared CUDA Memory

Under what circumstances should you use the volatile keyword with a CUDA kernel's shared memory? I understand that volatile tells the compiler never to cache any values, but my question is about th...

cuda gpgpu volatile gpu-shared-memory

Marthena asked 11/3, 2013 at 4:2

2

Continuous Integration Service for GPU package?

Continuous integration services are wonderful for continually testing updates to packages for various languages. These include services like Travis-CI, Jenkins, and Shippable among many others. How...

continuous-integration gpgpu

Site asked 1/5, 2015 at 12:35

2

Solved

nvidia-smi Volatile GPU-Utilization explanation?

I know that nvidia-smi -l 1 will give the GPU usage every one second (similarly to the following). However, I would appreciate an explanation on what Volatile GPU-Util really means. Is that the num...

cuda nvidia gpgpu gpu

Gladsome asked 2/12, 2016 at 17:31

11

Solved

OpenGL vs. OpenCL, which to choose and why?

What features make OpenCL unique to choose over OpenGL with GLSL for calculations? Despite the graphic related terminology and inpractical datatypes, is there any real caveat to OpenGL? For examp...

opengl opencl gpgpu

Dorsman asked 26/10, 2011 at 18:57

1

What are the "long" and "short" scoreboards w.r.t. MIO/L1TEX?

With recent NVIDIA micro-architectures, there's a new (?) taxonomy of warp stall reasons / warp scheduler states. Two of the items in this taxonomy are: Short scoreboard - scoreboard dependency on...

cuda gpu gpgpu micro-architecture nsight-compute

Korney asked 9/2, 2021 at 17:14

2

Kmeans clustering acceleration in GPU(CUDA)

I am a fairly new cuda user. I'm practicing on my first cuda application where I try to accelerate kmeans algorithm by using GPU(GTX 670). Briefly, each thread works on a single point which is co...

cuda parallel-processing gpgpu k-means nsight

Etrem asked 21/3, 2015 at 20:7

4

Solved

Is it possible to write OpenCL kernels in C++ rather than C?

I understand there's an openCL C++ API, but I'm having trouble compiling my kernels... do the kernels have to be written in C? And then it's just the host code that's allowed to be written in C++? ...

c++opencl gpgpu pyopencl

Sporogenesis asked 7/7, 2016 at 17:29

gpgpu Questions

Recommended topics

Hot tags