cuda-streams Questions
2
Solved
I have a CUDA stream which someone handed to me - a cudaStream_t value. The CUDA Runtime API does not seem to indicate how I can obtain the index of the device with which this stream is associated....
Huh asked 17/7, 2015 at 11:28
1
Solved
Glancing from the official NVIDIA Multi-Process Server docs, it is unclear to me how it interacts with CUDA streams.
Here's an example:
App 0: issues kernels to logical stream 0;
App 1: issues k...
Horrific asked 7/3, 2018 at 23:35
2
Solved
We are having performance issues when using the CUDA Dynamic Parallelism. At this moment, CDP is performing at least 3X slower than a traditional approach.
We made the simplest reproducible code to...
Orontes asked 19/7, 2017 at 21:10
2
Solved
From K20 different streams becomes fully concurrent(used to be concurrent on the edge).
However My program need the old way. Or I need to do a lot of synchronization to solve the dependency proble...
Escheat asked 11/2, 2013 at 9:53
2
Solved
For my CUDA development, I am using a machine with 16 cores, and 1 GTX 580 GPU with 16 SMs. For the work that I am doing, I plan to launch 16 host threads (1 on each core), and 1 kernel launch per ...
Knuckleduster asked 6/9, 2012 at 5:56
1
This question is related to using cuda streams to run many kernels
In CUDA there are many synchronization commands
cudaStreamSynchronize,
CudaDeviceSynchronize,
cudaThreadSynchronize,
and also cud...
Fulminant asked 14/8, 2012 at 13:48
2
Solved
I have something very similar to the code:
int k, no_streams = 4;
cudaStream_t stream[no_streams];
for(k = 0; k < no_streams; k++) cudaStreamCreate(&stream[k]);
cudaMalloc(&g_in, size1...
Rysler asked 20/5, 2011 at 10:10
1
© 2022 - 2024 — McMap. All rights reserved.