cuda-streams Questions

2

Solved

I have a CUDA stream which someone handed to me - a cudaStream_t value. The CUDA Runtime API does not seem to indicate how I can obtain the index of the device with which this stream is associated....
Huh asked 17/7, 2015 at 11:28

1

Solved

Glancing from the official NVIDIA Multi-Process Server docs, it is unclear to me how it interacts with CUDA streams. Here's an example: App 0: issues kernels to logical stream 0; App 1: issues k...
Horrific asked 7/3, 2018 at 23:35

2

Solved

We are having performance issues when using the CUDA Dynamic Parallelism. At this moment, CDP is performing at least 3X slower than a traditional approach. We made the simplest reproducible code to...
Orontes asked 19/7, 2017 at 21:10

2

Solved

From K20 different streams becomes fully concurrent(used to be concurrent on the edge). However My program need the old way. Or I need to do a lot of synchronization to solve the dependency proble...
Escheat asked 11/2, 2013 at 9:53

2

Solved

For my CUDA development, I am using a machine with 16 cores, and 1 GTX 580 GPU with 16 SMs. For the work that I am doing, I plan to launch 16 host threads (1 on each core), and 1 kernel launch per ...
Knuckleduster asked 6/9, 2012 at 5:56

1

This question is related to using cuda streams to run many kernels In CUDA there are many synchronization commands cudaStreamSynchronize, CudaDeviceSynchronize, cudaThreadSynchronize, and also cud...
Fulminant asked 14/8, 2012 at 13:48

2

Solved

I have something very similar to the code: int k, no_streams = 4; cudaStream_t stream[no_streams]; for(k = 0; k < no_streams; k++) cudaStreamCreate(&stream[k]); cudaMalloc(&g_in, size1...
Rysler asked 20/5, 2011 at 10:10
1

© 2022 - 2024 — McMap. All rights reserved.