cuda Questions

4

OS: Ubuntu 22.04.1 Python 3.8.1 (Conda) GPU: RTX4090 Nvidia driver: 530.30.02 When I set the environment of Deep Learning, I found that in pytorch, the torch.cuda.is_available() function is alway...
N asked 24/4, 2023 at 3:48

1

Solved

UDPATE I found an error in my code. I was running my render function in sub-blocks from years ago and forgot I had set it as such. So it was calling the GPU read function far more often than I thou...
Presentational asked 17/8 at 19:21

1

I am attempting to launch thrust::fill on two different device vectors in parallel on different CUDA streams. However, when I look at the kernel launches in NSight Systems, they appear to be serial...
Aldarcy asked 22/7 at 18:28

7

I have ubuntu 18.04, and accidentally installed cuda 9.1 to run Tensorflow-gpu, but it seems tensorflow-gpu requires cuda 10.0, so I want to remove cuda first by executing: martin@nlp-server:~$ su...
Phelloderm asked 3/6, 2019 at 16:44

9

Solved

so I've been successfully working on my CUDA program on my Linux but I would like to support Windows platform as well. However, I've been struggling with correctly compiling it. I use : Windows 1...
Cymophane asked 17/6, 2019 at 18:9

3

Solved

CUDA document does not specific how many CUDA process can share one GPU. For example, if I launch more than one CUDA programs by the same user with only one GPU card installed in the system, what i...
Columbarium asked 27/7, 2015 at 0:55

3

Solved

I have just upgraded Visual Studio 2022 to the latest version 17.10, and found that my program using CUDA v12.0 does not compile because of the error in NVIDIA GPU Computing Toolkit\CUDA\v12.0\incl...
Eliseoelish asked 22/5 at 7:35

12

Solved

I got the message: "cutilCheckMsg() CUTIL CUDA error : kernel launch failure : CUDA driver version is insufficient for CUDA runtime version." While trying to run an example source code. Als...
Flex asked 15/7, 2010 at 7:13

1

The task is like How to set bits of a bit vector efficiently in parallel?, but for CUDA. Consider a bit vector of N bits in it (N is large, e.g. 4G) and an array of M numbers (M is also large, e.g....
Picofarad asked 27/7, 2022 at 7:12

11

I got the following error when I ran my PyTorch deep learning model in Google Colab /usr/local/lib/python3.6/dist-packages/torch/nn/functional.py in linear(input, weight, bias) 1370 ret = torch.ad...
Seagoing asked 28/4, 2020 at 5:39

5

I am a newbie to Thrust. I see that all Thrust presentations and examples only show host code. I would like to know if I can pass a device_vector to my own kernel? How? If yes, what are the operat...
Crissman asked 1/4, 2011 at 8:14

3

Solved

I am trying to compile some CUDA and I wish to display compiler warnings. Equivalent to: g++ fish.cpp -Wall -Wextra Except NVCC doesn't understand these, and you have to pass them through: nvcc...
Lawler asked 20/12, 2019 at 12:24

8

Solved

When I run my code with TensorFlow directly, everything is normal. However, when I run it in a screen window, I get the following error. ImportError: libcuda.so.1: cannot open shared object file: N...
Illinois asked 18/1, 2019 at 7:39

8

Solved

I'm trying to compile a cuda test program on Windows 7 via Command Prompt, I'm this command: nvcc test.cu But all I get is this error: nvcc fatal : Cannot find compiler 'cl.exe' in PATH What...
Orthodoxy asked 14/11, 2011 at 17:49

2

Solved

I'm trying to run stylegan2 on Google Colab but with all the files on my Drive and avoiding using !git clone from the github of stylegan2. Here is my code on the specific cell: %tensorflow_version ...
Hirschfeld asked 2/7, 2020 at 13:20

5

My laptop doesn't have a nVidia graphic cards, and I want to work on CUDA. The website says that CUDA can be used in emulation mode on non-cuda hardware too. But when I tried installing CUDA driver...
Monteith asked 18/11, 2009 at 5:0

3

Solved

I would like to determine the maximum int value in a CUDA kernel. Unfortunately I can't find anything similar to std::numeric_limits for CUDA. Trying to use the ::std function results in a error: ...
Mycah asked 30/6, 2014 at 20:6

5

Solved

When a computer has multiple CUDA-capable GPUs, each GPU is assigned a device ID. By default, CUDA kernels execute on device ID 0. You can use cudaSetDevice(int device) to select a different device...
Biddle asked 8/12, 2012 at 20:42

4

Solved

I successfully installed CUDA 8.0 in my PC and I can see its files by running the following commands in my Ubuntu 16.10: $ sudo find / -name nvcc /usr/local/cuda-8.0/bin/nvcc $ sudo find / -name...
Magnesia asked 23/12, 2016 at 20:56

15

Solved

I have searched many places but ALL I get is HOW to install it, not how to verify that it is installed. I can verify my NVIDIA driver is installed, and that CUDA is installed, but I don't know how ...

2

I have a question on the eigen-decomposition of hundreds of small matrices using CUDA. I need to calculate the eigenvalues and eigenvectors of hundreds (e.g. 500) of small (64-by-64) real symmetr...
Dail asked 9/7, 2012 at 18:51

8

Solved

I am using Clion to develop a cuda program. The code highlight works fine when the extension is .h. However, when it is changed to .cuh, Clion just consider the new file a plain text file, and I ha...
Anitraaniweta asked 11/10, 2016 at 15:13

6

Solved

Quite often, I get the CUDA library to completely fail and return with an error 46 ("all CUDA-capable devices are busy or unavailable") even for simple calls like cudaMalloc. The code runs successf...
Onomastics asked 6/8, 2011 at 11:28

3

Solved

How can I calculate FLOPS of my application? If I have the total number of executed instructions, I can divide it by the execution time. But, how to count the number of executed instructions? My q...
Terris asked 30/9, 2012 at 9:2

5

Solved

Windows Version: Windows 10 Pro 21H2 19044.1706 GPU: rtx2070 import tensorflow as tf import torch print(torch.__version__) #1.10.1+cu113 print(torch.version.cuda) #11.3 print(tf.__version__) #2.9.1...
Crossland asked 4/6, 2022 at 11:36

© 2022 - 2024 — McMap. All rights reserved.