Pytorch is not detecting GPU

manusai.manoj · August 4, 2022, 10:27am

Hi All,

I was trying to use PyTorch with GPU in one VM installed with Ubuntu 18.04.

GPU is displaying nvidia-smi.

±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+

Installed following packages as well

conda list | grep -i cuda
cudatoolkit 11.3.1 h2bc3f7f_2
cudnn 8.2.1 cuda11.3_0
pytorch 1.12.0 py3.7_cuda11.3_cudnn8.3.2_0 pytorch
pytorch-mutex 1.0 cuda pytorch

I am getting following error while running script.

UserWarning: CUDA initialization: CUDA unknown error - this may be due to an incorrectly set up environment, e.g. changing env variable CUDA_VISIBLE_DEVICES after program start. Setting the available devices to be zero. (Triggered internally at /opt/conda/conda-bld/pytorch_1656352464346/work/c10/cuda/CUDAFunctions.cpp:109.)
return torch._C._cuda_getDeviceCount() > 0
Traceback (most recent call last):
File “test.py”, line 4, in
torch.cuda.get_device_name(0)
File “/home/ubuntu/miniconda3/lib/python3.7/site-packages/torch/cuda/init.py”, line 329, in get_device_name
return get_device_properties(device).name
File “/home/ubuntu/miniconda3/lib/python3.7/site-packages/torch/cuda/init.py”, line 359, in get_device_properties
_lazy_init() # will define _get_device_properties
File “/home/ubuntu/miniconda3/lib/python3.7/site-packages/torch/cuda/init.py”, line 217, in _lazy_init
torch._C._cuda_init()
RuntimeError: CUDA unknown error - this may be due to an incorrectly set up environment, e.g. changing env variable CUDA_VISIBLE_DEVICES after program start. Setting the available devices to be zero.

I have ran nvidia-bug-report.sh script as well but could interprept much from that.
Here I am attahced the result of the script.

nvidia-bug-report.log (277.8 KB)

Could any one please help to debug this ?

Topic		Replies	Views
There is no mismatch between CUDA version then why Is CUDA available: False? CUDA Programming and Performance cuda , cudnn	4	934	April 16, 2024
No devices were found - nvidia-smi Linux	0	382	June 10, 2023
Dell A100 gpu issues nvida driver Drivers - Linux, Windows, MacOS	2	892	October 31, 2023
NVIDIA-SMI Shows "No Devices Found" CUDA Setup and Installation	0	156	August 30, 2024
Unable to determine the device handle for GPU unknown error Linux cuda	0	690	April 2, 2022
CUDA driver initialization failed CUDA Setup and Installation cuda , ubuntu , python	0	1905	June 7, 2023
Unable to determine the device handle for GPU0000:C1:00.0: Unknown Error Linux cuda , kernel , ubuntu	4	506	February 23, 2024
Emergency problems about GPU Driver Jetson Xavier NX cuda , ubuntu , python	7	605	July 17, 2023
GPU doesn't respond after training with PyTorch Linux cuda , pytorch , python	5	734	December 28, 2023
A100 GPUs visible on nvidia-smi not visible for Pytorch or on cuda-samples Linux cuda	5	4568	October 12, 2021

Pytorch is not detecting GPU

Related topics