CUDA initialization error on deep learning framework (Tensorflow, PyTorch) and deviceQuery

line.k0 · August 26, 2021, 6:30am

Hi,

I have problem to use deep learning framework (TensorFlow, PyTorch) with my GPUs.
This is my setting

4 x A100-SXM 80 GB
Nvidia driver version : 470.57.02
CUDA version : 11.0

I installed NVIDIA driver and CUDA from official runfile, especially I installed different version of CUDA toolkit for framework compatibility.

Both command nvidia-smi and nvcc -V works fine.

However, it has raised me CUDA initialization error on both frameworks.
When I also try to test deviceQuery from CUDA samples, it has the same issue :(

This is my screenshot about the issue.

Please support me to solve this issue ! 🙏

mehmetdeniz · August 26, 2021, 7:24am

Hello @line.k0
In the screenshot, the driver’s CUDA version is 11.4 (top right corner of nvidia-smi command).

Can you downgrade your driver version for CUDA 11.0 or can you upgrade frameworks’ packages for CUDA 11.4?

Additionally, your nvidia-smi and nvcc commands’ results have different CUDA versions. Did you upgrade your GPU driver recently?

Regards

line.k0 · August 26, 2021, 8:20am

Hi @mehmetdeniz !
Thank you for reply.

I also tried both options you suggested.

It still has same issue for the lower version of NVIDIA driver and CUDA 11.4.
I attach screenshot my trial for the CUDA 11.4.

image1358×381 69.3 KB

I wonder if one driver version works for CUDA. From the release note, it shows that it needs to satisfy higher than the minimum required version. https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html

I installed nvidia-driver first, and downgrade to CUDA 11.0 for the framework.
I found that it doesn’t matter though the different CUDA version between nvidia-smi and nvcc comand.
https://stackoverflow.com/questions/53422407/different-cuda-versions-shown-by-nvcc-and-nvidia-smi

Topic		Replies	Views
problems with installation Deep Learning (Training & Inference)	0	741	October 15, 2018
Failed call to cuInit: CUDA_ERROR_NOT_INITIALIZED: initialization error Container: CUDA	0	2853	December 28, 2020
Installing driver and required packages of libraries - CUDA, cudNN for NVIDIA GeForce RTX 2080 CUDA Setup and Installation	0	1442	June 6, 2019
CUDA driver initialization failed CUDA Setup and Installation cuda , ubuntu , python	0	1790	June 7, 2023
Both PyTorch and TensorFlow cannot detect 3090Ti GPUs CUDA Setup and Installation cuda , tensorflow , pytorch	2	1622	January 20, 2023
CUDA driver version is insufficient for CUDA runtime version CUDA Setup and Installation	0	2058	February 18, 2019
Can't make it work cuDNN	0	541	October 15, 2018
NVIDIA Quadro K2200 GPU Not Detected in TensorFlow or PyTorch Despite Following All Setup Steps CUDA Setup and Installation cuda , tensorflow , pytorch , cudnn , gpu , deep-learning , machine-learning , windows-driver	3	27	January 17, 2025
CUDA 10.0 - no CUDA-capable device is detected, nvidia-smi does not work. CUDA Setup and Installation	0	2383	April 24, 2019
Driver/library version mismatch after updating drivers to version 375.26 CUDA Setup and Installation	3	17085	May 18, 2017

CUDA initialization error on deep learning framework (Tensorflow, PyTorch) and deviceQuery

Related topics