Tensorflow 2.3.1 + CUDA 10.1.105 + cuDNN 7.6.5.32 failed to create cublas handle: CUBLAS_STATUS_ALLOC_FAILED

anon3978859 · November 13, 2020, 11:55am

I am currently following a tutorial to generate a model for handwritten text recognition at the following link: GitHub - arthurflor23/handwritten-text-recognition: Handwritten Text Recognition (HTR) using TensorFlow 2.x

To do so, I have the following hardware:
CPU: Intel i7-9700K
GPU: RTX 2070 Super
OS: Windows 10

Having read online about using CUDA and cuDNN with TensorFlow 2.3, it seemed as though the generally accepted versions to use are CUDA 10.1 (which I have version 10.1.105 from the NVIDIA downloads) as well as cuDNN 7.6 (which I have version 7.6.5.32 from the NVIDIA downloads also).

The above tutorial uses Google Colab but I’m relatively certain that my GPU is going to be faster than the ones available on Google Colab.

To install the CUDA version, I visited the following url: https://developer.nvidia.com/cuda-10.1-download-archive-base?target_os=Windows&target_arch=x86_64&target_version=10&target_type=exenetwork

and to download the cuDNN version, I followed this one: https://developer.nvidia.com/compute/machine-learning/cudnn/secure/7.6.5.32/Production/10.1_20191031/cudnn-10.1-windows10-x64-v7.6.5.32.zip

With all this installed and various restarts to be sure, I added the required paths to my PATH variable following the instructions here: 使用 pip 安装 TensorFlow

I then proceed to start following the jupyter notebook instructions provided with the GitHub repository making sure to install the requirements. One issue is that I needed to install numpy v1.16.0 as the current 1.19 version is incompatible with tensorflow.

With all this, I run the cells (excluding the Google Colab cell since I am running this locally). I additionally added the following code to the first TensorFlow cell since I noticed a lot of debate on the memory growth option:
gpus = tf.config.experimental.list_physical_devices(‘GPU’)
if gpus:
try:
for gpu in gpus:
tf.config.experimental.set_memory_growth(gpu, True)
except RuntimeError as e:
print(e)

Everything works fine until the training cell at which I get the following message in the console:

I’ll spare the details regarding my attempt at running this out of Jupyter which gave the same results.

I’ve tried to find answers online but they all revolve around TensorFlow 1.x which I’m not using. I’m wondering if maybe my GPU simply doesn’t have enough memory to run the training.

Thanks in advance to anyone that may help !

PS: Yes, I am a new member so I may have numerous changes to make to the post/code

AakankshaS · November 20, 2020, 9:46am

Hi @anon3978859,
Which TRT version you are using here?
We recommend you to use the latest release.
To avoid system dependencies, suggest NGC container.

Please share the code and model in case issue persist.

Thanks!

Topic		Replies	Views
Tensorflow 1.11.0 (via whl) - Jetpack3.3 Jetson TX2	2	769	October 18, 2021
Issues with Tensorflow on CUDA10 and RTX2080 CUDA Setup and Installation	3	4360	March 6, 2019
cuDNN Error cuDNN	1	930	April 25, 2019
CUDA & TensorRT issue, I'd appreciate the help CUDA Setup and Installation tensorrt , cuda , tensorflow , driver	0	1574	March 26, 2023
tensorflow/stream_executor/cuda/cuda_dnn.cc:329 CUDA Setup and Installation	2	3667	February 18, 2020
Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR cuDNN	3	8101	November 7, 2019
Trying to implement GPU for Tensorflow using CUDA & cuDNN TensorRT cudnn	1	574	May 20, 2024
Failed call to cuInit: CUDA_ERROR_UNKNOWN: unknown error : Ubuntu 20.04.2, RTX 2070 SUPER GPU Linux tensorflow	0	2419	May 30, 2021
Failed to get convolution algorithm. This is probably because cuDNN failed to initialize cuDNN	29	51566	October 12, 2021
"Failed to get convolution algorithm" problem cuDNN	4	8482	September 7, 2019

Tensorflow 2.3.1 + CUDA 10.1.105 + cuDNN 7.6.5.32 failed to create cublas handle: CUBLAS_STATUS_ALLOC_FAILED

Related topics