Could not create cudnn handle: CUDNN_STATUS_ALLOC_FAILED

flyingfishfusealt · November 30, 2019, 2:43pm

The stack overflow post about the error is :

https://stackoverflow.com/questions/59116872/could-not-create-cudnn-handle-cudnn-status-alloc-failed-on-a-project-that-sho

Like the link says, I just installed everything fresh, all the proper version, and its simply not working. I’ve looked through the code, its all good. I’ve been modifying it for my own purposes.

I don’t know where to ask, I feel like I’m stuck not being able to fully achieve my goals :(*

NVES_R · December 2, 2019, 7:32pm

Hi,

This could happen for a few reasons.

As you mentioned, it may be a a memory issue, which you could try to verify by allocating less memory to the GPU and seeing if that error still occurs. You can do this in TF 2.0 like so (https://github.com/tensorflow/tensorflow/issues/25138#issuecomment-484428798):

import tensorflow as tf
tf.config.gpu.set_per_process_memory_fraction(0.75)
tf.config.gpu.set_per_process_memory_growth(True)

# your model creation, etc.
model = MyModel(...)

I see the code you’re running sets dynamic memory growth if you have > 1 GPU (https://github.com/zzh8829/yolov3-tf2/blob/master/train.py#L46-L47), but since you only have 1 GPU, then it is likely just trying to allocate all memory (>90%) at the start.

Some users seem to have experienced this on Windows when there were other TensorFlow or similar processes using the GPU simultaneously, either by you or by other users: https://stackoverflow.com/a/53707323/10993413
As always, make sure your PATH variables are correct. Sometimes if you tried multiple installations and didn’t clean things up properly, the PATHs may be finding the wrong version first, and using that, causing an issue. If you add new paths to the beginning of PATH, they should be found first: https://www.tensorflow.org/install/gpu#windows_setup
As mentioned on your Stack Overflow post by another user, you could try upgrading to a newer version of CUDNN, though I’m not sure this will help since your config is listed as supported on TF docs: 從原始碼開始建構 | TensorFlow. If this does solve it, it may have been PATH issue after all since you will likely update the PATHs after installing the newer version.

satapathy.roshan · June 11, 2020, 10:34am

I had a similar issue with the same error(CUDA 10.0, Windows 10). I realized that it was happening because I was running the program from within an IDE. Once I ran the program from the cmd, it worked perfect.

teddyhaley1 · September 2, 2020, 8:21pm

This worked for me.

From the tensorflow 2.0 documentation: Use a GPU | TensorFlow Core

gpus = tf.config.experimental.list_physical_devices('GPU')
if gpus:
  try:
    # Currently, memory growth needs to be the same across GPUs
    for gpu in gpus:
      tf.config.experimental.set_memory_growth(gpu, True)
    logical_gpus = tf.config.experimental.list_logical_devices('GPU')
    print(len(gpus), "Physical GPUs,", len(logical_gpus), "Logical GPUs")
  except RuntimeError as e:
    # Memory growth must be set before GPUs have been initialized
    print(e)

clbroberts24 · October 7, 2020, 2:27am

I am receiving a similar error training a pix2pix model.
hardware:
1080ti in slot 1
2080ti in slot 2
msi tomahawk ac x299 mobo (both pcie x16 slots)

os: windows 10
tensorflow 1.15
Cuda10, CUDNN7.6

I only get the allocation error when trying to use the 2080 ti in the second pcie slot on my motherboard. I need it there for thermal reasons (the 1080 ti overheats with the 2080ti above it).
I have tried with just the 2080ti installed (this succeeded) as well as using cuda_visible_devices to select only the 2080ti when both were installed (this caused the error).
Is there some hardware limitation with allocating to the second pcie device?

frogfreak12 · November 10, 2020, 8:58pm

i love u man.

microscoper · January 30, 2021, 12:42pm

Thanks Man!! Works for me

Topic		Replies	Views
Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR cuDNN	3	8117	November 7, 2019
Issues with Tensorflow on CUDA10 and RTX2080 CUDA Setup and Installation	3	4368	March 6, 2019
tensorflow/stream_executor/cuda/cuda_dnn.cc:329 CUDA Setup and Installation	2	3683	February 18, 2020
"Failed to get convolution algorithm" problem cuDNN	4	8498	September 7, 2019
CUDA Error CUDA Setup and Installation	0	631	March 30, 2018
Tensoflow error: Could not create cudnn handle: CUDNN_STATUS_NOT_INITIALIZED CUDA Developer Tools	0	1272	January 25, 2021
Tensorflow issue:OP_REQUIRES failed at conv_ops_fused_impl.h:697 : Not found: No algorithm worked! cuDNN	3	8687	May 12, 2021
"ERROR: Could not create cudnn handle: CUDNN_STATUS_NOT_INITIALIZED" inside nvidia tensorflow docker CUDA Setup and Installation	3	1872	May 27, 2020
cuDNN Error cuDNN	1	938	April 25, 2019
CUDA_ERROR_OUT_OF_MEMORY: out of memory cuDNN cuda , tensorflow , windows-driver	1	1816	July 31, 2023

Could not create cudnn handle: CUDNN_STATUS_ALLOC_FAILED

Related topics