GTX 1660 Ti - Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR

josephb99kd · February 28, 2020, 9:21pm

Hi, were having an issue running a number of models on a 1660 Ti. We tested it in both Ubuntu 18.04.3 LTS and CentOS 7. Error is “Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR”. There seems to be a suggested fix: Add “config.gpu_options.allow_growth = True” which we did, but it doesn’t seem to help. We installed driver version “440.59”.

import keras.backend as K
config = tf.ConfigProto()
config.gpu_options.allow_growth = True
session = tf.Session(config=config)
K.set_session(session)

Here is an example model that’s failing
https://github.com/bedapudi6788/NudeNet/releases/download/v0/classifier_model

SunilJB · March 2, 2020, 6:34am

Hi,

This could be due to OOM. Could you try to reduce the TF GPU memory fraction: config.gpu_options.per_process_gpu_memory_fraction

Thanks

josephb99kd · March 2, 2020, 6:46pm

This didnt seem to help. The part that’s really baffling me is that this exact same model works fine on a much more lower-end P1000 GPU

SunilJB · March 3, 2020, 10:19am

Hi,

Could you please share the sample repro script and model file so we can help better?

Also, can you provide details on the platforms you are using:
o CUDA version
o CUDNN version
o Python version [if using python]
o Tensorflow and PyTorch version
o TensorRT version

Thanks

josephb99kd · March 3, 2020, 5:42pm

Hi,

See answers below. The notebook is attached, and the model URL is in the original post

o CUDA version - CUDA Version: 10.0
o CUDNN version - CUDNN Version 7.6.2 (also tried 7.6.5, same result)
o Python version [if using python] - Python 3.6.8
o Tensorflow and PyTorch version - TF version: 1.15.0, no PyTorch
o TensorRT version - not installed
05_nudenet.zip (4.03 KB)

david.bernstein · March 3, 2020, 11:56pm

For what it’s worth, I’m experiencing the same problem with a laptop that has a 1660 Ti.

nluehr · March 12, 2020, 9:25pm

Thanks for the repro. When I run it on a 32GB V100, Keras grabs 95% of the GPU memory regardless of whether K.set_session() is called.

This seems to be a problem in Keras, and there seems to be an existing issue tracking it. https://github.com/keras-team/keras/issues/11584

Topic		Replies	Views
Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR cuDNN	3	8111	November 7, 2019
could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR cuDNN	1	674	November 30, 2019
Could not create cudnn handle: CUDNN_STATUS_ALLOC_FAILED cuDNN	6	18146	January 30, 2021
Tensorflow_gpu R - could not create cudnn handle cuDNN	1	1238	November 12, 2019
CUDA Error CUDA Setup and Installation	0	631	March 30, 2018
"Failed to get convolution algorithm" problem cuDNN	4	8491	September 7, 2019
Does the latest GTX 1660 model support cuda? CUDA Setup and Installation	16	66285	October 1, 2023
cuDNN Error cuDNN	1	933	April 25, 2019
Issues with Tensorflow on CUDA10 and RTX2080 CUDA Setup and Installation	3	4363	March 6, 2019
"ERROR: Could not create cudnn handle: CUDNN_STATUS_NOT_INITIALIZED" inside nvidia tensorflow docker CUDA Setup and Installation	3	1869	May 27, 2020

GTX 1660 Ti - Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR

Related topics