Tensorflow issue:OP_REQUIRES failed at conv_ops_fused_impl.h:697 : Not found: No algorithm worked!

ferlito.sergio · February 8, 2021, 12:17pm

I’m trying to run a simple CNN Tensorflow model on my system Ubuntu 20.04 on laptop Razer 15 blade with an RTX 2060 Max-P with 6Gb GDDR6.
The model does not compile and I get the following error:
OP_REQUIRES failed at conv_ops_fused_impl.h:697 : Not found: No algorithm worked!.
I have NVIDIA driver (460.32.3) installed by Ubuntu, with cuda compiler:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2020 NVIDIA Corporation
Built on Wed_Jul_22_19:09:09_PDT_2020
Cuda compilation tools, release 11.0, V11.0.221
Build cuda_11.0_bu.TC445_37.28845127_0
I also installed cudnn from cudnn-11.0-linux-x64-v8.0.5.39.
Tensorflow installed is 2.4.1.
In my zshrc I have added:
export LD_LIBRARY_PATH=“$LD_LIBRARY_PATH:/usr/local/cuda/lib64”
export CUDA_HOME=/usr/local/cuda
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/cuda/extras/CUPTI/lib64

The only way to get things work is to add:
export TF_FORCE_GPU_ALLOW_GROWTH=‘true’
in my .zshrc, as suggested here.
This is really weird as I suppose that 6Gb of GDDR6 are enough for this CNN model.
I also tried to run ai-benchmark and I was not able to finish the tests while with the env variable set as shown above I get several warning regarding not having enough memory to allocate, but I’m able to end all tests.
If I use watch nvidia-smi while running the notebook code from CNN model I noticed that with the env set the GDRR6 allocated is quite small while is near the limit of the available memory if the env variable is not set.
What’s the problem? Any suggestion to fix it?

AakankshaS · February 9, 2021, 9:27am

Hi @ferlito.sergio ,
If you are using cuDNN then this might be related to large workspace required by cuDNN algos.
Below link should be able to help you-
https://github.com/tensorflow/tensorflow/issues/45068#issuecomment-732148781

Thanks!

francois.plessier · February 15, 2021, 6:11pm

Hi!

I think I’m having the same problem, this time with Nvidia TLT container tlt-streamanalytics:yuw-v2 and a tlt_pretrained_detectnet_v2:resnet18 model, on a GeForce RTX 2060.
(I’m working on this tutorial: Implementing a Real-time, AI-Based, Face Mask Detector Application for COVID-19 )

print(tf.__version__)

Tensorflow version: 1.15.2

!nvcc --version

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2020 NVIDIA Corporation
Built on Wed_Jul_22_19:09:09_PDT_2020
Cuda compilation tools, release 11.0, V11.0.221
Build cuda_11.0_bu.TC445_37.28845127_0

I’m using a different dataset, but everything seems OK with the tlt-dataset-convertoperation.

Now the training fails with the following error (see tlt-train.log (38.3 KB) )

OP_REQUIRES failed at conv_grad_filter_ops.cc:1038 : Not found: No algorithm worked!

I tried !export TF_FORCE_GPU_ALLOW_GROWTH=‘true’ inside my notebook but it does not fix the problem.

I also found this command, which do not fix the problem either:

from tensorflow.compat.v1 import ConfigProto
from tensorflow.compat.v1 import InteractiveSession

config = ConfigProto()
config.gpu_options.allow_growth = True
session = InteractiveSession(config=config)

And this command, which raises an error:

physical_devices = tf.config.list_physical_devices('GPU') 
tf.config.experimental.set_memory_growth(physical_devices[0], True)

AttributeError: module ‘tensorflow._api.v1.config’ has no attribute ‘list_physical_devices’

Can you please help me?

EDIT (SOLVED):

For TF 1.15, the last command was:

from tensorflow.config.experimental import list_physical_devices, set_memory_growth
physical_devices = list_physical_devices('GPU')
set_memory_growth(physical_devices[0], True)

And now it works!

guillaume.perez.314 · May 12, 2021, 8:56am

Adding the code below right after the imports solved the issue for me ! Thanks !

physical_devices = tf.config.list_physical_devices(‘GPU’)
tf.config.experimental.set_memory_growth(physical_devices[0], True)

Topic		Replies	Views
"Failed to get convolution algorithm" problem cuDNN	4	8501	September 7, 2019
Cudnn PoolForward launch failed CUDA NVCC Compiler cuda , tensorflow , ubuntu , python	0	996	June 1, 2022
OP_REQUIRES failed at matrix_inverse_op.cc:191 : Internal: tensorflow/core/kernels/cuda_solvers.cc:803: cuBlas call failed status = 13 Frameworks tensorflow	6	2142	September 4, 2019
Failed call to cuInit: CUDA_ERROR_OUT_OF_MEMORY: out of memory Frameworks cuda , tensorflow	1	2941	April 22, 2021
cuDNN failed to initialize cuDNN	2	1592	September 16, 2019
Could not create cudnn handle: CUDNN_STATUS_INTERNAL_ERROR cuDNN	3	8122	November 7, 2019
CUDA & TensorRT issue, I'd appreciate the help CUDA Setup and Installation tensorrt , cuda , tensorflow , driver	0	1587	March 26, 2023
cuda_driver.cc:175] Check failed: err == cudaSuccess \|\| err == cudaErrorInvalidValue Unexpected CUDA error: invalid argument CUDA Setup and Installation	2	3648	January 6, 2021
CUDA_ERROR_OUT_OF_MEMORY: out of memory cuDNN cuda , tensorflow , windows-driver	1	1845	July 31, 2023
Failed to get convolution algorithm. This is probably because cuDNN failed to initialize cuDNN	29	51648	October 12, 2021

Tensorflow issue:OP_REQUIRES failed at conv_ops_fused_impl.h:697 : Not found: No algorithm worked!

Related topics