Error: CUDNN_STATUS_EXECUTION_FAILED

mhm_nab · June 25, 2020, 7:09am

Hi,
I was following one of the online tutorials but I was getting this error:
Traceback (most recent call last):
File “ssd_object_detection.py”, line 20, in
detections = net.forward()
cv2.error: OpenCV(4.3.0) /home/blah/opencv/modules/dnn/src/layers/…/cuda4dnn/primitives/…/csl/cudnn/convolution.hpp:461: error: (-217:Gpu API call) CUDNN_STATUS_EXECUTION_FAILED in function ‘convolve_with_bias_activation’

It a python script and this is the command:
python ssd_object_detection.py --prototxt MobileNetSSD_deploy.prototxt
–model MobileNetSSD_deploy.caffemodel
–input guitar.mp4 --output output.avi
–display 0 --use-gpu 1

This is my configuration:

Jetson Nano device
Ubuntu 18.04
/usr/local/cuda/bin/nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Wed_Oct_23_21:14:42_PDT_2019
Cuda compilation tools, release 10.2, V10.2.89

– NVIDIA CUDA: YES (ver 10.2, CUFFT CUBLAS FAST_MATH)
– NVIDIA GPU arch: 53
– NVIDIA PTX archs:
– cuDNN: YES (ver 8.0)
– NVIDIA CUDA: YES (ver 10.2, CUFFT CUBLAS FAST_MATH)
– NVIDIA GPU arch: 53
– cuDNN: YES (ver 8.0)

opencv 4.3.0 built from source with OPENCV_DNN_CUDA=ON, CUDNN_VERSION=‘8.0’, WITH_CUDA=ON, WITH_CUDNN=ON, and many other settings enabled
Python 3.7.7

This is the code I am trying to run (it completes successfully if I don’t use the GPU). It fails at the line detections = net.forward()

CLASSES = [“background”, “aeroplane”]
COLORS = np.random.uniform(0, 255, size=(len(CLASSES), 3))

net = cv2.dnn.readNetFromCaffe(args[“prototxt”], args[“model”])
print(“[INFO] setting preferable backend and target to CUDA…”)
net.setPreferableBackend(cv2.dnn.DNN_BACKEND_CUDA)
net.setPreferableTarget(cv2.dnn.DNN_TARGET_CUDA)

print(“[INFO] accessing video stream…”)
vs = cv2.VideoCapture(args[“input”] if args[“input”] else 0)
writer = None
fps = FPS().start()

while True:
(grabbed, frame) = vs.read()
frame = imutils.resize(frame, width=400)
(h, w) = frame.shape[:2]
blob = cv2.dnn.blobFromImage(frame, 0.007843, (300, 300), 127.5)
net.setInput(blob)
detections = net.forward()

for i in np.arange(0, detections.shape[2]):
confidence = detections[0, 0, i, 2]
if confidence > args[“confidence”]:
idx = int(detections[0, 0, i, 1])
box = detections[0, 0, i, 3:7] * np.array([w, h, w, h])
(startX, startY, endX, endY) = box.astype(“int”)
  	label = "{}: {:.2f}%".format(CLASSES[idx],
  		confidence * 100)
  	cv2.rectangle(frame, (startX, startY), (endX, endY),
  		COLORS[idx], 2)
  	y = startY - 15 if startY - 15 > 15 else startY + 15
  	cv2.putText(frame, label, (startX, y),
  		cv2.FONT_HERSHEY_SIMPLEX, 0.5, COLORS[idx], 2)
if args[“display”] > 0:
cv2.imshow(“Frame”, frame)
key = cv2.waitKey(1) & 0xFF
  if key == ord("q"):
  	break
if args[“output”] != “” and writer is None:
fourcc = cv2.VideoWriter_fourcc(*“MJPG”)
writer = cv2.VideoWriter(args[“output”], fourcc, 30,
(frame.shape[1], frame.shape[0]), True)

if writer is not None:
writer.write(frame)

fps.update()

fps.stop()
print(“[INFO] elasped time: {:.2f}”.format(fps.elapsed()))
print(“[INFO] approx. FPS: {:.2f}”.format(fps.fps()))

Thanks!

AastaLLL · June 29, 2020, 2:17am

Hi,

It looks like your OpenCV inference the model with Caffe frameworks.

Suppose you have built Caffe from source on your environment first.
Please check that if you have built the library with correct architecture (sm_53) for Nano GPU.

Thanks.

blowbuz · March 15, 2021, 11:56pm

I’m using opencv 4.2.0 and have two Nvidia P5000 installed. getCudaEnabledDeviceCount() returns 2.
In the case of the first GPU(setDevice(0)) :

inputNet.setPreferableBackend(cv::dnn::DNN_BACKEND_CUDA);
inputNet.setPreferableTarget(cv::dnn::DNN_TARGET_CUDA);
runs without any errors.

In the case of the second gpu(setDevice(1)):
I get (-217:Gpu API call) CUDNN_STATUS_EXECUTION_FAILED in function ‘convolve’

kayccc · March 24, 2021, 5:44am

Hi blowbuz,

Here is for Jetson platform, for dGPU, please open topic at CUDA Programming and Performance - NVIDIA Developer Forums

Topic		Replies	Views
Cannot open display: [TensorRT] ERROR: 1: [convolutionRunner.cpp::executeConv::458] Error Code 1: Cudnn (CUDNN_STATUS_EXECUTION_FAILED) Jetson Nano tensorrt	3	1243	April 20, 2022
CUDNN failure 8: CUDNN_STATUS_EXECUTION_FAILED ; GPU=0 ; hostname=ubuntu CUDA Setup and Installation cudnn , jetson	1	150	January 8, 2026
CUDNN_STATUS_EXECUTION_FAILED cuDNN	2	3816	May 22, 2020
CUDNN_STATUS_EXECUTION_FAILED with cuda8.0.61 + cudnn6.0.21 + ubuntu16.04 + Caffe-BVLC cuDNN	0	1677	July 9, 2018
cudnn_conv_layer.cpp:53] Check failed: status == CUDNN_STATUS_SUCCESS (4 vs. 0) CUDNN_STATUS_INTERNAL_ERROR Jetson TX2	7	9873	October 18, 2021
How to solve cudnn_conv_layer.cpp:53] Check failed: status == CUDNN_STATUS_SUCCESS? Jetson AGX Xavier	2	1121	October 18, 2021
CuDNN error on Jetson Nano :: "reason: CUDNN_STATUS_NOT_SUPPORTED" Jetson Nano cudnn	6	1669	October 15, 2021
[convolutionRunner.cpp::executeConv::458] Error Code 1: Cudnn (CUDNN_STATUS_EXECUTION_FAILED) Jetson Nano jetson-inference	8	688	March 26, 2024
crash when using multi-GPU on caffe. CUDNN_STATUS_EXECUTION_FAILED cuDNN	3	1163	February 22, 2019
ERROR: Check failed: status == CUDNN_STATUS_SUCCESS (8 vs. 0) CUDNN_STATUS_EXECUTION_FAILED, device 0 TensorRT cuda , tensorflow , jetson-inference	1	1156	May 10, 2021

Error: CUDNN_STATUS_EXECUTION_FAILED

Related topics