Cuda Runtime Error when infering Onnx model

damien.pouyanneen37v · October 8, 2021, 9:51am

Description

We developped a version of Yolo V3 on ONNX format to infer with TensorRT.
After the serialization of the model, randomly, we have this cuda error
1: [gatherRunner.cpp::execute::104] Error Code 1: Cuda Runtime (invalid configuration argument)
when the execution of the “enqueuev2” command.
This error appends with differents GPUs and on dfferents computers.

If you have any idea to help us to resolve our issue, thank you.

Environment

TensorRT Version: 8.0.0
GPU Type: 1080 - 20280 Ti
Nvidia Driver Version: 465.19.01
CUDA Version: 11.3
CUDNN Version: 8.2.0.53
Operating System + Version: Ubuntu 18.04 (Docker)
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): nvidia/cuda:11.3.0-cudnn8-devel-ubuntu18.04

NVES · October 8, 2021, 10:38am

Hi,
Request you to share the ONNX model and the script if not shared already so that we can assist you better.
Alongside you can try few things:

validating your model with the below snippet

check_model.py

import sys
import onnx
filename = yourONNXmodel
model = onnx.load(filename)
onnx.checker.check_model(model).
2) Try running your model with trtexec command.
https://github.com/NVIDIA/TensorRT/tree/master/samples/opensource/trtexec
In case you are still facing issue, request you to share the trtexec “”–verbose"" log for further debugging
Thanks!

damien.pouyanneen37v · October 11, 2021, 12:09pm

Hi. Thanks for your answer.
Unfortunately, I cannot share my model because it is the property of my company.

I tried to use the snippet check_model.py and I have this issue :

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python3.6/dist-packages/onnx/checker.py", line 104, in check_model
    C.check_model(protobuf_string)
onnx.onnx_cpp2py_export.checker.ValidationError: No Op registered for BatchedNMSDynamic_TRT with domain_version of 11

==> Context: Bad node spec for node. Name: onnx_graphsurgeon_node_0 OpType: BatchedNMSDynamic_TRT

Otherwise, the use of the trtexec command does not reveal any issue with my model.

spolisetty · October 11, 2021, 2:53pm

Hi,

We recommend you to please make sure you’re using enqueuev2 correctly.
Please refer following samples for your reference

BatchedNMS plugin - TensorRT/plugin/batchedNMSPlugin at master · NVIDIA/TensorRT · GitHub

Also we recommend you to please try on latest TensorRT version as BatchedNMS plugin updated.

Thank you.

Topic		Replies	Views
Error occurred while running the Tensorrt samples: [reformat.cpp::executeCutensor::385] TensorRT tensorrt	3	1187	December 12, 2023
Cuda Error in launchPwgenKernel- When running a specific engine in async TensorRT tensorrt	9	2154	June 11, 2022
Cuda Error TensorRT	3	1097	April 8, 2021
[defaultAllocator.cpp::deallocate::35] Error Code 1: Cuda Runtime (invalid argument) TensorRT tensorrt	3	3954	March 31, 2022
[defaultAllocator.cpp::deallocate::35] Error Code 1: Cuda Runtime (invalid argument) TensorRT tensorrt	3	1080	May 5, 2022
TensorRT 8 - nvinfer1::CudaRuntimeError GUI application TensorRT cuda	3	1345	September 15, 2021
:nvinfer1::rt::ExecutionContext::enqueueInternal::330, condition: bindings[x] != nullptr TensorRT tensorrt	1	1877	February 15, 2022
Error Code 1: Cudnn (CUDNN_STATUS_EXECUTION_FAILED) TensorRT cuda	3	2162	May 31, 2022
Cuda Error in nvinfer1::cudnn::findFastestTactic: 700 TensorRT	7	1300	December 15, 2021
Onnx to trt and use int8 for inference, with batchsize=8. Got ERROR:genericReformat.cu (1262) TensorRT	2	562	May 5, 2021

Cuda Runtime Error when infering Onnx model

Description

Environment

check_model.py

Related topics