Trying to run TensorRT project on AWS, getting error

daniel.bancsi · November 14, 2020, 5:56pm

Description

Hello
I am trying to verify my TensorRT is working i get the error:

python3
>>> import tensorrt
>>> print(tensorrt.version)
>>> assert tensorrt.Builder(tensorrt.Logger())

[TensorRT] ERROR: CUDA initialization failure with error 222. Please check your CUDA installation: CUDA Installation Guide for Linux
Traceback (most recent call last):
File “”, line 1, in
TypeError: pybind11::init(): factory function returned nullptr

This error is supposed to be "cudaErrorUnsupportedPtxVersion " which im not sure how to solve.

Environment

TensorRT Version: 7.2.1.6
GPU Type:
Nvidia Driver Version: 450.80.02
CUDA Version: 11.0
CUDNN Version:
Operating System + Version: Ubuntu 18.04.5 LTS
Python Version (if applicable): 3.6
TensorFlow Version (if applicable): 1.15.3
PyTorch Version (if applicable): 1.5.0
Baremetal or Container (if container which image + tag):

AakankshaS · November 20, 2020, 10:00am

Hi @daniel.bancsi,
Can you please try running your model using trtexec with verbose, and share the logs with us?

Thanks!

Martin.ZJ · January 8, 2021, 9:09am

Hi, I met the same issue when I was using tensorrt 7.2 with docker image “nvcr.io/nvidia/tensorrt:20.12-py3”.

But This model is working well on 7.1.

I reinstalled cuda to 11.0 within a docker container (default version is 11.1) but the problem still remains.

I guess @daniel.bancsi may face the same issue.

Environment

TensorRT Version : 7.2.1.6
GPU Type :
Nvidia Driver Version : 450.66
CUDA Version : 11.0
CUDNN Version :
Operating System + Version : Ubuntu 20.04.1 LTS

[01/08/2021-16:48:31] [I] Output:
[01/08/2021-16:48:31] [I] === Build Options ===
[01/08/2021-16:48:31] [I] Max batch: explicit
[01/08/2021-16:48:31] [I] Workspace: 16 MiB
[01/08/2021-16:48:31] [I] minTiming: 1
[01/08/2021-16:48:31] [I] avgTiming: 8
[01/08/2021-16:48:31] [I] Precision: FP32
[01/08/2021-16:48:31] [I] Calibration:
[01/08/2021-16:48:31] [I] Refit: Disabled
[01/08/2021-16:48:31] [I] Safe mode: Disabled
[01/08/2021-16:48:31] [I] Save engine:
[01/08/2021-16:48:31] [I] Load engine:
[01/08/2021-16:48:31] [I] Builder Cache: Enabled
[01/08/2021-16:48:31] [I] NVTX verbosity: 0
[01/08/2021-16:48:31] [I] Tactic sources: Using default tactic sources
[01/08/2021-16:48:31] [I] Input(s)s format: fp32:CHW
[01/08/2021-16:48:31] [I] Output(s)s format: fp32:CHW
[01/08/2021-16:48:31] [I] Input build shapes: model
[01/08/2021-16:48:31] [I] Input calibration shapes: model
[01/08/2021-16:48:31] [I] === System Options ===
[01/08/2021-16:48:31] [I] Device: 0
[01/08/2021-16:48:31] [I] DLACore:
[01/08/2021-16:48:31] [I] Plugins:
[01/08/2021-16:48:31] [I] === Inference Options ===
[01/08/2021-16:48:31] [I] Batch: Explicit
[01/08/2021-16:48:31] [I] Input inference shapes: model
[01/08/2021-16:48:31] [I] Iterations: 10
[01/08/2021-16:48:31] [I] Duration: 3s (+ 200ms warm up)
[01/08/2021-16:48:31] [I] Sleep time: 0ms
[01/08/2021-16:48:31] [I] Streams: 1
[01/08/2021-16:48:31] [I] ExposeDMA: Disabled
[01/08/2021-16:48:31] [I] Data transfers: Enabled
[01/08/2021-16:48:31] [I] Spin-wait: Disabled
[01/08/2021-16:48:31] [I] Multithreading: Disabled
[01/08/2021-16:48:31] [I] CUDA Graph: Disabled
[01/08/2021-16:48:31] [I] Separate profiling: Disabled
[01/08/2021-16:48:31] [I] Skip inference: Disabled
[01/08/2021-16:48:31] [I] Inputs:
[01/08/2021-16:48:31] [I] === Reporting Options ===
[01/08/2021-16:48:31] [I] Verbose: Enabled
[01/08/2021-16:48:31] [I] Averages: 10 inferences
[01/08/2021-16:48:31] [I] Percentile: 99
[01/08/2021-16:48:31] [I] Dump refittable layers:Disabled
[01/08/2021-16:48:31] [I] Dump output: Disabled
[01/08/2021-16:48:31] [I] Profile: Disabled
[01/08/2021-16:48:31] [I] Export timing to JSON file:
[01/08/2021-16:48:31] [I] Export output to JSON file:
[01/08/2021-16:48:31] [I] Export profile to JSON file:
[01/08/2021-16:48:31] [I]
[01/08/2021-16:48:31] [I] === Device Information ===
[01/08/2021-16:48:31] [I] Selected Device: GeForce RTX 2080 Ti
[01/08/2021-16:48:31] [I] Compute Capability: 7.5
[01/08/2021-16:48:31] [I] SMs: 68
[01/08/2021-16:48:31] [I] Compute Clock Rate: 1.545 GHz
[01/08/2021-16:48:31] [I] Device Global Memory: 11019 MiB
[01/08/2021-16:48:31] [I] Shared Memory per SM: 64 KiB
[01/08/2021-16:48:31] [I] Memory Bus Width: 352 bits (ECC disabled)
[01/08/2021-16:48:31] [I] Memory Clock Rate: 7 GHz
[01/08/2021-16:48:31] [I]
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::BatchTilePlugin_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::BatchedNMS_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::BatchedNMSDynamic_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::CoordConvAC version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::CropAndResize version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::DetectionLayer_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::FlattenConcat_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::GenerateDetection_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::GridAnchor_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::GridAnchorRect_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::InstanceNormalization_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::LReLU_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::MultilevelCropAndResize_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::MultilevelProposeROI_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::NMS_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::Normalize_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::PriorBox_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::ProposalLayer_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::Proposal version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::PyramidROIAlign_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::Region_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::Reorg_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::ResizeNearest_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::RPROI_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::SpecialSlice_TRT version 1
[01/08/2021-16:48:31] [V] [TRT] Registered plugin creator - ::Split version 1
[01/08/2021-16:48:31] [E] [TRT] CUDA initialization failure with error 222. Please check your CUDA installation: CUDA Installation Guide for Linux
[01/08/2021-16:48:31] [E] Builder creation failed
[01/08/2021-16:48:31] [E] Engine creation failed
[01/08/2021-16:48:31] [E] Engine set up failed
&&&& FAILED TensorRT.trtexec # trtexec --onnx= my_model

AakankshaS · January 28, 2021, 5:42am

Hi @Martin.ZJ ,
Can you please check if other CUDA application works and update us?

Thanks!

Martin.ZJ · February 23, 2021, 2:52am

I ran that on k8s virtiual machine and I deleted that VM (ノへ￣、). Now I am using TensorRT 7.1. I will update here when I reproduce this problem.

wjli116 · May 18, 2021, 6:25am

Hi, I have the same problem with Martin.ZJ, in ubuntu 18.04 in aws machine, where there are 5 cuda+cudnn versions preinstalled by aws. both the following configuration failed:
(1) cuda 11.1+ TensorRT7.2.2
(2) cuda 10.0 + TensorRT7.0.0
Now I’am trying cuda 10.2+TensorRT7.1.3.4, but not sure it works.
Do you have any suggestion?

wjli116 · May 19, 2021, 2:26am

I solved this problem with:
driver 450.80.02 +cuda 11.1 (with cudnn 8.0 installed in cuda-11.1/lib64) + TensorRT7.2.2.3 for cuda 11.1 and cudnn 8.0 ,
and this combination is obviously not the only feasible one.
For me, there are three tips:
1.cudnn version in cuda lib64 directory must consist with cudnn version of TensorRT(you can find this in TensorRT install file name)
2.you’d better run all your commands in pure terminal not VScode terminal (enrionment variables in the two are probably different), especially you change your cuda and TensorRT configuration frequently to find out a feasible set of configuration
3. to run trtexec, you must add “sudo”, otherwise you may get an error of “cannot open engine file”

Topic		Replies	Views
ERROR: CUDA initialization failure with error 222 with C++ TensorRT	6	5992	October 12, 2021
Cuda initialization failure when converting trt model with different GPU TensorRT tensorrt	7	6469	September 28, 2022
[executionContext.cpp::executeInternal::652] Error Code 1: Cuda Runtime (an illegal memory access was encountered) \| Cuda failure: 700 TensorRT tensorrt	5	3006	April 11, 2022
CUDA and TensorRT Compatibility TensorRT	1	1234	August 31, 2023
TensorRT Python Runtime TensorRT	7	4986	September 11, 2021
Cuda failure: CUDA driver version is insufficient for CUDA runtime version TensorRT tensorrt , cuda	8	2712	October 12, 2021
TensorRT 10.8 on Windows: API Usage Error (Target GPU SM 120 is not supported by this TensorRT release.) TensorRT cudnn	3	500	March 27, 2025
CUDA & TensorRT issue, I'd appreciate the help CUDA Setup and Installation tensorrt , cuda , tensorflow , driver	0	1586	March 26, 2023
Build TensorRT on Cuda compute capability 7.5 and make it backward compatible with previous capabilities TensorRT tensorrt	4	1964	May 19, 2022
TensorRT-7.1.3.4 Deserialize the cuda engine failed TensorRT cuda	9	8156	March 28, 2024

Trying to run TensorRT project on AWS, getting error

Description

Environment

Environment

Related topics