Cuda Runtime error when switching from TensorRT 7.1.3 to TensorRT 8.0.1

realimposter · February 23, 2022, 3:55pm

Hi, I basically upgraded from TensorRT 7.1.3 to TensorRT 8.0.1 on AGX Xavier. I run everything in Python 3.6

I converted a face detector model from Pytorch to TensorRT in both with identical code. I then ran this exact same code in both:

#load in the models
stream = cuda.Stream()
TRT_LOGGER = trt.Logger()
explicit_batch = 1 << (int)(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)
#load in the face detector tensorrt model
with open("weights/yolov5s-face-448x800.trt", "rb") as f, trt.Runtime(TRT_LOGGER) as runtime:
	fd_engine = runtime.deserialize_cuda_engine(f.read())
for binding in fd_engine:
	if fd_engine.binding_is_input(binding):
		fd_device_input = cuda.mem_alloc(trt.volume(fd_engine.get_binding_shape(binding)) * fd_engine.max_batch_size * np.dtype(np.float32).itemsize)
	else:
		fd_host_output = cuda.pagelocked_empty(trt.volume(fd_engine.get_binding_shape(binding)) * fd_engine.max_batch_size, dtype=np.float32)
		fd_device_output = cuda.mem_alloc(fd_host_output.nbytes)
fd_context = fd_engine.create_execution_context()

Just testing with this code, I consistently find this error:

This happens when I only load the model in the entire script. It basically shows up at the end of every run. This did not show up with Jetpack 4.4 with TensorRT 7.1.3, only on Jetpack 4.6 with TensorRT 8.0.1.
Could you explain why and how to fix it?

AastaLLL · February 24, 2022, 4:04am

Hi,

There are some changes in destructors that might lead to this behavior.
You can find more details in our release notes:
https://docs.nvidia.com/deeplearning/tensorrt/release-notes/tensorrt-8.html#rel_8-0-1

Currently, please try to set the fd_engine to None before terminating and it should work.
For example:

if __name__ == "__main__":
    engine = PrepareEngine()
    Inference(engine)
    ...
    engine = None

Thanks.

realimposter · February 24, 2022, 6:04am

Thank you, it is fixed.

Setting fd_context = None fixed it. fd_engine = None didn’t do anything.

system · March 23, 2022, 5:23am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
[defaultAllocator.cpp::deallocate::35] Error Code 1: Cuda Runtime (invalid argument) TensorRT tensorrt	3	1105	May 5, 2022
TensorRT-7.1.3.4 Deserialize the cuda engine failed TensorRT cuda	9	8199	March 28, 2024
Issue with TensorRT 7.1.3 on Jetson AGX Jetson AGX Xavier tensorrt	13	947	October 18, 2021
TensorRT Cask Error in checkCaskExecError<false> Jetson AGX Xavier cuda , jetson-inference	9	752	January 4, 2023
Runtime error of Tensorrt 7.1.3 on Jetson Xavier AGX Jetson AGX Xavier tensorrt	5	769	October 18, 2021
cuMemcpyHtoD failed: context is destroyed TensorRT python	1	971	October 14, 2022
Using TensorRT3.0 to convert tensorflow model to create TensorRT engine Jetson TX1	3	619	March 8, 2018
Error [defaultAllocator.cpp::deallocate::35] Error Code 1: Cuda Runtime (invalid argument) Jetson Nano yolo	6	1182	April 25, 2023
Jetson AGX 16GB with jetpack4.6 [L4T 32.6.1] can not update tensorrt Jetson AGX Xavier tensorrt	6	525	September 14, 2022
Profiling fails with [E] Error[1]: [executionContext.cpp::syncShapeBindingsToDevice::1990] Error Code 1: Cuda Runtime (context is destroyed) TensorRT	2	477	August 28, 2023

Cuda Runtime error when switching from TensorRT 7.1.3 to TensorRT 8.0.1

Related topics