TensorRT error : cuMemcpyDtoHAsync failed: an illegal memory access was encountered

hkr1990 · April 1, 2021, 12:12am

I’m trying to run a batch inference job with MobileNetV2 trt engine. I created this engine from the ONNX model(attached). I’m using the following trtexec command for the batch size of 128 images:

!trtexec --workspace=4096 --onnx=mobilenetv2-7.onnx --shapes=input:128x3x224x224 --saveEngine=mobilenet_engine_int8_128.trt --int8 --maxBatch=128

When I use the attached jupyter notebook to run inference. I get the following error :

LogicError Traceback (most recent call last)
in
1 # Warm up:
----> 2 trt_model.predict(dummy_input_batch) # softmax probability predictions for the first 10 classes of the first sample

/mnt/TensorRT/quickstart/IntroNotebooks/onnx_helper.py in predict(self, batch)
67 self.context.execute_async_v2(self.bindings, self.stream.handle, None)
68 # Transfer predictions back
—> 69 cuda.memcpy_dtoh_async(self.output, self.d_output, self.stream)
70 # Syncronize threads
71 self.stream.synchronize()

LogicError: cuMemcpyDtoHAsync failed: an illegal memory access was encountered

Note : I’m able to get batch sizes of 32 and 64 working. I want to use the batch size of 128 and 256 as well.

Environment

TensorRT Version: 7.2.2.3
GPU Type: A100-40GB
Nvidia Driver Version: 460.39
CUDA Version: 11.1
CUDNN Version:
Operating System + Version: Ubuntu 20.04
Python Version (if applicable):
TensorFlow Version (if applicable):
PyTorch Version (if applicable):
Baremetal or Container (if container which image + tag): nvcr.io/nvidia/tensorflow:21.02-tf1-py3

Relevant Files

ONNX model Link : models/vision/classification/mobilenet/model at master · onnx/models · GitHub

Jupyter Notebook attached along with the supporting code.

Steps To Reproduce

Download and start the container nvcr.io/nvidia/tensorflow:21.02-tf1-py3
Mount the attached jupyter notebook and supporting python and ONNX files inside the container.
Install and Start jupyter notebook server.
Open the MobileNetV2 notebook.
Execute the commands in the notebook.
MobileNet_Debug.zip (12.4 MB)

spolisetty · April 6, 2021, 4:38am

Hi @hkr1990,

We could reproduce the same error. We are looking into it.
Please allow us some time.

Thank you

spolisetty · April 7, 2021, 1:44pm

Hi @hkr1990,

Looks like you’re using both PyTorch and PyCUDA. Its better to use PyTorch device tensors directly, and drop PyCUDA completely. For your reference similar issue,
https://github.com/NVIDIA/TensorRT/issues/1133#issuecomment-809509799

Thank you.

Topic		Replies	Views
pycuda._driver.LogicError: cuMemcpyDtoHAsync failed: an illegal memory access was encountered TensorRT tensorrt , cuda , kernel	3	859	May 18, 2023
pycuda._driver.LogicError: cuMemcpyHtoDAsync failed: invalid argument TensorRT	9	12029	August 16, 2022
TensorRT 8.5.2-1+cuda11.8: pycuda._driver.LogicError: cuMemcpyHtoDAsync failed: invalid argument TensorRT tensorrt , jetson-inference , pycuda , jetson	2	1906	March 8, 2023
pycuda._driver.LogicError: cuMemcpyDtoHAsync failed: an illegal memory access was encountered[SOLVED] TensorRT	6	2769	August 7, 2020
Inference with 5D input data TensorRT tensorrt	3	688	August 10, 2020
Error "pycuda._driver.LogicError: cuMemcpyHtoD failed: invalid device context" While using Flask and TensorRT engine TensorRT tensorrt , cuda , inference-server-triton	2	2186	November 10, 2021
pycuda._driver.LogicError: cuStreamSynchronize failed: an illegal memory access was encountered TensorRT	1	1106	September 3, 2021
Error in TensorRT python yolo2onnx sample , inference appears ' pycuda._driver.LogicError: cuMemcpyHtoDAsync failed: invalid argument' GPU-Accelerated Libraries tensorrt , yolo , pycuda	2	1171	October 12, 2021
Cuda Error in executeMemcpy: 1 (invalid argument) TensorRT	5	2268	October 13, 2021
TensorRT execution fails without error TensorRT	1	1021	February 4, 2020

TensorRT error : cuMemcpyDtoHAsync failed: an illegal memory access was encountered

Environment

Relevant Files

Steps To Reproduce

Related topics