Can huge model with huge total weights size be infered by tensorrt at A10?

1102696569 · November 6, 2022, 10:27am

When I use trtexec to infer opt13 onnx model file, internal errors happen. The opt30 model is a huge model with 60 GB weights size. What is the root cause? Detailed log as below.

TensorRT version: 8.4.0.6

spolisetty · November 7, 2022, 6:24am

Hi,

We are unable to access the images. Could you please share with us the trtexec --verbose logs.
FYI, Please refer,

Thank you.

1102696569 · November 8, 2022, 3:20am

1102696569 · November 8, 2022, 3:24am

Each tensor size in opt13 model is no more than 2^31 -1 elements. But the total tensors size of the model is more than 120 GB .

spolisetty · November 8, 2022, 4:43am

Hi,

There has been a known issue similar to this and fixed in recent versions.
We recommend you to please upgrade the TensorRT version to the latest(8.5) and try again by increasing the workspace. If you still face this issue, please share with us the complete trtexec --verbose logs and if possible minimal issue repro ONNX model.

Thank you.

1102696569 · November 10, 2022, 9:58am

I try again by using the lastest TensoRT version v8.5, It reports similar error. Log as blow.

spolisetty · November 14, 2022, 8:01am

Hi,

Could you please share with us the ONNX model (here or via DM) for better debugging.

Thank you.

410069103 · November 16, 2022, 7:25am

maybe you can try prohibit cublas and cudnn when use trtexec

1102696569 · November 24, 2022, 1:48am

@410069103 It does not work when I add the option “–tacticSource=-CUBLAS,-CUDNN”.

1102696569 · November 24, 2022, 1:55am

@spolisetty The model is too big to upload. I just want to know How TensorRT to infer the huge model with total weights size 60GB more than A10 available device memory size 22GB. And how to use trtexec to infer the huge model?

From the log, we found TensorRT try to allocate 50GB . It’s incomprehensible.

spolisetty · November 29, 2022, 4:39am

Hi,

This is not possible. The model has to fit within device memory.

Thank you.

Topic		Replies	Views
HOW TO CONVERT a large pytorch model to TRT model? TensorRT tensorrt	7	2743	October 12, 2021
ONNX to tensorrt model file TensorRT	0	487	May 5, 2019
Run out of memory when creating TensorRT engine from onnx model Jetson Xavier NX tensorrt	7	2900	October 18, 2021
Trt file from onnx is too large TensorRT	1	937	March 10, 2021
Tensor volume exceeds (2^31)-1 TensorRT	7	2986	June 13, 2023
Tensor volume issue in onnx2tensorrt conversion process TensorRT tensorrt	1	627	April 7, 2022
Cuda OutOfMemory when creating tensor with 2^29 (~0.5 G) elements TensorRT tensorrt , cuda , onnx	6	1815	March 9, 2022
ONNX to TRT using trtexec gives output only on batch size 1 TensorRT	7	1336	November 20, 2020
AssertionError: Max workspace size for TensorRT inference should be positive, got 0 TensorRT	4	774	July 21, 2021
Myelin memory budget exceeded while building TensorRT engine with batch > 1 TensorRT tensorrt	4	1001	October 12, 2021

Can huge model with huge total weights size be infered by tensorrt at A10?

Related topics