Running on multiple compute capabilities with one TRT Engine

solaris.insight · December 6, 2021, 8:12pm

Hi, we’d like to run TensorRT 6 or 8 on both Jetson AGX and in AWS (Turing T4 on Arm). Do we have to build different engines locally on each platform or is there a way to generate PTX or at least force TRT to generate multiple CCs from one platform run?

Thank you

NVES · December 7, 2021, 6:09am

Hi,
This looks like a Jetson issue. Please refer to the below samlples in case useful.

For any further assistance, we recommend you to raise it to the respective platform from the below link

Thanks!

solaris.insight · December 7, 2021, 7:27pm

Thank you for your reply. This is not a Jetson issue, it’s a TensorRT issue.

solaris.insight · December 8, 2021, 2:36am

This is not a Jetson-specific issue.
Is there any way TensorRT can be enabled to generate support for multiple CCs or PTX in the same engine file?

spolisetty · December 8, 2021, 4:47am

Tensorrt doesn’t support that. Yes, we have to build different engines locally on each platform
The generated plan/engine files are not portable across platforms or TensorRT versions. Plans are specific to the exact GPU model they were built on (in addition to the platforms and the TensorRT version) and must be re-targeted to the specific GPU.

Thanks!

solaris.insight · December 8, 2021, 6:07am

Thank you for confirming. You guys may want to add support for PTX/alternate path/or cuDNN fallback for use cases when performance is not important but let’s say the same code needs to run in CI pipeline on a card with a different CC.

solaris.insight · December 30, 2021, 4:27pm

Because the cost of switching versions/different CC is sometimes stratospheric. I spent the last 120 hrs trying to get running on CC 8.6 the model that was already previously running and exporting fine on Turing. Can’t say i’m a huge fan of TensorRT at the moment.

Topic		Replies	Views
Running dgpu optimized TensorRT model on Nvidia Jetson NX Jetson Xavier NX tensorrt , jetson-inference	4	841	September 12, 2021
Encountered some difficulties on the tensorrt serialization engine Jetson AGX Xavier tensorrt , jetson-inference	11	960	June 7, 2023
Can I limit the computational resources consumption at the TensorRT engine building stage? TensorRT tensorrt , cuda , kernel	3	766	August 28, 2023
Preventing engine duplication Jetson Nano tensorrt	9	517	August 10, 2023
When does TensorRT 8 support jetson platform TensorRT	1	513	August 31, 2021
TRT inference speed on two AGX Xavier TensorRT	1	305	September 12, 2021
Development best practices for tensorRT on Tx1 and Ubuntu18 desktop Jetson TX1	6	612	October 18, 2021
TAO deploy docs - build trt engine on x86 run on aarch64? TAO Toolkit	1	34	August 30, 2024
Upgrade tensorRT from 7.1.3 to 7.2.0+ Jetson AGX Xavier tensorrt , onnx	6	1466	October 18, 2021
TensorRT-LLM for Jetson Jetson AGX Orin generative_ai	10	1781	April 21, 2025

Running on multiple compute capabilities with one TRT Engine

Related topics