TensorRT Cross-Compilation: OpenVLA Model (PyTorch) for Jetson Orin, Converting on GPU Server (8x L20)

noname.mark09 · September 30, 2025, 4:37pm

Dear NVIDIA Community,

I am working on deploying the OpenVLA model on a Jetson Orin device and require assistance with TensorRT model compilation, specifically regarding cross-compilation from a powerful host system.

Problem Statement: My Jetson Orin device experiences an overcurrent shutdown during the TensorRT model compilation phase, likely due to the intensive computational and memory demands of optimizing a large model like OpenVLA. This prevents me from successfully generating a TensorRT engine directly on the target device.

Proposed Solution & Goal: To overcome this limitation, I propose leveraging a more powerful host system for the TensorRT conversion. I have access to a GPU server equipped with 8 NVIDIA L20 GPUs (x86_64 architecture). My goal is to compile the OpenVLA model (which is currently a PyTorch model) on this server such that the resulting TensorRT engine (.engine file) is optimized and fully compatible for efficient inference on the Jetson Orin (ARM64 architecture).

Specific Questions & Concerns:

Cross-Compilation Feasibility & Recommended Workflow:
- Is this approach (compiling a TensorRT engine on a discrete GPU server for deployment on a Jetson Orin) officially supported and a recommended workflow by NVIDIA?
- What are the recommended tools and steps for performing this cross-compilation? (e.g., using trtexec with specific flags, TensorRT Python API, or other utilities).
Compatibility Considerations (Versions & Architectures):
- TensorRT Versions: Should the TensorRT version installed on my L20 server (for compilation) precisely match the TensorRT version available on the Jetson Orin (for inference)? Or is there an acceptable range of compatibility?
- Architectural Differences: Given the fundamental difference between the x86_64 host (L20) and the ARM64 target (Orin), are there specific TensorRT builder flags, target GPU architecture specifications (--device=DLA_0 if applicable, or more fundamentally, sm_XX version) or explicit platform declarations needed during the compilation process on the server to ensure the engine is built for the Orin’s specific hardware?
- OpenVLA Specifics: Are there common pitfalls or special considerations when converting large Vision-Language Models (like OpenVLA) in a cross-compilation scenario?

Context: This strategy would enable us to leverage the significant computational resources of the GPU server for the heavy lifting of model optimization and engine generation, while still achieving efficient, low-power inference on the Jetson Orin for edge deployment, without hitting power limitations during development.

Any guidance, best practices, recommended workflows, or warnings about potential compatibility issues would be greatly appreciated. If any further information about my setup (e.g., specific Jetson Orin SKU, exact TensorRT versions I’m planning to use, an anonymized model graph) would be helpful, please let me know.

Thank you in advance for your time and assistance.

Best regards,

AastaLLL · October 1, 2025, 2:44am

Hi,

Since TensorRT optimizes the engine based on hardware architecture, you will need to convert it on the target directly.
To prevent a shutdown, you can try to lower the clocks (either nvpmodel or custom clocks) to reduce power consumption.

Thanks.

1378805302 · October 9, 2025, 9:32am

I tried to lower the clock frequency by setting the Orin to MODE_15W, but it still shuts down automatically during the mode switch. What should I do?

system · October 23, 2025, 9:33am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to convert OpenVLA to TensorRT (targeting Jetson AGX Orin's GPU arch) on x86 server without reconfiguring Orin environment? Jetson AGX Orin tensorrt	2	157	October 1, 2025
LSTM Error Tensorrt Jetson Orin TensorRT	1	93	August 29, 2024
Cross compilation from x86 servers to Orin Jetson AGX Orin tensorrt , nvbugs	7	2480	October 12, 2022
Tensorrt conversion on Orin platform Jetson AGX Orin tensorrt	4	168	August 13, 2025
How to install torch_tensorrt Jetson Orin Nano tensorrt	3	242	August 20, 2025
TAO .etlt to TensorRT Engine Conversion on Jetson Orin / WSL2 / Docker Failed TAO Toolkit tensorrt , cudnn , onnx , jetson , deepstream	8	232	July 20, 2025
How to infer the .trt/.engine model on Drive AGX Orin through cross-compile? DRIVE AGX Orin General driveos-dl	4	555	March 7, 2024
Tensort-RT LLM Support for Jetson Jetson Orin Nano generative_ai	2	79	January 22, 2026
Keras->Onnx->TensorRT Jetson AGX Orin tensorrt	4	291	September 25, 2024
TAO deploy docs - build trt engine on x86 run on aarch64? TAO Toolkit	1	93	August 30, 2024

TensorRT Cross-Compilation: OpenVLA Model (PyTorch) for Jetson Orin, Converting on GPU Server (8x L20)

Related topics