Inference Time Hugging Face DETR

attiliocastanom · December 1, 2023, 10:32pm

Hello,

I have been testing the inference speed of the Hugging Face DETR (DETR for Object Detection) algorithm on my Jetson Orin Nano (8GB, 15W), and the results have been surprising. Any help would be appreciated.

The model is run with the same configurations on all different devices, the most relevant configuration I have made is that I am using config.num_queries = 500.

The image I am running inference on has size 800x800 pixels.

On my iMac, with 3.3 GHz 6-Core Intel Core i5, no GPU, inference time is about 1 sec

On the Jetson Orin Nano it takes about 4 seconds. Surprisingly, there is no much difference between using .to(‘cuda’) or .to(‘cpu’).

Time is measured only before an after model(inputs).

I would have expected inference to run at least as fast on the Jetson Orin Nano as on my Desktop CPU, or at the very least to be accelerated by the usage of CUDA. I monitored the GPU usage and it is 0% when using .to(‘cpu’) and slightly higher when using .to(‘cuda’).

Just for reference I am running the inference on the docker container: ‘dustynv/transformers:git-r35.3.1’.

AastaLLL · December 4, 2023, 5:17am

Hi,

If the GPU utilization is low, the bottleneck might come from data access.

It sounds like you are using PyTorch.
Could you loop the real inference call to see if the GPU utilization increases?

Thanks.

attiliocastanom · December 4, 2023, 3:38pm

I implemented the suggestion, inference time went down to 0.25 sec and GPU utilization went up.

Thanks. I appreciate the help.

system · January 1, 2024, 7:57am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Why nano run faser than orin nano when i inference cyclegan with pytorch Jetson Orin Nano jetson-inference	3	220	May 15, 2024
[Jetson Orin Nano] RuntimeError: FIND was unable to find an engine to execute this computation after trying 0 plans Jetson Orin Nano cuda , jetson	8	45	August 27, 2025
How to run pytorch custom inference on Jetson Nano's GPU? Jetson Nano pytorch	4	1207	June 21, 2022
Jetson Nano faster for object recognition with GPU Jetson Nano jetson-inference , nano2gb	5	1032	December 15, 2021
Examples for Deployment of and Inference with Pretrained Custom PyTorch-Based Models on Jetson Orin Nano Jetson Orin NX pytorch	13	232	May 25, 2025
Need help with slower inference using YOLOv8 on NVIDIA Orin Nano 4GB Jetson Orin Nano jetson-inference	8	2095	July 26, 2023
Jetpack 6 \| tensorflow model \| Object detection \| inference \| Jetson orin nano module TensorRT cudnn	2	295	May 1, 2024
Deploy a pretrained custom model in jetson nano Jetson Nano jetson-inference	10	5163	April 14, 2022
YOLOv8 model training on Jetson Orin Nano Jetson Orin Nano yolo	7	861	August 1, 2024
Performance analysis on Jetson Orin Nano 8GB Jetson Nano cudnn	2	479	June 4, 2024

Inference Time Hugging Face DETR

Related topics