Benchmarck int8 similar to fp32 on yolov8 from ultralytics

fnando1995 · December 13, 2023, 9:25pm

Hello,

I just install jetpack 5.1.2 to my JNO 8GB. I installed ultralytics and resolved the Pytorch with Cuda.

I started to benchmark yolov8 models from ultralytics package and I have same performance for fp32 and int8 configuration (fp16 is, as expected, half of fp32).

Is this a problem with the int8 support in the jetson nano orin???

Thanks in advance.

test.py

from ultralytics.utils.benchmarks import benchmark
benchmark(model=f'yolov8n.pt', data='coco8.yaml', imgsz=640, int8=True, device=0)

half and int8 variables were modified as expected for the benchmarks:

FP32 Benchmarks complete for yolov8n.pt on coco8.yaml at imgsz=640 (905.18s)

                   Format Status❔  Size (MB)  metrics/mAP50-95(B)  Inference time (ms/im)
4                TensorRT       ✅       13.6               0.6117                   12.63

FP16 Benchmarks complete for yolov8n.pt on coco8.yaml at imgsz=640 (919.86s)

                   Format Status❔  Size (MB)  metrics/mAP50-95(B)  Inference time (ms/im)
4                TensorRT       ✅        8.2               0.6092                    7.04

INT8 Benchmarks complete for yolov8n.pt on coco8.yaml at imgsz=640 (423.97s)

                   Format Status❔  Size (MB)  metrics/mAP50-95(B)  Inference time (ms/im)
4                TensorRT       ✅       13.5               0.6117                   12.61

AastaLLL · December 14, 2023, 4:28am

Hi,

Do you have the serialized TensorRT engine?
If yes, please test it with trtexec with maximized performance and update.

$ sudo nvpmodel -m 0
$ sudo jetson_inference
$ /usr/src/tensorrt/bin/trtexec --loadEngine=[file]

Thanks.

rajupadhyay59 · December 14, 2023, 6:09am

@fnando1995 also do, sudo jetson_clocks --fan

@AastaLLL This is the first time I have seen “sudo jetson_inference”.
Is it related to dusty’s git package? I’ll try it read on it.

fnando1995 · December 14, 2023, 6:50pm

I think I do not have de serialized tensorrt engine. Where can I check that to install? or upgrade?

I do have TensorRT, as it comes with the Jetpack installation, but when trying to use your commands, it outputs this:

jn@ubuntu:~/Documents$ /usr/src/tensorrt/bin/trtexec --loadEngine=/home/jn/Documents/yolov8l.engine
&&&& RUNNING TensorRT.trtexec [TensorRT v8502] # /usr/src/tensorrt/bin/trtexec --loadEngine=/home/jn/Documents/yolov8n.engine
[12/14/2023-13:47:57] [I] === Model Options ===
[12/14/2023-13:47:57] [I] Format: *
[12/14/2023-13:47:57] [I] Model: 
[12/14/2023-13:47:57] [I] Output:

...
...
...
[12/14/2023-13:47:57] [I] Memory Clock Rate: 0.624 GHz
[12/14/2023-13:47:57] [I] 
[12/14/2023-13:47:57] [I] TensorRT version: 8.5.2
[12/14/2023-13:47:57] [I] Engine loaded in 0.211736 sec.
[12/14/2023-13:47:58] [I] [TRT] Loaded engine size: 168 MiB
[12/14/2023-13:47:58] [E] Error[1]: [stdArchiveReader.cpp::StdArchiveReader::32] Error Code 1: Serialization (Serialization assertion magicTagRead == kMAGIC_TAG failed.Magic tag does not match)
[12/14/2023-13:47:58] [E] Error[4]: [runtime.cpp::deserializeCudaEngine::65] Error Code 4: Internal Error (Engine deserialization failed.)
[12/14/2023-13:47:58] [E] Engine deserialization failed
[12/14/2023-13:47:58] [E] Got invalid engine!
[12/14/2023-13:47:58] [E] Inference set up failed
&&&& FAILED TensorRT.trtexec [TensorRT v8502] # /usr/src/tensorrt/bin/trtexec --loadEngine=/home/jn/Documents/yolov8n.engine

rajupadhyay59 · December 18, 2023, 4:18am

Your “yolov8n.engine” is a serialized engine which is deserialized before the start of inference.

May I ask how did you create your calib cache. If your fp16 is working as expected and I am assuming so is fp32, maybe your int8 calib cache is incorrect.

AastaLLL · December 18, 2023, 6:42am

Hi,

Serialization assertion magicTagRead == kMAGIC_TAG failed.Magic tag does not match

The error indicates that your engine file is created on a different environment (either software or hardware).

Since TensorRT optimizes with environment info, please recreate the engine file when you change the device or update the software.

Thanks.

system · January 16, 2024, 8:33am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
The problem of deploying yolov8 on jetson orin with jetpack 5.1.1 Jetson Orin Nano tensorrt , python	6	974	February 7, 2024
Int8 yolov8n on Jetson agx orin issue with deepstream DeepStream SDK tensorrt , cuda , yolo , deepstream	14	1203	July 5, 2023
Issue with yolov8s-seg conversion from onnx to engine Jetson AGX Orin yolo , onnx	5	47	October 29, 2024
Int8 YOLOv8s on Jetson Orin Nano issue with DeepStream 6.3 DeepStream SDK	4	805	November 23, 2023
Yolov6 Slow inference speed on the Nvidia Jetson NX board Jetson Xavier NX yolo	8	1617	August 24, 2022
INT8 Calibration with DS 6.3 worse than with DS 6.0 DeepStream SDK tensorrt , jetson , deepstream , tensorrt-model-optimizer	20	85	March 10, 2025
TensorRT INT8 calibration python API Jetson AGX Orin tensorrt , jetson-inference , python , calibration	28	5496	October 26, 2022
Jetson Nano Python 3.7 version for Tensorrt Jetson Nano tensorrt , python	14	3842	April 12, 2023
TensorRT problem on NVIDIA APEX ORIN NX TensorRT tensorrt , jetson-inference , cudnn	1	40	August 29, 2024
Orin Nano TensorRT installed but module not found when using YOLO Jetson Orin Nano tensorrt , yolo	3	398	August 12, 2024

Benchmarck int8 similar to fp32 on yolov8 from ultralytics

Related topics