Hi,
I have executed Jetson Benchmarks in Xavier AGX Xavier devkit.
The results didn’t matched the max throughput published here
Results I got matches, 97% with Limited Latency and 84% with Max throughput
Command used:
sudo python3 benchmark.py --all --csv_file_path benchmark_csv/xavier-benchmarks.csv --model_dir $(pwd)/models --jetson_devkit xavier --gpu_freq 1377000000 --dla_freq 1395200000 --power_mode 0
MAX-N mode | Published Results Jetson 4.4.1 |
% Achieved in published KPI’s Jetson 4.5.1 |
|||||||
---|---|---|---|---|---|---|---|---|---|
Model name | Devices | BatchSizeGPU | BatchSizeDLA | Avg FPS jetson 4.5.1 |
FPS(Limited latency) |
FPS(Max throughput) |
FPS(Limited latency) |
FPS(Max throughput) |
|
0 | inception_v4 | 3 | 4 | 1 | 513.388317 | 528 | 704 | 97.2326358% | 72.92447685% |
1 | vgg19_N2 | 1 | 4 | 0 | 270.036357 | 276 | 432 | 97.83925978% | 62.50841597% |
2 | super_resolution_bsd 500 | 1 | 4 | 0 | 277.767551 | 281 | 302 | 98.84966228% | 91.97601026% |
3 | unet-segmentation | 1 | 2 | 0 | 236.369876 | 240 | 251 | 98.48744833% | 94.17126534% |
4 | pose_estimation | 1 | 4 | 0 | 427.127834 | 439 | 484 | 97.29563417% | 88.24955248% |
5 | yolov3-tiny-416 | 1 | 16 | 0 | 1072.215892 | 1100 | 1127 | 97.474172% | 95.13894339% |
6 | ResNet50_224x224 | 3 | 16 | 4 | 1807.485788 | 1946 | 2109 | 92.88210627% | 85.7034513% |
7 | ssd-mobilenet-v1 | 3 | 16 | 2 | 1607.719843 | 1602 | 1919 | 100.3570439% | 83.77904341% |
Avg | 97.55224531% | 84.30639488% |
Nvidia AGX Xavier details:
- NVIDIA Jetson AGX Xavier [32GB]
- Jetpack 4.5.1 [L4T 32.5.1]
- NV Power Mode: MAXN - Type: 0
- jetson_stats.service: active
- Board info:
- Type: AGX Xavier [16GB]
- SOC Family: tegra194 - ID:25
- Module: P2888-0001 - Board: P2822-0000
- Code Name: galen
- CUDA GPU architecture (ARCH_BIN): 7.2
- Libraries:
- CUDA: 10.2.89
- cuDNN: 8.0.0.180
- TensorRT: 7.1.3.0
- Visionworks: 1.6.0.501
- OpenCV: 4.1.1 compiled CUDA: NO
- VPI: ii libnvvpi1 1.0.15 arm64 NVIDIA Vision Programming Interface library
- Vulkan: 1.2.70
- jetson-stats:
- Version 3.1.1
- Works on Python 3.6.9
Could you help me to reproduce the published benchmarks.
Thanks in advance,