NVIDIA AI Inference Performance Evaluation

lapd · April 30, 2019, 10:02am

I only know a bit of system under test information from the article

All data here gathered on ResNet-50 using TensorRT 5.
Throughput and Efficiency tests run at batch-size 128;
System config: Dual-socket Xeon Gold 6140 with 384GB of system memory and
- a single Tesla V100 OR
- Tesla T4

Could you please tell or give some hints on how to replicate this test such as:

Thank you very much!

Topic		Replies	Views
Inference Benchmarks - TensorRT Version ? TensorRT	1	2036	October 4, 2018
Which program are used to test Inference Image Classification on CNNs with TensorRT? TensorRT	0	394	September 11, 2019
Which program are used to test Inference Image Classification on CNNs with TensorRT? TensorRT	0	394	September 12, 2019
How to reproduce the inferencing performance with INT8 on T4 or A2 TensorRT	2	368	September 4, 2022
NVIDIA AI Inference Performance Milestones: Delivering Leading Throughput, Latency and Efficiency Technical Blog	0	395	March 13, 2023
How can I improve my prediction performance in TenserRt 3.0? TensorRT	3	915	April 26, 2018
How to reproduce nvidia product performance result? General tensorrt	0	641	March 14, 2023
TensorRT latency and wattage numbers by NVIDIA recreation. TensorRT	0	403	August 16, 2019
Ideas to maximize throughput using TensorRT TensorRT	1	362	November 20, 2020
TensorRT Inference Consuming Large Amount of System Resources TensorRT	1	624	July 5, 2022