output of cat /etc/nv_tegra_release
:
# R32 (release), REVISION: 5.2, GCID: 27767740, BOARD: t210ref, EABI: aarch64, DATE: Fri Jul 9 16:01:52 UTC 2021
output of /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/resnet50/ResNet50.onnx
:
tess@tess-test:~$ /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/resnet50/ResNet50.onnx
&&&& RUNNING TensorRT.trtexec [TensorRT v8001] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/resnet50/ResNet50.onnx
[01/30/2023-22:49:03] [I] === Model Options ===
[01/30/2023-22:49:03] [I] Format: ONNX
[01/30/2023-22:49:03] [I] Model: /usr/src/tensorrt/data/resnet50/ResNet50.onnx
[01/30/2023-22:49:03] [I] Output:
[01/30/2023-22:49:03] [I] === Build Options ===
[01/30/2023-22:49:03] [I] Max batch: explicit
[01/30/2023-22:49:03] [I] Workspace: 16 MiB
[01/30/2023-22:49:03] [I] minTiming: 1
[01/30/2023-22:49:03] [I] avgTiming: 8
[01/30/2023-22:49:03] [I] Precision: FP32
[01/30/2023-22:49:03] [I] Calibration:
[01/30/2023-22:49:03] [I] Refit: Disabled
[01/30/2023-22:49:03] [I] Sparsity: Disabled
[01/30/2023-22:49:03] [I] Safe mode: Disabled
[01/30/2023-22:49:03] [I] Restricted mode: Disabled
[01/30/2023-22:49:03] [I] Save engine:
[01/30/2023-22:49:03] [I] Load engine:
[01/30/2023-22:49:03] [I] NVTX verbosity: 0
[01/30/2023-22:49:03] [I] Tactic sources: Using default tactic sources
[01/30/2023-22:49:03] [I] timingCacheMode: local
[01/30/2023-22:49:03] [I] timingCacheFile:
[01/30/2023-22:49:03] [I] Input(s)s format: fp32:CHW
[01/30/2023-22:49:03] [I] Output(s)s format: fp32:CHW
[01/30/2023-22:49:03] [I] Input build shapes: model
[01/30/2023-22:49:03] [I] Input calibration shapes: model
[01/30/2023-22:49:03] [I] === System Options ===
[01/30/2023-22:49:03] [I] Device: 0
[01/30/2023-22:49:03] [I] DLACore:
[01/30/2023-22:49:03] [I] Plugins:
[01/30/2023-22:49:03] [I] === Inference Options ===
[01/30/2023-22:49:03] [I] Batch: Explicit
[01/30/2023-22:49:03] [I] Input inference shapes: model
[01/30/2023-22:49:03] [I] Iterations: 10
[01/30/2023-22:49:03] [I] Duration: 3s (+ 200ms warm up)
[01/30/2023-22:49:03] [I] Sleep time: 0ms
[01/30/2023-22:49:03] [I] Streams: 1
[01/30/2023-22:49:03] [I] ExposeDMA: Disabled
[01/30/2023-22:49:03] [I] Data transfers: Enabled
[01/30/2023-22:49:03] [I] Spin-wait: Disabled
[01/30/2023-22:49:03] [I] Multithreading: Disabled
[01/30/2023-22:49:03] [I] CUDA Graph: Disabled
[01/30/2023-22:49:03] [I] Separate profiling: Disabled
[01/30/2023-22:49:03] [I] Time Deserialize: Disabled
[01/30/2023-22:49:03] [I] Time Refit: Disabled
[01/30/2023-22:49:03] [I] Skip inference: Disabled
[01/30/2023-22:49:03] [I] Inputs:
[01/30/2023-22:49:03] [I] === Reporting Options ===
[01/30/2023-22:49:03] [I] Verbose: Disabled
[01/30/2023-22:49:03] [I] Averages: 10 inferences
[01/30/2023-22:49:03] [I] Percentile: 99
[01/30/2023-22:49:03] [I] Dump refittable layers:Disabled
[01/30/2023-22:49:03] [I] Dump output: Disabled
[01/30/2023-22:49:03] [I] Profile: Disabled
[01/30/2023-22:49:03] [I] Export timing to JSON file:
[01/30/2023-22:49:03] [I] Export output to JSON file:
[01/30/2023-22:49:03] [I] Export profile to JSON file:
[01/30/2023-22:49:03] [I]
[01/30/2023-22:49:03] [I] === Device Information ===
[01/30/2023-22:49:03] [I] Selected Device: NVIDIA Tegra X1
[01/30/2023-22:49:03] [I] Compute Capability: 5.3
[01/30/2023-22:49:03] [I] SMs: 1
[01/30/2023-22:49:03] [I] Compute Clock Rate: 0.9216 GHz
[01/30/2023-22:49:03] [I] Device Global Memory: 3956 MiB
[01/30/2023-22:49:03] [I] Shared Memory per SM: 64 KiB
[01/30/2023-22:49:03] [I] Memory Bus Width: 64 bits (ECC disabled)
[01/30/2023-22:49:03] [I] Memory Clock Rate: 0.01275 GHz
[01/30/2023-22:49:03] [I]
[01/30/2023-22:49:03] [I] TensorRT version: 8001
[01/30/2023-22:49:05] [I] [TRT] [MemUsageChange] Init CUDA: CPU +202, GPU +0, now: CPU 205, GPU 3626 (MiB)
[01/30/2023-22:49:05] [I] Start parsing network model
[01/30/2023-22:49:05] [I] [TRT] ----------------------------------------------------------------
[01/30/2023-22:49:05] [I] [TRT] Input filename: /usr/src/tensorrt/data/resnet50/ResNet50.onnx
[01/30/2023-22:49:05] [I] [TRT] ONNX IR version: 0.0.3
[01/30/2023-22:49:05] [I] [TRT] Opset version: 9
[01/30/2023-22:49:05] [I] [TRT] Producer name: onnx-caffe2
[01/30/2023-22:49:05] [I] [TRT] Producer version:
[01/30/2023-22:49:05] [I] [TRT] Domain:
[01/30/2023-22:49:05] [I] [TRT] Model version: 0
[01/30/2023-22:49:05] [I] [TRT] Doc string:
[01/30/2023-22:49:05] [I] [TRT] ----------------------------------------------------------------
[01/30/2023-22:49:06] [W] [TRT] onnx2trt_utils.cpp:364: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
[01/30/2023-22:49:06] [I] Finish parsing network model
[01/30/2023-22:49:06] [I] [TRT] [MemUsageChange] Init CUDA: CPU +0, GPU +0, now: CPU 304, GPU 3521 (MiB)
[01/30/2023-22:49:06] [I] [TRT] [MemUsageSnapshot] Builder begin: CPU 304 MiB, GPU 3519 MiB
[01/30/2023-22:49:06] [I] [TRT] ---------- Layers Running on DLA ----------
[01/30/2023-22:49:06] [I] [TRT] ---------- Layers Running on GPU ----------
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/conv1_1 + node_of_gpu_0/res_conv1_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/pool1_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_0_branch2a_1 + node_of_gpu_0/res2_0_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_0_branch2b_1 + node_of_gpu_0/res2_0_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_0_branch1_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_0_branch2c_1 + node_of_gpu_0/res2_0_branch2c_bn_2 + node_of_gpu_0/res2_0_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_1_branch2a_1 + node_of_gpu_0/res2_1_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_1_branch2b_1 + node_of_gpu_0/res2_1_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_1_branch2c_1 + node_of_gpu_0/res2_1_branch2c_bn_2 + node_of_gpu_0/res2_1_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_2_branch2a_1 + node_of_gpu_0/res2_2_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_2_branch2b_1 + node_of_gpu_0/res2_2_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res2_2_branch2c_1 + node_of_gpu_0/res2_2_branch2c_bn_2 + node_of_gpu_0/res2_2_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_0_branch2a_1 + node_of_gpu_0/res3_0_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_0_branch2b_1 + node_of_gpu_0/res3_0_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_0_branch1_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_0_branch2c_1 + node_of_gpu_0/res3_0_branch2c_bn_2 + node_of_gpu_0/res3_0_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_1_branch2a_1 + node_of_gpu_0/res3_1_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_1_branch2b_1 + node_of_gpu_0/res3_1_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_1_branch2c_1 + node_of_gpu_0/res3_1_branch2c_bn_2 + node_of_gpu_0/res3_1_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_2_branch2a_1 + node_of_gpu_0/res3_2_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_2_branch2b_1 + node_of_gpu_0/res3_2_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_2_branch2c_1 + node_of_gpu_0/res3_2_branch2c_bn_2 + node_of_gpu_0/res3_2_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_3_branch2a_1 + node_of_gpu_0/res3_3_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_3_branch2b_1 + node_of_gpu_0/res3_3_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res3_3_branch2c_1 + node_of_gpu_0/res3_3_branch2c_bn_2 + node_of_gpu_0/res3_3_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_0_branch2a_1 + node_of_gpu_0/res4_0_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_0_branch2b_1 + node_of_gpu_0/res4_0_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_0_branch1_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_0_branch2c_1 + node_of_gpu_0/res4_0_branch2c_bn_2 + node_of_gpu_0/res4_0_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_1_branch2a_1 + node_of_gpu_0/res4_1_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_1_branch2b_1 + node_of_gpu_0/res4_1_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_1_branch2c_1 + node_of_gpu_0/res4_1_branch2c_bn_2 + node_of_gpu_0/res4_1_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_2_branch2a_1 + node_of_gpu_0/res4_2_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_2_branch2b_1 + node_of_gpu_0/res4_2_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_2_branch2c_1 + node_of_gpu_0/res4_2_branch2c_bn_2 + node_of_gpu_0/res4_2_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_3_branch2a_1 + node_of_gpu_0/res4_3_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_3_branch2b_1 + node_of_gpu_0/res4_3_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_3_branch2c_1 + node_of_gpu_0/res4_3_branch2c_bn_2 + node_of_gpu_0/res4_3_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_4_branch2a_1 + node_of_gpu_0/res4_4_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_4_branch2b_1 + node_of_gpu_0/res4_4_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_4_branch2c_1 + node_of_gpu_0/res4_4_branch2c_bn_2 + node_of_gpu_0/res4_4_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_5_branch2a_1 + node_of_gpu_0/res4_5_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_5_branch2b_1 + node_of_gpu_0/res4_5_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res4_5_branch2c_1 + node_of_gpu_0/res4_5_branch2c_bn_2 + node_of_gpu_0/res4_5_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_0_branch2a_1 + node_of_gpu_0/res5_0_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_0_branch2b_1 + node_of_gpu_0/res5_0_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_0_branch1_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_0_branch2c_1 + node_of_gpu_0/res5_0_branch2c_bn_2 + node_of_gpu_0/res5_0_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_1_branch2a_1 + node_of_gpu_0/res5_1_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_1_branch2b_1 + node_of_gpu_0/res5_1_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_1_branch2c_1 + node_of_gpu_0/res5_1_branch2c_bn_2 + node_of_gpu_0/res5_1_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_2_branch2a_1 + node_of_gpu_0/res5_2_branch2a_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_2_branch2b_1 + node_of_gpu_0/res5_2_branch2b_bn_2
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/res5_2_branch2c_1 + node_of_gpu_0/res5_2_branch2c_bn_2 + node_of_gpu_0/res5_2_branch2c_bn_3
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/pool5_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] node_of_gpu_0/pred_1
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] (Unnamed Layer* 176) [Shuffle] + (Unnamed Layer* 177) [Shuffle]
[01/30/2023-22:49:06] [I] [TRT] [GpuLayer] (Unnamed Layer* 178) [Softmax]
[01/30/2023-22:49:07] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +158, GPU +72, now: CPU 552, GPU 3622 (MiB)
[01/30/2023-22:49:09] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +241, GPU -9, now: CPU 793, GPU 3613 (MiB)
[01/30/2023-22:49:09] [W] [TRT] Detected invalid timing cache, setup a local cache instead
[01/30/2023-22:49:17] [I] [TRT] Some tactics do not have sufficient workspace memory to run. Increasing workspace size may increase performance, please check verbose output.
[01/30/2023-22:50:40] [I] [TRT] Detected 1 inputs and 1 output network tensors.
[01/30/2023-22:50:42] [I] [TRT] Total Host Persistent Memory: 131296
[01/30/2023-22:50:42] [I] [TRT] Total Device Persistent Memory: 82422784
[01/30/2023-22:50:42] [I] [TRT] Total Scratch Memory: 0
[01/30/2023-22:50:42] [I] [TRT] [MemUsageStats] Peak memory usage of TRT CPU/GPU memory allocators: CPU 98 MiB, GPU 192 MiB
[01/30/2023-22:50:42] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +3, now: CPU 1045, GPU 3669 (MiB)
[01/30/2023-22:50:42] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +1, GPU +1, now: CPU 1046, GPU 3670 (MiB)
[01/30/2023-22:50:42] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +0, now: CPU 1045, GPU 3671 (MiB)
[01/30/2023-22:50:42] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +0, now: CPU 1045, GPU 3671 (MiB)
[01/30/2023-22:50:42] [I] [TRT] [MemUsageSnapshot] Builder end: CPU 1045 MiB, GPU 3671 MiB
[01/30/2023-22:50:43] [I] [TRT] Loaded engine size: 121 MB
[01/30/2023-22:50:43] [I] [TRT] [MemUsageSnapshot] deserializeCudaEngine begin: CPU 1077 MiB, GPU 3771 MiB
[01/30/2023-22:50:44] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +1, now: CPU 1077, GPU 3778 (MiB)
[01/30/2023-22:50:44] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +0, GPU +1, now: CPU 1077, GPU 3779 (MiB)
[01/30/2023-22:50:44] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +0, now: CPU 1077, GPU 3779 (MiB)
[01/30/2023-22:50:44] [I] [TRT] [MemUsageSnapshot] deserializeCudaEngine end: CPU 1077 MiB, GPU 3779 MiB
[01/30/2023-22:50:44] [I] Engine built in 100.338 sec.
[01/30/2023-22:50:44] [I] [TRT] [MemUsageSnapshot] ExecutionContext creation begin: CPU 856 MiB, GPU 3622 MiB
[01/30/2023-22:50:44] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +1, now: CPU 856, GPU 3622 (MiB)
[01/30/2023-22:50:44] [I] [TRT] [MemUsageChange] Init cuDNN: CPU +0, GPU +1, now: CPU 856, GPU 3623 (MiB)
[01/30/2023-22:50:44] [I] [TRT] [MemUsageSnapshot] ExecutionContext creation end: CPU 857 MiB, GPU 3690 MiB
[01/30/2023-22:50:44] [I] Created input binding for gpu_0/data_0 with dimensions 1x3x224x224
[01/30/2023-22:50:44] [I] Created output binding for gpu_0/softmax_1 with dimensions 1x1000
[01/30/2023-22:50:44] [I] Starting inference
[01/30/2023-22:50:47] [I] Warmup completed 4 queries over 200 ms
[01/30/2023-22:50:47] [I] Timing trace has 56 queries over 3.08943 s
[01/30/2023-22:50:47] [I]
[01/30/2023-22:50:47] [I] === Trace details ===
[01/30/2023-22:50:47] [I] Trace averages of 10 runs:
[01/30/2023-22:50:47] [I] Average on 10 runs - GPU latency: 55.6723 ms - Host latency: 55.7345 ms (end to end 55.7758 ms, enqueue 3.94339 ms)
[01/30/2023-22:50:47] [I] Average on 10 runs - GPU latency: 55.139 ms - Host latency: 55.2022 ms (end to end 55.2432 ms, enqueue 4.25462 ms)
[01/30/2023-22:50:47] [I] Average on 10 runs - GPU latency: 55.1018 ms - Host latency: 55.1644 ms (end to end 55.2058 ms, enqueue 3.00271 ms)
[01/30/2023-22:50:47] [I] Average on 10 runs - GPU latency: 54.8281 ms - Host latency: 54.8893 ms (end to end 54.9259 ms, enqueue 2.22249 ms)
[01/30/2023-22:50:47] [I] Average on 10 runs - GPU latency: 54.7418 ms - Host latency: 54.8026 ms (end to end 54.8458 ms, enqueue 2.19004 ms)
[01/30/2023-22:50:47] [I]
[01/30/2023-22:50:47] [I] === Performance summary ===
[01/30/2023-22:50:47] [I] Throughput: 18.1263 qps
[01/30/2023-22:50:47] [I] Latency: min = 54.4517 ms, max = 58.1728 ms, mean = 55.1214 ms, median = 55.0273 ms, percentile(99%) = 58.1728 ms
[01/30/2023-22:50:47] [I] End-to-End Host Latency: min = 54.4902 ms, max = 58.2183 ms, mean = 55.1679 ms, median = 55.0585 ms, percentile(99%) = 58.2183 ms
[01/30/2023-22:50:47] [I] Enqueue Time: min = 2.00269 ms, max = 16.5146 ms, mean = 3.04357 ms, median = 2.31653 ms, percentile(99%) = 16.5146 ms
[01/30/2023-22:50:47] [I] H2D Latency: min = 0.0561523 ms, max = 0.0667725 ms, mean = 0.0589101 ms, median = 0.0580444 ms, percentile(99%) = 0.0667725 ms
[01/30/2023-22:50:47] [I] GPU Compute Time: min = 54.3936 ms, max = 58.1107 ms, mean = 55.0595 ms, median = 54.9636 ms, percentile(99%) = 58.1107 ms
[01/30/2023-22:50:47] [I] D2H Latency: min = 0.00195312 ms, max = 0.00354004 ms, mean = 0.00301089 ms, median = 0.0030365 ms, percentile(99%) = 0.00354004 ms
[01/30/2023-22:50:47] [I] Total Host Walltime: 3.08943 s
[01/30/2023-22:50:47] [I] Total GPU Compute Time: 3.08333 s
[01/30/2023-22:50:47] [I] Explanations of the performance metrics are printed in the verbose logs.
[01/30/2023-22:50:47] [I]
&&&& PASSED TensorRT.trtexec [TensorRT v8001] # /usr/src/tensorrt/bin/trtexec --onnx=/usr/src/tensorrt/data/resnet50/ResNet50.onnx
[01/30/2023-22:50:47] [I] [TRT] [MemUsageChange] Init cuBLAS/cuBLASLt: CPU +0, GPU +0, now: CPU 856, GPU 3699 (MiB)