Performance of Neural Network on Jetson

msee16012 · June 25, 2019, 12:18pm

Dear Concern,

I want to breakdown the performance of a neural network on jetson.
Is there any method which basically gives layerwise performance on jetson in terms of time and energy.

Thanks

dusty_nv · June 25, 2019, 3:39pm

Hi msee16012, if you are using TensorRT, you could utilize it’s IProfiler interface, which will report the execution time that each layer takes.
See the following documentation for more info about it:

msee16012 · June 25, 2019, 7:18pm

Thanks.

Can I do profiling of a neural network, I mean layer-wise performance without using tensorRT and using NVIDIA Nsight Systems.

dusty_nv · June 25, 2019, 8:30pm

You can use the Visual Profiler to measure the execution time of the individual CUDA kernels, but you would have to map these back to the layers. You could instrument the code with markers via the NVTX API, such that the names of the code sections would appear in the profiler tool.

However if you are using a framework, the invocation of the individual layers for inferencing might be abstracted away. I believe some frameworks have built-in profiling functions to serve this purpose.

msee16012 · July 2, 2019, 11:04am

Hi,

I am able to get information of each layer in terms of execution time using tensorrt.

here are my questions

Tensorrt does some optimization so profiling using tensorrt gives me time for each layer of the optimized model no original model. I want to do profiling of original model
I am getting execution time, is there any way that I can get energy and memory access for each layer ?

AastaLLL · July 9, 2019, 9:24am

Hi,

1. You can profiling the original model with its frameworks directly.
Ex. If a model comes from TensorFlow, you should be able to profile it with TensorFlow directly.

2. We only support execution time profiling in layer-level.
You can get some energy and memory information with tegrastats but not in layer-level.

sudo tegrastats

Thanks.

Topic		Replies	Views
Is it possible to know how much time each layer takes on TensorRT? TensorRT	3	1023	April 27, 2022
Can I use IProfiler to measure the execution power consumption of CNN on orin？ Jetson AGX Orin tensorrt	3	245	March 1, 2024
Profiling TensorRT Inference TensorRT	3	1298	January 20, 2021
Profiling TensorRT network on TX2 - nvprof vs. IProfiler Jetson TX2	3	1574	October 18, 2021
How to show every layer inference time in tensorRT? TensorRT	1	915	March 27, 2021
TensorRT 5 - Python profiler TensorRT	4	2714	October 12, 2021
Understanding Profiling gpu-trace output of Inference - TFTRT TensorRT	3	882	January 30, 2019
Jetson TX2 Benchmark Jetson TX2	4	4207	October 18, 2021
How to check performance of TensorRT optimised file DRIVE AGX Xavier General driveos-dl	11	923	June 19, 2023
TensorRT uses GPU alone or mix of CPU & GPU Jetson Nano tensorrt	5	612	October 18, 2021

Performance of Neural Network on Jetson

Related topics