Xavier NX restarts while running AI models

shiva.sanketh · December 18, 2023, 7:44pm

We are using our custom developed controller board with Jetson Xavier NX chip. This setup runs fine when not using any Deep Learning models.
We are running Deep Learning models using Triton server. Whenever we query these ensemble models, we see a current draw spike. This sometimes leads to the entire system restarting. This problem is mostly caused by the power drawn by the GPUs when AI inference is run. This problem does not occur with the Xavier NX development kit. But I would like to solve this using software and not make any changes to the hardware controller board that we have developed.
Can we limit the power drawn by the device ? Or is there any other way I can solve this ?

TensorRT Version : 8.5.2.2
GPU Type : NVIDIA Volta architecture
Nvidia Driver Version :
CUDA Version : 11.4
CUDNN Version : 8.6
Operating System + Version : Jetpack 5.1
Python Version (if applicable) : 3.8
TensorFlow Version (if applicable) : NA
PyTorch Version (if applicable) : NA
Baremetal or Container (if container which image + tag) : Docker

AastaLLL · December 19, 2023, 5:31am

Hi,

Please try to set the power model to see if this helps.

https://docs.nvidia.com/jetson/archives/r35.4.1/DeveloperGuide/text/SD/PlatformPowerAndPerformance/JetsonXavierNxSeriesAndJetsonAgxXavierSeries.html#supported-modes-and-power-efficiency

$ sudo nvpmodel -m [ID]

Thanks.

system · January 16, 2024, 8:46am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Xavier NX restarts while running AI models TensorRT tensorrt , cudnn , inference-server-triton	2	329	December 18, 2023
Jetson Xavier NX running with Cuda and TF1.5 Jetson Xavier NX tensorflow	6	1448	October 18, 2021
Power consumption can be regulated on Jetson Xavier? Jetson Xavier NX power	2	440	November 13, 2022
How to get reliable GPU clock rate on Jetson AGX Xavier? Jetson AGX Xavier tensorrt	2	677	October 18, 2021
Nvidia Jetson NX extremely slow even with TensorRT inference for yolov3 Jetson Xavier NX tensorrt	21	2615	October 18, 2021
tensorRT inference Running Jetson Nano tensorrt , jetson-inference	4	870	October 15, 2021
Yolov5 slow inference on Jetson Xavier NX16 Jetson Xavier NX ai	10	1631	October 26, 2022
System Throttle Due to Over-Current Jetson Xavier NX tensorrt , camera , jetson-inference , gstreamer , python	7	741	August 11, 2023
Jetson Xavier GPU performance drop after inactivity Jetson Xavier NX jetson-inference	5	567	September 19, 2021
GPU power is maxout then inference is running with tensorrt TensorRT	4	473	December 19, 2022

Xavier NX restarts while running AI models

Related topics