Random spikes in RAM while using Triton Inference

raj.srujan · August 3, 2023, 11:53am

Description

We are observing spikes in RAM (~40GB) while using Triton Inference server. We have python backend models (CPU and GPU) and TensorRT models. And we also use BLS in the pipeline.

Environment

Trition verison: 22.12 (nvcr.io/nvidia/tritonserver:22.12-py3)
Python Backend verison: r21.08
TensorRT Version: 8.5.1
GPU Type: GeForce RTX 2080 SUPER
Nvidia Driver Version: 510.108.03
CUDA Version: 11.8
CUDNN Version: 8.7.0 GA
Operating System + Version: Ubuntu 20.04
Python Version (if applicable): 3.7

Relevant Files

Steps To Reproduce

We couldn’t find the exact reason for the spikes, so cannot specify the steps to reproduce. This issue happens mostly on long runs (8-12 hrs) of Triton Server.

spolisetty · August 3, 2023, 12:15pm

Hi,

We recommend you to please reach out to Issues · triton-inference-server/server · GitHub to get better help on Triton related issues.

Thank you.

Topic		Replies	Views
Triton server memory accumulation problem TensorRT cudnn	1	435	March 14, 2024
TensorRT Inference Server system RAM usage climbs until container is closed by OS Triton Inference Server (archived)	2	1045	June 23, 2019
Detection tesnorRT takes seconds to run on TX2 Jetson TX2 tensorrt , jetson-inference	8	791	October 18, 2021
Problem with accumulating gpu memory usage in tritonserver TensorRT cudnn , inference-server-triton , deepstream	0	271	September 3, 2024
TensorRT model consuming more amount of RAM Jetson TX2 tensorrt	3	973	October 18, 2021
Tf consuming too much RAM memory Jetson AGX Orin tensorrt , tensorflow , python	4	638	March 7, 2023
Triton crashes with Signal (11) TensorRT	1	959	August 3, 2023
how to release the TRTIS memory Triton Inference Server (archived)	2	1043	October 23, 2019
TensorRT model consuming more amount of RAM on Jetson TX2 Jetson TX2 tensorrt	5	1217	October 18, 2021
CPU RAM consumption during TensorRT calibration process TensorRT	4	920	October 12, 2021

Random spikes in RAM while using Triton Inference

Description

Environment

Relevant Files

Steps To Reproduce

Related topics