TX2NX Swap Filling Up Due To Possible Memory Leak

geralt_of_rivia · August 18, 2021, 9:09am

Issue:

We are currently running a two-staged DeepStream pipeline with PGIE (Object Detection) and SGIE (Object Detection). What we have observed is after a while the swap space on TX2NX (which is 2GB) fills up and so does the RAM. This causes the application to slow down and in some cases stop the downstream tasks like relay actuation and so forth.

The application architecture is slightly complex but the following are the Docker containers that run at the same time:

Django application for APIs
Kafka container for message passing
DeepStream based application
NGINX

Right after running the application, jtop looks like following:

After several hours of running (usually 4 - 5), jtop looks like following:

As you can see the memory has gone up and so has swap space.

What we have tried

Profiling Tools
We have tried using several tools to profile the applications like valgrind, cuda-memcheck, heaptrack and have not found any significant leaks in the application.

Valgrind: detects 800+MB leaks in libcuda.so. But according to our research (refer link) valgrind reports false positives with cuda so we didn’t take this very seriously.
cuda-memcheck: no leaks/errors
heaptrack: no major memory leaks (detects ~6MB of memory leak in gstreamer which should be OK)

Restarting containers
We have also tried restarting certain containers to boil down the issue as restarting the containers frees a certain amount of swap memory. The exact memory freed in most cases is arbitrary and has the following range:

Restarting Kafka: frees 600 MB - 1.1 GB of Swap
Restarting DeepStream: frees 500 - 1.5 GB of Swap
Restarting APIs, and NGINX have almost no effect.

Swappiness
We have also things like setting the swappiness using vm.swappiness in /etc/sysctl.conf but it didnt change anything.

Disabling custom processing and postprocessing
To rule out the possibility of memleak in our custom code, we disabled all gstreamer probes, and even bbox parsing functions where the only thing that was allowed to run was the DeepStream pipeline (only PGIE would run at this point, as we disabled bbox parsing, no objects would reach SGIE) and we still observed slow increase in the swap space. This strengthens argument (1) in suspicion below.

Suspicion

Memory leak in DeepStream
I am 99% confident that there are no leaks in the custom code that is developed. For critical components like buffer conversion and creation, we are making sure we use appropriate unmapping, and destruction of streams.
Write caching
I have read that Jetson devices write to swap memory first before flushing to disk to improve latency. We are using SPD-logger extensively to flush logs so maybe too much disk I/O is causing Jetson to cache in the swap space?

• Hardware Platform: Jetson TX2NX
• DeepStream Version: 5.1
• JetPack Version: 4.5.1
• Issue Type: bug/question

geralt_of_rivia · August 20, 2021, 5:34am

Experiments I tried:

Running the same models that I have using deepstream-app in this case the memory does not go up (monitored for 2 hours)
Ran my application OUTSIDE of docker, in this case the memory consumption speed is slower. Eventually it does go up but it’s a lot slower outside of docker than inside of docker. Is this an issue?

I have read up on other alternatives to Kafka like Mosquitto that are more lightweight and suited for IOT devices, would that make sense using here? Looking forward to a response!

mchi · August 20, 2021, 2:34pm

Hi @geralt_of_rivia ,
Sorry for delay!
Do you mean there is memory leak in DeepStream?
Could you use the script in DeepStream SDK FAQ - #14 by mchi to capture the memory usage log for some time to find out which kind of memory it leaks?

What’s your pipeline?
And, in Valgrind log, did you see other suspicious leak log?
I think, you can use Valgrind to run the application for one hour and three hours respectively, and compare the Valgrind log to find out the suspicious leak.

Thanks!

Topic		Replies	Views
Memory leak in Deepstream 6.0 when running with RTSP Streams DeepStream SDK rtsp , gstreamer	12	780	October 16, 2023
DS 5.0.1 memory leak DeepStream SDK jetson-inference , nvbugs	7	626	October 12, 2021
Memory Leak in DeepStream Python bindings DeepStream SDK jetson-inference , gstreamer , docker , python	2	405	October 6, 2022
Unexpected memory usage in deepstream-test3 DeepStream SDK	25	2172	July 6, 2021
After 10h or more runtime, swap and memory is suddenly full used on jestson nano? DeepStream SDK	9	1781	October 12, 2021
Memory Leak of when running deepstream python (using grpc, triton-server, docker, Ubuntu) DeepStream SDK deepstream	11	168	March 10, 2025
Memory leak in DeepStream DeepStream SDK	33	5911	June 27, 2022
Default Deepstream app causing memory leaks! DeepStream SDK	3	468	October 12, 2021
Deepstream pipeline is causing memory leak DeepStream SDK	3	265	May 14, 2024
Memory leak in test5-app with smart-record DeepStream SDK gstreamer , nvbugs	14	2138	December 7, 2021

TX2NX Swap Filling Up Due To Possible Memory Leak

Issue:

What we have tried

Suspicion

Related topics