Extra memory usage in GPU0 when testing deepstream-segmentation-app in GPU1

wlzkobe · July 28, 2022, 7:10am

• Hardware Platform (GPU ) RTX3080 ×2
• DeepStream Version 6.0.1
• TensorRT Version 8.0.1.6
**• NVIDIA GPU Driver Version ** 470.82.01

• Issue Type( bugs)
We found some extra memory usage in GPU0(default) when using segmentation model in multi gpu application.

• How to reproduce the issue ?
/opt/nvidia/deepstream/deepstream/sources/apps/sample_apps/deepstream-segmentation-test

we set all the gpu-id as 1

lQLPJxaJmP7Rf01xzQIDsC102pu-3VGTAuKBm4mAYQA_515_113

./deepstream-segmentation-app dstest_segmentation_config_industrial.txt /opt/nvidia/deepstream/deepstream-6.0/samples/streams/sample_industrial.jpg

lQDPJxaJmP7Rf6M9zQJssPZb1aWs9B2xAuKBm6dAbgA_620_61

fanzh · July 29, 2022, 3:12am

I am checking

fanzh · July 29, 2022, 10:08am

after checking， deepstream_segmentation_app1.c can reproduce this issue, deepstream_segmentation_app2.c can’t, the difference is nvinfer 's usage.
deepstream_segmentation_app1.c (13.9 KB)
deepstream_segmentation_app2.c (13.9 KB)
check result with gst-launch-1.0 command:
this command can’t reproduce.
gst-launch-1.0 filesrc location=…/…/…/…/samples/streams/sample_720p.mjpeg ! jpegparse ! nvv4l2decoder gpu-id=1 ! mux.sink_0 nvstreammux name=mux batch-size=1 width=1280 height=720 gpu-id=1 ! fakesink
this command can reproduce.
gst-launch-1.0 filesrc location=…/…/…/…/samples/streams/sample_720p.mjpeg ! jpegparse ! nvv4l2decoder gpu-id=1 ! mux.sink_0 nvstreammux name=mux batch-size=1 width=1280 height=720 gpu-id=1 ! nvinfer config-file-path=/opt/nvidia/deepstream/deepstream/sources/apps/sample_apps/deepstream-test1/dstest1_pgie_config.txt gpu-id=1 ! fakesink

will continue to check.

fanzh · August 5, 2022, 5:47am

it is a reproducible bug, nvinfer plugin is opensource, please use this workaround:

modify /opt/nvidia/deepstream/deepstream-6.1/sources/gst-plugins/gst-nvinfer/gstnvinfer.cpp, like this:
static gpointer gst_nvinfer_input_queue_loop (gpointer data)
{
GstNvInfer *nvinfer = (GstNvInfer *) data;
cudaSetDevice (nvinfer->gpu_id);
…
}
compile, then copy libnvdsgst_infer.so to /opt/nvidia/deepstream/deepstream/lib/gst-plugins, backup old libnvdsgst_infer.so first.

yingliu · August 16, 2022, 3:30am

@wlzkobe Can you let us know if the above workaround wor for your case? thanks.

wlzkobe · August 18, 2022, 3:33am

No, it still doesn’t work

wlzkobe · August 18, 2022, 3:36am

We’ve tested this workaround, but still the same result.

fanzh · August 18, 2022, 4:00am

one correction:
2. compile, then copy libnvdsgst_infer.so to /opt/nvidia/deepstream/deepstream/lib/gst-plugins, backup old libnvdsgst_infer.so first.

wlzkobe · August 18, 2022, 5:36am

We know the correct path since it’s a gstreamer plugin lib.

GPU 0 memory usage happend when the program running for a moment not the very early time.

fanzh · August 18, 2022, 6:35am

Using the workaround, is there improvement?
Using the workaround, I can’t reproduce this issue by ./deepstream-segmentation-app -t infer dstest_segmentation_config_industrial.txt /opt/nvidia/deepstream/deepstream/samples/streams/sample_industrial.jpg, could you provide simplified code to reproduce? thanks!

wlzkobe · August 18, 2022, 7:50am

It seems no improvement.
We also use this demo and the same command.

But our sdk version is 6.0.1 since the latest 6.1 need Ubuntu 20,

and this is the source code we modified.
deepstream_segmentation_app.c (13.8 KB)

fanzh · August 18, 2022, 8:02am

my code deepstream_segmentation_app1.c is similar with yours.
you can test in ds6.1 docker , here is the link: Docker Containers — DeepStream 6.1.1 Release documentation

zhy2022 · August 19, 2022, 9:06am

This problem is assured to be nvinfer plugin’s bug.
It can simply reprocude using gst-launch-1.0 command like this:

gst-launch-1.0 \
rtspsrc location=rtsp://RTSP_RESOURCE latency=200 drop-on-latency=1 ! rtph264depay ! \
nvv4l2decoder gpu-id=1 ! m.sink_0 \
nvstreammux gpu-id=1 name=m batch-size=1 width=1280 height=720 batched-push-timeout=40000 ! \
nvinfer gpu-id=1 config-file-path=INFER_CONFIG_FILE !```

This is annoying!

fanzh · August 19, 2022, 9:14am

yes, can you try the fix in comment 5？

zhy2022 · August 19, 2022, 10:03am

I just testd the fix, and it works on 2080Ti / deepstream 6.1-dev / nvidia driver 510.

before fix
|    0   N/A  N/A     25172      C   gst-launch-1.0                    159MiB |
|    1   N/A  N/A     25172      C   gst-launch-1.0                    817MiB |

after fix
|    1   N/A  N/A     23992      C   gst-launch-1.0                    825MiB |

Would you further give a brief explanation? I’ve almost read through codes of gstnvinfer and nvinfer, but I can’t figure out how your one-line fix works.

fanzh · August 21, 2022, 2:18pm

thanks for your update, please refer to cudaSetDevice explanation,:CUDA Runtime API :: CUDA Toolkit Documentation need to Sets device as the current device in thread gst_nvinfer_input_queue_loop.

system · September 6, 2022, 3:20am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Nvinferserver DeepStream SDK	13	1615	October 12, 2021
NvInfer using both GPU 0 and GPU 1 although setting gpu-id = 1 DeepStream SDK deepstream	10	175	April 29, 2025
Running deesptream-segmentation-test DeepStream SDK deepstream61	4	531	July 26, 2022
Segmentation fault when trying to start a second GStreamer pipeline with nvinfer after launching and stopping a previous pipeline DeepStream SDK cuda , ubuntu , gstreamer , deepstream	3	120	June 12, 2025
Deepstream: Slow framerate (7 FPS) for TensorRT segmentation engine (20ms GPU latency) DeepStream SDK	10	1348	September 28, 2021
DS6.0.1 Segfault ibnvds_opticalflow_dgpu: Setting GPU_ID = 0 DeepStream SDK	10	1004	April 28, 2022
Deepstream Test 3 - nvinfer error:NVDSINFER_CUDA_ERROR Failed in mem copy (Segmentation fault) DeepStream SDK cuda , gstreamer , deepstream	5	23	January 6, 2026
Model run using nvinferserver occupying high GPU memory-usage DeepStream SDK	10	1464	October 12, 2021
Nvinferserver always allocates memory on GPU ID 0 and ignores gpu_ids configuration DeepStream SDK	19	903	May 24, 2023
GPU RAM keeps increasing DeepStream SDK deepstream	8	157	June 12, 2025

Extra memory usage in GPU0 when testing deepstream-segmentation-app in GPU1

Related topics