The Memory usage difference between multimedia_apis and gstreamer commands

someline222 · January 15, 2019, 5:21am

Our program based on multimedia_apis has some problems on memory usage when encoding 3840x2160 video.

When Gst command below is called(3840x2160 encode), The memory usage is about 150MB

gst-launch-1.0 v4l2src device="/dev/video1" ! 'video/x-raw, width=1280, height=720, format=(string)I420, framerate=25/1' ! nvvidconv ! 'video/x-raw(memory:NVMM), width=(int)3840, height=(int)2160, format=(string)I420' ! omxh265enc bitrate=2000000 ! 'video/x-h265, stream-format=(string)byte-stream' ! rtph265pay pt=98 ! udpsink host=192.xxx.xx.xx port=6000

When use multimedia_apis to implement similar function, The memory usage is rather big, 511MB

I build the demo: tegra_multimedia_api_bk\samples\01_video_encode, and then run

./video_encode Kimono_1920x1080.yuv 3840 2160 H265 Kimono_1920x1080.h265 -br 410000 -ifi 25 -fps 25 1 -hpt 1

I also tried to reduce the buffer number from 10 to 4 (in video_encode_main.cpp, line 838 843 848 855 1099)

The memory usage is reduced as well, about 360M, but it's still rather bigger than gstreamer.

So, what’s the reason? How can I optimize our program to lower memory usage?

Thx!

DaneLLL · January 15, 2019, 7:21am

Hi,
One optimization is to do memory map/unmap dynamically:

ret = ctx.enc->output_plane.mapOutputBuffers(v4l2_buf, ctx.output_plane_fd[i]);

ret = ctx.enc->output_plane.unmapOutputBuffers(i, ctx.output_plane_fd[i]);

In default code flow, it is executed in initialization and termination. You can execute it for reading every frame.

someline222 · January 15, 2019, 9:45am

DaneLLL:

Hi,
One optimization is to do memory map/unmap dynamically:
ret = ctx.enc->output_plane.mapOutputBuffers(v4l2_buf, ctx.output_plane_fd[i]);
ret = ctx.enc->output_plane.unmapOutputBuffers(i, ctx.output_plane_fd[i]);
In default code flow, it is executed in initialization and termination. You can execute it for reading every frame.

Thanks!

So, If this helps on decoder as well?

And here is some more questions…

Why does multimeida_apis program cost more RAM than gstreamer command?

Do mm-api and gstreamer use different low-level libraries?

Or they work in different ways?

Thanks very much!!

DaneLLL · January 16, 2019, 1:56am

Hi,

By default the decoded buffers are not mapped to CPU. If you want to do post-processing on decoded buffers, you can dynamically map/unmap the buffers via NvBuffer APIs.

The main difference should be in buffer map/unmap.

No, low-level libraries are the same

One is in gstreamer and the other is in v4l2.

someline222 · January 21, 2019, 9:01am

I tried to implement dynamic map/unmap based on the demo 01_video_encode, my thought is(just show important lines):

delete code between about 909 and 1085(the for loop to map memory for output plane)

in the while loop after the for loop, call

ctx.enc->output_plane.getNthBuffer(0)

ctx.enc->output_plane.mapOutputBuffers(v4l2_buf, ctx.output_plane_fd[0])

then call read_video_frame(), etc.(not change)
…
after qBuffer() is called, call dqBuffer(), then call

ctx.enc->output_plane.unmapOutputBuffers(0, ctx.output_plane_fd[0])

But the memory cost is almost the same(510MB for 4K encode, output h265 file is the same), so… Am I wrong and how to fix this? Or can you give me some code or demo using dynamic map/unmap?

Thanks!! :-)

DaneLLL · January 22, 2019, 3:29am

Hi,
Please share how you profile memory usage.

someline222 · January 22, 2019, 5:38am

I use the tool: tegrastats

I opened two ssh, one runs video_encode, the other runs the command:

./tegrastats --interval 50 | cut -c1-15

to show memmory usage

[memory usage when video_encode is running] - [memory usage when video_encode isn’t running] is about 510MB.

someline222 · January 23, 2019, 1:39am

I carried out another test, in the demo:

setup_output_dmabuf(&ctx, NUM_BUFFERS)

The Macro NUM_BUFFERS is defined as 10, 6, 4.
respectively, the memory usage is 510MB, 510MB, 359MB not relevant about if I dynamically map/unmap.

DaneLLL · January 23, 2019, 1:43am

Hi,
We will check and clarify.

Topic		Replies	Views
what's the difference between gstreamer and multimedia api Jetson TX2	9	2333	October 18, 2021
use gstreamer or tegra_multimedia_api to decode video would be more efficient and increase throughpu... Jetson TX2	16	4556	October 18, 2021
NVMM and Gstreamer Jetson TX2	18	8503	October 18, 2021
Nvvideoconvert element produces memory increases in a create-start-stop-delete sequence DeepStream SDK gstreamer	2	1358	October 12, 2021
GStreamer RTSP Client increase memory usage gradually in Jetson TX2 Jetson TX2 gstreamer	12	1231	October 18, 2021
High CPU usage in omxh264dec and low performance as compared to nv_omx_h264dec Jetson TK1	20	4891	October 18, 2021
NVMM memory in custom GStreamer plugin Jetson TX1	11	5982	October 18, 2021
How do you take GStreamer video out of memory:NVMM Jetson TX1	2	2337	June 29, 2016
Gstreamer transcoding performance issue Jetson TX1	9	4982	October 18, 2021
Formatting images to feed into NvVideoEncoder (Tegra multimedia API) Jetson TX2	29	4785	October 18, 2021

The Memory usage difference between multimedia_apis and gstreamer commands

Related topics