RTSP Stream from USB Cam trough Deepstream Python Bindings

project2kq54 · May 14, 2020, 9:43pm

• Hardware Platform (Jetson / GPU)
Jetson Nano
• DeepStream Version
5.0
• JetPack Version (valid for Jetson only)
4.4

Hi Guys,
i tried to work with deepstream SDK 5.0, and it works fine.
I played with the example "deepstream-test1-usbcam, and the most parts i understood and can work with. But now i tried to send the USB CAM output after detection and draw to a RTSP stream and this not works, because i’m not really familiar with pipeline and linking. Have you here a good documentation that can me help?

I understand that i get from my CAM a RAW Frame, and after detection it need to be converted to h264 because, the output of a RTSP should be h264, right?
If i try following code, than my RTSP stream works, but, it’s really lagging like 10-20 seconds and detection / tracking not works.

I’m sure, did with linking / decoding something wrong, so if some one can me help here with a code fix / explaination, would be very appricated.

Thank you very much.

Here is my python code:

#!/usr/bin/env python3

import sys
sys.path.append('../')
import platform
#needed to parse tracker:
import configparser


import gi
gi.require_version('Gst', '1.0')
gi.require_version('GstRtspServer', '1.0')
from gi.repository import GObject, Gst, GstRtspServer
#from gi.repository import GObject, Gst
from common.is_aarch_64 import is_aarch64
from common.bus_call import bus_call

import pyds

PGIE_CLASS_ID_VEHICLE = 0
PGIE_CLASS_ID_BICYCLE = 1
PGIE_CLASS_ID_PERSON = 2
PGIE_CLASS_ID_ROADSIGN = 3


def osd_sink_pad_buffer_probe(pad,info,u_data):
    frame_number=0
    #Intiallizing object counter with 0.
    obj_counter = {
        PGIE_CLASS_ID_VEHICLE:0,
        PGIE_CLASS_ID_PERSON:0,
        PGIE_CLASS_ID_BICYCLE:0,
        PGIE_CLASS_ID_ROADSIGN:0
    }
    num_rects=0

    gst_buffer = info.get_buffer()
    if not gst_buffer:
        print("Unable to get GstBuffer ")
        return

    # Retrieve batch metadata from the gst_buffer
    # Note that pyds.gst_buffer_get_nvds_batch_meta() expects the
    # C address of gst_buffer as input, which is obtained with hash(gst_buffer)
    batch_meta = pyds.gst_buffer_get_nvds_batch_meta(hash(gst_buffer))
    l_frame = batch_meta.frame_meta_list
    while l_frame is not None:
        try:
            # Note that l_frame.data needs a cast to pyds.NvDsFrameMeta
            # The casting is done by pyds.NvDsFrameMeta.cast()
            # The casting also keeps ownership of the underlying memory
            # in the C code, so the Python garbage collector will leave
            # it alone.
           frame_meta = pyds.NvDsFrameMeta.cast(l_frame.data)
        except StopIteration:
            break

        frame_number=frame_meta.frame_num
        num_rects = frame_meta.num_obj_meta
        l_obj=frame_meta.obj_meta_list
        while l_obj is not None:
            try:
                # Casting l_obj.data to pyds.NvDsObjectMeta
                obj_meta=pyds.NvDsObjectMeta.cast(l_obj.data)
                # added for tracker
                print("Show Object ID/Tracking ID?")
                print(obj_meta.object_id)
            except StopIteration:
                break
            obj_counter[obj_meta.class_id] += 1
            try: 
                l_obj=l_obj.next
            except StopIteration:
                break

        # Acquiring a display meta object. The memory ownership remains in
        # the C code so downstream plugins can still access it. Otherwise
        # the garbage collector will claim it when this probe function exits.
        display_meta=pyds.nvds_acquire_display_meta_from_pool(batch_meta)
        display_meta.num_labels = 1
        py_nvosd_text_params = display_meta.text_params[0]
        # Setting display text to be shown on screen
        # Note that the pyds module allocates a buffer for the string, and the
        # memory will not be claimed by the garbage collector.
        # Reading the display_text field here will return the C address of the
        # allocated string. Use pyds.get_string() to get the string content.
        py_nvosd_text_params.display_text = "Frame Number={} Number of Objects={} Vehicle_count={} Person_count={}".format(frame_number, num_rects, obj_counter[PGIE_CLASS_ID_VEHICLE], obj_counter[PGIE_CLASS_ID_PERSON])

        # Now set the offsets where the string should appear
        py_nvosd_text_params.x_offset = 10
        py_nvosd_text_params.y_offset = 12

        # Font , font-color and font-size
        py_nvosd_text_params.font_params.font_name = "Serif"
        py_nvosd_text_params.font_params.font_size = 10
        # set(red, green, blue, alpha); set to White
        py_nvosd_text_params.font_params.font_color.set(1.0, 1.0, 1.0, 1.0)

        # Text background color
        py_nvosd_text_params.set_bg_clr = 1
        # set(red, green, blue, alpha); set to Black
        py_nvosd_text_params.text_bg_clr.set(0.0, 0.0, 0.0, 1.0)
        # Using pyds.get_string() to get display_text as string
        print(pyds.get_string(py_nvosd_text_params.display_text))
        pyds.nvds_add_display_meta_to_frame(frame_meta, display_meta)
        try:
            l_frame=l_frame.next
        except StopIteration:
            break
			
    return Gst.PadProbeReturn.OK	


def main(args):
    # Check input arguments
    if len(args) != 2:
        sys.stderr.write("usage: %s <v4l2-device-path>\n" % args[0])
        sys.exit(1)

    # Standard GStreamer initialization
    GObject.threads_init()
    Gst.init(None)

    # Create gstreamer elements
    # Create Pipeline element that will form a connection of other elements
    print("Creating Pipeline \n ")
    pipeline = Gst.Pipeline()

    if not pipeline:
        sys.stderr.write(" Unable to create Pipeline \n")

    # Source element for reading from the file
    print("Creating Source \n ")
    source = Gst.ElementFactory.make("v4l2src", "usb-cam-source")
    if not source:
        sys.stderr.write(" Unable to create Source \n")

    caps_v4l2src = Gst.ElementFactory.make("capsfilter", "v4l2src_caps")
    if not caps_v4l2src:
        sys.stderr.write(" Unable to create v4l2src capsfilter \n")


    print("Creating Video Converter \n")

    # Adding videoconvert -> nvvideoconvert as not all
    # raw formats are supported by nvvideoconvert;
    # Say YUYV is unsupported - which is the common
    # raw format for many logi usb cams
    # In case we have a camera with raw format supported in
    # nvvideoconvert, GStreamer plugins' capability negotiation
    # shall be intelligent enough to reduce compute by
    # videoconvert doing passthrough (TODO we need to confirm this)


    # videoconvert to make sure a superset of raw formats are supported
    vidconvsrc = Gst.ElementFactory.make("videoconvert", "convertor_src1")
    if not vidconvsrc:
        sys.stderr.write(" Unable to create videoconvert \n")

    # nvvideoconvert to convert incoming raw buffers to NVMM Mem (NvBufSurface API)
    nvvidconvsrc = Gst.ElementFactory.make("nvvideoconvert", "convertor_src2")
    if not nvvidconvsrc:
        sys.stderr.write(" Unable to create Nvvideoconvert \n")

    caps_vidconvsrc = Gst.ElementFactory.make("capsfilter", "nvmm_caps")
    if not caps_vidconvsrc:
        sys.stderr.write(" Unable to create capsfilter \n")

    # Create nvstreammux instance to form batches from one or more sources.
    streammux = Gst.ElementFactory.make("nvstreammux", "Stream-muxer")
    if not streammux:
        sys.stderr.write(" Unable to create NvStreamMux \n")

    # Use nvinfer to run inferencing on camera's output,
    # behaviour of inferencing is set through config file
    pgie = Gst.ElementFactory.make("nvinfer", "primary-inference")
    if not pgie:
        sys.stderr.write(" Unable to create pgie \n")

    #added Tracker function by @gudio
    tracker = Gst.ElementFactory.make("nvtracker", "tracker")
    if not tracker:
        sys.stderr.write(" Unable to create tracker \n")

    # Use convertor to convert from NV12 to RGBA as required by nvosd
    nvvidconv = Gst.ElementFactory.make("nvvideoconvert", "convertor")
    if not nvvidconv:
        sys.stderr.write(" Unable to create nvvidconv \n")

    # Create OSD to draw on the converted RGBA buffer
    nvosd = Gst.ElementFactory.make("nvdsosd", "onscreendisplay")

    if not nvosd:
        sys.stderr.write(" Unable to create nvosd \n")

    #RSTP
    #Create a caps filter
    #caps = Gst.ElementFactory.make("capsfilter", "filter")
    #caps.set_property("caps", Gst.Caps.from_string("video/x-raw(memory:NVMM), format=I420"))
    #
    caps_v4l2src.set_property('caps', Gst.Caps.from_string("video/x-raw,width=640,height=480,framerate=30/1"))
    #caps_v4l2src.set_property('caps', Gst.Caps.from_string("video/x-raw,width=640,height=480,framerate=30/1"))
    #caps_vidconvsrc.set_property('caps', Gst.Caps.from_string("video/x-raw(memory:NVMM), format=I420"))
    caps_vidconvsrc.set_property('caps', Gst.Caps.from_string("video/x-raw(memory:NVMM)"))

    #RSTP
    # Make the encoder
    encoder = Gst.ElementFactory.make("nvv4l2h264enc", "encoder")
    print("Creating H264 Encoder")

    if not encoder:
        sys.stderr.write(" Unable to create encoder")
    encoder.set_property('bitrate', 4000000)
    if is_aarch64():
        encoder.set_property('preset-level', 1)
        encoder.set_property('insert-sps-pps', 1)
        encoder.set_property('bufapi-version', 1)

    rtppay = Gst.ElementFactory.make("rtph264pay", "rtppay")
    print("Creating H264 rtppay")

    # Finally render the osd output
    if is_aarch64():
        transform = Gst.ElementFactory.make("nvegltransform", "nvegl-transform")

    #print("Creating EGLSink \n")
    #sink = Gst.ElementFactory.make("nveglglessink", "nvvideo-renderer")
    #if not sink:
    #    sys.stderr.write(" Unable to create egl sink \n")
 
    sink = Gst.ElementFactory.make("udpsink", "udpsink")
    if not sink:
        sys.stderr.write(" Unable to create udpsink")

    #print("Playing cam %s " %args[1])
    #caps_v4l2src.set_property('caps', Gst.Caps.from_string("video/x-raw, framerate=30/1"))
    #caps_v4l2src.set_property('caps', Gst.Caps.from_string("video/x-raw,width=800,height=600,framerate=20/1"))
    #caps_v4l2src.set_property('caps', Gst.Caps.from_string("video/x-raw,width=640,height=480,framerate=30/1"))
    #caps_vidconvsrc.set_property('caps', Gst.Caps.from_string("video/x-raw(memory:NVMM)"))

    # Make the UDP sink
    updsink_port_num = 5400

    sink.set_property('host', '224.224.255.255')
    sink.set_property('port', updsink_port_num)
    sink.set_property('async', False)
    sink.set_property('sync', 1)

    source.set_property('device', args[1]) 
    streammux.set_property('width', 1920)
    streammux.set_property('height', 1080)
    streammux.set_property('batch-size', 1)
    streammux.set_property('batched-push-timeout', 4000000)
    
    pgie.set_property('config-file-path', "dstest1_pgie_config.txt")
    # Set sync = false to avoid late frame drops at the display-sink
    sink.set_property('sync', False)

    # added properties for Tracker
    #Set properties of tracker
    config = configparser.ConfigParser()
    config.read('dstest2_tracker_config_v2.txt')
    config.sections()

    for key in config['tracker']:
        if key == 'tracker-width' :
            tracker_width = config.getint('tracker', key)
            tracker.set_property('tracker-width', tracker_width)
        if key == 'tracker-height' :
            tracker_height = config.getint('tracker', key)
            tracker.set_property('tracker-height', tracker_height)
        if key == 'gpu-id' :
            tracker_gpu_id = config.getint('tracker', key)
            tracker.set_property('gpu_id', tracker_gpu_id)
        if key == 'll-lib-file' :
            tracker_ll_lib_file = config.get('tracker', key)
            tracker.set_property('ll-lib-file', tracker_ll_lib_file)
        if key == 'll-config-file' :
            tracker_ll_config_file = config.get('tracker', key)
            tracker.set_property('ll-config-file', tracker_ll_config_file)
        if key == 'enable-batch-process' :
            tracker_enable_batch_process = config.getint('tracker', key)
            tracker.set_property('enable_batch_process', tracker_enable_batch_process)

    print("Adding elements to Pipeline \n")
    pipeline.add(source)
    pipeline.add(caps_v4l2src)
    pipeline.add(vidconvsrc)
    pipeline.add(nvvidconvsrc)
    pipeline.add(caps_vidconvsrc)
    pipeline.add(streammux)
    pipeline.add(pgie)
    #added Tracker to PIPeline
    pipeline.add(tracker)
    #
    pipeline.add(nvvidconv)
    pipeline.add(nvosd)
    #ENCODE / RTP
    pipeline.add(encoder)
    pipeline.add(rtppay)
    pipeline.add(sink)
 
#   if is_aarch64():
#        pipeline.add(transform)

    # we link the elements together
    # v4l2src -> nvvideoconvert -> mux -> 
    # nvinfer -> nvvideoconvert -> nvosd -> video-renderer
    print("Linking elements in the Pipeline \n")
    source.link(caps_v4l2src)
    caps_v4l2src.link(vidconvsrc)
    vidconvsrc.link(nvvidconvsrc)
    nvvidconvsrc.link(caps_vidconvsrc)

    sinkpad = streammux.get_request_pad("sink_0")
    if not sinkpad:
        sys.stderr.write(" Unable to get the sink pad of streammux \n")
    srcpad = caps_vidconvsrc.get_static_pad("src")
    if not srcpad:
        sys.stderr.write(" Unable to get source pad of caps_vidconvsrc \n")
    srcpad.link(sinkpad)
    streammux.link(pgie)
    #Link tracker to  pipeline 
    pgie.link(tracker)
    tracker.link(nvvidconv)
    #
    nvvidconv.link(nvosd)
    caps_vidconvsrc.link(encoder)
    encoder.link(rtppay)
    rtppay.link(sink)
   #-- DEACTIVE CAMERA VIEW--
    #if is_aarch64():
        #remove view
        #nvosd.link(transform)
        #transform.link(sink)
    #else:
    #    nvosd.link(sink)

    # create an event loop and feed gstreamer bus mesages to it
    loop = GObject.MainLoop()
    bus = pipeline.get_bus()
    bus.add_signal_watch()
    bus.connect ("message", bus_call, loop)

    # Start streaming
    rtsp_port_num = 8554
    
    server = GstRtspServer.RTSPServer.new()
    server.props.service = "%d" % rtsp_port_num
    server.attach(None)
    
    factory = GstRtspServer.RTSPMediaFactory.new()
    factory.set_launch( "( udpsrc name=pay0 port=%d buffer-size=524288 caps=\"application/x-rtp, media=video, clock-rate=90000, encoding-name=(string)%s, payload=96 \" )" % (updsink_port_num, "H264"))
    factory.set_shared(True)
    server.get_mount_points().add_factory("/ds-test", factory)
    
    print("\n *** DeepStream: Launched RTSP Streaming at rtsp://localhost:%d/ds-test ***\n\n" % rtsp_port_num)

    # Lets add probe to get informed of the meta data generated, we add probe to
    # the sink pad of the osd element, since by that time, the buffer would have
    # had got all the metadata.
    osdsinkpad = nvosd.get_static_pad("sink")
    if not osdsinkpad:
        sys.stderr.write(" Unable to get sink pad of nvosd \n")

    osdsinkpad.add_probe(Gst.PadProbeType.BUFFER, osd_sink_pad_buffer_probe, 0)

    # start play back and listen to events
    print("Starting pipeline \n")
    pipeline.set_state(Gst.State.PLAYING)
    try:
        loop.run()
    except:
        pass
    # cleanup
    pipeline.set_state(Gst.State.NULL)

if __name__ == '__main__':
    sys.exit(main(sys.argv))

DaneLLL · May 15, 2020, 1:31am

Hi,
Please refer to the sample:

You would need to integrate it with deepstream-test1-usbcam

project2kq54 · May 15, 2020, 4:54am

Hi Danelll

Thanks for your fast replay.

how i wrote i already trie to combined deepstreemt-test1-usbcam + deepstream-test1-rstp-out, but i failed because i’m not really sure how is the correct “pipeline/link” way.

the deep stream-test1-rstp-out is an example with a h264 source, but the camera comes with a video/raw source, i of course removed the “decoders” on the start, an a RSTP stream working, but its really lagging.

I’m really sure that my issue is depending of wrong handling VIDEO ENCODING/CONVERTION, so if some one can maybe tell me what i did wrong with it would very appreciated.

Thanks.

project2kq54 · May 18, 2020, 11:16pm

No answer? No help?

DaneLLL · May 19, 2020, 12:05am

Hi,
Since the samples are there. We would encourage users on integration. It shall work by modifying source in deepstream-test1-rtsp-out to

v4l2src ! video/x-raw,width=_SOURCE_W_,height=_SOURCE_H_,format=_SOURCE_FMT_,framerate=_SOURCE_FR_ ! videoconvert ! nvvideoconvert ! video/x-raw(memory:NVMM),format=NV12 ! nvstreammux ! ...

krzysztof.grobelny · June 28, 2020, 11:40pm

@DaneLLL
Could you please explain how to read and translate your solution to Python code?
Especially three dots ending this pipeline are confusing. How should it be ended?

krzysztof.grobelny · June 28, 2020, 11:44pm

@project2kq54
Would you share your final solution, please?

DaneLLL · June 29, 2020, 3:46am

hi,
In deepstream-test1-rtsp-out, the source is

filesrc ! h264parse ! nvv4l2decoder ! nvstreammux ! ...

You can refer to deepstream-test1-usbcam and change it to

v4l2src ! video/x-raw,width=_SOURCE_W_,height=_SOURCE_H_,format=_SOURCE_FMT_,framerate=_SOURCE_FR_ ! videoconvert ! nvvideoconvert ! video/x-raw(memory:NVMM),format=NV12 ! nvstreammux ! ...

krzysztof.grobelny · June 29, 2020, 9:35am

Thank you for the reply.
I’m not getting it yet though, sorry.

Are you able to refer to actual Python code, please?

github.com

NVIDIA-AI-IOT/deepstream_python_apps/blob/master/apps/deepstream-test1-rtsp-out/deepstream_test1_rtsp_out.py

#!/usr/bin/env python3

################################################################################
# SPDX-FileCopyrightText: Copyright (c) 2020-2021 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# SPDX-License-Identifier: Apache-2.0
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
################################################################################

import argparse

This file has been truncated. show original

The way pipeline is built is different in Python than in the gstreamer command you are sharing.
If I understand properly the pipeline elements should be as follows:
pipeline.add(source)
pipeline.add(caps_v4l2src)
pipeline.add(vidconvsrc)
pipeline.add(nvvidconvsrc)
pipeline.add(caps_vidconvsrc)
pipeline.add(streammux)
pipeline.add(nvvidconv)
pipeline.add(nvosd)
pipeline.add(encoder)
pipeline.add(rtppay)
pipeline.add(sink)

So my questions are:

are pipeline elements correct? elements between source and streammux (first 6) should refer to the pipeline you’ve shared.
are literals SOURCE_W, SOURCE_H, SOURCE_FMT, SOURCE_FR to be substituted by actual values? if so, can you provide a Python example of doing that?

In general, can you refer to Python code, not gstreamer commands? I feel like I’m missing or mistaking something in string parameters to pipeline elements, so this generic suggestion of yours is not helpful enough.

Many thanks!

DaneLLL · June 30, 2020, 1:14am

Hi,
Please check line #144 to #189 in

github.com

NVIDIA-AI-IOT/deepstream_python_apps/blob/master/apps/deepstream-test1-usbcam/deepstream_test_1_usb.py

#!/usr/bin/env python3

################################################################################
# SPDX-FileCopyrightText: Copyright (c) 2020-2021 NVIDIA CORPORATION & AFFILIATES. All rights reserved.
# SPDX-License-Identifier: Apache-2.0
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
################################################################################

import sys

This file has been truncated. show original

You need to apply the code to deepstream-test1-rtsp-out.

linux.matthew · May 26, 2021, 11:27pm

I’ve tried to consolidate these two files without luck. Is there a version where this works?

kayccc · June 1, 2021, 2:18am

Hi linux.matthew,

Please help to open a new topic with more details of your issue. Thanks

Topic		Replies	Views
How to create an rtsp sink with deepstream python program? DeepStream SDK	28	8537	October 12, 2021
Issue regarding deepstreamer-6.4 filesink output DeepStream SDK gstreamer , computer-vision-cv , deepstream	8	304	April 30, 2024
How to run RTSP Program? DeepStream SDK jetson , deepstream	8	38	October 21, 2024
Pass sink arguments DeepStream SDK	8	744	November 28, 2022
Deepstream RTSP in and RTSP out with nvanalytics and tracker DeepStream SDK	9	2339	June 16, 2022
How to Stabilize Output RTSP Streams in DeepStream for Reliable Web Streaming? DeepStream SDK python , deepstream	7	229	July 23, 2024
Python binding send RTP DeepStream SDK	13	371	June 15, 2023
Can not run deepstream_test_1.py? DeepStream SDK	8	312	July 31, 2023
Testing usb cam on deepstream test1 usb-cam app without pgie config file DeepStream SDK camera , jetson , deepstream , jetson-nano	12	39	September 11, 2024
Deepstream-nvdsanalytics DeepStream SDK camera , gstreamer , python	15	478	November 23, 2023

RTSP Stream from USB Cam trough Deepstream Python Bindings

Related topics