Deepstream FPS drops to 0 with no error

vishnu6 · May 26, 2023, 10:23am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
Jetson and GPU

• DeepStream Version
6.2

• JetPack Version (valid for Jetson only)
5.1

• TensorRT Version
8.5.2.2

• NVIDIA GPU Driver Version (valid for GPU only)
525

• Issue Type( questions, new requirements, bugs)
Question

• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)
Deepstream FPS drops to 0 and the video sink gets stuck after processing the first 10 or so frames. The current model we use has a 99% GPU utilization rate on the Jetson device and less on a T4. I have noticed that the GPU usage abruptly falls to 0 once the FPS hits 0. And any print statements on the probe function which is connected to the source pad of the primary inference also stops printing. I am using a custom struct to pass information from the model by the same probe function mentioned above. And since there is no stacktrace available or error available, it has become impossible to monitor it. The nsys report generated is not opening in the nsights system either. Once I remove the probe function, it works properly. Attaching the function as below:

def sink_probe(pad, info, user_data):
    gst_buffer = info.get_buffer()

    batch_meta = pyds.gst_buffer_get_nvds_batch_meta(hash(gst_buffer))
    frame_list = batch_meta.frame_meta_list

    while frame_list is not None:
        try:
            frame_meta = pyds.NvDsFrameMeta.cast(frame_list.data)
        except StopIteration:
            continue

        frame_user_meta_list = frame_meta.frame_user_meta_list

        while frame_user_meta_list is not None:
            try:
                frame_user_meta = pyds.NvDsUserMeta.cast(frame_user_meta_list.data)
            except StopIteration:
                continue

            meta_type = frame_user_meta.base_meta.meta_type
            if meta_type == pyds.NVDSINFER_TENSOR_OUTPUT_META:
                tensor_meta = pyds.NvDsInferTensorMeta.cast(frame_user_meta.user_meta_data)
                layer = pyds.get_nvds_LayerInfo(tensor_meta, 0)
                ptr = ctypes.cast(pyds.get_ptr(layer.buffer), ctypes.POINTER(ctypes.c_float))
                detections = np.ctypeslib.as_array(ptr, shape=(64512,6))
                nms_dets = nms(detections=detections, image_size=(512, 512)).flatten().tolist()
                data = pyds.alloc_rapid_struct(frame_user_meta)
                data.num_detections = len(nms_dets) // 5
                data.detections = nms_dets
                frame_user_meta.user_meta_data = data
                frame_user_meta.base_meta.meta_type = pyds.NvDsMetaType.NVDS_USER_META

            try:
                frame_user_meta_list = frame_user_meta_list.next
            except StopIteration:
                break

        try:
            frame_list = frame_list.next
        except StopIteration:
            break

    return Gst.PadProbeReturn.OK

Pybind11 for custom metadata

#include "bind_string_property_definitions.h"
#include "include/bindrapidmeta.hpp"

namespace py = pybind11;

namespace pydeepstream {
    void* copy_rapid_struct(void* data, void* user_meta) {
        NvDsUserMeta* source_meta = (NvDsUserMeta*) data;
        RapidOutputStruct* source_data = (RapidOutputStruct*) source_meta->user_meta_data;
        RapidOutputStruct* destination_data = (RapidOutputStruct*) g_malloc0(sizeof(RapidOutputStruct));

        int num_detections = source_data->num_detections;
        destination_data->num_detections = source_data->num_detections;

        if (num_detections > 0) {
            for (int i = 0; i < num_detections * 5; i++) {
                destination_data->detections.push_back(source_data->detections[i]);
            }
        }
        return destination_data;
    }

    void release_rapid_struct(void* data, void* user_data) {
        NvDsUserMeta* source_meta = (NvDsUserMeta*) data;

        if (source_meta != nullptr) {
            RapidOutputStruct* source_data = (RapidOutputStruct*) source_meta->user_meta_data;
            if (source_data != nullptr) {
                free(source_data);
            }
        }
    }

    void bindrapid(py::module &m) {
        py::class_<RapidOutputStruct>(
            m, "RapidOutputStruct", pydsdoc::rapid::RapidStructDoc::descr)
        .def(py::init<>())
        .def_readwrite("num_detections", &RapidOutputStruct::num_detections)
        .def_readwrite("detections", &RapidOutputStruct::detections)
        .def(
            "cast", [](void* data) {
                return (RapidOutputStruct*) data;
            },
            py::return_value_policy::reference,
            pydsdoc::rapid::RapidStructDoc::cast
        );
        m.def(
            "alloc_rapid_struct",
            [](NvDsUserMeta *meta) {
                auto* mem = (RapidOutputStruct* ) g_malloc0(sizeof(RapidOutputStruct));
                meta->base_meta.copy_func = (NvDsMetaCopyFunc) pydeepstream::copy_rapid_struct;
                meta->base_meta.release_func = (NvDsMetaReleaseFunc) pydeepstream::release_rapid_struct;
                return mem;
            },
            py::return_value_policy::reference,
            pydsdoc::methodsDoc::alloc_rapid_struct
        );
    }

yuweiw · May 29, 2023, 1:49am

So it’s got stuck when you are running? Is this your own demo code or modified on our basis? Could this issues be reproduced in our demo?

vishnu6 · May 29, 2023, 6:22am

Yes, it’s says 0 Fps while running. Something like this:

0:00:05.842002678  8249      0x2f6e490 INFO                 nvinfer gstnvinfer.cpp:680:gst_nvinfer_logger:<rapid-detector> NvDsInferContext[UID 1]: Info from NvDsInferContextImpl::generateBackendContext() <nvdsinfer_context_impl.cpp:2012> [UID = 1]: Use deserialized engine model: /home/ubuntu/video-analytics-v2/models/rapid.onnx_b1_gpu0_fp32.engine
0:00:05.850513395  8249      0x2f6e490 INFO                 nvinfer gstnvinfer_impl.cpp:328:notifyLoadModelStatus:<rapid-detector> [UID 1]: Load new model:configs/rapid.config.txt sucessfully


**PERF : FPS_0 (0.00)
**PERF : FPS_0 (0.00)
**PERF : FPS_0 (0.00)

This is my own code which I wrote referring the python custom binding example. I don’t think this can be reproduced because of the extra angle parameter in the model output.

[UPDATE]
So after a few trial and error, I found out that the issue is with the release_rapid_struct function. The RapidOutputStruct is the following:

struct RapidOutputStruct {
    int num_detections;
    vector<float> detections = vector<float>(500);
};

Am i doing anything wrong while freeing the memory? (Pardon, I don’t have a lot of experience with C++). The nvdslogger outputs 0 FPS when this line: meta->base_meta.release_func = (NvDsMetaReleaseFunc) pydeepstream::release_rapid_struct; is commented out.

yuweiw · May 29, 2023, 6:52am

We have already bound simailar API like: custom_binding. Could you use our API to bind your data structure?

vishnu6 · May 29, 2023, 6:58am

This is the API that I’ve referred to while creating my codebase. Since, there is no strings inside the struct that I’m using, I’ve not copied the release function as it is. Instead, I’ve freed the entire struct itself.

yuweiw · May 30, 2023, 11:08am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

Can you provide us with a simplified version of your demo? We can try to run in our environment.

system · June 20, 2023, 1:40am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Making the deepstream python example 1 work in headless mode with output to file DeepStream SDK	4	368	April 7, 2024
Deepstream freezes on Jetson DeepStream SDK	13	2111	October 12, 2021
Regarding the frame delay measurement method DeepStream SDK python , deepstream	6	158	July 10, 2024
Deepstream Memory Leak DeepStream SDK	8	1935	December 13, 2021
Why Jetson nano memory leaks when Deepstream_app is running and the system is also hang? DeepStream SDK	9	1445	October 12, 2021
Deepstreem installation fails DeepStream SDK gstreamer , python	16	988	October 14, 2022
Nsys profile on deepstream for jetson device DeepStream SDK	2	240	October 9, 2023
Run BACK-TO-BACK-DETECTORS REFERENCE APP under DeepStream SDK 5.0 DeepStream SDK	16	997	October 12, 2021
Dropping the FPS of input rtsp streams DeepStream SDK rtsp , fps , deepstream	3	35	March 10, 2025
Deepstream run resnet50 blocked DeepStream SDK camera , gstreamer	10	1195	October 12, 2021

Deepstream FPS drops to 0 with no error

Related topics