SRT H.264 video source

faridh · May 11, 2024, 10:40am

Hello,

I am currently attempting to compare software- and hardware-based conversion of h.264 → rgb. For that I implemented a FFmpeg-based video source operator which receives a srt stream and decodes the h.264.

Is it possible to profile FFmpeg conversion using nsight systems? The process for the ffmpeg command does not show any traces.

Is there a better way to do this? Is it possible to circumvent the host machine and directly receive the frames on the device?

Thanks in advance

class FFmpegSRTStreamSourceOp(Operator):
        
    def __init__(self, fragment, url, height, width, n_channels, *args, **kwargs):
        self.height = height
        self.width = width
        self.n_channels = n_channels
        self.url = url
        self.buffer_size = self.width*self.height*self.n_channels
        self.ffmpeg_command = ['ffmpeg',
                                '-nostdin',
                                '-max_delay', '0',
                                '-y', '-vsync', '0',
                                '-hwaccel_device', '0',
                                '-hwaccel', 'cuda',
                                '-fflags', 'nobuffer', '-flags', 'low_delay', '-strict', 'experimental',
                                '-i', url,
                                '-pix_fmt', 'rgb24',
                                '-s', f'{width}x{height}',
                                '-vf', 'setpts=0',
                                '-f', 'rawvideo', 'pipe:'
                                ]
        super().__init__(fragment, *args, **kwargs)
        
    def setup(self, spec: OperatorSpec):
        spec.output("source")
        
    def start(self):
        # using subprocess and pipe to fetch frame data
        self.p = subprocess.Popen(self.ffmpeg_command, stdout=subprocess.PIPE, bufsize=10**8)

    @nvtx.annotate("compute", color="green")
    def compute(self, op_input, op_output, context):
        with nvtx.annotate("stdout.read", color="blue"):
            raw_bytes = self.p.stdout.read(self.width*self.height*self.n_channels)
        
        with nvtx.annotate("bytes_to_tensor", color="yellow"):
            tensor = cp.frombuffer(raw_bytes, cp.uint8)

            if tensor.size != self.buffer_size:
                return
            
            tensor = tensor.reshape(self.height, self.width, self.n_channels)
            
        entity = Entity(context)
        entity.add(hs.as_tensor(tensor))
        op_output.emit(entity, "source")

    def stop(self):
        self.p.kill()
        return super().stop()

leehindongcn · May 16, 2024, 8:36am

When I was reproducing the ‘multi-endoscopy app’ demo, I also had a similar need: output the real-time video stream which is captured from a usb video camera to the input of the ‘ai-model’. In this step, the problem of ‘yuyv cannot be converted’ occurred. I also want to learn from your idea and use a piece of code to adapt the video stream to a suitable RGB format.
I would like to ask the author, have you solved this problem? If successful, please share how you solved it?

faridh · May 24, 2024, 4:54pm

Feel free to use parts of my operator for your use case. I noticed however that the hardware acceleration is not even used for decoding h.264 in the example. I forgot to add the -c:v h264_cuvid flag. But there is not a noticeable difference between cpu decoding and gpu decoding.

Do you know by any chance if it is possible to keep the decoded frames on the gpu using -hwaccel_output_format cuda and converting them directly to cupy array?

Regarding your issue with the color space conversion, I never came across this problem.

Topic		Replies	Views
Ffmpeg: Mixing CPU and GPU processing Video Processing & Optical Flow cuda , ffmpeg	6	3696	January 27, 2022
Color falsification by inference DeepStream SDK nvbugs , encoder	14	343	December 9, 2024
Issue to decode Raw video YUY2 / YUYV 422 using CUDA , the "NvDecoder : cuvidCreateVideoParser" fail with "CUDA_ERROR_INVALID_SOURCE" Video Processing & Optical Flow	3	2194	February 15, 2022
HW transcoding live stream, DAR/SAR source changes CUDA Programming and Performance	4	1410	November 6, 2018
decode h264/h265 stream with gstreamer and nvcuvid GPU-Accelerated Libraries	2	1817	January 29, 2020
How do I get Gst-nvvideoconvert plugin? Jetson Xavier NX gstreamer	3	580	October 18, 2021
HW decoding in ffmpeg does not work well Jetson Xavier NX ffmpeg	2	1454	August 11, 2022
Cannot decode some H264 data on GPU (libavcodec / ffmpeg) Video Processing & Optical Flow decoder , linux , ffmpeg	3	1605	January 16, 2023
Colorspace issue with gstreamer while decoding h264 stream with bt709 VUI Jetson AGX Orin gstreamer , ffmpeg	8	428	July 26, 2024
Any hardware acceleration possible in gstreamer pipeline for usb camera? Jetson AGX Xavier camera	9	1622	October 18, 2021

SRT H.264 video source

Related topics