[Test Request]- Usage of POCL cuda backend combined with Jetson_FFMPEG library for rapid multimedia trancsoding and filtering

FCLC · December 30, 2020, 7:25pm

Hi everyone!

I’m looking for help in a very bare bones initial test to use OpenCL ffmpeg filters on a jetson nano.

very specifically I’m hoping to achieve an ffmpeg command in the ball park of

ffmpeg -hwaccel cuda -init_hw_device opencl=ocl -filter_hw_device ocl -c:v hevc_cuvid -resize 1920x1080 -i INPUT.mp4 -vf "format=p010,hwupload,tonemap_opencl=tonemap=mobius:param=0.01:desat=0:r=tv:p=bt709:t=bt709:m=bt709:format=nv12,hwdownload,format=nv12" -c:a copy -c:s copy -c:v 264_nvenc OUTPUT.mp4

however because nvenc/nvdec hasnt been implemented on the nano yet, the command would use this library to support the hardware encoder/decoder in ffmpeg and use this library (POCL) to add openCL support.

ffmpeg -hwaccel cuda -init_hw_device opencl=ocl -filter_hw_device ocl -c:v hevc_nvmpi -resize 1920x1080 -i INPUT.mp4 -vf "format=p010,hwupload,tonemap_opencl=tonemap=mobius:param=0.01:desat=0:r=tv:p=bt709:t=bt709:m=bt709:format=nv12,hwdownload,format=nv12" -c:a copy -c:s copy -c:v h264_nvmpi  OUTPUT.mp4

This should be able to use the decoding block on any of the current jetson model, keep in gpu system ram, utilize openCL 1.2 filters, once again keep it in gpu ram, encode to h264 to an output file.

Would test myself on hardware, but don’t have the spare budget to commit to the device if this isn’t going to work. [Computer engineering student on a tight budget, looking for advice and if someone with hardware might be able to help me out.]

DaneLLL · December 30, 2020, 11:26pm

Hi,
We have a package to enable hardware decoding in ffmpeg. Please check
Jetson Nano FAQ
Q: Is hardware acceleration enabled in ffmpeg?

Related topics:

FCLC · December 30, 2020, 11:34pm

Hi Dane!

Thanks for the pair of links! I already knew about the decoder- but AFAIK the encoder has yet to be implemented outside of Jcover’s git repository correct?

Also, tho not an officially supported use case,do you believe that by using the POCL 1.6 library to serve as a OpenCL to Cuda backend, we would be able to utilize OpenCL ffmpeg filters?

DaneLLL · December 31, 2020, 1:36am

Hi,
Yes, in NVIDIA package, hardware encoding is not enabled.

Since we have independent hardware engines NVENC and NVDEC on Jetson platforms, we usually leverage the engines and leave GPU for doing deep learning inference. Don’t have experience to enable OpenCL ffmpeg. May see if others can share suggestion on this.

You may consider to use jetson_multimedia_api. Hardware encoding/decoding can be done through v4l2 interface. Please take a look at document:
https://docs.nvidia.com/jetson/l4t-multimedia/index.html

FCLC · December 31, 2020, 4:49am

Hi Dane,

Thanks for the reply! I’ve looked through the docks, but unfortunately one of the requirements for my project is the use of FFMPEG.

The pipeline would be to decode in the hardware decoder, do the filtering/image processing on the gpu (tone mapping and resizing) then pass that to the hardware encoder to be encoded to the relevant format. [Which would then be sent to a different device]

Essentially, between both the current nvidia ffmpeg build and the version created by @jocover , I don’t think I need to worry about being able to leverage the build in decode/encode engines.

it also seems that both @girgink and @znmeb have gotten OpenCL 1.2 support via the POCL/Cuda backend (per this thread)

Since POCL creates a OpenCL 1.2 compliant device. If I can somehow confirm that using POCL works with ffmpeg open CL filters, then life will be pretty sweet!

znmeb · December 31, 2020, 5:51am

I have POCL working in a Docker image - the Docker context is here if you want to fork it:

https://github.com/edgyR/edgyR-containers/tree/travis-refactor/internal-jetson-pocl

I also have CSound and ChucK on that image but the build does POCL first. It shouldn’t be too hard to add FFMPEG.

FCLC · January 1, 2021, 1:32am

Hey mate!

Thanks for replying!

Have you noticed any issues with the OpenCL implementation? I’m a student, so trying to get a general idea of if what I want to do is even possible before spending part of my budget!

the Image buffer size looks large enough (8096*8096 in 2D) from the clinfo-nano.txt output file.

if you don’t mind terribly, would it be possible to test this set of commands on this file or any video file you might have on hand?

ffmpeg -init_hw_device opencl=ocl -filter_hw_device ocl -i INPUTFILE format=p010,hwupload,tonemap_opencl=tonemap=mobius:param=0.01:desat=0:r=tv:p=bt709:t=bt709:m=bt709:format=nv12,hwdownload,format=nv12" -c:a copy -c:s copy -c:v libx264 OUTPUTFILENAME.mp4

so long as the output is different in colour than the input, that means the filter is being initialized and that I should be rocking a rolling!

znmeb · January 1, 2021, 2:45am

Issue is open: https://github.com/edgyR/edgyR-containers/issues/23

FCLC · January 1, 2021, 11:21pm

Tracking the issue! Thanks for taking the time mate!

Super excited to see the results

FCLC

Topic		Replies	Views
Hardware accelerated video playback with L4T ffmpeg Jetson Nano decoder	19	9814	September 8, 2021
Hardware assisted ffmpeg with Jetpack 4.5.1 Jetson Nano ffmpeg	8	10574	October 15, 2021
Trying to get OpenCV (built with CUDA) working with FFMPEG Jetson Xavier NX opencv , cuda , ffmpeg , segmentation	7	8757	October 10, 2021
Jetson nx video vodec sdk ffmpeg compile run [h264 @ 0x55ab4840d0] Cannot load libnvcuvid.so.1 [h264 @ 0x55ab4840d0] Failed loading nvcuvid. error Jetson Xavier NX cuda	8	2689	March 11, 2022
ffmpeg using hardware gpu (cuda) Jetson Nano	25	19129	October 14, 2021
OpenCV H264 decoder high CPU usage Jetson Nano opencv	3	3501	October 18, 2021
Problem about Accelerated Decode with ffmpeg Jetson Xavier NX ffmpeg , chinese	8	2204	November 15, 2022
Accelerated HW Decode with ffmpeg not working，Decoder h264 does not support device type cuda Jetson AGX Xavier decoder	15	7534	October 18, 2021
The NVIDIA ffmpeg package supports hardware-accelerated decode on Jetson platforms Jetson Nano mmapi	26	13022	August 25, 2021
Hardware accelerated Ffmpeg on Jetson Jetson TX1 ffmpeg	5	3859	February 21, 2023