P4 and T4 Decoding on Windows Server 2016 - Low utilization and frame rate

tamirg · July 18, 2019, 11:31am

Environment:
OS: Windows server 2016
GPU: Tesla T4 and Tesla P4
Drivers: 411.98 and 412.36

We have ffmpeg with cuvid decoder enabled (the problem reproduces with other GPU decoders as well).
We run this command:

ffmpeg -c:v h264_cuvid -i <video file> -f null –

and observed the decoder utilization using this command

nvidia-smi.exe -q -l 1 | FINDSTR Decoder

Update:
Testing on a public video from:

The video is very short - create a video with x4 loop -

ffmpeg.exe -c:v h264_cuvid -stream_loop 4 -i video.mp4 video_loopX4.mp4

Comparing 2 driver versions:
Tesla P4:
Driver 411.98: ~358 fps, 87% decoder utilization
Driver 412.36: ~355 fps, 88% decoder utilization
Tesla T4:
Driver 411.98: ~434 fps, 34% decoder utilization
Driver 412.36: ~194 fps, 29% decoder utilization

Running the same test on Linux we are able to achieve 100% (P4) / 50% (T4) decoder utilization and much higher decode frame rate.

Tesla T4 - Linux (Ubuntu 18.04.1 LTS)
Driver 415.27: ~620 fps, 50% decoder utilization

Actual fps and decoder utilization vary when testing different input videos, but both GPUs are never able to achieve their decoding potential seen on Linux when using Windows server.
update: Tesla T4 decodes in lower fps after driver update.

mandar_godse · July 26, 2019, 11:36am

Hi Tamir,
Thanks for providing detailed information.
We are looking into this issue and get back to you if need any more details.

Thanks.

brainiarc7 · August 9, 2019, 10:59pm

Any updates on this issue?

mandar_godse · February 26, 2020, 10:21am

Hi.

Can you test with the latest driver available on nvidia.com (Official Drivers | NVIDIA) and confirm if the issue is fixed?
For reference, this is tracked internally as 200538703.

Thanks.

Topic		Replies	Views
Tesla P4 ffmpeg bad performance Tesla Boards	1	5061	April 6, 2018
FFMPEG Transcoding Perfromance not good on Tesla p4 GPU-Accelerated Libraries	1	2077	July 27, 2019
CPU reaches 100% usages when using multiples nvv4l2h264enc on Tesla T4 GPU DeepStream SDK	4	517	March 1, 2023
H264_nvenc The minimum required Nvidia driver for nvenc is 520.56.06 or newer Video Codec, PyNv & OFA ffmpeg , video , nvenc	3	8991	March 13, 2023
How to test the decode performance of T4? DeepStream SDK	7	715	November 23, 2021
Tesla P4 Low Speech When Render Video 720p More vGPU Forums	0	887	July 11, 2023
Difference in performace for parallell decode encode with ffmpeg h264_cuvid and h264_nvenc Tesla P100 GPU-Accelerated Libraries	0	1479	November 14, 2017
Nvidia codec sdk for 4K@60fps video Video Codec, PyNv & OFA gstreamer	0	633	March 5, 2021
Tesla T4 - Low FPS during video and overall session latency XenDesktop	3	3604	January 18, 2020
ffmpeg failed at encoding on Tesla T4 card Video Codec, PyNv & OFA	2	3028	December 28, 2019

P4 and T4 Decoding on Windows Server 2016 - Low utilization and frame rate

Related topics