NVENC H.264 Encoder MFT latency increases when framerate is limited

KeloCube · August 7, 2018, 10:16pm

Hello!

I am developing an application for Windows that utilizes low-latency display streaming. I want the application to be platform-agnostic (as far as GPUs go), so I am not using the H.264 hardware encoder directly through the NVIDIA Video Codec SDK but through a Media Foundation Transform instead. For display capture I am using the Windows Desktop Duplication API.

On my development machine I have a GTX 1050 Ti, which is able to encode a 1080p (-ish, actually 1920x1200) stream with around a 3 ms latency at best. However, after a while the encoder seemingly decides that since it is only being fed samples every 16 ms, it doesn’t need to encode them at the regular rate. Perhaps this leads to better encoding quality or lower power consumption or something.

To test this, I set up a switch which would make the application stop collecting frames through the Desktop Duplication API and instead just feed the same frame over and over again as fast as it could into the encoder. During this time the encoding latency decreases dramatically. After returning to normal operation of capturing the display every 16 ms and sending the frame to the transform, the encoder will keep processing the frames at a lower latency for a while, but eventually returns to the higher latency.

I’ve illustrated the problem with the following graph. I’m guessing the spikes must be I-frames, which take slightly longer to encode. The green area of the graph indicates a section where the encoder was artificially saturated.

http://i.xomf.com/msbnx.png

I wonder if this has to do with not having set the low latency preset on the encoder? Is there a way to instruct the encoder to use a low-latency mode through Media Foundation? I have tried changing some Media Foundation attributes (such as MF_LOW_LATENCY) as well as properties exposed by the ICodecAPI interface of the transform, but none has had an effect so far.

A 12 ms latency is not something I cannot live with, especially considering that systems with lower-end GPUs will probably see a higher latency anyway. Nevertheless, it would be nice to have a confirmation on whether anyone else has observed this behaviour or if anything can be done about it.

rypark · August 9, 2018, 11:42pm

Hi KeloCube,

Could you provide a detailed info of:

Driver version
Operation System

Thanks,
Ryan Park

KeloCube · August 10, 2018, 9:56am

Hi,

Thanks for the response! Silly of me to not include them right away. I’m running Windows 10 Pro 64-bit (10.0, Build 17134) with driver version 24.21.13.9882. I have GeForce Experience installed, which reports the version as 398.82. See full DxDiag output as an attachment. Hopefully there’s nothing too sensitive there. :)

At the moment the integrated GPU is enabled (I was testing the Quick Sync implementation of h.264 encoding), but I was having the same problem with Nvidia’s encoder before enabling the iGPU.

I also put together a sample program that only does encoding of an empty surface. No desktop capture, no color conversion, just synchronous surface encoding. The same behaviour is still observed, although it takes the encoder much longer to start increasing the encoding latency and the encoder responds quicker to unthrottling of framerate. Average encode latency on my machine was about 2 ms, which is still far greater than what I got by modifying the native Encode SDK encoding sample to not read data from a file. This got me sub-millisecond latencies, though the surface being empty might have had something to do with it.

I figured the sample might be a useful starting point for anyone interested in MFT hardware encoding, so I hosted it on GitHub gists.

gist.github.com

https://gist.github.com/KeloCube/0e56ba7f2c5729223483147eb35d9cc7

MFTTest.cpp

// MIT License
// 
// Copyright 2018 Otto Itkonen
// 
// Permission is hereby granted, free of charge, to any person obtaining a copy
// of this software and associated documentation files (the "Software"), to deal
// in the Software without restriction, including without limitation the rights
// to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
// copies of the Software, and to permit persons to whom the Software is
// furnished to do so, subject to the following conditions:

This file has been truncated. show original

Also, I realized that the increasing latency in my desktop streaming app is going to be a problem after all since the Desktop Duplication API only sends a frame when the surface changes. This means the encoder will receive frames at an even lower rate, increasing the latency to above 16 ms.
DxDiag.txt (103 KB)

KeloCube · September 4, 2018, 10:04pm

Hi, any update on the matter? If there is no chance of this getting resolved then please let me know. Writing an Nvidia-specific implementation is certainly a possibility, but I would like to explore this option first.

rypark · September 4, 2018, 11:29pm

Hi,

We filed a bug on the issue you reported. Internal engineers are working on a issue repro now. I’ll update you once progress is made.

Thanks,
Ryan Park

Topic		Replies	Views
H.264 video encoder latency Video Processing & Optical Flow	4	3472	July 13, 2019
NVENC HEVC ultra low latency with FFmpeg libraries, what should be my expectations? Video Processing & Optical Flow	5	3593	July 27, 2020
Using the Nvidia Encoder SDK General Topics and Other SDKs encoder , sdk	1	832	May 4, 2022
M4000 simultaneous h264 encode sessions problem Video Processing & Optical Flow	1	1622	May 8, 2017
NVENC Quality - Blocky & Jumpy Blacks Video Processing & Optical Flow	16	7704	December 4, 2020
Low Latency Decoding Issue Video Processing & Optical Flow	11	1562	September 11, 2018
H264 encoding with low latency General	6	2370	April 12, 2019
NVDEC/CUDA/NVENC speed comparison GPU-Accelerated Libraries	9	50934	May 30, 2019
Encoding + displaying not keeping-up (NVEnc, OpenGL, CUDA) Video Processing & Optical Flow	1	1897	March 15, 2018
MFT NVENC H264/HEVC Encoder Crash GPU-Accelerated Libraries encoder , mft	0	820	September 13, 2022

NVENC H.264 Encoder MFT latency increases when framerate is limited

Related topics