cuStreamSynchronize() pulls cpu usage to 100%

diederick · November 24, 2020, 3:39pm

I’m using nvdec to decode h264 and I noticed that the call to cuStreamSynchronize() pulls my CPU usage to 100%. I’ve used the NvDecoder.cpp as a reference where cuStreamSynchronize() is used as well. When I remove the call to cuStreamSynchronize() CPU usage drops to 2-7% for a 1280x720 video, which is what I would expect from decoding via a hardware-pipeline.

Do I need to call cuStreamSynchronize() after mapping/copying/unmapping decoded data into a GL texture? If I need the call to cuStreamSynchronize() I’m curious why it pulls my CPU usage to 100%.

My goal is to decode using nvdec/cuvid and copy decoded NV12 frames into OpenGL textures. I’m ensuring my video is using NV12 and mapping the decoded frames into two GL textures I create during the initialization phase; I create these textures in my pfnSequenceCallback. As the NvDecoder.cpp might not use a full GPU pipeline (e.g. data is copied from GPU > CPU) it might be OK to skip the call to cuStreamSynchronize() in an implementation that copies decoded frames into GL textures?

Topic		Replies	Views
NVDEC - Post decode performance issue Video Processing & Optical Flow	6	1512	May 14, 2020
Is it need to call cuStreamSynchronize() to push image on device memory to nve encoder? GPU-Accelerated Libraries	0	762	July 20, 2015
SM usage increases with nvv4l2decoder's drop-frame-interval set DeepStream SDK	4	696	October 12, 2021
NVJPEG -- a few questions abour decoupled decoding GPU-Accelerated Libraries nvjpeg	15	1775	July 4, 2023
Decoding, processing and displaying video w/ Nvidia Codec SDK & CUDA interop using OpenGL GLFW CUDA Programming and Performance cuda , opengl , video	4	1262	December 7, 2021
NVDEC internal synchronisation, unclear documentation Video Processing & Optical Flow	0	566	October 21, 2019
cuvid decoder occupy a whole core of cpu GPU-Accelerated Libraries	5	1151	July 13, 2017
Erratic NVDEC behavior only in fullscreen OpenGL General Topics and Other SDKs	0	746	September 20, 2017
How to decode multiple videos concurrently with NVENC? GPU-Accelerated Libraries	0	690	February 27, 2019
nvEncDestroyEncodercall hangs when nvencoder and Optix denoiser use the same CUDA context OptiX cuda	7	844	January 19, 2023

cuStreamSynchronize() pulls cpu usage to 100%

Related topics