I’m trying to decode several full HD (1920x1080) h264 videos at the same time using NVIDIA CUDA video decoder API.
Also I’m monitoring GPU status using GPUZ; my card is a GTX 570.
I’m experiencing a serious bottleneck in the Video Processor indicator at GPUZ ( I think it’s correspond to card’s Video Processor Engine (VPE) ).
While GPU load marks only 3% and memory usage is also low, VPE indicator marks 35 % only with one video, so decoding more than 3 signals will be a problem, because will overpass the 100 %.
I also tried a commercial CUDA H264 decoder called CoreAVC and result it’s the same, Video Engine indicator it’s very high.
My question essentially is why I having a bottleneck in VPE? While GPU load is so low and video engine so high, it’s an API bug ? How can I do to improve performance ?
Thanks in advance.