NvEncode fails cuda-memcheck

luis.ayuso · March 5, 2019, 9:38am

Trying to validate my program, cuda-memcheck reports issues inside of the nvEncEncodePicture call.

I was wondering if somehow may I been feeding the API with some sort of misaligned buffers, or is it there some other parameter I may have ignored?

========= Invalid __global__ write of size 4
=========     at 0x00000cb8 in parallelReductionAdd
=========     by thread (0,0,0) in block (0,0,0)
=========     Address 0x7fd34fc00600 is out of bounds
=========     Saved host backtrace up to driver entry point at kernel launch time
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libcuda.so.1 (cuLaunchGrid + 0x18a) [0x255a9a]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvcuvid.so.1 [0xd6eda]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvcuvid.so.1 [0xd7484]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvcuvid.so.1 [0x9a8b5]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x9057]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x9275]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0xb2f9]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0xb457]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x78cb]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x797e]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x7e7b]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x6b09]
=========     Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x12edb]
=========     my code locations here

I am using nvEncode 8.2 in linux and the session is initialized on a CUDA device.

saulocpp · March 7, 2019, 1:31pm

Line #4 gives a good indication of the problem: the very first thread is already unable to write to memory.
Double-check the pointer being passed, memory allocation, index going out of bound…

luis.ayuso · March 8, 2019, 1:12pm

Thanks saulocpp for your answer, I was dead worried about the pointer alignment but I am afraid this is an internal issue in the nvencode API, after all, there is not much more to do in my side than to allocate a buffer of WxHxChannels and pass it to the API.

I continued investigating this bug. and I found something:
I tested the following geometries as input buffer sizes:

  {32, 32}, // this size locks the encoder
  {140, 140},
   {225, 225}, // this geometry crashes cuda-memcheck
  {250, 250}, // this geometry crashes cuda-memcheck
  {800, 600},
  {800 * 2, 600},
  {800, 600 * 2},
  {1920, 1080},
  {1920 * 2, 1080},
  {1920, 1080 * 2},
  {2560, 1440},
  {2560, 2560}};

It seems that the internal implementation of nvEncEncodePicture is geometry-sensible, and certain buffer sizes are just not supported.

It would be great if this would be a documented feature, as so far there is no way to validate the input buffer sizes, and the API will just crash instead of gently reporting an error.

Topic		Replies	Views
Initcheck fails for an npp API CUDA-MEMCHECK	1	628	July 25, 2020
Bug in cuda-memcheck? CUDA Programming and Performance	2	1416	March 14, 2013
cuda-memcheck dilemma cuda-memcheck problem with in-kernel allocations CUDA Programming and Performance	3	774	April 4, 2011
In-function initialization of a shared struct produces a memcheck error CUDA Programming and Performance	0	725	March 13, 2013
Cannot run example "even easier introduction to CUDA" CUDA Programming and Performance	7	644	January 14, 2019
cudaErrorLaunchFailure from custom kernel CUDA Programming and Performance	8	3347	October 20, 2014
[bugreport] __alignof(CUdeviceptr) == 4, should 8 CUDA Programming and Performance	12	27203	July 5, 2010
cuda-memcheck error: Address is out of bounds. CUDA Programming and Performance	2	5774	November 12, 2012
Cuda application crashes works fine for small data and crashes for big data CUDA Developer Tools	0	361	December 8, 2020
CUDA kernels keep on crashing CUDA Programming and Performance	6	3644	October 27, 2008

NvEncode fails cuda-memcheck

Related topics