Trying to validate my program, cuda-memcheck reports issues inside of the nvEncEncodePicture call.
I was wondering if somehow may I been feeding the API with some sort of misaligned buffers, or is it there some other parameter I may have ignored?
========= Invalid __global__ write of size 4
========= at 0x00000cb8 in parallelReductionAdd
========= by thread (0,0,0) in block (0,0,0)
========= Address 0x7fd34fc00600 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:/usr/lib/x86_64-linux-gnu/libcuda.so.1 (cuLaunchGrid + 0x18a) [0x255a9a]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvcuvid.so.1 [0xd6eda]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvcuvid.so.1 [0xd7484]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvcuvid.so.1 [0x9a8b5]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x9057]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x9275]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0xb2f9]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0xb457]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x78cb]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x797e]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x7e7b]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x6b09]
========= Host Frame:/usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 [0x12edb]
========= my code locations here
I am using nvEncode 8.2 in linux and the session is initialized on a CUDA device.