I’m currently working on a very large CUDA application which is malloc’ing and freeing millions of cuda pitches. I’m consistently hitting a crash (unfortunately after several hours of processing) inside of cudaMallocPitch, with a stack trace ending with :
Program received signal SIGSEGV, Segmentation fault.
0x00007f4ecc468566 in ?? () from /usr/lib/libcuda.so.1
#1 0x00007f4ecc45bce3 in ?? () from /usr/lib/libcuda.so.1
#2 0x00007f4ecc45041a in ?? () from /usr/lib/libcuda.so.1
#3 0x00007f4ecc2040c3 in cudaMallocPitch () from /usr/local/cuda/lib/libcudart.so.2
I’m allocating a plane that is 2048x1556 and I should have plenty of CUDA memory available to me. Any ideas on where to go to chase this? Like I said, it takes HOURS and millions of mallocs before it happens, so it’s tricky to try to trace down. Any known problems that I should look out for? I’m running driver version 177.67 on a 280 card. Thanks.