Is there a way to find out if the size of my kernel is getting too big?
I have a quite complex kernel, lots of branches etc
1>nvopencc ERROR: C:\CUDA\bin64/…/open64/lib//be.exe returned non-zero status -1073741571
The program compiles and works fine in emulation mode
I am using cuda 2.3 on 64 bit.
It also shows error 0x2 ?
1>nvopencc ERROR: C:\CUDA\bin64/…/open64/lib//be.exe returned non-zero status -1073741571
1># --error 0x2 –
LSChien
3
are you sure that problem is due to large size of your kernel?
you can comment some part of your kernel and check if error occurs.
in fact, you can use loop unrolling to flatten the for-loop and such unrolling
would lead to huge binary code, so your problem may not be code size.