I’m developing an image processing application under CUDA.
The program works correctly in device emulation mode (make emu=1, with or without dbg=1), producing the correct resulting image, while running on the device, the resulting image contains a “NaN” in every pixel.
This happens independently of the number of threads (I tried from 1 to 128).
Do you have any idea about what problem it could be?
I work with a GeForce 9800 GTX under Fedora 9. The host is an AMD 64 3200+.
well, then I would at first try to write a constant value to your pixels, then a value dependend on threadIdx. If that is all ok, you probably have an error in your calculation code.
In emulation mode, TransVol contains only 0.0’s, as expected. In device mode (release or debug) it contains only nan’s. I have really no idea, why it happens!
At the moment, NVoxelsX = NVoxelsY =500, NVoxelsZ =1. The returned value for VolPitch is 64.