[font=“Tahoma”]Hi all, I’m still very green in CUDA programming.
I tried to parallelize some modules by using common CUDA steps.
- allocating the memory for the device (by using cudaMalloc)
- copy from the host to the device (by using cudaMemCpy)
- do whatever computation I need in the device
- copy it back to the host
- free the allocated memory (cudaFree)
I’ve successfully compiled the kernel.cu (creating kernel.o), however when I tried to run the executable file,
there is an error of Segmentation Fault.
Can you help me with this? what is the most common-sense reason that cause a “Segmentation Fault” error in CUDA?
is it the same in C language.
Thanks in advance…
I appreciate your help…