Friends I am trying to change my c++ code to Cuda but I am not getting the desired output and also there is no error from Cuda error check.
I have tried my best. Its FWI code where I am calling 2 kernels, once for calculation other for updating my values. Please help…
My code:-//the commented out for loops is code that I have converted to Cuda.
If we uncomment the for loops after the kernel calls and comment out the kernel calls and also the Cuda memcpy in the end (for vz array) then we will get to see the correct output.
My Cuda Code where Conversion took place
Please have a look and let me know if you need any more information of any sort.
NOTE: Those who are trying to run the code, there may be an error while deallocation at the end of program. Please ignore it as it has no effect on our output.