Hi all,
I have mentioned in previous post about my fortran code generating NaN (not a n number) error in the middle if execution. I have used cudamemcheck tool to diagnose. I am not familiar with it so I am posting what I get when running memcheck on my executable code (Quick5.exe): 12 severe errors.
I am compiling the code using PGF 13.9 fortran compiler (and cuda toolkit 5.0) with micro-soft VS 2010.
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x1a9) [0x234c9]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarqnj_kernel_ + 0x2a0) [0x4a510]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc0) [0x87a50]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Invalid local write of size 8
========= at 0x00000190 in kernels_getreynvarqnj_kernel_
========= by thread (4,12,0) in block (0,2,0)
========= Address 0x00fffc08 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuLaunchKernel + 0x1b2) [0xe042]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll [0x3706]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x1a9) [0x234c9]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarqnj_kernel_ + 0x2a0) [0x4a510]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc0) [0x87a50]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Invalid local write of size 8
========= at 0x00000190 in kernels_getreynvarqnj_kernel_
========= by thread (3,12,0) in block (0,2,0)
========= Address 0x00fffc08 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuLaunchKernel + 0x1b2) [0xe042]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll [0x3706]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x1a9) [0x234c9]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarqnj_kernel_ + 0x2a0) [0x4a510]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc0) [0x87a50]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Invalid local write of size 8
========= at 0x00000190 in kernels_getreynvarqnj_kernel_
========= by thread (2,12,0) in block (0,2,0)
========= Address 0x00fffc08 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuLaunchKernel + 0x1b2) [0xe042]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll [0x3706]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x1a9) [0x234c9]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarqnj_kernel_ + 0x2a0) [0x4a510]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc0) [0x87a50]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Invalid local write of size 8
========= at 0x00000190 in kernels_getreynvarqnj_kernel_
========= by thread (1,12,0) in block (0,2,0)
========= Address 0x00fffc08 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuLaunchKernel + 0x1b2) [0xe042]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll [0x3706]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x1a9) [0x234c9]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarqnj_kernel_ + 0x2a0) [0x4a510]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc0) [0x87a50]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Invalid local write of size 8
========= at 0x00000190 in kernels_getreynvarqnj_kernel_
========= by thread (0,12,0) in block (0,2,0)
========= Address 0x00fffc08 is out of bounds
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuLaunchKernel + 0x1b2) [0xe042]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll [0x3706]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x1a9) [0x234c9]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarqnj_kernel_ + 0x2a0) [0x4a510]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc0) [0x87a50]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Program hit error 30 on CUDA API call to cudaThreadSynchronize
========= Saved host backtrace up to driver entry point at error
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuProfilerStop + 0xa0432) [0xbfc12]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaThreadSynchronize + 0x218) [0x1e1b8]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (cudathreadsynchronize_ + 0x12) [0xaa312]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x1dc8) [0x87a58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Program hit error 30 on CUDA API call to cudaLaunch
========= Saved host backtrace up to driver entry point at error
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuProfilerStop + 0xa0432) [0xbfc12]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x2a5) [0x235c5]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarak_kernel_ + 0x36e) [0x4a88e]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x207e) [0x87d0e]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Program hit error 30 on CUDA API call to cudaThreadSynchronize
========= Saved host backtrace up to driver entry point at error
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuProfilerStop + 0xa0432) [0xbfc12]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaLaunch + 0x2a5) [0x235c5]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (kernels_getreynvarak_kernel_ + 0x36e) [0x4a88e]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x207e) [0x87d0e]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Program hit error 30 on CUDA API call to cudaThreadSynchronize
========= Saved host backtrace up to driver entry point at error
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuProfilerStop + 0xa0432) [0xbfc12]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaThreadSynchronize + 0x218) [0x1e1b8]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (cudathreadsynchronize_ + 0x12) [0xaa312]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x2086) [0x87d16]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= Program hit error 30 on CUDA API call to cudaMemcpy
========= Saved host backtrace up to driver entry point at error
========= Host Frame:C:\Windows\SYSTEM32\nvcuda.dll (cuProfilerStop + 0xa0432) [0xbfc12]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\cudart64_50_35.dll (cudaMemcpy + 0x2ae) [0x27dae]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (pgf90_dev_copyout + 0x4c) [0xa727c]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (reyneq3_ + 0x21d3) [0x87e63]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (vcycle_ + 0x3c29) [0x98239]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (fullmult_ + 0x74d) [0x989fd]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (initcasepreadapt_ + 0x2e8) [0x6bb58]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (MAIN_ + 0x7ca4) [0x67954]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (main + 0x70) [0x10e0]
========= Host Frame:C:\Users\Dolf\Desktop\quick 5 test results\run\Quick5.exe (__tmainCRTStartup + 0x136) [0x11e6e6]
========= Host Frame:C:\Windows\system32\KERNEL32.DLL (BaseThreadInitThunk + 0x1a) [0x1832]
========= Host Frame:C:\Windows\SYSTEM32\ntdll.dll (RtlUserThreadStart + 0x21) [0x5d609]
========= ERROR SUMMARY: 12 errors
any ideas? which ones are the 12 errors I need to fix?
thanks,
Dolf