I’m new to CUDA Fortran. I’m testing CUDA Fortran and comparing with CUDA C. According to my test, for a same problem, it seems CUDA Fortran is about 2 times slower than its CUDA C version.
I don’t know what’s wrong with my CUDA Fortran code. Maybe I missed something when compiling the CUDA Fortran version. I used the following command: pgfortran -fast xxx.cuf
Anyone has the same problem as me?