An Easy Introduction to CUDA Fortran

jwitsoe · September 30, 2013, 7:33pm

Originally published at: https://developer.nvidia.com/blog/easy-introduction-cuda-fortran/

CUDA Fortran for Scientists and Engineers shows how high-performance application developers can leverage the power of GPUs using Fortran. This post is the first in a series on CUDA Fortran, which is the Fortran interface to the CUDA parallel computing platform. If you are familiar with CUDA C, then you are already well on your…

anon61998926 · July 13, 2014, 11:03pm

Hi dear all
I am a beginner to CUDA (CUDA FORTRAN) . I have installed PGI 13.9 but the problem is, when I try to debug even a very simple CUDA Fortran code I get plenty of errors such as below:
Error 1 unresolved external symbol cudaSetupArgument referenced in function mathops_saxpy_ saxp.obj
Error 2 unresolved external symbol cudaLaunch referenced in function mathops_saxpy_ saxp.obj
Error 3 unresolved external symbol __cudaRegisterFatBinary referenced in function mathops_saxpy_ saxp.obj
Error 4 unresolved external symbol __cudaRegisterFunction referenced in function mathops_saxpy_ saxp.obj
Error 5 unresolved external symbol __cudaUnregisterFatBinary referenced in function mathops_saxpy_ saxp.obj
Error 6 unresolved external symbol pgf90_dev_auto_alloc04 referenced in function MAIN_ saxp.obj
Error 7 unresolved external symbol pgf90_dev_copyin referenced in function MAIN_ saxp.obj
.
.
.
.Error 12 unresolved external symbol CUDAFOR saxp.obj

Furthermore, when I check the project properties I see that the CUDA FOR is not even enabled when I enable it and debug again I get the same errors. I will be appreciated if someone help me with this problem.
Thanks,
Reza

anon79155597 · April 28, 2017, 2:01pm

Hi,
i tried to run this script and it returned 'Max error: 2.0000'.
Where this error come from?
from the cudaDeviceProp in Fortran CUDA i got
Device Number: 0
Device name: GeForce GTX 1060 3GB
Memory Clock Rate (KHz): 4004000
Memory Bus Width (bits): 192
Peak Memory Bandwidth (GB/s): 192.19

anon79155597 · April 28, 2017, 2:26pm

this is work when I use pgf90 -Mcuda=cc60 -o saxpy saxpy.cuf to compile

anon42684293 · September 26, 2017, 12:33pm

Thanks! I have exact same problem with you. And it can be solved by your solution.

anon50366678 · April 27, 2018, 4:19pm

What we really need is to discuss why the different in "results" depending upon the compute capability (or compiler version)

anon88202159 · September 17, 2018, 6:02pm

Hi, I am experiencing the same problem as you did. It seems to me that the kernel subroutine never return any value to the host. I have tried your suggestion but still not working

gxming · June 21, 2024, 8:25am

I have a problem about the p2pBandwidth code on Page 128. Line 52-53 and Line 55-56 are the same. Is it right ?

50 do i = 0, nDevices -1
51 if (i == j) cycle
52 istat = cudaMemcpyPeer ( distArray (j )% a_d , j , &
53 distArray (i )% a_d , i , N)
54 istat = cudaEventRecord ( startEvent ,0)
55 istat = cudaMemcpyPeer ( distArray (j )% a_d , j , &
56 distArray (i )% a_d , i , N)
57 istat = cudaEventRecord ( stopEvent ,0)
58 istat = cudaEventSynchronize ( stopEvent )
59 istat = cudaEventElapsedTime ( time , &
60 startEvent , stopEvent )

I think Line 52-53 should be removed. Maybe I was wrong. If not, could you give me a explanation ?

Topic		Replies	Views
An Easy Introduction to CUDA C and C++ Technical Blog	48	1247	July 19, 2018
CUDA Fortran matrix-multiply 10x slower than CUDA C version Legacy PGI Compilers	5	6923	July 14, 2010
What can't you do in CUDA that you'd like? Requests for the future CUDA Programming and Performance	407	134576	May 26, 2010
CUDA very slow performance CUDA Programming and Performance	21	16743	March 6, 2020
Error running simple CUDA Fortran program Legacy PGI Compilers	9	21317	February 26, 2010
Annoying problems with memory and/or syntax CUDA Programming and Performance	19	4769	April 8, 2008
An Even Easier Introduction to CUDA Technical Blog	141	6376	November 28, 2023
CUDA Fortran and Fortran 77 Legacy PGI Compilers	13	8230	March 12, 2012
How to choose how many threads/blocks to have? CUDA Programming and Performance	43	52277	June 7, 2022
Cuda code performance CUDA Programming and Performance	14	3150	December 16, 2014

An Easy Introduction to CUDA Fortran

Related topics