Recently I have been in the mix of developing a almost complete FORTRAN wrapper for the cuda runtime api host code using the FORTRAN 2003 ISO C bind intrinsics. But as some of you may know that the pointers in FORTRAN are just for memory addressing , you cannot perform arithmetic operations on them for dynamic addressing. Hence, Now I am stuck on how to replicate the cuda streams feature equivalent to the one shown in programming guide 2.3 page 34 (section 184.108.40.206) . Does anyone know how to go about replicating that via a wrapper in FORTRAN ?
I looked at the pinned memory example on Nvidia’s website; it doesn’t tell anything about streams as it just calls CUBLAS.
Anyone from Nvidia/here at forums help me out here please ? I plan to release this whole wrapper as a library for all but am stuck rite now :( because of this problem…
thanks , merry xmas.