I want to use the cublas routines from Portland fortran or in some caes intel. The fotran.c fail to compile. I do not have g95 available. I have gcc suite 4.8.0 (gcc/gfortran), Portland 13.3 and Intel 2013.3 available and new Kepler architecture cards. The SDK examples work fine, Portland accelerator work fine. However, I would like to be able to call sgemm and dgemm (c and z too) from Fortran compiled with the Portland compiler.
How do I build the fortran.c wrapper so I can get a library that I can link in and call ?gemm from fortran using Portland or Intel ?
The performance in the matrix multiply examples in the SDK yield over one Tflops/s which is quite good.
Hence my wish to call the highly optimized blas routines.