I’m trying the example programs from the June 2009 PGInsider article but having problems with the third example program, C version. I use ‘-ta=nvidia,host’ when compiling but compiler info messages show only a host version of the ‘smooth’ function is generated. When run, the program has numeric problems (most values fail verification tests).
The Fortran unified version builds both host and GPU kernels and satisfies validation tests. When built with just ‘-ta=nvidia’ the C version builds a GPU kernel and satisfies validation tests.
This is 9.0-3 on Linux (Fedora 10), with a cc11 device.
Any suggestions to debug where this is going wrong, or a workaround for generating working unified binaries?