'No binary for GPU' error uccured from CUDACAST#3 sample

soongk.lee · July 24, 2015, 9:14am

Hello

I’m trying to using OpenACC offloading to NVIDIA(with CUDA).
I downloaded sample code from CUDACast#3 in Youtube.
(cudacasts/ep3-first-openacc-program at master · NVIDIA-developer-blog/cudacasts · GitHub)

I compiled the code with pgcc compiler.
: $ pgcc -acc -Minfo=accel -ta=nvidia,cuda7.0 -o laplace2d_acc laplace2d.c

But after run, I faced error message below
: call to cuModuleLoadData returned error 209: No binary for GPU

I found someone who faced similar error, so I changed ‘double’ to ‘float’.
But error uccur still…
(and I HAVE TO WORK IN UBUNTU)
(cuModuleLoadData error 209)

I installed cuda7.0. And I checked it works through example (deviceQuery).

What do I have to now?? I need your help.

This is my desktop spec.

Intel x86_64
Ubuntu 14.04.2 LTS
NVIDIA GeForce GTX 960 (CUDA Capability number : 5.2)
installed CUDA : CUDA 7.0
compiler : /opt/pgi/linux86-64/15.4 (TRIAL LICENSE)

This is compile and run log.

openacc_test $ pgcc -acc -Minfo=accel -o laplace2d_acc laplace2d.c
main:
49, Generating copy(A[:][:])
Generating create(Anew[:][:])
55, Generating Tesla code
56, Loop carried scalar dependence for error_num at line 62
Scalar last value needed after loop for error_num at line 76
Accelerator restriction: scalar variable live-out from loop: error_num
Accelerator scalar kernel generated
58, Loop carried scalar dependence for error_num at line 62
Scalar last value needed after loop for error_num at line 76
Accelerator restriction: scalar variable live-out from loop: error_num
67, Generating Tesla code
68, Loop is parallelizable
70, Loop is parallelizable
Accelerator kernel generated
68, #pragma acc loop gang /* blockIdx.y /
70, #pragma acc loop gang, vector(128) / blockIdx.x threadIdx.x */
openacc_test $ ./laplace2d_acc
Jacobi relaxation Calculation: 4096 x 4096 mesh
call to cuModuleLoadData returned error 209: No binary for GPU
openacc_test $

MatColgrove · July 27, 2015, 6:53pm

Hi soongk.lee,

The problem here is that you’re using a GTX 960. This card uses the Maxwell architecture (compute capability 5.2). Since we officially only support the NVIDIA Tesla product line and Tesla doesn’t have a Maxwell based product, we didn’t produce device code capable of running on a Maxwell.

However, we had enough folks using Maxwell so we went ahead and started adding support in our recent 15.7 release.

Which version of the compiler are you using? Can you try using 15.7?

Mat

soongk.lee · July 29, 2015, 4:28am

Thank you mkclog.

I compiled the code with 15.7 version, and It works!
(I used 15.4 version when I had trouble.)

Thank you very much :)

Topic		Replies	Views
error 209 Legacy PGI Compilers	2	2896	December 28, 2016
cuModuleLoadData error 209 Legacy PGI Compilers	7	16311	February 10, 2015
call to cuModuleLoadData returned error 209: No binary GPU Legacy PGI Compilers	2	4364	August 22, 2016
cuModuleLoadData returned error 209: No binary for GPU Legacy PGI Compilers	2	3958	July 17, 2015
No Binary for GPU on P100 when running executable Legacy PGI Compilers	1	3472	January 14, 2017
Error 209: No binary for GPU Legacy PGI Compilers	3	4754	March 2, 2015
Missing GPU Binary Legacy PGI Compilers	2	4429	November 15, 2014
call to cuModuleLoadData returned error 209 Legacy PGI Compilers	4	4585	September 21, 2015
Could not find GPU binary file Legacy PGI Compilers	2	3330	April 21, 2012
No GPU acceleration in blackschole sample? Legacy PGI Compilers	5	4398	April 11, 2012

'No binary for GPU' error uccured from CUDACAST#3 sample

This is compile and run log.

Related topics