gcc passing compiler options to nvcc release 8.0, V8.0.26 - cudafe died signal 11

cuda-coda · September 29, 2016, 6:49pm

cudafe dies during gcc compilation with nvcc error signal 11 (Invalid memory reference)

I have tried to solve the problem by passing the following:-

gcc ADD_CFLAGS = -m64 -mavx2 -mfma -o -shared -pipe -time -mtune=native -fPIC -std=c++11 -Dnvcc–compiler-options=‘–nvlink-options --gpu-architecture=compute_52 --gpu-code=sm52 --shared --relocatable-device-code=true–compile’–ptxas-options=‘–allow-expensive-optimizations --gpu-name sm52 -m64’

I am on RHEL7 SL7 with an Intel Broadwell Core i7 5960 Extreme Edition CPU and triple head Nvidia GTX90 SLI bridged Maxwell video cards

I added the above flags to the config.mk file trying to solve the problem, but I think I need them anyway to optimize the build.

Is this related to earlier bugs reported with cuda?
If so it was supposed to be solved with subsequent releases.

The problem occurred also without the -std=c++11 flag.

How do we avoid invalid memory references such as this?

njuffa · September 29, 2016, 6:54pm

Assuming the toolchain is installed correctly (no corrupt or missing files), a segfault in the compiler during compilation is never a reasonable response, as opposed to an orderly abnormal termination with an appropriate error message. It should always be considered a bug.

I would suggest filing a bug report with NVIDIA, using the form linked from the CUDA registered developer website. In my experience, there are rarely workarounds for bugs of this nature, but if you file a bug there is a chance the compiler team has a recommendation as to how to avoid it.

cuda-coda · September 29, 2016, 6:58pm

Thanks for the quick reply.
I understand it could well be a bug and may consider filing a bug report.

The error message I receive is this:-

nvcc error : ‘cudafe’ died due to signal 11 (Invalid memory reference)
nvcc error : ‘cudafe’ core dumped

From what I have read this has been an ongoing issue since Cuda-2.0
At release 8.0 I would have thought the problem was solved by now.

Any further ideas or suggestions ?

Robert_Crovella · September 29, 2016, 7:00pm

8.0.26 is CUDA 8 RC

before you file a bug, update your system to CUDA 8 (production release) which should be 8.0.44 or something like that.

The error message you are reporting is a generic one that could happen with various different compiler issues. It’s almost certainly not due to a single issue that has been around since CUDA 2.0 and never fixed. So just because you are finding reports of that error message dating back to CUDA 2.0 does not mean that you are having the same underlying issue in the compiler that has never been fixed.

cuda-coda · September 29, 2016, 7:06pm

Thanks txbob, I did think of the same solution and did:-

yum reinstall cuda

I get the follwoing query result

$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Wed_May__4_21:01:56_CDT_2016
Cuda compilation tools, release 8.0, V8.0.26

When I do:-

yum update cuda

I get No packages marked for update

I will check for 8.0.44

Robert_Crovella · September 29, 2016, 7:09pm

so obviously your method is broken

If you want to install the latest version of cuda, I suggest you go to:

[url]http://www.nvidia.com/getcuda[/url]

find the installation guide appropriate for your OS (i.e. the linux install guide), and follow the instructions there.

cuda-coda · September 29, 2016, 7:12pm

Thanks again txbob.

would it be safest to do:-

yum remove cuda

prior to install 8.0.44 ?

cuda-coda · September 29, 2016, 7:51pm

I did
$sudo yum remove cuda

and then followed the install instructions for cuda-8.0.44

$ nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Sun_Sep__4_22:14:01_CDT_2016
Cuda compilation tools, release 8.0, V8.0.44

The same build error persists

nvcc error : ‘cudafe’ died due to signal 11 (Invalid memory reference)
nvcc error : ‘cudafe’ core dumped

Any other ideas?

njuffa · September 29, 2016, 8:07pm

As a sanity check: are you able to successfully build the example programs that ship with CUDA? If so, that means the installed toolchain is functional, and it is extremely likely you are hitting a bug in the CUDAFE component of the CUDA toolchain.

For filing a bug report with NVIDIA, you would want to prepare the smallest possible self-contained code that reproduces the issue and attach that to the bug report. A single, short, source code file plus the nvcc commandline invocation that triggers the segfault would be ideal for this purpose.

Robert_Crovella · September 29, 2016, 8:37pm

agreed, do what njuffa said, then file a bug

Topic		Replies	Views
compile error using CUDA 2.0 'cudafe' died due to signal 11 CUDA Programming and Performance	5	15315	September 11, 2008
'cicc' compilation error and debug flag CUDA Programming and Performance	25	14629	May 23, 2023
nvcc 3.1 compiler error CUDA Programming and Performance	3	9163	July 14, 2010
CentOS 5.5+CUDA3.2rc: 'cudafe' died due to signal 11 rock solid ICE on boost 1.33.1 posix_ti CUDA Programming and Performance	11	3447	November 12, 2010
CUDACOMPILE : nvcc error : 'cudafe++' died with status 0xC0000409 CUDA NVCC Compiler	16	5383	June 3, 2024
permanent CUDAFE crashes due to 0x0 memory reference CUDA Programming and Performance	6	1599	January 19, 2015
'cudafe++' died with status 0xC0000005 (ACCESS_VIOLATION) CUDA Programming and Performance	17	10644	February 27, 2024
Nvcc error : 'cicc' died with status 0xC0000005 - Only in DEBUG mode CUDA NVCC Compiler	7	2559	April 30, 2024
Windows command line NVCC compilation error 'cudafe++' died with status 0xC0000005 (ACCESS_VIOLATION) CUDA Programming and Performance cuda , nvcc	1	1326	January 26, 2024
Error 0xC0000005 with cudafe++ with a very simple code CUDA Programming and Performance	11	61	June 12, 2025

gcc passing compiler options to nvcc release 8.0, V8.0.26 - cudafe died signal 11

Related topics