I tried to compile the bf16TensorCoreGemm CUDA 11.2 sample (located in 0_Simple folder) using make. Following are the error messages:
>>> GCC Version is greater or equal to 5.0.0 <<<
//usr/lib/cuda/bin/nvcc -ccbin g++ -I…/…/common/inc -m64 --std=c++11 --threads 0 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_86,code=compute_86 -o bf16TensorCoreGemm.o -c bf16TensorCoreGemm.cu
g++: internal compiler error: Segmentation fault signal terminated program cc1plus
Please submit a full bug report,
with preprocessed source if appropriate.
See <file:///usr/share/doc/gcc-10/README.Bugs> for instructions.
make: *** [Makefile:357: bf16TensorCoreGemm.o] Error 255
Is it known to be unable to compile this sample in CUDA 11.2? Do I have any chance to fix this issue on my side? How much will it cost me to fix it?
I did not use j8. Just make. VERBOSE=1 did not give any information.
Can you compile on your machine? If you can, how many memory do your machine have? Mine is 32GB.
I can see the similar issue when using gcc 10.3 , see Installation Guide Linux :: CUDA Toolkit Documentation for CUDA 11.2 , it is possible that gcc10 is not formally supported .
I can see it passes on gcc9.3 . Please try to use gcc-9 see if that works for you or try to update to latest CUDA versions .