--fatbin segfaults

zanderso · February 23, 2009, 1:19am

I’d like to use the --fatbin argument to nvcc mentioned in the man page, but doing so always causes nvcc to segfault.

I’m doing:

$ nvcc --ptx incrementArrays.cu

$ nvcc --fatbin incrementArrays.ptx

Segmentation fault

Where incrementArrays.cu is:

// incrementArray.cu

#include <assert.h>

#include <stdio.h>

void incrementArrayOnHost(float *a, int N)

{

  int i;

  for (i=0; i < N; i++) a[i] = a[i]+1.f;

}

__global__ void incrementArrayOnDevice(float *a, int N)

{

  int idx = blockIdx.x*blockDim.x + threadIdx.x;

  if (idx<N) a[idx] = a[idx]+1.f;

}

int main(void)

{

  float *a_h, *b_h;		   // pointers to host memory

  float *a_d;				 // pointer to device memory

  int i, N = 10;

  size_t size = N*sizeof(float);

  // allocate arrays on host

  a_h = (float *)malloc(size);

  b_h = (float *)malloc(size);

  // allocate array on device 

  cudaMalloc((void **) &a_d, size);

  // initialization of host data

  for (i=0; i<N; i++) a_h[i] = (float)i;

  // copy data from host to device

  cudaMemcpy(a_d, a_h, sizeof(float)*N, cudaMemcpyHostToDevice);

  // do calculation on host

  incrementArrayOnHost(a_h, N);

  // do calculation on device:

  // Part 1 of 2. Compute execution configuration

  int blockSize = 4;

  int nBlocks = N/blockSize + (N%blockSize == 0?0:1);

  // Part 2 of 2. Call incrementArrayOnDevice kernel 

  incrementArrayOnDevice <<< nBlocks, blockSize >>> (a_d, N);

  // Retrieve result from device and store in b_h

  cudaMemcpy(b_h, a_d, sizeof(float)*N, cudaMemcpyDeviceToHost);

  // check results

  for (i=0; i<N; i++) assert(a_h[i] == b_h[i]);

  // cleanup

  printf("Success\n");

  free(a_h); free(b_h); cudaFree(a_d); 

  return 0;

}

For reference:

$ nvcc --version

nvcc: NVIDIA ® Cuda compiler driver

Built on Wed_Dec__3_18:29:25_PST_2008

Cuda compilation tools, release 2.1, V0.2.1221

$ uname -a

Linux fortitude 2.6.27-9-generic #1 SMP Thu Nov 20 21:57:00 UTC 2008 i686 GNU/Linux

zanderso · February 25, 2009, 8:08am

As a sanity check, could someone do me the favor of checking that the segfault also happens on their machine? Also, if this is not the right place for this sort of inquiry, could someone please point me in the right direction?

Thanks!

Topic		Replies	Views
-deviceemu crashes on Vista32 CUDA Programming and Performance	0	2030	August 18, 2009
nvcc segfault in Code_Expansion phase CUDA Programming and Performance	0	1594	June 24, 2010
Beginner at Cuda seg faulting CUDA Programming and Performance	0	429	August 31, 2016
NVCC Segfault on boost::format in Host side code in .cu file CUDA Programming and Performance	8	2275	February 8, 2011
Segmentation fault while passing variables from different modules to kernel nvc, nvc++ and nvfortran cuda , kernel , cuda-gdb	4	1131	August 16, 2021
Beginner at Cuda seg faulting CUDA Programming and Performance	2	444	August 31, 2016
first install of cuda CUDA Setup and Installation	6	7650	February 12, 2017
nvcc Segfault CUDA Programming and Performance	6	11415	October 14, 2010
CUDA FORTRAN examples don't work for PGI19.4 Legacy PGI Compilers	7	3720	May 8, 2019
ptxas segfault crashing the compiler on large kernel CUDA Programming and Performance	10	9607	April 4, 2008

--fatbin segfaults

Related topics