unresolved external symbol _main referenced in function ___tmainCRTStartup

coldneut · January 24, 2010, 5:56pm

I am struggling to get my 1st CUDA program to build with Visual Studio 2008 under Windows 7. It compiles OK but gives the linking error:

1>LIBCMTD.lib(crt0.obj) : error LNK2019: unresolved external symbol _main referenced in function ___tmainCRTStartup

The code is taken from the “My first CUDA program” tutorial:

// CUDAVB2008Test.cpp : Defines the entry point for the console application.

#include “stdafx.h”
#include <stdio.h>
#include <cuda.h>
// Kernel that executes on the CUDA device
global void square_array(float *a, int N)
{ int idx = blockIdx.x * blockDim.x + threadIdx.x;
if (idx<N) a[idx] = a[idx] * a[idx];
}
// main routine that executes on the host
int main(void)
{ float *a_h, *a_d; // Pointer to host & device arrays
const int N = 10; // Number of elements in arrays
size_t size = N * sizeof(float);
a_h = (float *)malloc(size); // Allocate array on host
cudaMalloc((void **) &a_d, size); // Allocate array on device
// Initialize host array and copy it to CUDA device
for (int i=0; i<N; i++) a_h[i] = (float)i;
cudaMemcpy(a_d, a_h, size, cudaMemcpyHostToDevice);
// Do calculation on device:
int block_size = 4;
int n_blocks = N/block_size + (N%block_size == 0 ? 0:1);
square_array <<< n_blocks, block_size >>> (a_d, N);
// Retrieve result from device and store it in host array
cudaMemcpy(a_h, a_d, sizeof(float)*N, cudaMemcpyDeviceToHost);
// Print results
for (int i=0; i<N; i++) printf(“%d %f\n”, i, a_h[i]);
// Cleanup
free(a_h); cudaFree(a_d);
}

Can anyone steer me on where to go from here?

iceberg · January 25, 2010, 4:55am

I am struggling to get my 1st CUDA program to build with Visual Studio 2008 under Windows 7. It compiles OK but gives the linking error:

1>LIBCMTD.lib(crt0.obj) : error LNK2019: unresolved external symbol _main referenced in function ___tmainCRTStartup

The code is taken from the “My first CUDA program” tutorial:

// CUDAVB2008Test.cpp : Defines the entry point for the console application.

include “stdafx.h”

include <stdio.h>

include <cuda.h>

// Kernel that executes on the CUDA device

global void square_array(float *a, int N)

{ int idx = blockIdx.x * blockDim.x + threadIdx.x;

if (idx<N) a[idx] = a[idx] * a[idx];

}

// main routine that executes on the host

int main(void)

{ float *a_h, *a_d; // Pointer to host & device arrays

const int N = 10; // Number of elements in arrays

size_t size = N * sizeof(float);

a_h = (float *)malloc(size); // Allocate array on host

cudaMalloc((void **) &a_d, size); // Allocate array on device

// Initialize host array and copy it to CUDA device

for (int i=0; i<N; i++) a_h[i] = (float)i;

cudaMemcpy(a_d, a_h, size, cudaMemcpyHostToDevice);

// Do calculation on device:

int block_size = 4;

int n_blocks = N/block_size + (N%block_size == 0 ? 0:1);

square_array <<< n_blocks, block_size >>> (a_d, N);

// Retrieve result from device and store it in host array

cudaMemcpy(a_h, a_d, sizeof(float)*N, cudaMemcpyDeviceToHost);

// Print results

for (int i=0; i<N; i++) printf(“%d %f\n”, i, a_h[i]);

// Cleanup

free(a_h); cudaFree(a_d);

}

Can anyone steer me on where to go from here?

You should change the file extension .cpp into .cu to ensure using the nvcc compiler.

coldneut · January 25, 2010, 11:33pm

The file extension is .cu contrary to what the the comment statement says.

avidday · January 25, 2010, 11:49pm

You really do have to rename the input to have .cu extension. nvcc uses the file extension to determine how to process the input code. It will not correct parse and compile the device code in your input file unless it has the extension .cu.

coldneut · January 26, 2010, 12:17am

The input file name does have the extension .cu

I just didn’t change the extension in the comment which has no effect

AKazak · February 21, 2011, 11:27am

If try to compile this following code:

// cuda_example3.cu : Defines the entry point for the console application.

//

#include "stdafx.h"

#include <stdio.h>

#include <cuda.h>

// Kernel that executes on the CUDA device

__global__ void square_array( float *a, int N )

{

    int idx = blockIdx.x * blockDim.x + threadIdx.x;

    if ( idx < N )

        a[idx] = a[idx] * a[idx];

}

// main routine that executes on the host

int main( void )

{

    float *a_h, *a_d; // Pointer to host & device arrays

    const int N = 10; // Number of elements in arrays

    size_t size = N * sizeof( float );

    a_h = (float *)malloc( size );    // Allocate array on host

    cudaMalloc( (void **)&a_d, size ); // Allocate array on device

    // Initialize host array and copy it to CUDA device

    for ( int i = 0; i < N; i++ )

        a_h[i] = (float)i;

    cudaMemcpy( a_d, a_h, size, cudaMemcpyHostToDevice );

    // Do calculation on device:

    int block_size = 4;

    int n_blocks   = N / block_size + ( N % block_size == 0 ? 0 : 1 );

    square_array <<< n_blocks, block_size >>> ( a_d, N );

    // Retrieve result from device and store it in host array

    cudaMemcpy( a_h, a_d, sizeof( float ) * N, cudaMemcpyDeviceToHost );

    // Print results

    for ( int i = 0; i < N; i++ )

        printf( "%d %f\n", i, a_h[i] ); // Cleanup

    free( a_h );

    cudaFree( a_d );

}

I get the same error after got updated to CUDA Toolkit 3.2:

1>------ Build started: Project: CT, Configuration: Debug Win32 ------

1>Compiling with CUDA Build Rule…

1>“C:\Program Files (x86)\NVIDIA GPU Computing Toolkit\CUDA\v3.2\bin\nvcc.exe” -arch sm_10 -ccbin “C:\Program Files (x86)\Microsoft Visual Studio 9.0\VC\bin” -I"C:\Program Files (x86)\NVIDIA GPU Computing Toolkit\CUDA\v3.2\include" -maxrregcount=32 -m32 -cubin -o “D:\Ð”Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ñ‹\Visual Studio 2008\Projects\CT\Debug/CUDA_Test_1.cubin” CUDA_Test_1.cu

1>CUDA_Test_1.cu

1>CUDA_Test_1.cu

1>tmpxft_00001908_00000000-3_CUDA_Test_1.cudafe1.gpu

1>tmpxft_00001908_00000000-10_CUDA_Test_1.cudafe2.gpu

1>Linking…

1>MSVCRTD.lib(crtexe.obj) : error LNK2019: unresolved external symbol _main referenced in function ___tmainCRTStartup

1>D:\Ð”Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ñ‹\Visual Studio 2008\Projects\CT\Debug\CT.exe : fatal error LNK1120: 1 unresolved externals

1>Build log was saved at “file://d:\Ð”Ð¾ÐºÑƒÐ¼ÐµÐ½Ñ‚Ñ‹\Visual Studio 2008\Projects\CT\Debug\BuildLog.htm”

1>CT - 2 error(s), 0 warning(s)

========== Build: 0 succeeded, 1 failed, 0 up-to-date, 0 skipped ==========

What else can I try?

tera · February 22, 2011, 1:25pm

I have no experience on Windows, but leaving out the [font=“Courier New”]#include “stdafx.h”[/font] looks like a hot candidate.

AKazak · February 22, 2011, 7:37pm

It seems that some important changes have been made in toolkit version 3.2, isn’t it?
Is it mandatory that host code should be place in cpp and kernels should be saved in cu?

Topic		Replies	Views
Build Error MSB3721 When calling object method within kernel, using compiler directives CUDA Programming and Performance	9	5727	November 18, 2015
Compiling CUDA in matlab. CUDA Programming and Performance	6	9186	January 10, 2011
error LNK2019: unresolved external symbol CUDA Programming and Performance	15	140680	June 3, 2020
Embarrassing: Hello World no go CUDA Programming and Performance	5	2360	November 8, 2011
linking problem undefined reference to CUDA Programming and Performance	8	13132	February 15, 2010
Undefined reference to library CUDA Programming and Performance	4	4456	November 18, 2007
Simple CUDA Wizard for Visual Studio 2005 CUDA Programming and Performance	100	174509	April 8, 2012
Noob Q: How to extern c function? CUDA Programming and Performance	19	23626	June 30, 2010
Compiling / linking CUDA apps? CUDA Programming and Performance	8	4815	September 21, 2009
Qtcreator , windows and cuda CUDA Setup and Installation opencv	5	4204	March 6, 2017

unresolved external symbol _main referenced in function ___tmainCRTStartup

Related topics