threadIdx undeclared - Compile Problem NVCC NVCC is not reconizing the builtin blockIdx, blockDim, n

martin1yness · September 24, 2009, 2:45pm

I’m trying to write my very first CUDA application in C, i’ve been building parts slowly to avoid multitudes of mistakes and compiling after each change to ensure there is working code.

The issue i’m having is that once i try to invoke a Kernel or even just define a kernel with a threadIdx.x call, the nvcc compiler complains that threadIdx is undeclared (first use in this function).

I’m importing <cuda_runtime_api.h> and i’ve tried importing <cuda.h>… here is my complete source code:

[codebox]

/**

I am a simple program designed to run
on the GPU
@author Martin Dale Lyness martin.lyness@gmail.com

*/

#include<cuda.h>

#include<cuda_runtime_api.h>

#include<stdlib.h>

#include<stdio.h>

#include<time.h>

#include “fp_shared.h”

int main(int argc, void ** argv);

/**

A GPU Kernel that computes the average of a set of
exactly 9 points effectively downsampling a data set.

*/

global void ComputeAverage(int resolution, float * points, float * pointsNew) {

    int idx = blockIdx.x * blockDim.x + threadIdx.x;

}

int main(int argc, void ** argv) {

    int i;

    FILE *inputFp, *outputFp;

    float *points, *pointsNew;

    cudaError_t memCopyToError, memCopyFromError;

printf(“Hello, I am going to downsample data file ‘%s’\n”, INPUT_FILE_NAME);

time_t secondsStart, secondsRead, secondsInit, secondsGPU, secondsWrite;

    time(&secondsStart);

points = (float*) malloc(sizeof(float) * DIMENSIONS);

    pointsNew = (float*) malloc(sizeof(float) * (DIMENSIONS/9));

    inputFp = fopen(INPUT_FILE_NAME, "r+");

    if(inputFp==NULL) perror("Input file doesn't exist yet, run input generator first!");

    else {

            i = 0;

            while(feof(inputFp)==0) {

                    fscanf(inputFp, "%f", points + i++);

            }

            fclose(inputFp);

    }

    time(&secondsRead);

float *d_points, *d_pointsNew;

    memCopyToError = cudaMalloc((void**) &d_points, sizeof(float) * DIMENSIONS);

    if(memCopyToError != cudaSuccess) printf("CUDA Error: %s\n", cudaGetErrorString(memCopyToError));

    memCopyToError = cudaMalloc((void**) &d_pointsNew, sizeof(float) * (DIMENSIONS/9));

    if(memCopyToError != cudaSuccess) printf("CUDA Error: %s\n", cudaGetErrorString(memCopyToError));

    memCopyToError = cudaMemcpy(d_points, points, sizeof(float) * DIMENSIONS, cudaMemcpyHostToDevice);

    if(memCopyToError != cudaSuccess) printf("CUDA Error: %s\n", cudaGetErrorString(memCopyToError));

time(&secondsInit);

int block_size = DIMENSIONS / 2;

    int n_blocks = DIMENSIONS / 2;

// ComputeAverage<<< block_size, n_blocks >>> (DIMENSIONS, d_points, d_pointsNew);

time(&secondsGPU);

printf(“All done, check out file ‘%s’\n”, OUTPUT_FILE_NAME);

    printf("\tStats\n");

    printf("Took %d seconds to read file to memory...\n", secondsRead - secondsStart);

    printf("Took %d seconds to initialize GPU memory...\n", secondsInit - secondsStart);

    printf("Took %d seconds to execute GPU calculations...\n", secondsGPU - secondsStart);

    free(points);

    free(pointsNew);

    cudaFree(d_points);

    cudaFree(d_pointsNew);

    return 0;

}

[/codebox]

And on compile this outputs: nvcc main.c

main.c: In function Ã¢ComputeAverageÃ¢:

main.c:19: error: Ã¢blockIdxÃ¢ undeclared (first use in this function)

main.c:19: error: (Each undeclared identifier is reported only once

main.c:19: error: for each function it appears in.)

main.c:19: error: Ã¢blockDimÃ¢ undeclared (first use in this function)

main.c:19: error: Ã¢threadIdxÃ¢ undeclared (first use in this function)

I am most certainly missing something simple… I found the threadIdx to be defined in one of the header files includeind in the cuda distribution, i think it was device_types.h or something but including that didn’t seem to fix the issue either, and nowhere have i ever seen any other headers included in a source file for a cuda application.

LSChien · September 24, 2009, 3:04pm

I copy your code to test.cu and use nvcc to compile it. It is O.K. without no error.

I think that you may save the code as .cpp file and use C++ compiler to compile it.

This is invalid since you define kernel function, you must use nvcc to compile it

mfatica · September 24, 2009, 5:19pm

The file with CUDA kernels needs to have a suffix .cu

Gaurav_Garg · September 24, 2009, 5:32pm

I am also getting the same error. It was working fine on my OpenSuse 10.3 machine, but compiling the same program on Fedora 10 gives this error.
Also, it works perfectly fine in release mode, the problem comes only if I try to build it in debug mode.

Topic		Replies	Views
undeclared identifier using CUDA CUDA Programming and Performance	9	7820	March 20, 2009
a very simple problem CUDA Programming and Performance	1	4134	June 4, 2008
a very simple problem CUDA Programming and Performance	1	2397	June 4, 2008
threadIdx undeclared identifier CUDA Programming and Performance	7	20214	November 9, 2009
visual studio gives undeclared indentifier for threadIdx, blockIdx, blockDim et al CUDA Programming and Performance	1	896	November 16, 2010
error C2065: 'blockIdx' : undeclared identifier CUDA Programming and Performance	7	8173	June 4, 2010
a very simple problem CUDA Programming and Performance	2	1463	June 4, 2008
[SOLVED] Code not compiling for mysterious reason CUDA Programming and Performance	3	5667	December 5, 2017
Cuda: error C2065:"threadIdx':undeclared identifier CUDA Programming and Performance	2	2759	June 6, 2012
Can someone help me? CUDA Programming and Performance	4	2400	October 31, 2008

threadIdx undeclared - Compile Problem NVCC NVCC is not reconizing the builtin blockIdx, blockDim, n

Related topics