The simplest way to compile a .cu file (looking for "hello world" example for compilation

arabarra · February 21, 2011, 10:35am

Hi,

I’ve recently started to work with CUDA, and I surprised about the difficulty on finding some kind of “getting started” information about the compilation process.

Apparently, from what I take from the online material and contributions to different forums, the most general trend seems to be to adapt some makefile from the SDK.

I’m not certain about why building a makefile should be necessary. In other words: if I have a small .cu code, what is wrong about compiling it with a simple call to

nvcc -o EXECUTABLE_NAME “…/SOURCE_NAME.cu”-I/usr/local/cuda/include -lcudart -L/usr/local/cuda/lib

I guess there must be some kind of problem with this way, as my programs do work, but seem to be rather underperforming… I guess there is a minimum set of flags that I need in order to ensure adaption to the local architecture.

Any suggestion is very welcome…

avidday · February 21, 2011, 11:07am

It isn’t - you don’t need a makefile for trivial compilation.

Nothing. it can be simpler than that:

nvcc -o executable source.cu

will build an executable that will run on any CUDA compatible card, as long as source.cu doesn’t have any external dependencies outside of CUDA or the standard C/C++ library.

I doubt that has anything to do with compilation.

There isn’t. Almost without exception, nvcc compilation flags only turn on specific architectural features at the PTX generation stage (things like double precision, atomic memory operations, C++ runtime support, in kernel printf support, etc). If you don’t use those features, the PTX code produced will be almost identical. If you try and use those features without the correct flags, the compiler will generate warnings or errors. nvcc uses very aggressive optimization settings during C compilation, and the PTX assembler and driver have a lot of internal architecture specific optimisations over which there is basically no programmer control. About the only real compilation options that can effect performance are floating point compliance settings (ie. whether to use exact or fast versions of some math library functions and operands), and register usage limits. Both are discussed in some detail in the programming guide.

arabarra · February 21, 2011, 12:01pm

Hi avidday,

thanks! That really helped… now I guess I can come back to the programming guide with a clearer idea of what to look for.

Actually my problem was that interruption of coda execution at runtime seems to lead CUDA to fail to release memory at the GPU. As I had read that this feature should be solved for newest versions of CUDA (and I work with 3.2), I thought this behavior was to be corrected using some special compilation flag.

Topic		Replies	Views
Compiling .cu and .c files with multiple targets How can I make a Makefile to achieve that CUDA Programming and Performance	9	6111	June 20, 2010
Help making a simple Makefile CUDA Programming and Performance	1	10747	October 9, 2009
Need help with adding CUDA to MakeFile CUDA Programming and Performance	8	8963	January 13, 2012
NVCC at Runtime - End User Friendly Configuration Compiling GPU code without requiring Visual Studio CUDA Programming and Performance	16	10440	June 19, 2009
Big newbie having questions about GPU computing CUDA Programming and Performance	5	15761	May 20, 2007
Slow compile and cudaMalloc CUDA Programming and Performance	8	3700	February 2, 2011
Compiling C and CUDA code Problems linking CUDA code and C code CUDA Programming and Performance	7	19119	November 4, 2011
Integrating a kernel into a pre-existing makefile CUDA Programming and Performance	7	10925	March 13, 2009
Understanding code optimization resulting from the --gpu-architecture, --gpu-code and --generate-code flags CUDA NVCC Compiler	1	911	May 31, 2024
Compile .cu like .cpp CUDA Programming and Performance	7	3111	October 14, 2016

The simplest way to compile a .cu file (looking for "hello world" example for compilation

Related topics