An sprintf() which works in your kernel? It's almost here, help beta-test it

epk · April 24, 2022, 9:17pm

So, anybody who’s ever used printf() to debug GPU kernels must know these frustration:

If you print something, then print again, the lines won’t appear together since other threads’ printf()'s will likely come in-between;
Which means that you must combine all of your printing into a single instruction;
But you can’t do that for a variable-size structure;
… and you pine for having a sprintf() (or a C+±style stringstream).

And that’s not all: What if you want to write a printf wrapper, which, say, identifiers the current thread? You can write a varargs function in CUDA… but unfortunately, there is no vprintf which you can call inside your wrapper. So, you’re stuck with writing a macro. Blech :-(

Finally, maybe you want to flex your printf muscles: printf("%.*s\n", my_string) for example. or printf("%z\n", my_size); . Tough cookies, that’s not supported. Not to mention extra features outside of ISO C, like the super-useful support printing in binary.

It’s weird that CUDA has been around for, what, 13 years now, and nobody’s offered this (AFAICT). So, that period is now - almost - over. I’ve recently pushed an implementation of most of the printf() family of functions to the development branch of my cuda-kat library.

In a way, this is pretty mature code: It’s a porting of this stand-alone printf library for embedded systems, so it’s inherited a rather extensive set of unit tests. But even though these now pass when running in GPU kernels - that doesn’t test the behavior in a massively parallel environment.

So: I need some beta testers to try this out. So if you’re doing some kernel development work, and occasionally debug-print stuff… please consider giving it a spin.

Bugs/suggestions can obviously be filed either on the cuda-kat issue page or here.

elin.lundin92 · October 3, 2023, 9:46am

Hi. This would be very useful :) Did it get released in the Cuda Toolkit already?

Robert_Crovella · October 3, 2023, 1:36pm

It’s not part of the CUDA toolkit delivered by NVIDIA. It is part of this (<- click link).

epk · October 3, 2023, 5:50pm

It’s an independently-developed library. I really think something like this should have been part of CUDA itself, and I would definitely be open to collaborating with NVIDIA on beefing up the internal printf with the features of this library. Unfortunately, as I have experienced with my CUDA C++ API wrappers - NVIDIA is not too keen on such collaborations.

But - who knows? Maybe they might change their mind somehow. Hope springs eternal etc.

For now, I depend on satisfied users spreading the word about these libraries.

Robert_Crovella · October 3, 2023, 6:00pm

I tried clicking on the printf.cuh link on this page (i.e. this link) and got a 404 error.

epk · October 3, 2023, 6:31pm

Yeah, that’s an artifact of printf.cuh only existing on the development branch, and links in README.md not being branch-relative…

this is the direct link:

github.com

eyalroz/cuda-kat/blob/development/src/kat/on_device/c_standard_library/printf.cuh

/**
 * @author (c) Eyal Rozenberg <eyalroz1@gmx.com>
 *             2021-2022, Haifa, Palestine/Israel
 * @author (c) Marco Paland (info@paland.com)
 *             2014-2019, PALANDesign Hannover, Germany
 *
 * @note Others have made smaller contributions to this file: see the
 * contributors page at https://github.com/eyalroz/printf/graphs/contributors
 * or ask one of the authors.
 *
 * @brief An implementation of the printf family of functions (including 
 * (`(v)printf`, `(v)s(n)printf` etc.) for use with CUDA.
 *
 * @note These functions are not to be used for high-performance work. 
 * For high-performance work on strings in kernels, determine exactly what
 * formatting is necessary and write code to perform _only_ that - do not
 * use a kitchen-sink `printf()` like function with runtime parsing of a
 * format string.
 *
 * @note Unlike the original `printf` library (@see

This file has been truncated. show original

Topic		Replies	Views
Make a wish: What would you like in a CUDA on-device formatted printing library? CUDA Programming and Performance	0	449	October 21, 2021
printf in device kernels and <stdio.h> CUDA Programming and Performance	3	10459	September 24, 2011
printf in OpenCL? CUDA Programming and Performance	3	10733	February 5, 2011
printf inside a kernel is not working nVIDIA Quadro 4000 CUDA Programming and Performance	2	3707	November 7, 2011
printf vs cuPrintf in kernels CUDA Programming and Performance	2	5838	February 5, 2013
Templated kernels and printf CUDA Programming and Performance	5	9566	December 20, 2008
Debugging and printf in kernel CUDA Programming and Performance	6	2414	April 19, 2023
Printf does not work in emulation mode /tmp/xxxxxxxx_stub.c: no such file or directory CUDA Programming and Performance	7	2404	February 1, 2010
A simple question about printf() inside a kernel with no convincing answer on google or nvidia docs CUDA Programming and Performance	8	5505	August 4, 2019
just for fun! my own implementation of 'cuPrintf()' enabling output debug message from k CUDA Programming and Performance	3	2542	March 31, 2010

An sprintf() which works in your kernel? It's almost here, help beta-test it

Related topics