GPU assembly

turboscrew · July 10, 2013, 2:21pm

Are there any tutorials, manuals or other material about GPU programming in assembly?
I’d be interested to get the idea how it works.
How does the communication between the GPU and host system work?

I understand that the assembly and architecture of different GPUs are quite different, but
I’d still like to see how it goes from the bare iron programmer’s P.O.V.
I guess any common GPU would do.

The only GPGPU-capable (CUDA 1.1) NVIDIA thing I own at the moment is old
NVIDIA GeForce 9300 GE (Lumenex).
It would be nice if there is something about GPU at least somewhat close to that so
I could maybe clown around with it a little, but other GPUs are fine too.

Cuda, OpenCL etc material is easy to find, but I haven’t found anything about the assembly.

[edit]

Funny that there seems to be prettu much nothing about GeForce 9300 GE in english, but in some other languages there is (too bad I only read finnish, english and sweedish well enough).
Like Laptops y Tarjetas Gráficas GeForce RTX Serie 20

seibert · July 12, 2013, 10:49pm

The closest that you can easily get to assembly on NVIDIA GPUs is PTX, which is a virtual assembly language that is compiled by the CUDA driver to the machine code of your GPU before execution. There is a manual in the CUDA toolkit about PTX.

turboscrew · July 14, 2013, 12:37pm

OK. Thanks.

Greg · September 11, 2013, 3:24am

The CUDA Binary Utilities document has a list of the assembly instructions for Compute Capability 1.2 and above.
[url]CUDA Binary Utilities :: CUDA Toolkit Documentation

The Parallel Thread Execution ISA Version 3.2 (PTX) has information on the PTX intermediate language which has a very close mapping to the final assembly instructions.
[url]PTX ISA :: CUDA Toolkit Documentation

The best approach for learning how the GPU works is to use the Nsight VSE CUDA debugger and cuda-gdb and single step the assembly for different programs. If you are not set up to debug then simply writing small sample programs and using cuobjdump or nvdisasm to list the PTX and SASS (assembly) is fairly easy way to learn.

Topic		Replies	Views
Lower Level CUDA NVasc CUDA Programming and Performance	20	17998	July 10, 2007
Programming CUDA at 'assembler' level? CUDA Programming and Performance	9	13493	November 7, 2010
Getting Started CUDA Programming and Performance	1	5002	January 10, 2008
Available PTX assembly instructions CUDA Programming and Performance kernel	3	486	October 12, 2021
Big newbie having questions about GPU computing CUDA Programming and Performance	5	15761	May 20, 2007
Document about NVIDIA GPU native instruction set? CUDA Programming and Performance	4	14814	December 6, 2018
Nvidia Kepler assembly language CUDA Programming and Performance	4	1141	June 25, 2019
assembler CUDA ? CUDA Programming and Performance	4	1062	June 23, 2012
GPU Info/Tutorial and OpenGL Process CUDA Programming and Performance	3	3738	January 16, 2008
Problem Structure for GPU Programming CUDA Programming and Performance	1	2647	July 13, 2009

GPU assembly

Related topics