How to find out how many ptx instructions are in the kernel ? Keeping in mind the 2 million ptx inst

Romant · September 17, 2009, 11:32pm

The question is in the subject …
My kernel grows extensively, would be nice to know how far it’s size is from the limit.

Ailleur · September 18, 2009, 12:41am

nvcc -ccbin -ptx
will generate the ptx file. Im not sure if the limit if before or after optimization (which is not present in the ptx)

Romant · September 18, 2009, 1:40am

How to interpret it ?

Each line (like “mul.lo.s32 %r1578, %r19, %r1577;”) is a single instruction ?

Also, ptx is not optimized so the final result can differ significantly …

It is possible to generate .cubin and check the bincode { … } section out, it contains a list of 32bit integers. Are these integers actual instructions ? And if so, how many bits (32 or 64) each instruction contains ?

Lots of questions, heh …

tmurray · September 18, 2009, 2:24am

I don’t think the 2 million instruction limit is a PTX instruction limit…

Romant · September 18, 2009, 2:36am

Well, how to estimate how much is too much ? :-)

My cubin file (bincode section) contains 792 lines of fours like this: 0x307ccbfd 0x6c20c7c8 0x30000003 0x00000280

Each line is an instruction ? Or each 32bit hex is an instruction ?

Tobi_W · September 18, 2009, 6:26am

The visual profiler counts the instructions executed by a kernel. Maybe you could use this as a hint…

_Big_Mac · September 18, 2009, 9:03am

Nope, a

for(i = 0; i < 1000000; ++i)

a++;

will be counted as a million instructions in the profiler (actually, closer to 4 million probably) yet it’s about four or five PTX instructions. The limit is for code length, not # of executed instructions.

Romant · September 18, 2009, 12:05pm

Nope, a
for(i = 0; i < 1000000; ++i)

a++;
will be counted as a million instructions in the profiler (actually, closer to 4 million probably) yet it’s about four or five PTX instructions. The limit is for code length, not # of executed instructions.

Yeah, performed instructions are not what I’m trying to find out …

So, the only way is an examination of .cubin ?

Sylvain_Collange · September 18, 2009, 2:23pm

Most instructions are 64-bit wide. So your program contains at least 1584 instructions.

I suspect the 2M-instruction limit is actually a 16MB-cubin limit.

Anyway the compiler will probably die well before reaching the million-instruction range… Kernels with ~100.000 instructions already take hours to compile.

Romant · September 18, 2009, 3:21pm

The distance between 1500 instructions and 100.000 instructions is not too big … should I get prepared to the exponential growth of kernel compilation time ?

_Big_Mac · September 18, 2009, 6:37pm

Geez, what are you coding anyway? :)

Romant · September 18, 2009, 6:55pm

In two words - designing a problem solver based on genetic programming, one particular problem may require significant amount of code.

I don’t think I’ll get to 100.000 instructions, however, 1500 is definitely not a limit, would like to know more about big kernels behaviour.

Topic		Replies	Views
Estimating kernel size? CUDA Programming and Performance	5	2605	March 1, 2010
NVIDIA people, please pay attention, still have no meaningful answer How to estimate the proximity t CUDA Programming and Performance	5	669	November 5, 2010
PTX instructions CUDA Programming and Performance	1	1121	February 16, 2009
CUDA kernel size What if it exceeds 2MB CUDA Programming and Performance	4	3870	November 5, 2007
how got the number of instruction from complier from the information of complier CUDA Programming and Performance	2	3421	August 12, 2008
Kernel max instructions? CUDA Programming and Performance	8	1791	February 8, 2018
what is the number of operations in one kernel help CUDA Programming and Performance	8	7241	May 25, 2010
Size of CUDA Object Code? CUDA Programming and Performance	5	1849	November 24, 2010
Maximum number of instructions per kernel CUDA Programming and Performance	2	1025	September 11, 2014
What is maximum size of kernel code? CUDA Programming and Performance	2	8676	February 18, 2010

How to find out how many ptx instructions are in the kernel ? Keeping in mind the 2 million ptx inst

Related topics