Interesting, i do have a good amount of sins and cosines in one function, that could be a culprit. Haven’t gotten a chance to look at the PTX yet as i’m about to run out the door, but I did look at -v -mem:
[codebox]ptxas info : Compiling entry function ‘_Z8MCkerneltPK6xsinfoiP7neutron’
ptxas info : Used 31 registers, 168+0 bytes lmem, 2080+16 bytes smem, 48632 bytes cmem[0], 256 bytes cmem[1]
Memory space statistics for ‘OCG memory pool for function _Z8MCkerneltPK6xsinfoiP7neutron’
============================================================
==============================
Page size : 0x1000 bytes
Total allocated : 0x22e4f00 bytes
Total available : 0x1bb240 bytes
Nrof small block pages : 3648
Nrof large block pages : 3182
Longest free list size : 1
Average free list size : 0
Memory space statistics for ‘Top level ptxas memory pool’
=========================================================
Page size : 0x1000 bytes
Total allocated : 0x58f88 bytes
Total available : 0x26318 bytes
Nrof small block pages : 78
Nrof large block pages : 3
Longest free list size : 1
Average free list size : 0
Memory space statistics for ‘Permanent OCG memory pool’
=======================================================
Page size : 0x1000 bytes
Total allocated : 0xc0160 bytes
Total available : 0x5d90 bytes
Nrof small block pages : 5
Nrof large block pages : 35
Longest free list size : 1
Average free list size : 0
Memory space statistics for ‘PTX parsing state’
===============================================
Page size : 0x1000 bytes
Total allocated : 0x23cef8 bytes
Total available : 0x1da88 bytes
Nrof small block pages : 535
Nrof large block pages : 8
Longest free list size : 1
Average free list size : 0
Memory space statistics for ‘Command option parser’
===================================================
Page size : 0x1000 bytes
Total allocated : 0x9108 bytes
Total available : 0x7038 bytes
Nrof small block pages : 9
Nrof large block pages : 0
[/codebox]
note: this was edited because I realized i didn’t include -arch sm_13 on the command line. Its been updated.