Register use by wmma

I need to use wmma m8n32k16 api for some calculation, so I just wander how many register will be used by wmma operation per thread?

You can look up the number of registers here for your data type: PTX ISA 8.5

Please be aware that PTX is an intermediate format; probably MMA instructions are used internally.

Thanks!

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.