I need to use wmma m8n32k16 api for some calculation, so I just wander how many register will be used by wmma operation per thread?
You can look up the number of registers here for your data type: PTX ISA 8.5
Please be aware that PTX is an intermediate format; probably MMA instructions are used internally.
Thanks!
This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.