|
Why compiler don't use registers to store my data?
|
43
|
483
|
December 7, 2024
|
|
Error or incomprehension, MMa ptx mixed precision Bfloat16 rtx3080
|
20
|
2693
|
October 12, 2021
|
|
Complete minimal ptx example for: mma.sync.aligned.m16n8k16.row.col.f32.f16.f16.f32
|
3
|
1007
|
October 25, 2024
|
|
Register in kernel
|
3
|
82
|
November 17, 2024
|
|
About compute accuracy
|
22
|
248
|
February 10, 2025
|
|
.reg f16x2 %Rb<1> in ISA example: mma.sync.aligned.m16n8k16.row.col.f32.f16.f16.f32
|
1
|
77
|
October 25, 2024
|
|
Padding of mma operation
|
20
|
380
|
December 19, 2024
|
|
Wrong answer with mma.sync.aligned.m8n8k4
|
8
|
1542
|
April 17, 2023
|
|
# of registers in different for different datatypes
|
3
|
623
|
January 21, 2020
|
|
Problem with the instruction "mma.sync.aligned.m16n8k16.row.col.f32.f16.f16.f32"
|
3
|
2001
|
October 12, 2021
|