Why we have three GEMM in cutlass?


I understand part 2 and 3. That is just k_iter 0 and [1, k_end). But what is part 1? Why we loop over k_block?

please don’t post pictures of code on these forums.