Does using cp.async.bulk
in TMA operations always require cp.async.bulk.commit_group
? Or is commit_group
not needed if there is only a single bulk operation without multi-stage transfers?
Does using cp.async.bulk
in TMA operations always require cp.async.bulk.commit_group
? Or is commit_group
not needed if there is only a single bulk operation without multi-stage transfers?