Hi,
When compiling the following code on Thor:
__global__ void tcgen05_dummy(const int num_iters) {
__shared__ uint32_t smem[1024];
asm volatile(
"tcgen05.alloc.cta_group::2.sync.aligned.shared::cta.b32 [%0], %1;"
:
: "r"(smem[0]), "n"(TMEM_COLUMNS));
}
NVCC reports the following errors:
error : Instruction 'tcgen05.alloc' not supported on .target 'sm_110'
error : Feature '.cta_group::2' not supported on .target 'sm_110'
CUDA Version:
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2025 NVIDIA Corporation
Built on Tue_Dec_16_07:27:17_PM_PST_2025
Cuda compilation tools, release 13.1, V13.1.115
Build cuda_13.1.r13.1/compiler.37061995_0
Compilation command:
nvcc -arch=sm_110a -O2 tcgen05_dummy.cu
The tcgen05 instructions are supported on Thor, right?
-
Thanks.