Hi everyone,
I’m exploring an experimental approach to formally verifying CUDA kernels that rely on cuda::barrier/mbarrier and async copy (agentically). Is anyone interested on working on this with me? Would people find it useful if it worked?
Hi everyone,
I’m exploring an experimental approach to formally verifying CUDA kernels that rely on cuda::barrier/mbarrier and async copy (agentically). Is anyone interested on working on this with me? Would people find it useful if it worked?