CUDA 12/13 `-arch` flag no longer produces "universal" binaries