Emulate another Compute Capability for debugging

Hi,

I’m building a GPU application using custom CUDA kernels and cuBLAS functions. A user reported an issue on an Ampere GPU, but I do not have access to such architecture at the moment. Is there any tool available to emulate a higher Compute Capability/architecture?

Thanks,

Guillaume