I have the same problem with Almalinux 8.5 (RHEL based). I have two A6000 GPUs. I have installed different nvidia-drivers (460+) and always I have duplicate processes on both GPUs.
I don’t know if this is expected behaviour, but I suspect that I have access only to the half GPU memory (ie. only in one card). Always both GPUs have the same memory allocated.
In the python code only the second GPU is being utilized, but as you can see memory is being allocated on the first one also. As a consequence, if I want to run two processes, each one on different GPU, the memory is consumed fast because the duplicate process allocates memory on the other GPU.