I have two RTX a6000 Turing GPUs that we previously used in vGPU headless mode. We were successful but let the project they were used for die out. After a couple years we want to now use them for local LLM not wanting them to be virtualized. The cards are still in headless mode and when I connect them to the Ubuntu machine it is recognized as a 03:00.0 3D controller: NVIDIA Corporation GA102GL [RTX A6000] (rev a1) and is not recognized as a usable GPU. Running nvidia-smi after disabling nouveau and installing the 580 driver it comes with a message about no compatible device found. I’m presuming it’s due to it’s previous use in headless mode. When I run the mode selector tool to set it enabled at 256MB bar1 it errors with this message:
terminate called after throwing an instance of ‘std::runtime_error’
what(): The PCI BAR assignment for the processed device is invalid.
Please check with NVIDIA web site for possible SBIOS Setup setting
to fit with the processed device.
I have set the kernel to “pci=realloc” and then to “pci=realloc=off” and both do not allow me to enble the GPU.
I’m running this on a Dell Precision Tower 7910, single Xeon E5-2643 v4, 64GB DDR4 RAM, and a 1200watt PSU.
What do I need to do to get this GPU running operational? I don’t care if it’s headless or with enabled display ports, the machine it’s in won’t be using a display.