with a rtx 5070 Ti and rtx 3050 for graphics I have the following errors…
Software I use, such as Gromacs, NAMD, GAMESS…etc wil compile fine with 575 or 570 drivers, and cuda-12.9. However, if I do not use the nvidia-57x-open, the software will not run, always GPU detection erors. When I use these drivers, graphics do not work through the 3050. I am then reduced to a command line terminal, the errors are all through xorg windows problems. If I revert back to nvidia-57x, the graphics work fine, but execution of any software outside of graphics gives device not found errors. Nvidia (sorry I miss posted on the 575 discussion feed) detects the rtx 3050 on either drive set, but only detects the rtx 5070 Ti on the open set. the normal lspci on the open detects both, but again only says nvidia device “0000:01:00.0 VGA compatible controller: NVIDIA Corporation Device 2c05 (rev a1) (prog-if 00 [VGA controller])” With the open, the xorg error is "
[ 578.225] (II) NVIDIA(G0): [DRI2] VDPAU driver: nvidia
[ 578.225] (EE)
[ 578.225] (EE) Backtrace:
[ 578.225] (EE) 0: /usr/lib/xorg/Xorg (OsLookupColor+0x14c) [0x64c2bc60803c]
[ 578.225] (EE) 1: /lib/x86_64-linux-gnu/libc.so.6 (__sigaction+0x50) [0x7cda06045330]
[ 578.225] (EE) 2: /usr/lib/xorg/Xorg (InitOutput+0x4aa) [0x64c2bc4ce07a]
[ 578.225] (EE) 3: /usr/lib/xorg/Xorg (InitFonts+0x1d0) [0x64c2bc48ccf0]
[ 578.226] (EE) 4: /lib/x86_64-linux-gnu/libc.so.6 (__libc_init_first+0x8a) [0x7cda0602a1ca]
[ 578.226] (EE) 5: /lib/x86_64-linux-gnu/libc.so.6 (__libc_start_main+0x8b) [0x7cda0602a28b]
[ 578.226] (EE) 6: /usr/lib/xorg/Xorg (_start+0x25) [0x64c2bc475395]
[ 578.226] (EE)
[ 578.226] (EE) Segmentation fault at address 0x10
[ 578.226] (EE)
Fatal server error:
[ 578.226] (EE) Caught signal 11 (Segmentation fault). Server aborting
[ 578.226] (EE)
"
with the non open the errors from software are all the same an example,
"
CUDA compiler: /usr/local/cuda-12.9/bin/nvcc nvcc: NVIDIA (R) Cuda compiler driver;Copyright (c) 2005-2025 NVIDIA Corporation;Built on Wed_Apr__9_19:24:57_PDT_2025;Cuda compilation tools, release 12.9, V12.9.41;Build cuda_12.9.r12.9/compiler.35813241_0
CUDA compiler flags: -O3 -DNDEBUG
CUDA driver: 12.90
CUDA runtime: 12.90
Running on 1 node with total 24 cores, 32 processing units (GPU detection failed: invalid device ordinal)
"
Interestingly it should have detected at least the 3050, but does not.
I wounder if anyone has a workaround, ie how to get the xorg working with the open driver set, or most likely not, get software runing under the nvidia-575 or 570. I have also tried the cuda modules from nvidia, however they do not support blackwell, and they refer you to the nvidia-570 or 575 open. It is also more problematic, as the new mother boards from almost all vendors do not allow choosing the pci slot anymore, and default to the first slot, as I guess it was necessary to remove 10 lines of assembly code in everyone’s bios. And, graphics wise, new blackwell and nvidia drivers will not go back beyond 2050 cards, making it necessary to have at least a 2050 with any blackwell. Combined, this makes it almost impossible to use commercial gaming GPUs in laboratory settings outside of lone serve set ups, also making it necessary to buy 2x the PCs so people can use graphics as well. It seems weird to me. Literally, really simple fixes are all gone. I can also not find anything on the xorg error, other than the C and cuda compiler are most likely competing for the same memory, with no solutions.