Dear all,
I write here after reading and trying all the related threads I could find about this.
I have an HPE server Proliant DL380 Gen10 with integrated graphics card (MATROX MGA G200eH3), with Ubuntu 20.04, and an nvidia Quadro RTX4000 (we’ve tested an RTX6000 as well).
Installed nvidia 470 drivers, no issues detected during installation
Problems encounteres so far:
- xrandr command doesn’t ever show the displayport outputs of the nvidia card, only can see the integrated VGA output.
- when using nvidia-smi, the GPU load is always at 0%. we tried with software which we know in other servers is succesfully using the GPU, but in this server it all goes to CPU.
- we had several black screen problems, but we were able to sort it out by deleting the file /etc/X11/xorg.conf (fix that we saw in other thread). Before this, we had to purge the drivers for the server to restart.
- All nomodeset parameters found in different files were deleted.
- Server is NOT in secure boot mode.
- No stray blacklist files were found ( /lib/modprobe.d/blacklist-nvidia.conf etc/modprobe.d/blacklist-nvidia.conf)
- rest of the commands seem to show a “no problem” view: ‘lshw -c video’ shows both cards, nvidia-settings seem to work fine, prime-select query always shows ‘nvidia’, lspci show the nvidia modules…
I attach some of the outputs here:
innovation@innovation-holographics:/usr/sbin$ sudo lshw -c video
*-display
description: VGA compatible controller
product: MGA G200eH3
vendor: Matrox Electronics Systems Ltd.
physical id: 0.1
bus info: pci@0000:01:00.1
version: 02
width: 32 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom
configuration: driver=mgag200 latency=0
resources: irq:17 memory:d8000000-d8ffffff memory:d9b98000-d9b9bfff memory:d9000000-d97fffff memory:c0000-dffff
*-display
description: VGA compatible controller
product: TU104GL [Quadro RTX 4000]
vendor: NVIDIA Corporation
physical id: 0
bus info: pci@0000:86:00.0
version: a1
width: 64 bits
clock: 33MHz
capabilities: pm msi pciexpress vga_controller bus_master cap_list rom
configuration: driver=nvidia latency=0
resources: iomemory:d7f0-d7ef iomemory:d7f0-d7ef irq:433 memory:f0000000-f0ffffff memory:d7fe0000000-d7fefffffff memory:d7ff0000000-d7ff1ffffff ioport:a000(size=128) memory:f1080000-f10fffff
innovation@innovation-holographics:/usr/sbin$ lspci -k |grep nvidia
Kernel driver in use: nvidia
Kernel modules: nvidiafb, nouveau, nvidia_drm, nvidia
Kernel driver in use: nvidia-gpu
Kernel modules: i2c_nvidia_gpu
innovation@innovation-holographics:/usr/sbin$ nvidia-smi
Tue Nov 30 10:07:20 2021
±----------------------------------------------------------------------------+
| NVIDIA-SMI 470.86 Driver Version: 470.86 CUDA Version: 11.4 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Quadro RTX 4000 Off | 00000000:86:00.0 Off | N/A |
| 30% 34C P8 4W / 125W | 10MiB / 7982MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 2383 G /usr/lib/xorg/Xorg 4MiB |
| 0 N/A N/A 3332 G /usr/lib/xorg/Xorg 4MiB |
±----------------------------------------------------------------------------+
innovation@innovation-holographics:/usr/sbin$ xrandr
Screen 0: minimum 320 x 200, current 1920 x 1080, maximum 4096 x 4096
VGA-1 connected primary 1920x1080+0+0 (normal left inverted right x axis y axis) 596mm x 335mm
1600x900 60.00
1280x1024 75.02 60.02
1152x864 75.00
1024x768 75.03 60.00
800x600 75.00 60.32
640x480 75.00 59.94
1920x1080_60.00 59.96*
innovation@innovation-holographics:/usr/sbin$ prime-select query
nvidia
Thanks for any help, and sorry if this an obvious problem, but we’ve tested everything we could find in these threads, with no success…