Black screen, unstable or not second monitor RTX2070, Ubuntu

In the last couple of days my two monitor setup stopped working, where the second monitor switches off, does not render. Now i am just seeing a black screen with a cursor.

I have tried downgrading, reinstalling the driver and also explicitly disabling nouveau but nothing makes a difference. I also removed the latest OEM kernel 6.0

GPU :
09:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU106 [GeForce RTX 2070] [10de:1f02] (rev a1) (prog-if 00 [VGA controller])

Output of dmesg | grep nvidia


[    8.610542] kernel: nvidiafb 0000:09:00.0: BAR 3: can't reserve [mem 0xe0000000-0xe1ffffff 64bit pref]
[    8.610544] kernel: nvidiafb: cannot request PCI regions
[    8.800790] kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input14
[    8.800843] kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input15
[    8.800883] kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input16
[    8.800919] kernel: input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input17
[    8.800963] kernel: input: HDA NVidia HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input18
[    8.801002] kernel: input: HDA NVidia HDMI/DP,pcm=11 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input19
[    8.801041] kernel: input: HDA NVidia HDMI/DP,pcm=12 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input20
[    8.874116] kernel: audit: type=1400 audit(1676286052.660:8): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=1137 comm="apparmor_parser"
[    8.874118] kernel: audit: type=1400 audit(1676286052.660:9): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=1137 comm="apparmor_parser"
[    9.642696] kernel: nvidia-gpu 0000:09:00.3: i2c timeout error e0000000

lsb_release -a

|Description:|Ubuntu 22.04.1 LTS|
|---|---|
|Release:|22.04|

uname -r

5.15.0-56-lowlatency

Can you please suggest what else I can try to unblock the syste?

Please blacklist nvidiafb, that’s no part of the nvidia driver and is blocking it.
Looking at your bug report log in the other thread about your monitor issues, this looks like a cable/monitor issue. The stable monitor runs at 165MHz, the unstable at 300MHz. It sometimes vanishes completely, sometimes it’s there but the edid can’t be read. Please swap cables between the monitors and check whether the issue moves with it.
Furthermore, you seem to have some desktop plugin running that’s hammering the xserver with xrandr queries every second. Rather find and disable it.

hi generix

thank you for the reply. Unfortunately blacklisting nvidiafb makes no difference…

this is the output from my last boot dmesg logs

cat /var/log/dmesg | grep -i nvidia


 [    8.155066] kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input14
[    8.155105] kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input15
[    8.155125] kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input16
[    8.155146] kernel: input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input17
[    8.155174] kernel: input: HDA NVidia HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input18
[    8.155194] kernel: input: HDA NVidia HDMI/DP,pcm=11 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input19
[    8.155221] kernel: input: HDA NVidia HDMI/DP,pcm=12 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input20
[    8.274430] kernel: audit: type=1400 audit(1676301524.059:7): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=1125 comm="apparmor_parser"
[    8.274432] kernel: audit: type=1400 audit(1676301524.059:8): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=1125 comm="apparmor_parser"
[    9.044788] kernel: nvidia-gpu 0000:09:00.3: i2c timeout error e0000000

the blacklisting works as you can there is no output from nvidiafb anymore and the /etc/modprobe.d/blacklist-nvidiafb.conf file is in place

I have also reinstalled the driver version 525 and run updateramfs for all kernels … still stuck.

Any other idea on what to look into?

Please run nvidia-bug-report.sh as root and attach the resulting nvidia-bug-report.log.gz file to your post.

nvidia-bug-report.log.gz (104.2 KB)

There are no nvidia kernel modules installed. Please try reinstalling kernel headers
sudo apt install --reinstall linux-headers-$(uname -r)
and post the output of
dkms status
dpkg -l |grep nvidia

dkms status

nvidia/525.78.01, 5.15.0-60-generic, x86_64: installed
v4l2loopback/0.12.5, 5.15.0-43-generic, x86_64: installed
v4l2loopback/0.12.5, 5.15.0-60-generic, x86_64: installed

The above also shows some warning such as grep: /lib/modules/5.15.0-43-generic/build.config: no such file or directory

dpkg output in file

dpkgGrepNvidia.log (8.0 KB)

Since the dkms status now shows the nvidia modules as installed for your kernel, please reboot and check if the modules load.

lsmod | grep nvidia shows …

i2c_nvidia_gpu 16384 0

The UX window manager/monitors are still not loading. I get a cursor with a black screen and to access the shell i am still CTRL + ALT + F3 and login there.

Please create a new nvidia-bug-report.log.

nvidia-bug-report.log.gz (125.0 KB)

You now booted into an old kernel 5.15.0-56, the correct one is 5.15.0-60. Please select that in grub.

Thank you. I am able to boot with the 60-generic kernel. I swapped monitors around and now they both work. For some reason they stopped working when connected with the other cables?

How do i identify really what is causing this?

Also you wrote earlier
“you seem to have some desktop plugin running that’s hammering the xserver with xrandr queries every second.”

Can you please point out where you spotted this from so i can further look into it?

In the xorg logs and in journal,
(journalctl -b -0 _COMM=Xorg)

Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): connected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-0: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-0: Internal DisplayPort
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-0: 2660.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-1: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-1: Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-1: 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-2: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-2: Internal DisplayPort
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-2: 2660.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-3: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-3: Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-3: 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-4: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-4: Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-4: 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-5: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-5: Internal DisplayPort
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-5: 2660.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): connected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): 165.0 MHz maximum pixel clock

repeated over and over. The xserver logs that whenever xrandr is run.

What kind of cables are those exactly? DP2HDMI?

No the cables are just HDMI, no conversion https://www.lg.com/hk_en/monitor/lg-W2486L-PF

I can only speculate. While the monitors are the same, they might have different firmware versions, thus using different connection speeds. One cable might be flakey so only works at the lower speed monitor. Another idea might be the connectors became dirty and just pulling out/plugging back in the cables made it work again.

Hi generix, not sure if it’s proper way to ask help, been quite frustrated with my Ubuntu 22.04, I read a lot your threads here, but seems still I couldn’t fix my issue.
thanks for your time.

Env

Ubuntu 22.04
Secure boot disabled
Quadro T2000 with Max-Q Design/PCIe/SSE2 / Quadro T2000 with Ma…
LG using HDMI, DELL using DP, via dock station connected to laptop( confirmed same cable connection working for my another Win laptop, I can use 2 external monitors)

Issue Desc

Black screen issue when connect all my two external monior (it can detect all 3 monitors 1 laoptop, 2 external;)


Remove DELL(DP cable) from docker station, LG works(HDMI)
Tried tweaking refresh rate and resolution, not seems work.

Extra Info

sudo apt install linux-headers-$(uname -r)
0 to upgrade, 0 to newly install, 0 to remove and 0 not to upgrade.
sudo ubuntu-drivers autoinstall
All the available drivers are already installed.

nvidia-settings

(nvidia-settings:14952): GLib-GObject-CRITICAL **: 08:39:50.588: g_object_unref: assertion ‘G_IS_OBJECT (object)’ failed
** Message: 08:39:50.619: PRIME: Requires offloading
** Message: 08:39:50.619: PRIME: is it supported? yes
** Message: 08:39:50.639: PRIME: Usage: /usr/bin/prime-select nvidia|intel|on-demand|query
** Message: 08:39:50.639: PRIME: on-demand mode: “1”
** Message: 08:39:50.639: PRIME: is “on-demand” mode supported? yes

log
nvidia-bug-report.log.gz (455.4 KB)

tx!

 (--) NVIDIA(0):     DFP-0.2
 (--) NVIDIA(0):     DFP-0.3.1
[...]
 (WW) NVIDIA(0): No valid modes for
 (WW) NVIDIA(0):     "DFP-0.2:nvidia-auto-select,DFP-0.3.1:nvidia-auto-select";
 (WW) NVIDIA(0):     removing.

The dock has a slightly odd topology, the DP connector to the dell looks like daisy-chained to another connector. The nvidia driver has a bug with that. What dock model is this, does it have a third monitor connector?
Please try using the latest 550 driver, if that doesn’t help, please set mode debugging by creating
/etc/X11/xorg.conf.d/nv-modebug.conf

Section "OutputClass"
    Identifier "nvidia-modedebug"
    MatchDriver "nvidia-drm"
    Driver "nvidia"
    Option "ModeDebug" "true"
EndSection

Afterwards, please create a new nvidia-bug-report.log.

Despite the saldom occasions when i had to remove and add cables again, now I am again experiencing the same problem on my system. I am not sure what is happening but if i keep removing and adding cables again eventually they will indeed break but currently this is the only way to get both screens to work again.
Again details of my system

Xrand_20250320.log (12.3 KB)

the above shows the monitor attached on the HDMI and one through DP. The one on the DP is not working, despite being detected, it is showing a black screen and the led is blinking.

Attaching
nvidia-bug-report.log.gz (790.7 KB)

Also from dmesg it shows

$ sudo dmesg | grep nvidia
[ 9.171808] nvidia: loading out-of-tree module taints kernel.
[ 9.171816] nvidia: module license ‘NVIDIA’ taints kernel.
[ 9.190704] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[ 9.313346] nvidia-nvlink: Nvlink Core is being initialized, major device number 235
[ 9.315249] nvidia 0000:09:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=io+mem
[ 9.367063] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 535.183.01 Sun May 12 19:31:08 UTC 2024
[ 9.368795] [drm] [nvidia-drm] [GPU ID 0x00000900] Loading driver
[ 10.434650] audit: type=1400 audit(1742443846.964:7): apparmor=“STATUS” operation=“profile_load” profile=“unconfined” name=“nvidia_modprobe” pid=1498 comm=“apparmor_parser”
[ 10.434653] audit: type=1400 audit(1742443846.964:8): apparmor=“STATUS” operation=“profile_load” profile=“unconfined” name=“nvidia_modprobe//kmod” pid=1498 comm=“apparmor_parser”
[ 10.884198] nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device HDMI-0
[ 11.159134] nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device HDMI-0
[ 11.178694] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:09:00.0 on minor 0
[ 11.184488] nvidia_uvm: module uses symbols nvUvmInterfaceDisableAccessCntr from proprietary module nvidia, inheriting taint.
[ 11.208232] nvidia-uvm: Loaded the UVM driver, major device number 511.
[ 11.338859] nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device HDMI-0
[ 804.578017] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.936046] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.936099] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.936125] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.936152] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.945724] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.945772] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.945804] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.945829] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.955836] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.955879] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.955907] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 805.955930] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 819.664340] nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device LG Electronics W2486 (DP-3)
[ 837.995090] nvidia-modeset: WARNING: GPU:0: Unable to read EDID for display device LG Electronics W2486 (DP-3)
[ 5612.857324] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 5612.857381] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 5612.857403] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 5612.857424] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 7557.834135] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 7557.834204] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 7557.834228] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership
[ 7557.834249] [drm:nv_drm_master_set [nvidia_drm]] ERROR [nvidia-drm] [GPU ID 0x00000900] Failed to grab modeset ownership

I assume the main issue in this specific time is Unable to read EDID, but what i would like to know is

  • is the driver likely the cause of these detection problems?
  • is there a way to restart the nvidia detection without restarting X, ie force through CLI the unplugging and plugging back in of the cables
  • i can see the display if i edit the displays config through Gnome UX but if i try to join the dead one it doesnt do and stays dead.

Unfortunately where i live there is lack of PC parts, i could not find a DP → HDMI cable at all and i cannot connect the second monitor with HDMI to HDMI because there is only one port on the card