Black screen, unstable or not second monitor RTX2070, Ubuntu

In the last couple of days my two monitor setup stopped working, where the second monitor switches off, does not render. Now i am just seeing a black screen with a cursor.

I have tried downgrading, reinstalling the driver and also explicitly disabling nouveau but nothing makes a difference. I also removed the latest OEM kernel 6.0

GPU :
09:00.0 VGA compatible controller [0300]: NVIDIA Corporation TU106 [GeForce RTX 2070] [10de:1f02] (rev a1) (prog-if 00 [VGA controller])

Output of dmesg | grep nvidia


[    8.610542] kernel: nvidiafb 0000:09:00.0: BAR 3: can't reserve [mem 0xe0000000-0xe1ffffff 64bit pref]
[    8.610544] kernel: nvidiafb: cannot request PCI regions
[    8.800790] kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input14
[    8.800843] kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input15
[    8.800883] kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input16
[    8.800919] kernel: input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input17
[    8.800963] kernel: input: HDA NVidia HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input18
[    8.801002] kernel: input: HDA NVidia HDMI/DP,pcm=11 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input19
[    8.801041] kernel: input: HDA NVidia HDMI/DP,pcm=12 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input20
[    8.874116] kernel: audit: type=1400 audit(1676286052.660:8): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=1137 comm="apparmor_parser"
[    8.874118] kernel: audit: type=1400 audit(1676286052.660:9): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=1137 comm="apparmor_parser"
[    9.642696] kernel: nvidia-gpu 0000:09:00.3: i2c timeout error e0000000

lsb_release -a

|Description:|Ubuntu 22.04.1 LTS|
|---|---|
|Release:|22.04|

uname -r

5.15.0-56-lowlatency

Can you please suggest what else I can try to unblock the syste?

Please blacklist nvidiafb, that’s no part of the nvidia driver and is blocking it.
Looking at your bug report log in the other thread about your monitor issues, this looks like a cable/monitor issue. The stable monitor runs at 165MHz, the unstable at 300MHz. It sometimes vanishes completely, sometimes it’s there but the edid can’t be read. Please swap cables between the monitors and check whether the issue moves with it.
Furthermore, you seem to have some desktop plugin running that’s hammering the xserver with xrandr queries every second. Rather find and disable it.

hi generix

thank you for the reply. Unfortunately blacklisting nvidiafb makes no difference…

this is the output from my last boot dmesg logs

cat /var/log/dmesg | grep -i nvidia


 [    8.155066] kernel: input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input14
[    8.155105] kernel: input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input15
[    8.155125] kernel: input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input16
[    8.155146] kernel: input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input17
[    8.155174] kernel: input: HDA NVidia HDMI/DP,pcm=10 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input18
[    8.155194] kernel: input: HDA NVidia HDMI/DP,pcm=11 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input19
[    8.155221] kernel: input: HDA NVidia HDMI/DP,pcm=12 as /devices/pci0000:00/0000:00:03.1/0000:09:00.1/sound/card0/input20
[    8.274430] kernel: audit: type=1400 audit(1676301524.059:7): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe" pid=1125 comm="apparmor_parser"
[    8.274432] kernel: audit: type=1400 audit(1676301524.059:8): apparmor="STATUS" operation="profile_load" profile="unconfined" name="nvidia_modprobe//kmod" pid=1125 comm="apparmor_parser"
[    9.044788] kernel: nvidia-gpu 0000:09:00.3: i2c timeout error e0000000

the blacklisting works as you can there is no output from nvidiafb anymore and the /etc/modprobe.d/blacklist-nvidiafb.conf file is in place

I have also reinstalled the driver version 525 and run updateramfs for all kernels … still stuck.

Any other idea on what to look into?

Please run nvidia-bug-report.sh as root and attach the resulting nvidia-bug-report.log.gz file to your post.

nvidia-bug-report.log.gz (104.2 KB)

There are no nvidia kernel modules installed. Please try reinstalling kernel headers
sudo apt install --reinstall linux-headers-$(uname -r)
and post the output of
dkms status
dpkg -l |grep nvidia

dkms status

nvidia/525.78.01, 5.15.0-60-generic, x86_64: installed
v4l2loopback/0.12.5, 5.15.0-43-generic, x86_64: installed
v4l2loopback/0.12.5, 5.15.0-60-generic, x86_64: installed

The above also shows some warning such as grep: /lib/modules/5.15.0-43-generic/build.config: no such file or directory

dpkg output in file

dpkgGrepNvidia.log (8.0 KB)

Since the dkms status now shows the nvidia modules as installed for your kernel, please reboot and check if the modules load.

lsmod | grep nvidia shows …

i2c_nvidia_gpu 16384 0

The UX window manager/monitors are still not loading. I get a cursor with a black screen and to access the shell i am still CTRL + ALT + F3 and login there.

Please create a new nvidia-bug-report.log.

nvidia-bug-report.log.gz (125.0 KB)

You now booted into an old kernel 5.15.0-56, the correct one is 5.15.0-60. Please select that in grub.

Thank you. I am able to boot with the 60-generic kernel. I swapped monitors around and now they both work. For some reason they stopped working when connected with the other cables?

How do i identify really what is causing this?

Also you wrote earlier
“you seem to have some desktop plugin running that’s hammering the xserver with xrandr queries every second.”

Can you please point out where you spotted this from so i can further look into it?

In the xorg logs and in journal,
(journalctl -b -0 _COMM=Xorg)

Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): connected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-0: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-0: Internal DisplayPort
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-0: 2660.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-1: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-1: Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-1: 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-2: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-2: Internal DisplayPort
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-2: 2660.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-3: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-3: Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-3: 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-4: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-4: Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-4: 165.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-5: disconnected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-5: Internal DisplayPort
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): DFP-5: 2660.0 MHz maximum pixel clock
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0):
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): connected
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): Internal TMDS
Sep 15 05:57:34 bizmate-i7 /usr/libexec/gdm-x-session[4584]: (--) NVIDIA(GPU-0): LG Electronics W2486 (DFP-6): 165.0 MHz maximum pixel clock

repeated over and over. The xserver logs that whenever xrandr is run.

What kind of cables are those exactly? DP2HDMI?

No the cables are just HDMI, no conversion https://www.lg.com/hk_en/monitor/lg-W2486L-PF

I can only speculate. While the monitors are the same, they might have different firmware versions, thus using different connection speeds. One cable might be flakey so only works at the lower speed monitor. Another idea might be the connectors became dirty and just pulling out/plugging back in the cables made it work again.

Hi generix, not sure if it’s proper way to ask help, been quite frustrated with my Ubuntu 22.04, I read a lot your threads here, but seems still I couldn’t fix my issue.
thanks for your time.

Env

Ubuntu 22.04
Secure boot disabled
Quadro T2000 with Max-Q Design/PCIe/SSE2 / Quadro T2000 with Ma…
LG using HDMI, DELL using DP, via dock station connected to laptop( confirmed same cable connection working for my another Win laptop, I can use 2 external monitors)

Issue Desc

Black screen issue when connect all my two external monior (it can detect all 3 monitors 1 laoptop, 2 external;)


Remove DELL(DP cable) from docker station, LG works(HDMI)
Tried tweaking refresh rate and resolution, not seems work.

Extra Info

sudo apt install linux-headers-$(uname -r)
0 to upgrade, 0 to newly install, 0 to remove and 0 not to upgrade.
sudo ubuntu-drivers autoinstall
All the available drivers are already installed.

nvidia-settings

(nvidia-settings:14952): GLib-GObject-CRITICAL **: 08:39:50.588: g_object_unref: assertion ‘G_IS_OBJECT (object)’ failed
** Message: 08:39:50.619: PRIME: Requires offloading
** Message: 08:39:50.619: PRIME: is it supported? yes
** Message: 08:39:50.639: PRIME: Usage: /usr/bin/prime-select nvidia|intel|on-demand|query
** Message: 08:39:50.639: PRIME: on-demand mode: “1”
** Message: 08:39:50.639: PRIME: is “on-demand” mode supported? yes

log
nvidia-bug-report.log.gz (455.4 KB)

tx!

 (--) NVIDIA(0):     DFP-0.2
 (--) NVIDIA(0):     DFP-0.3.1
[...]
 (WW) NVIDIA(0): No valid modes for
 (WW) NVIDIA(0):     "DFP-0.2:nvidia-auto-select,DFP-0.3.1:nvidia-auto-select";
 (WW) NVIDIA(0):     removing.

The dock has a slightly odd topology, the DP connector to the dell looks like daisy-chained to another connector. The nvidia driver has a bug with that. What dock model is this, does it have a third monitor connector?
Please try using the latest 550 driver, if that doesn’t help, please set mode debugging by creating
/etc/X11/xorg.conf.d/nv-modebug.conf

Section "OutputClass"
    Identifier "nvidia-modedebug"
    MatchDriver "nvidia-drm"
    Driver "nvidia"
    Option "ModeDebug" "true"
EndSection

Afterwards, please create a new nvidia-bug-report.log.