NVIDIA GPU is not loaded in battery

Hello,

I have hp omen 16 with nvidia geforce rtx 4060 gpu and version 550.67 driver is installed on Manjaro Linux. When the laptop is not charging, computer stucks after entering username and password, then login. Also, when it is used while charging, the laptop is stucks in the same way when it is unplugged, whether it is replugged at any time or not at all, and when the computer is shutdown. Apart from this, even though the graphics card is loaded without any problems, when you start the game, you get a screen flickering problem, and after sometimes game crashes, it gives the error “Failed to initialize Vulkan. Please make sure your driver and GPU support Vulkan”.

Recieved error examples;
when running nvidia-smi → “Unable to determine the device handle for GPU0000:01:00.0: Unknown Error”
GPU:0: Unable to read EDID for display device DP-4

is there anyway to solve the problem.

nvidia-bug-report.log.gz (166.6 KB)

[   22.402361] pcieport 0000:00:01.0: AER: Uncorrected (Non-Fatal) error message received from 0000:00:01.0
[   22.402382] pcieport 0000:00:01.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Requester ID)
[   22.402386] pcieport 0000:00:01.0:   device [8086:460d] error status/mask=00100000/00010000
[   22.402391] pcieport 0000:00:01.0:    [20] UnsupReq               (First)
[   22.402396] pcieport 0000:00:01.0: AER:   TLP Header: 34000000 01000010 00000000 00000000
[   22.402405] nvidia 0000:01:00.0: AER: can't recover (no error_detected callback)
[   22.402408] snd_hda_intel 0000:01:00.1: AER: can't recover (no error_detected callback)
[   22.402460] pcieport 0000:00:01.0: AER: device recovery failed
[   22.616484] NVRM: GPU at PCI:0000:01:00: GPU-7878ea61-a426-7d5b-8d11-b6ae4e1bbd5e
[   22.616496] NVRM: Xid (PCI:0000:01:00): 79, pid='<unknown>', name=<unknown>, GPU has fallen off the bus.
[   22.616503] NVRM: GPU 0000:01:00.0: GPU has fallen off the bus.
[   22.616518] NVRM: A GPU crash dump has been created. If possible, please run
               NVRM: nvidia-bug-report.sh as root to collect this data before
               NVRM: the NVIDIA kernel module is unloaded.
[   26.883097] NVRM: Error in service of callback 

That’s not looking good, the notebook might be broken. Did you already check for a bios update? Does it reliably work in Windows?

I checked playing game couple of hours in Windows, I don’t see any problem. At first I get blue screen, but later there was no problem.

That’s rather unspecific, might still be a broken gpu, bluescreening while it’s cold, working when it’s warm.
Please check if you can work around it setting kernel parameter pcie_aspm=off