Quadro T2000/Ubuntu 18.04 starts idling after 30 seconds of load on latest driver 440.59

I am running configuration from $SUBJECT (configured with prime-select nvidia).

Everything seems to be working fine-ish, but once I start executing any GPU heavy process (for example glmark2) it goes from:

nvidia-smi -q -d PERFORMANCE

==============NVSMI LOG==============

Timestamp                           : Mon Feb 10 10:56:45 2020
Driver Version                      : 440.59
CUDA Version                        : 10.2

Attached GPUs                       : 1
GPU 00000000:01:00.0
    Performance State               : P0
    Clocks Throttle Reasons
        Idle                        : Not Active
        Applications Clocks Setting : Not Active
        SW Power Cap                : Not Active
        HW Slowdown                 : Not Active
            HW Thermal Slowdown     : Not Active
            HW Power Brake Slowdown : Not Active
        Sync Boost                  : Not Active
        SW Thermal Slowdown         : Not Active
        Display Clock Setting       : Not Active

to:

$ nvidia-smi -q -d PERFORMANCE

==============NVSMI LOG==============

Timestamp                           : Mon Feb 10 10:57:15 2020
Driver Version                      : 440.59
CUDA Version                        : 10.2

Attached GPUs                       : 1
GPU 00000000:01:00.0
    Performance State               : P0
    Clocks Throttle Reasons
        Idle                        : Active
        Applications Clocks Setting : Not Active
        SW Power Cap                : Not Active
        HW Slowdown                 : Not Active
            HW Thermal Slowdown     : Not Active
            HW Power Brake Slowdown : Not Active
        Sync Boost                  : Not Active
        SW Thermal Slowdown         : Not Active
        Display Clock Setting       : Not Active

There is not thermal throttling going on since temperatures are looking fine.

I have read on forums there that it might be caused due to HardDPMS, so I have tried to disable DPMS altogether, but it has no effect.

Section "Files"
    ModulePath     "/usr/lib/x86_64-linux-gnu/nvidia/xorg/"
    ModulePath     "/usr/lib/xorg/modules"
EndSection

Section "Extensions"
    Option      "DPMS" "Disable"
EndSection

Section "ServerLayout"
    Identifier "layout"
    Screen 0 "nvidia"
    Inactive "intel"
EndSection

Section "Device"
    Identifier "nvidia"
    Driver "nvidia"
    BusID "PCI:1:0:0"
EndSection

Section "Screen"
    Identifier "nvidia"
    Device "nvidia"
    Option "AllowEmptyInitialConfiguration"
EndSection

Section "Device"
    Identifier "intel"
    Driver "modesetting"
EndSection

Section "Screen"
    Identifier "intel"
    Device "intel"
EndSection

Once throttling kicks in only logout or couple of AC disconnects help.

EDIT: this happens with trying to run with prime offload configuration as well.

I have tried to set HardDPMS to false in Device Section in xorg, based on this question [1] but it did not produce any difference.

[1] https://devtalk.nvidia.com/default/topic/1067676/440-36-with-bumblebee-drops-to-1-fps-after-running-for-10-minutes/

EDIT2: I have sent my report over the mail to linux-bugs@nvidia.com

Hi Pavel,

I have file a bug 2848108 internally for tracking purpose.
Just wanted to check if issue is reproducible with old driver version.

Hello Amrits,

I have tried

440.33, 440.59, 435.21, 418 (not sure about minor version of this one) - All same behavior

I will gladly provide any extra logs or follow any instructions you provide.

Hi Pavel,

We have been able to reproduce issue locally and our engineering team is working on it. Will keep you updated on the same.