Cannot get CUDA to work on OpenSUSE Tumbleweed-Slowroll

I installed CUDA libraries on my system using the .run file for Leap 15, as specified on this thread . I downloaded the 12.4.0_550.54.14 version as my current driver is 550, i’ve then unmarked the driver and module installation as i’ve it already installed from the package manager, i’ve then checked PATH and ldconfig and added the missing paths where needed, and also downloaded CuDNN and copied the libs, however nothing can detect it.

Tensorflow was giving me CUDA_ERROR_NOT_INITIALIZED, while Blender was saying i had no compatible GPU. As mentioned in this thread, the open kernel module has to be the same version as the CUDA component, so i uninstalled the open-source module and installed the old proprietary one. However, after that, CUDA still can’t work. Now Tensorflow gives me CUDA_ERROR_UNKNOWN and Blender still says i have no compatible GPU.

So i’ve asked in the OpenSUSE Forums about this problem, made some diagnostics and checked the libraries are installed and detectable, but programs still can’t detect them and neither the drive can initialize them. Another user told me he got issues using NVIDIA alongside AMD. I’ve checked it on Windows aswell and CUDA does works (also alongside AMD HIP), but i would like to have it working on Linux.

My system specifications:

 (⌚qui jun-6 9:38:57)-(🦊joseskvolpe:~)-( 304K:62)
$ neofetch
              _aaaymQQmwaaa,                 joseskvolpe@ProtoFOX 
          ,wWQQQD????????$QQQQa,.            -------------------- 
       _wQQB?"              ??QQQa,          OS: openSUSE Tumbleweed-Slowroll x86_64 
     sQQD^                      ?QQ6\        Host: Nitro AN515-47 V1.14 
    yWW'                          4QQg       Kernel: 6.9.5-1-default 
  ,QQD          .aaaaaaaa          ^4Q6      Uptime: 55 mins 
 ,mQP        _wWQW?????YWWQa,        4Qm     Packages: 3449 (rpm), 44 (flatpak) 
 jQ@        wWW?'        ^4QQc       ^$QL    Shell: bash 5.2.26 
,QQ'       jWW'            )QW\       ]QQ    Resolution: 1920x1080 
|QQ       ,QW'              ]QQ       ^QQ|   DE: Plasma 6.0.5 
|QQ       |QQ               ]QQ        QQ|   WM: kwin 
|QQ        4Qg              ]QQ       .QQ|   Theme: [Plasma], X-Vulpus-DarkRed [GTK2/3] 
'QQ6       '$WQac.         _QQ(       jQQ    Icons: [Plasma], Vulpinity [GTK2/3] 
 ]QQw        "?QWQQf      _mQP       ,QQ(    Terminal: yakuake 
  4QQga                  wQQP       ,mQ?     CPU: AMD Ryzen 5 7535HS with Radeon Graphics (12) @ 4.603GHz 
   4QQQga,            saQWP'       jQQf      GPU: AMD ATI Radeon 680M 
    ?QQQQQQwaaaaaaaayWWW?'       _mQ@'       GPU: NVIDIA GeForce RTX 3050 Mobile 
      ?WQQQP?9VWUV???^        _amQP^         Memory: 6362MiB / 15171MiB 
        "4QQQaa,          ,awQQQ?^
           "?VQQQQQQQQQQQQQQP?'                                      
                                                                     

inxi:

$ inxi -GSaxxz
System:
  Kernel: 6.9.5-1-default arch: x86_64 bits: 64 compiler: gcc v: 13.3.0
    clocksource: tsc avail: hpet,acpi_pm
    parameters: BOOT_IMAGE=/vmlinuz-6.9.5-1-default
    root=/dev/mapper/OpenSUSE-SYSTEM splash=silent resume=/dev/OpenSUSE/SWAP
    quiet pcie_aspm=force acpi_backlight=native security=apparmor rd.shell=0
    mitigations=auto
  Desktop: KDE Plasma v: 6.0.5 tk: Qt v: N/A info: frameworks v: 6.2.0
    wm: kwin_wayland tools: avail: xscreensaver vt: 2 dm: SDDM Distro: openSUSE
    Tumbleweed-Slowroll 20240605
Graphics:
  Device-1: NVIDIA GA107M [GeForce RTX 3050 Mobile]
    vendor: Acer Incorporated ALI driver: nvidia v: 550.90.07
    alternate: nouveau,nvidia_drm non-free: 550.xx+ status: current (as of
    2024-04; EOL~2026-12-xx) arch: Ampere code: GAxxx process: TSMC n7 (7nm)
    built: 2020-2023 pcie: gen: 1 speed: 2.5 GT/s lanes: 8 link-max: gen: 4
    speed: 16 GT/s lanes: 16 ports: active: none off: HDMI-A-1 empty: none
    bus-ID: 01:00.0 chip-ID: 10de:25a2 class-ID: 0300
  Device-2: AMD Rembrandt [Radeon 680M] vendor: Acer Incorporated ALI
    driver: amdgpu v: kernel arch: RDNA-2 code: Navi-2x process: TSMC n7 (7nm)
    built: 2020-22 pcie: gen: 4 speed: 16 GT/s lanes: 16 ports:
    active: DP-3,eDP-1 empty: DP-1, DP-2, DP-4, DP-5, DP-6, DP-7, DP-8,
    Writeback-1 bus-ID: 75:00.0 chip-ID: 1002:1681 class-ID: 0300 temp: 47.0 C
  Device-3: Chicony ACER HD User Facing driver: uvcvideo type: USB rev: 2.0
    speed: 480 Mb/s lanes: 1 mode: 2.0 bus-ID: 5-1:2 chip-ID: 04f2:b76f
    class-ID: fe01 serial: <filter>
  Display: wayland server: X.org v: 1.21.1.12 with: Xwayland v: 24.1.0
    compositor: kwin_wayland driver: X: loaded: modesetting dri: radeonsi
    gpu: nvidia,amdgpu d-rect: 5376x3024 display-ID: 0
  Monitor-1: DP-3 pos: primary,top-left res: 1920x1080 size: N/A modes: N/A
  Monitor-2: HDMI-A-1 pos: bottom-c res: 1920x1080 size: N/A modes: N/A
  Monitor-3: eDP-1 pos: middle-r res: 1536x864 size: N/A modes: N/A
  API: EGL v: 1.5 hw: drv: nvidia drv: amd radeonsi platforms: device: 0
    drv: nvidia device: 1 drv: radeonsi device: 3 drv: swrast surfaceless:
    drv: nvidia wayland: drv: radeonsi x11: drv: radeonsi
    inactive: gbm,device-2
  API: OpenGL v: 4.6.0 compat-v: 4.5 vendor: amd mesa v: 24.0.8 glx-v: 1.4
    direct-render: yes renderer: AMD Radeon 660M (radeonsi rembrandt LLVM
    18.1.6 DRM 3.57 6.9.5-1-default) device-ID: 1002:1681 memory: 500 MiB
    unified: no display-ID: :0.0
  API: Vulkan v: 1.3.283 layers: 2 device: 0 type: integrated-gpu name: AMD
    Radeon 660M (RADV REMBRANDT) driver: N/A device-ID: 1002:1681
    surfaces: xcb,xlib,wayland device: 1 type: discrete-gpu name: NVIDIA
    GeForce RTX 3050 Laptop GPU driver: N/A device-ID: 10de:25a2
    surfaces: xcb,xlib,wayland

¿Something i’m missing?

Please run nvidia-bug-report.sh as root and attach the resulting nvidia-bug-report.log.gz file to your post.

Here

nvidia-bug-report.log.gz (948.0 KB)

Driver install looks fine but nvidia-smi is reporting an error
Addressing Mode : Unknown Error
Please try disabling the GSP firmware by adding kernel parameter
nvidia.NVreg_EnableGpuFirmware=0

Using this option caused a scary behaviour. It took longer to boot, then WiFi wasn’t working, CUDA worked but crashed after using Blender (not the first time). After rebooting, WiFi still wasn’t working and mouse wasn’t working aswell, had to power off then turn the laptop back on to fix it.

¡Solved! Thanks to malcolmlewis.

There was conflicting parameters in /etc/modprobe.d. Had to remove them and add these parameters on a new 50-nvidia.conf file:

/etc/modprobe.d/50-nvidia.conf

options nvidia-drm modeset=1
options nvidia "NVreg_DynamicPowerManagement=0x02"

Then run dracut

dracut -f --regenerate-all

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.