How to enable HMM on Debian?

clement.cc · December 11, 2024, 10:06pm

I’m on GCP instance with T4 GPU. Booted up Ubuntu. Installed nvidia-open. HMM is enabled. But a customer is using Debian. So I tried Debian on exact same hardware. HMM won’t enable no matter what I do. I tried kernel 6.9 and even 6.11. Different flavors of the kernel. Even compiling my own kernel from latest unstable. Is there an official document about what exactly is required? Which kernel option needs to be enabled? Any way to enable logs to show why it decided not to enable HMM?

[ 0.000000] Linux version 6.11.5+bpo-cloud-amd64 (debian-kernel@lists.debian.org) (x86_64-linux-gnu-gcc-12
(Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP PREEMPT_DYNAMIC Debian 6.11.5-1~bpo12
+1 (2024-11-11)
…
[ 4.933991] NVRM: loading NVIDIA UNIX Open Kernel Module for x86_64 565.57.01 Release Build (dvs-builder@U16-A24-9-2) Thu Oct 10 12:15:00 UTC 2024
[ 6.893551] nvidia-modeset: Loading NVIDIA UNIX Open Kernel Mode Setting Driver for x86_64 565.57.01 Release Build (dvs-builder@U16-A24-9-2) Thu Oct 10 12:03:51 UTC 2024
[ 8.338626] process ‘subagents/fluent-bit/bin/fluent-bit’ started with executable stack
[ 8.754027] nvidia-uvm: Loaded the UVM driver, major device number 240.

==============NVSMI LOG==============

Timestamp : Tue Dec 10 21:50:19 2024
Driver Version : 565.57.01
CUDA Version : 12.7

Attached GPUs : 1
GPU 00000000:00:04.0
Product Name : Tesla T4
Product Brand : NVIDIA
Product Architecture : Turing
Display Mode : Enabled
Display Active : Disabled
Persistence Mode : Enabled
Addressing Mode : None

clement.cc · December 11, 2024, 11:40pm

Answering my own question. This is the requirement. open-gpu-kernel-modules/kernel-open/nvidia-uvm/uvm_linux.h at 9d0b0414a5304c3679c5db9d44d2afba8e58cc1b · NVIDIA/open-gpu-kernel-modules · GitHub

CONFIG_HMM_MIRROR and CONFIG_DEVICE_PRIVATE.

Success is not possible on Debian because they don’t enable this even on the latest unstable kernel. Everything works after I compiled my own kernel with these enabled. Chapter 4. Common kernel-related tasks

clement.cc · December 11, 2024, 11:43pm

Note to others following this path. Make sure you download linux-config-xxx package to use as a starting point for your kernel config. Otherwise your custom kernel might not be compatible with the rest of your Debian system.

Topic		Replies	Views
HMM Linux Kernel Support AMA with CUDA 12 Team kb	4	939	July 26, 2023
Issue Activating HMM Feature on NVIDIA RTX A4500 with CUDA Toolkit 12.4 on Debian Bookworm CUDA Setup and Installation	6	637	December 12, 2024
Heterogeneous Memory Support (HMM) in NVIDIA UVM driver and Linux 4.14 Linux	8	4614	March 19, 2023
Problems under Debian 11 Kernel 5.19.x Linux boot , kernel , kb	5	1615	January 4, 2023
Understanding open driver error load with V100 GPU (Ubuntu 22.04) Linux	2	524	November 5, 2024
HMM support in driver Linux	2	1508	May 30, 2018
Enabling CC mode on passthrough H100 GPU on RHEL guest VM fails SPDM CUDA Setup and Installation	0	326	May 17, 2024
ERROR: Unable to load the kernel module 'nvidia.ko' NVIDIA Virtual GPU Drivers	3	51608	November 4, 2021
Your card is not supported by any driver version CUDA Setup and Installation	4	232	April 4, 2025
Nvidia: No devices were found Drivers - Linux, Windows, MacOS	2	93	October 23, 2024

How to enable HMM on Debian?

Related topics