Following latest doc(Unified Deployment Guide for SecureAI) for CC
Load driver failed in VM
apus@apus:~$ sudo dmesg | grep -i NVRM
[sudo] password for apus:
[ 16.808289] NVRM: The NVIDIA GPU 0000:01:00.0 (PCI ID: 10de:2331)
NVRM: installed in this system is not supported by the
NVRM: NVIDIA 570.172.08 driver release.
NVRM: Please see 'Appendix A - Supported NVIDIA GPU Products'
NVRM: in this release's README, available on the operating system
NVRM: specific graphics driver download page at www.nvidia.com.
[ 16.809389] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 16.809391] NVRM: None of the NVIDIA devices were initialized.
[ 17.058542] NVRM: The NVIDIA GPU 0000:01:00.0 (PCI ID: 10de:2331)
NVRM: installed in this system is not supported by the
NVRM: NVIDIA 570.172.08 driver release.
NVRM: Please see 'Appendix A - Supported NVIDIA GPU Products'
NVRM: in this release's README, available on the operating system
NVRM: specific graphics driver download page at www.nvidia.com.
[ 17.068806] NVRM: The NVIDIA probe routine failed for 1 device(s).
[ 17.068808] NVRM: None of the NVIDIA devices were initialized.
Status for sev-snp looks good
apus@apus:~$ uname -r
6.8.0-79-generic
apus@apus:~$ sudo dmesg | grep -i sev
[sudo] password for apus:
[ 4.610724] Memory Encryption Features active: AMD SEV SEV-ES SEV-SNP
[ 4.724495] SEV: APIC: wakeup_secondary_cpu() replaced with wakeup_cpu_via_vmgexit()
[ 4.896686] SEV: Using SNP CPUID table, 29 entries present.
[ 5.280797] SEV: SNP guest platform device initialized.
[ 9.363982] sev-guest sev-guest: Initialized SEV guest driver (using vmpck_id 0)
GPU is H100
apus@apus:~$ lspci -vv -s 01:00.0
01:00.0 3D controller: NVIDIA Corporation GH100 [H100 PCIe] (rev a1)
Subsystem: NVIDIA Corporation GH100 [H100 PCIe]
Physical Slot: 0
Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Interrupt: pin A routed to IRQ 23
Region 0: Memory at 3002000000 (64-bit, prefetchable) [disabled] [size=16M]
Region 2: Memory at 4000000000 (64-bit, prefetchable) [disabled] [size=128G]
Region 4: Memory at 3000000000 (64-bit, prefetchable) [disabled] [size=32M]
Capabilities: <access denied>
Kernel modules: nvidiafb, nouveau, nvidia_drm, nvidia
Can anyone can help check what is the issue ?