I have installed the nVIDIA software in Linux release 8.3.2011 with kernel 5.4.107 with T4 and V100 without problems, but when I install nvidia software in a system with A40 card I can’t create vGPUs instances.
I installed NVIDIA-GRID-Linux-KVM-460.32.04-460.32.03-461.33 ok, but when I list /sys/bus/pci/devices/0000:41:00.0 there is not directory mdev_supported_types.
In this path /sys/bus/pci/devices/0000:41:00.0 appears iommu and iommu_group directories and sriov* files that don’t appear in other installations with T4 or V100.
Some ideas? Can you help me?
nvidia-smi output is:
[root@a40 ~]# nvidia-smi
Sun Mar 21 09:15:12 2021
±----------------------------------------------------------------------------+
| NVIDIA-SMI 460.32.04 Driver Version: 460.32.04 CUDA Version: N/A |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 A40 On | 00000000:41:00.0 Off | 0 |
| 0% 29C P0 73W / 300W | 0MiB / 45634MiB | 0% Default |
| | | N/A |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+
mode compute is selected
[root@a40 ~]# ./displaymodeselector --listgpumodes
NVIDIA Display Mode Selector Utility (Version 1.48.0)
Copyright (C) 2015-2020, NVIDIA Corporation. All Rights Reserved.
Adapter: Graphics Device (10DE,2235,10DE,145A) S:00,B:41,D:00,F:00
EEPROM ID (EF,6015) : WBond W25Q16FW/JW 1.65-1.95V 16384Kx1S, page
GPU Mode: Compute
[root@a40]# ls /sys/bus/pci/devices/0000:41:00.0
aer_dev_correctable
aer_dev_fatal
aer_dev_nonfatal
ari_enabled
broken_parity_status
class
config
consistent_dma_mask_bits
current_link_speed
current_link_width
d3cold_allowed
device
dma_mask_bits
driver
driver_override
enable
i2c-5
i2c-6
iommu
iommu_group
irq
local_cpulist
local_cpus
max_link_speed
max_link_width
modalias
msi_bus
msi_irqs
numa_node
power
remove
rescan
reset
resource
resource0
resource1
resource1_wc
resource3
resource3_wc
revision
sriov_drivers_autoprobe
sriov_numvfs
sriov_offset
sriov_stride
sriov_totalvfs
sriov_vf_device
subsystem
subsystem_device
subsystem_vendor
uevent
vendor