Hello guys
I am trying since many days now to get Nvidia’s 1050Ti up, running and accessable for cuda/Python/Tensorflow1.14 in a VMM guest machine via pci-passthrough.
Guest machine is generated by Qemu/KVM with virt-manager, Debian Stretch, cuda10.0-p1, Tensorflow1.14; these all match and work together on Debian Stretch (!no backports activated/installed → no latest kernel), since I tested it before on bare metal machine (I know the is THE/one difference).
Looks as if all necessary virtio componentes work, also all necessary nvidia components and also CUDA installation did not show any error/suspicious messages like ‘no lib32 compatible directory found’ during CUDA install. Why I think it should work:
(and nvidia-gpu is not the primary one in host’s machine bios, if of a ny relevance)
| ~ @ StReTcH (debian)
| => lspci -Dnn | grep -i vga
0000:00:01.0 VGA compatible controller [0300]: Red Hat, Inc. QXL paravirtual graphic card [1b36:0100] (rev 04)
0000:05:00.0 VGA compatible controller [0300]: NVIDIA Corporation GP107 [GeForce GTX 1050 Ti] [10de:1c82] (rev a1)
| ~ @ StReTcH (debian)
| => lsmod | grep virtio
virtio_rng 16384 0
rng_core 16384 1 virtio_rng
virtio_balloon 16384 0
virtio_console 28672 0
virtio_blk 20480 3
virtio_net 32768 0
virtio_pci 24576 0
virtio_ring 24576 6 virtio_blk,virtio_net,virtio_rng,virtio_balloon,virtio_console,virtio_pci
virtio 16384 6 virtio_blk,virtio_net,virtio_rng,virtio_balloon,virtio_console,virtio_pci
| ~ @ StReTcH (debian)
| => lsmod | grep nvidia
nvidia_drm 45056 0
nvidia_modeset 1044480 1 nvidia_drm
nvidia 16797696 1 nvidia_modeset
ipmi_msghandler 49152 1 nvidia
drm_kms_helper 155648 2 qxl,nvidia_drm
drm 360448 7 qxl,ttm,nvidia_drm,drm_kms_helper
| ~ @ StReTcH (debian)
| => nvidia-smi
Unable to determine the device handle for GPU 0000:05:00.0: Unknown Error
| ~ @ StReTcH (debian)
| =>
Anyone any Idea? Any help on this is highly appreciated.
Thank you all in advance.
Kind regards,
Roger
nvidia-bug-report.log.gz (1.01 MB)