Hello
I am trying to install Cuda 10.0 on a new machine (ubuntu 18.04, RTX 2070), following those steps:
sudo apt-get purge nvidia*
sudo apt-get autoremove
sudo apt-get autoclean
sudo rm -rf /usr/local/cuda*
sudo apt-key adv --fetch-keys http://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/7fa2af80.pub
echo "deb https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64 /" | sudo tee /etc/apt/sources.list.d/cuda.list
sudo apt-get update
sudo apt-get -o Dpkg::Options::="--force-overwrite" install cuda-10-0 cuda-drivers
echo 'export PATH=/usr/local/cuda-10.0/bin${PATH:+:${PATH}}' >> ~/.bashrc
echo 'export LD_LIBRARY_PATH=/usr/local/cuda-10.0/lib64${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}' >> ~/.bashrc
Output:
nvidia-smi
±----------------------------------------------------------------------------+
| NVIDIA-SMI 418.56 Driver Version: 418.56 CUDA Version: 10.1 |
|-------------------------------±---------------------±---------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce RTX 2070 Off | 00000000:02:00.0 On | N/A |
| 0% 40C P8 14W / 175W | 334MiB / 7949MiB | 1% Default |
±------------------------------±---------------------±---------------------+
±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1693 G /usr/lib/xorg/Xorg 198MiB |
| 0 2397 G /usr/bin/gnome-shell 134MiB |
±----------------------------------------------------------------------------+
Output:
nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2018 NVIDIA Corporation
Built on Sat_Aug_25_21:08:01_CDT_2018
Cuda compilation tools, release 10.0, V10.0.130
It looks like everything is running as I was able to run the samples well, but when I try to run tensorflow on my GPU, it can’t recognise any GPU on my system (using tensorflow/tensorflow:latest-gpu-py3-jupyter DOCKER image that only requieres NVIDIA Drivers installed)
from tensorflow.python.client import device_lib
print(device_lib.list_local_devices())
outputs:
[name: "/device:CPU:0"
device_type: "CPU"
memory_limit: 268435456
locality {
}
incarnation: 10192142723765318570
, name: "/device:XLA_CPU:0"
device_type: "XLA_CPU"
memory_limit: 17179869184
locality {
}
incarnation: 8651583191431757218
physical_device_desc: "device: XLA_CPU device"
]
I never had this problem before and I think something is making conflict with CUDA 10.1 (tensorflow only supports CUDA 10.0)
Is it possible to avoid CUDA 10.1 installation?
418.56 Diver needed for this RTX2070 automatically installs CUDA 10.1
I think the problem that I have with tensorflow comes from here.
Regards.