./deviceQuery showing FAIL on a p2.xlarge(GPU instance) on AWS

shajansajani94 · April 17, 2019, 5:50am

I am using an AWS instance(p2.xlarge) for my GPU experiments. I downloaded CUDA using ‘wget https://developer.nvidia.com/compute/cuda/9.0/Prod/local_installers/cuda_9.0.176_384.81_linux-run
chmod +x cuda_9.0.176_384.81_linux-run’. I was able to successfully install the driver and work with it. When I stop the AWS instance and then restart it the next day, ./deviceQuery is giving the following error
[b]’
./deviceQuery Starting…

CUDA Device Query (Runtime API) version (CUDART static linking)

cudaGetDeviceCount returned 30
→ unknown error
Result = FAIL’
[/b]

and I am not able to use the GPU.

Pasting below the output when I gave ‘nvcc --version’ :-

‘nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2017 NVIDIA Corporation
Built on Fri_Sep__1_21:08:03_CDT_2017
Cuda compilation tools, release 9.0, V9.0.176’

Pasting below the output when I gave ‘nvidia-smi’:-

‘NVIDIA-SMI has failed because it couldn’t communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.’

Please Help me solve the issue

Robert_Crovella · April 17, 2019, 9:30am

my guess would be that a kernel update of some sort was applied when you stopped and restarted the instance, which broke the driver install

Topic		Replies	Views
deviceQuery fails for Cuda 9.0 installation on Ubuntu 16.04 CUDA Setup and Installation	1	793	October 8, 2018
CUDA Sample Error - deviceQuery CUDA Setup and Installation	0	966	April 19, 2016
deviceQuery Failed CUDA Setup and Installation	0	952	May 10, 2018
Unable to install cuda 10.0 on Ubuntu 18.04 on EC2 AWS CUDA Setup and Installation	2	840	July 6, 2022
CUDA 9.1 installation for GeForce GTX 1080 Ti fails on Ubuntu 17.10. CUDA Setup and Installation	2	1976	February 9, 2018
deviceQuery program is failing (Ubuntu 16.04, Cuda 8.0, nvidia-370) CUDA Setup and Installation	1	2182	November 10, 2016
./deviceQuery Cuda 9.0 sample fails CUDA Setup and Installation	0	1075	May 15, 2018
cudaGetDeviceCount FAILED CUDA Driver and Runtime version may be mismatched. CUDA Programming and Performance	2	32433	September 24, 2010
Cuda Installation : Running deviceQuery - Unknown Error Linux	0	1126	November 16, 2016
CUDA Initialization Issue: cudaGetDeviceCount returned 3 on g4dn.12xlarge with NVIDIA Driver 560.x and CUDA 12.6 CUDA Setup and Installation cuda , cudnn	2	311	November 28, 2024

./deviceQuery showing FAIL on a p2.xlarge(GPU instance) on AWS

Related topics