Hi all,
First time poster here. I hope I have all the information you need. I am debugging this problem for one of our professors.
The machine in question is a Dell Precision T3600 with the latest BIOS, A14 09/29/2014.
64GB memory
GeForce GTX TITAN X
1TB SSD
Ubuntu 14.04.3
Cuda 7.5
Driver Version: 352.39
MATLAB 2015A
Driver and CUDA installed via NVIDIA’s CUDA repository.
CUDA Toolkit 11.7 Update 1 Downloads | NVIDIA Developer
dpkg -i cuda-repo-ubuntu1404_7.5-18_amd64.deb
apt-get update
apt-get install cuda
When we run the sample MATLAB GPU code below on the ‘GeForce GTX TITAN X’, the machine reboots. Black screen reboot, no warning!
sample MATLAB code
----snip----
n = 13000;
clz = ‘single’;
A = gpuArray.rand(n, n, clz) + 100*eye(n, n, clz);
b = gpuArray.rand(n, 1, clz);
x = A\b;
----snip-----
The same code on the same machine with the same driver with a ‘Quadro 4000’ reports a out of memory error. (see below)
—snip—
Out of memory on device. To view more detail about available memory on the GPU, use ‘gpuDevice()’. If
the problem persists, reset the GPU by calling ‘gpuDevice(1)’.
Error in test (line 3)
A = gpuArray.rand(n, n, clz) + 100*eye(n, n, clz);
----snip-----
If I remove the Q4000 and re-install the TITAN X and change ‘n’ in the sample code to 7000, it works no reboot. After more testing I was able to discover the following.
n = 7000 - works
n = 8000 - works
n = 9000 - works
n = 10000 - reboot
n =13000 - reboot
So what is causing the reboot?? Should I be getting a out of memory error or some other error instead of a reboot!! Any suggestions?
nvidia-bug-report.log.gz (71.1 KB)