Hi AastaLLL,
I tried but it doesn’t work.
I also realized the similar unknown error appeared randomly with Cupy. The Cupy worked fine after I rebooted the system. However, once I failed with torch, the Cupy also failed with the same error.
$ sudo python3
[sudo] password for nvidia:
Python 3.5.2 (default, Nov 12 2018, 13:43:14)
[GCC 5.4.0 20160609] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import torch
>>> a = torch.cuda.FloatTensor(2).zero_()
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
RuntimeError: CUDA error: unknown error
>>> import cupy
>>> x_gpu = cupy.array([1, 2, 3])
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/usr/local/lib/python3.5/dist-packages/cupy/creation/from_data.py", line 41, in array
return core.array(obj, dtype, copy, order, subok, ndmin)
File "cupy/core/core.pyx", line 2350, in cupy.core.core.array
File "cupy/core/core.pyx", line 2384, in cupy.core.core.array
File "cupy/core/core.pyx", line 151, in cupy.core.core.ndarray.__init__
File "cupy/cuda/memory.pyx", line 517, in cupy.cuda.memory.alloc
File "cupy/cuda/memory.pyx", line 1065, in cupy.cuda.memory.MemoryPool.malloc
File "cupy/cuda/memory.pyx", line 1086, in cupy.cuda.memory.MemoryPool.malloc
File "cupy/cuda/memory.pyx", line 900, in cupy.cuda.memory.SingleDeviceMemoryPool.malloc
File "cupy/cuda/memory.pyx", line 921, in cupy.cuda.memory.SingleDeviceMemoryPool._malloc
File "cupy/cuda/memory.pyx", line 680, in cupy.cuda.memory._try_malloc
File "cupy/cuda/memory.pyx", line 677, in cupy.cuda.memory._try_malloc
File "cupy/cuda/memory.pyx", line 869, in cupy.cuda.memory.SingleDeviceMemoryPool._alloc
File "cupy/cuda/memory.pyx", line 472, in cupy.cuda.memory._malloc
File "cupy/cuda/memory.pyx", line 473, in cupy.cuda.memory._malloc
File "cupy/cuda/memory.pyx", line 77, in cupy.cuda.memory.Memory.__init__
File "cupy/cuda/runtime.pyx", line 213, in cupy.cuda.runtime.malloc
File "cupy/cuda/runtime.pyx", line 136, in cupy.cuda.runtime.check_status
cupy.cuda.runtime.CUDARuntimeError: cudaErrorUnknown: unknown error