When testing gpu-burn on a Linux kernel, I keep getting segmentation faults.
XXX:~/gpu-burn$ ./gpu_burn 60 GPU 0: GeForce GTX 1070 (UUID: GPU-8c57e0f7-03ca-bd20-fe5e-b25482e4ed9b) Segmentation fault XXX:~/gpu-burn$ Initialized device 0 with 8192 MB of memory (7203 MB available, using 6482 MB of it), using FLOATS
When running Tensorflow code, I keep getting illegal memory address access errors.
See: Unexpected Events CUDA_ERROR_ILLEGAL_ADDRESS and CUDA_ERROR_LAUNCH_FAILED · Issue #46247 · tensorflow/tensorflow · GitHub
People on the Tensorflow Discord team say that this not a Tensorflow issue since I get similar errors when running non-Tensorflow code such as gpu-burn. NVIDIA support said that my GPU was not faulty. This leaves CUDA as the cause of the issue. What can be done?