Hi - is anyone else noticing that some of the torch7 tests get killed on the TX1?
Here is the cudnn.torch benchmark (in the test directory from cudnn.torch):
@tegra-ubuntu:~/code/cudnn.torch/test$ th benchmark.lua
CUDNN Version: 5005
cudnn.SpatialConvolution
Forward AutoTuned : 14 13 15 9 48 18 29 0.0085029602050781
Forward implicit gemm : 14 13 15 9 48 18 29 0.015181064605713
Forward implicit precomp gemm: 14 13 15 9 48 18 29 0.004382848739624
Forward gemm : 14 13 15 9 48 18 29 0.018018007278442
Forward FFT : 14 13 15 9 48 18 29 0.0088889598846436
Forward FFT tiling : 14 13 15 9 48 18 29 0.0045740604400635
cudnn.VolumetricConvolution
Killed
tegra-ubuntu:~/code/cudnn.torch/test$ luajit -l cutorch -e 'cutorch.test()'
seed: 1472255993
Running 157 tests
...
125/157 cdiv3 ........................................................... [PASS]
126/157 add ............................................................. [PASS]
127/157 log1 ............................................................ [PASS]
128/157 cpow ............................................................ [PASS]
129/157 sort ............................................................ [WAIT]Killed
I thought it might be because of some kind of watchdog timer in X as described here: CUDA Visual Profiler 'Interactive' X config option? - Stack Overflow
I changed xorg.conf to have a line with Option “Interactive” “0” and restarted the TX1 but I don’t see any difference. Is the gpu card on the TX1 just running out of memory? Or is something explicitly killing the processes?
thanks!