Hello I’m trying to install tinycuda in Jetson TX2 and having errors.
I think there are some memory errors but can’t find why and how to fix it.
(python3.6) nvidia@jetson-desktop:~/project/tiny-cuda-nn$ ./build/mlp_learning_an_image data/images/albert.jpg data/config_hash.json
Loading custom json config ‘data/config_hash.json’.
tiny-cuda-nn warning: FullyFusedMLP is not supported for the selected architecture 62. Falling back to CutlassMLP. For maximum performance, raise the target GPU architecture to 75+.
tiny-cuda-nn warning: FullyFusedMLP is not supported for the selected architecture 62. Falling back to CutlassMLP. For maximum performance, raise the target GPU architecture to 75+.
Beginning optimization with 10000000 training steps.
tiny-cuda-nn warning: GPUMemoryArena: GPU 0 does not support virtual memory. Falling back to regular allocations, which will be larger and can cause occasional stutter.
terminate called after throwing an instance of ‘std::runtime_error’
** what(): /home/nvidia/project/tiny-cuda-nn/include/tiny-cuda-nn/cuda_graph.h:99 cudaStreamEndCapture(stream, &m_graph) failed: operation failed due to a previous error during capture**
Aborted
[with cuda-memcheck]
(python3.6) nvidia@jetson-desktop:~/project/tiny-cuda-nn$ cuda-memcheck ./build/mlp_learning_an_image data/images/albert.jpg data/config_hash.json
========= Saved host backtrace up to driver entry point at error
========= Host Frame:/usr/lib/aarch64-linux-gnu/tegra/libcuda.so.1 (cuMemGetAllocationGranularity + 0x16c) [0x1ee1cc]
========= Host Frame:./build/mlp_learning_an_image [0x735bc]
========= Host Frame:./build/mlp_learning_an_image [0x409d0]
========= Host Frame:./build/mlp_learning_an_image [0x40da4]
========= Host Frame:./build/mlp_learning_an_image [0x28cc4]
========= Host Frame:/lib/aarch64-linux-gnu/libc.so.6 (__libc_start_main + 0xe0) [0x207a0]
========= Host Frame:./build/mlp_learning_an_image [0x35830]
========= Program hit cudaErrorStreamCaptureUnsupported (error 900) due to “operation not permitted when stream is capturing” on CUDA API call to cudaDeviceSynchronize.
========= Saved host backtrace up to driver entry point at error
========= Host Frame:/usr/lib/aarch64-linux-gnu/tegra/libcuda.so.1 [0x32081c]
========= Host Frame:./build/mlp_learning_an_image [0x2560ec]
========= Host Frame:./build/mlp_learning_an_image [0x3fe50]
========= Host Frame:./build/mlp_learning_an_image [0x40758]
========= Host Frame:./build/mlp_learning_an_image [0x40da4]
========= Host Frame:./build/mlp_learning_an_image [0x28cc4]
========= Host Frame:/lib/aarch64-linux-gnu/libc.so.6 (__libc_start_main + 0xe0) [0x207a0]
========= Host Frame:./build/mlp_learning_an_image [0x35830]
========= Program hit cudaErrorStreamCaptureInvalidated (error 901) due to “operation failed due to a previous error during capture” on CUDA API call to cudaStreamEndCapture.
========= Saved host backtrace up to driver entry point at error
========= Host Frame:/usr/lib/aarch64-linux-gnu/tegra/libcuda.so.1 [0x32081c]
========= Host Frame:./build/mlp_learning_an_image [0x26e374]
========= Host Frame:./build/mlp_learning_an_image [0x3d168]
========= Host Frame:./build/mlp_learning_an_image [0x2ab08]
========= Host Frame:/lib/aarch64-linux-gnu/libc.so.6 (__libc_start_main + 0xe0) [0x207a0]
========= Host Frame:./build/mlp_learning_an_image [0x35830]
terminate called after throwing an instance of ‘=========
std::runtime_error’
what(): /home/nvidia/project/tiny-cuda-nn/include/tiny-cuda-nn/cuda_graph.h:99 cudaStreamEndCapture(stream, &m_graph) failed: operation failed due to a previous error during capture
========= Error: process didn’t terminate successfully
========= No CUDA-MEMCHECK results found
- NVIDIA GPU or System : JETSON TX2
- NVIDIA Software Version : Jetpack 4.6.4, python 3.6.9, cuda10.2
*OS Ubuntu 18.04.6 - Other Details
I’m trying to run tinycuda and had error in cudaStreamEndCapture etc… with runtime error or just killed.
Could you give me some idea why cudaMalloc error should be triggered?
Thank you in advance