fail to run tensorflow1.5 in tx2

liu.stover · February 7, 2018, 10:40am

Hi,

Today I compile the tensorflow v1.5 in tx2.
I tried to run alexnet(npy), failed the following msg.

<dataset.ImageProducer object at 0x7f43bd7f90>
2018-02-07 10:29:46.851157: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:866] ARM has no NUMA node, hardcoding to return zero
2018-02-07 10:29:46.851333: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1105] Found device 0 with properties:
name: NVIDIA Tegra X2 major: 6 minor: 2 memoryClockRate(GHz): 1.3005
pciBusID: 0000:00:00.0
totalMemory: 7.66GiB freeMemory: 5.91GiB
2018-02-07 10:29:46.851394: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1195] Creating TensorFlow device (/device:GPU:0) → (device: 0, name: NVIDIA Tegra X2, pci bus id: 0000:00:00.0, compute capability: 6.2)
start -----------------------------------------------------------------2018-02-07 10:29:50.875593
start------------run enqueue_paths_op------------
2018-02-07 10:29:51.241231: E tensorflow/stream_executor/cuda/cuda_driver.cc:967] failed to alloc 1048576 bytes on host: CUDA_ERROR_UNKNOWN
2018-02-07 10:29:51.241341: W ./tensorflow/core/common_runtime/gpu/pool_allocator.h:195] could not allocate pinned host memory of size: 1048576

Thanks.

AastaLLL · February 8, 2018, 3:05am

Hi,

So far, we guess this error is caused by out of memory but are still checking it in detail.
To help us figure out the root cause, could you apply following experiments?

1. Run alexnet on TF CPU mode to check if an out of resource error.
2. Run MNIST model on GPU mode to check if an out of resource error.

Thanks.

liu.stover · February 8, 2018, 8:35am

Hi,

1，It’s OK to run alexnet on TF CPU mode.

2，It’s OK to run MNIST on TF GPU mode.

Thanks

AastaLLL · February 12, 2018, 6:57am

Hi,

Guess that your error is caused by out of memory since MNIST is good on your environment.
Could you monitor TX2 memory usage via tegrastats and share it with us?

sudo ~/tegrastats

Thanks.

Topic		Replies	Views
Could not allocate memory: Tensorflow 1.5 on python 3 for Jetson TX2 Jetson TX2	4	1949	October 18, 2021
trouble with Tensorflow and TX2. Jetson TX2	1	1916	March 1, 2018
Tensorflow Memory Error Jetson TX2	25	15333	October 18, 2021
TensorFlow 1.5 on TX2 Errors Jetson TX2	6	2707	October 18, 2021
tensorflow/stream_executor/cuda/cuda_driver.cc:965] failed to alloc 2304 bytes on host: CUDA_ERROR_UNKNOWN Jetson TX2	4	2725	October 18, 2021
Tensorflow error in NVIDIA TX1 Jetson TX1	7	1908	December 30, 2017
run tensorflow 1.3 on tx2 stuck Jetson TX2	20	5624	October 18, 2021
object detection failed to run on TX2, based on tensorflow/modesl Jetson TX2	14	2099	October 18, 2021
I run tensorflow, but tensorflow would not run. the cmd block Jetson TX2	2	474	October 18, 2021
failed to enqueue convolution on stream: CUDNN_STATUS_EXECUTION_FAILED Jetson TX2	10	1269	March 1, 2018

fail to run tensorflow1.5 in tx2

Related topics