TensorFlow wheel for JetPack 4.0 !!

AastaLLL · September 18, 2018, 1:35am

TensorFlow for JetPack4.0 is updated !!

Python 2.7:
r1.10.1: Box

Python 3.6:
r1.10.1: Box

Thanks. :)

mechadeck · September 18, 2018, 12:10pm

Is the DLA supported in this release?

dusty_nv · September 18, 2018, 3:16pm

Hi mechadeck, it is supported in JetPack 4.0, but I don’t believe that TensorFlow has performed the DLA integration.

AerialRoboticsGuru · September 20, 2018, 6:21pm

Confirmed and installed last night for Python 3.6. Be patient. It will take awhile to install.

rebotnix · September 22, 2018, 7:24pm

AastaLL,

thanks a lot. It worked great on Xavier!

Here is a first tensor test.

nvidia@jetson-0423418009922:/opt/ssd500/installation$ python3 testTensorflow.py
2018-09-22 19:23:40.420202: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:864] ARM64 does not support NUMA - returning NUMA node zero
2018-09-22 19:23:40.420432: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1405] Found device 0 with properties:
name: Xavier major: 7 minor: 2 memoryClockRate(GHz): 1.5
pciBusID: 0000:00:00.0
totalMemory: 15.46GiB freeMemory: 11.19GiB
2018-09-22 19:23:40.420552: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1484] Adding visible gpu devices: 0
2018-09-22 19:23:41.254301: I tensorflow/core/common_runtime/gpu/gpu_device.cc:965] Device interconnect StreamExecutor with strength 1 edge matrix:
2018-09-22 19:23:41.254481: I tensorflow/core/common_runtime/gpu/gpu_device.cc:971] 0
2018-09-22 19:23:41.254525: I tensorflow/core/common_runtime/gpu/gpu_device.cc:984] 0: N
2018-09-22 19:23:41.254826: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1097] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 4748 MB memory) → physical GPU (device: 0, name: Xavier, pci bus id: 0000:00:00.0, compute capability: 7.2)
…

Ran 2 tests in 1.945s

AerialRoboticsGuru · September 22, 2018, 8:26pm

I am noticing an error when training. I am using a VGGNet model implemented in Keras training the CIFAR-10 dataset. I’m training for 40 epochs. It’s about 50/50 whether the training completes. I haven’t isolated this yet to v1.10 of Tensorflow. The error I am getting is: tensorflow.python.framework.errors_impl.InvalidArgumentError: Input to reshape is a tensor with ‘XX’ values, but the requested shape has ‘XX’.

The ‘XX’ differs each time. At times the training completes fine without error. I don’t have a reproducible scenario. The complete output is:

File “SB15.03_vggnet_mini_cifar10.py”, line 63, in
H = model.fit(trainX, trainY, validation_data=(testX, testY), batch_size=64, epochs=40, verbose=1)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/engine/training.py”, line 1037, in fit
validation_steps=validation_steps)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/engine/training_arrays.py”, line 199, in fit_loop
outs = f(ins_batch)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py”, line 2666, in call
return self._call(inputs)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py”, line 2636, in _call
fetched = self._callable_fn(*array_vals)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/tensorflow/python/client/session.py”, line 1382, in call
run_metadata_ptr)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/tensorflow/python/framework/errors_impl.py”, line 519, in exit
c_api.TF_GetCode(self.status.status))
tensorflow.python.framework.errors_impl.InvalidArgumentError: Input to reshape is a tensor with 64 values, but the requested shape has 978017632868082584
[[Node: training/SGD/gradients/loss/activation_6_loss/Sum_1_grad/Reshape = Reshape[T=DT_FLOAT, Tshape=DT_INT32, _class=[“loc:@training/SGD/gradients/loss/activation_6_loss/Sum_1_grad/Tile”], _device=“/job:localhost/replica:0/task:0/device:GPU:0”](training/SGD/gradients/loss/activation_6_loss/Neg_grad/Neg, training/SGD/gradients/loss/activation_6_loss/Sum_1_grad/DynamicStitch/_83)]]

rebotnix · September 22, 2018, 9:50pm

Can you try version TF 1.6?

AerialRoboticsGuru · September 23, 2018, 2:18am

Maybe :)

duclinh.fetel · September 24, 2018, 3:54pm

I use the command line:

pip install --extra-index-url=https://developer.download.nvidia.com/compute/redist/jp40 tensorflow-gpu

AerialRoboticsGuru · September 24, 2018, 5:15pm

@duclink.fetel: What version of Tensorflow does that install?

erwin.coumans · September 24, 2018, 10:40pm

Great!

Python 3.6:
r1.10.1: Box

What CUDA version did you install, and where did you download the Xavier CUDA installation?

AerialRoboticsGuru · September 25, 2018, 2:11am

CUDA 10. It is part of JetPack 4.0

[url]https://developer.nvidia.com/embedded/jetpack-notes[/url]

erwin.coumans · September 25, 2018, 2:45am

Got it now. The Xavier came pre-installed with an OS, so I assume we can install packages on the device (instead of from host) this time, but it has to be installed from host again like TX2.

Odd, why NVIDIA doesn’t make the deb packages available for download and install on the device (without host).
cuda-repo-l4t-10-0-local-10.0.117_1.0-1_arm64.deb etc.

AastaLLL · September 25, 2018, 4:05am

I am noticing an error when training. I am using a VGGNet model implemented in Keras training the CIFAR-10 dataset. I’m training for 40 epochs. It’s about 50/50 whether the training completes. I haven’t isolated this yet to v1.10 of Tensorflow. The error I am getting is: tensorflow.python.framework.errors_impl.InvalidArgumentError: Input to reshape is a tensor with ‘XX’ values, but the requested shape has ‘XX’.

The ‘XX’ differs each time. At times the training completes fine without error. I don’t have a reproducible scenario. The complete output is:

File “SB15.03_vggnet_mini_cifar10.py”, line 63, in
H = model.fit(trainX, trainY, validation_data=(testX, testY), batch_size=64, epochs=40, verbose=1)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/engine/training.py”, line 1037, in fit
validation_steps=validation_steps)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/engine/training_arrays.py”, line 199, in fit_loop
outs = f(ins_batch)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py”, line 2666, in call
return self._call(inputs)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/keras/backend/tensorflow_backend.py”, line 2636, in _call
fetched = self._callable_fn(*array_vals)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/tensorflow/python/client/session.py”, line 1382, in call
run_metadata_ptr)
File “/home/nvidia/.virtualenvs/dl4cv/lib/python3.6/site-packages/tensorflow/python/framework/errors_impl.py”, line 519, in exit
c_api.TF_GetCode(self.status.status))
tensorflow.python.framework.errors_impl.InvalidArgumentError: Input to reshape is a tensor with 64 values, but the requested shape has 978017632868082584
[[Node: training/SGD/gradients/loss/activation_6_loss/Sum_1_grad/Reshape = Reshape[T=DT_FLOAT, Tshape=DT_INT32, _class=[“loc:@training/SGD/gradients/loss/activation_6_loss/Sum_1_grad/Tile”], _device=“/job:localhost/replica:0/task:0/device:GPU:0”](training/SGD/gradients/loss/activation_6_loss/Neg_grad/Neg, training/SGD/gradients/loss/activation_6_loss/Sum_1_grad/DynamicStitch/_83)]]

Hi, AerialRoboticsGuru

It looks similar to this topic.
Could you check it first?

Thanks.

AastaLLL · September 25, 2018, 4:08am

Hi, erwin.coumans

We will keep updating our OS version.
Flash all the things from JetPack can release users from the dependency problem.

Thanks.

Kuonangzhe · October 11, 2018, 6:02am

Thanks for the great work. I successfully installed tf 1.10 on Jetson Xavier, while met some problem when comparing the performance with TX2.

Both tf was installed via
pip install --extra-index-url=https://developer.download.nvidia.com/compute/redist/jp40 tensorflow-gpu for xavier
and
pip install --extra-index-url=https://developer.download.nvidia.com/compute/redist/jp33 tensorflow-gpu for tx2.

I tested the tensorflow models for mobilenet-ssd, via https://github.com/tensorflow/models/blob/master/research/object_detection/object_detection_tutorial.ipynb

The bad news is, when using normal GPU mode, the average FPS on TX2 is around 3.5 FPS, while on Xavier is 1.74 FPS, only the half speed of TX2.

I then tested the vgg16 classification via ‘Use pre-trained models’ in

github.com

tensorflow/models/blob/master/research/slim/slim_walkthrough.ipynb

{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# TF-Slim Walkthrough\n",
    "\n",
    "This notebook will walk you through the basics of using TF-Slim to define, train and evaluate neural networks on various tasks. It assumes a basic knowledge of neural networks. "
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Table of contents\n",
    "\n",
    "<a href=\"#Install\">Installation and setup</a><br>\n",
    "<a href='#MLP'>Creating your first neural network with TF-Slim</a><br>\n",
    "<a href='#ReadingTFSlimDatasets'>Reading Data with TF-Slim</a><br>\n",

This file has been truncated. show original

The bad news is, when using normal GPU mode, the average FPS on TX2 is around 0.76 FPS, while on Xavier is 0.15 FPS, only the 1/5 speed of TX2.

Is there anything wrong in tf whl, or in the system? Help needed, thanks a lot!

AastaLLL · October 15, 2018, 2:57am

Hi,

Have you maximized the system performance via these commands?

sudo nvpmodel -m 0
sudo ./jetson_clocks.sh

More, if possible, it’s recommended to convert TensorFlow into TensorRT PLAN first.
TensorRT will optimize its implementation based on the GPU architecture.

Thanks.

Topic		Replies	Views
Problem to install tensorflow on Xavier (Solved) Jetson AGX Xavier	19	8662	October 18, 2021
Is Tensorflow 2.0 on Jetson TX2 supported? Jetson TX2	19	4505	October 18, 2021
TensorFlow 1.11.0 wheel with JetPack 3.3 Jetson TX2	103	45399	November 13, 2019
Slow model loading on a Jetson AGX Xavier with TensorFlow 2.5.0 Jetson AGX Xavier cuda , tensorflow	13	2348	November 10, 2021
Jetson Xavier NX - Tensorflow 2 container slower on GPU than on CPU Jetson Xavier NX tensorflow	5	2548	October 18, 2021
Nvidia Jetson Xavier Tensorflow Error Jetson Xavier NX cuda , tensorflow	6	681	August 17, 2023
TensorFlow Issue - 'NonMaxSuppressionV3' in binary Jetson TX2	16	3150	October 18, 2021
Cannot import TF 2.6.0 correctly on Xavier NX Jetson Xavier NX tensorflow	27	4580	December 29, 2021
run tensorflow 1.3 on tx2 stuck Jetson TX2	20	5580	October 18, 2021
Available: TensorFlow 1.5 for Jetson TX2 Jetson TX2	18	7873	May 21, 2018

TensorFlow wheel for JetPack 4.0 !!

Related topics