Tensorflow python module takes too much time to give result on a first start

mernov · April 14, 2022, 4:10pm

Hi, i have TF 2.4.1 python3.8 module, compiled from sources. The trouble is that, when i do import tensorflow for the first time , after i open #python3 cli, and give it a simple things to calc, it takes >10m to receive the results. But after it, it does calculations pretty fast. I’ll show you on a screenshots:

As you can see the time after i post “a = tf.constant([[1.0, 2.0, 3.0], [4.0, 5.0, 6.0]])” is 14:18 - 14:47
It’s quite a bit time.

The second pass of the same code with a different values takes no time:

And if i do it on a host system, it aslso does good:

AastaLLL · April 15, 2022, 3:20am

Hi,

Do you compile the package with Nano GPU architecture (sm_53)?
If not, some GPU files need to recompile with the correct architecture when initialization.

Thanks.

mernov · April 15, 2022, 10:57pm

I compiled it with bazel and --aarch64 only. What’s the correct way ?

bazel --host_jvm_args=-Xmx32768m build --config=opt --config=noaws --config=nogcp --config=nohdfs --config=nonccl --config=monolithic --config=cuda --config=v2 --local_cpu_resources=32 -j 32 --define=tflite_pip_with_flex=true --copt=-ftree-vectorize --copt=-funsafe-math-optimizations --copt=-ftree-loop-vectorize --copt=-fomit-frame-pointer --subcommands //tensorflow/tools/pip_package:build_pip_package

AastaLLL · April 18, 2022, 2:53am

Hi,

Please check the below repository for an example to compile TensorFlow with a specified GPU architecture:

github.com

jkjung-avt/jetson_nano/blob/master/install_tensorflow-2.3.0.sh#L64


      
          fi
          tar xzvf tensorflow-2.3.0.tar.gz
          cd tensorflow-2.3.0
          
          
patch -N -p1 < $patch_path && echo "tensorflow-2.3.0 source tree appears to be patched already.  Continue..."
          
          
echo "** Configure and build tensorflow-2.3.0"
          export TMP=/tmp
          PYTHON_BIN_PATH=$(which python3) \
          PYTHON_LIB_PATH=$(python3 -c 'import site; print(site.getsitepackages()[0])') \
          TF_CUDA_COMPUTE_CAPABILITIES=${cuda_compute} \
          TF_CUDA_VERSION=10.2 \
          TF_CUDA_CLANG=0 \
          TF_CUDNN_VERSION=8 \
          TF_TENSORRT_VERSION=${trt_version} \
          CUDA_TOOLKIT_PATH=/usr/local/cuda \
          CUDNN_INSTALL_PATH=/usr/lib/aarch64-linux-gnu \
          TENSORRT_INSTALL_PATH=/usr/lib/aarch64-linux-gnu \
          TF_NEED_IGNITE=0 \
          TF_ENABLE_XLA=0 \
          TF_NEED_OPENCL_SYCL=0 \

Thanks.

system · May 11, 2022, 3:47am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.