L4t-tensorflow:r32.4.3-tf2.2-py3 cuda linker path problem

synaptic.axon · August 14, 2020, 6:45am

Hello,

In the l4t-tensorflow:r32.4.3-tf2.2-py3 available at NVIDIA L4T TensorFlow | NVIDIA NGC, the /etc/ld.so.conf.d/nvidia.conf path is specified as /usr/local/cuda-10.0/targets/aarch64-linux/lib when it should be /usr/local/cuda-10.2/targets/aarch64-linux/lib.

Updating that and running ldconfig will allow tensorflow to properly load libcudart.so.10.2, libcudalas.so.10, libcufft.so.10, libcurand.so.10, libcusolver.so.10, libcusparse.so.10, libcudnn.so.8.

As shipped, tensorflow uses the cpu. Fixing the path to the Cuda 10.2 libraries for dynamic loading will get it using the nano hardware, e.g. the shakespeare rnn example from Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow takes about an hour with the Jetson Nano tweaked for cuDNN usage, or 12 hours with the CPU.

HTH!

AastaLLL · August 14, 2020, 11:00am

Hi,

Thanks for your reply.

We are going to check this issue.
Will share more information with you later.

Thanks.

synaptic.axon · August 14, 2020, 4:42pm

The GRU layers need recurrent_dropout=0 for the fast cuDNN in that Jupyter notebook example. Jetson Nano completes the training more than twice as fast as the example output. Not sure if that’s a fair comparison though…

AastaLLL · August 17, 2020, 4:44am

Hi,

Thanks for your reporting.

Confirmed that the path located at /etc/ld.so.conf.d/nvidia.conf links to the CUDA 10.0 incorrectly.
We are checking this with our internal team. Will keep you updated if we got any feedback.

However, we don’t find any issue when loading the CUDA library from TensorFlow.
The libcudart.so.10.2 can be loaded without modifying the nvidia.conf file.

root@nvidia-desktop:/# python3
Python 3.6.9 (default, Apr 18 2020, 01:56:04) 
[GCC 8.4.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import tensorflow as tf
2020-08-17 04:35:26.598242: I tensorflow/stream_executor/platform/default/dso_loader.cc:48] Successfully opened dynamic library libcudart.so.10.2

Do you meet any error when loading it with the default nvidia.conf?
If yes, would you mind to share the log with us?

By the way, we don’t recommend Jetson device for training since its limited storage and bandwidth.
Please also remember to maximize the device clocks as following to get the optimal performance.

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

You can also monitor the GPU utilization with $ sudo tegrastats.
If the GR3D_FREQ ratio cannot reach 99%, you may meet the IO limitation rather computational bound.

Thanks.

synaptic.axon · August 18, 2020, 2:59am

How odd, I can’t reproduce it now. It happened or else I wouldn’t have gone digging into ld.so.conf files but I can’t seem to make it happen again with either l4t-tensorflow or l4t-ml. Sorry to waste your time here.

Topic		Replies	Views
I cannnot load libcudart.so.10.2 to Docker container based on l4t-tensorflow Jetson Nano cuda , tensorflow	4	252	March 24, 2025
TensorFlow GPU not Working Jetson Nano tensorflow	4	2491	August 2, 2021
Jetson Orin nano Could not load dynamic library ‘libcudart.so.10.2’ with NVIDIA docker image Jetson Orin Nano docker	1	724	August 3, 2023
Release 18.09-py3: ld fails to resolve cuda libraries Frameworks (archived) tensorflow	0	1339	December 26, 2018
Errors while installing TensorFlow on Jetson Nano Jetson Nano tensorflow	6	1148	August 20, 2020
Cuda drivers for Jetson server Jetson Nano cuda , tensorflow , kernel , jetson-inference	1	1654	October 13, 2021
library libcudart.so.10.0 not found when importing tensorflow on python Jetson TX2	7	6074	February 26, 2020
Course-Getting Started eith AI on jetson Nano-Image Classification Jetson Nano cuda	5	1148	March 10, 2021
Jetson nano CUDA error Could not load dynamic library 'libcudart.so.10.2' Jetson Nano cuda	5	2336	March 2, 2023
Errror for Import tensorflow TensorRT	1	846	November 20, 2020

L4t-tensorflow:r32.4.3-tf2.2-py3 cuda linker path problem

Related topics