Problem trying to install MXNet and GluonCV on Jetson Nano

lews_therin · June 18, 2021, 9:55am

Hi there!
I just got a Jetson Nano, and flashed it by using the jetson-nano-jp451-sd-card-image file and Etcher.
I need to install MxNet and GluonCV, so I did the following:

ATTEMPT #1
I tried installing MXNet by using the compiled package, as described here

Installed dependencies

 sudo apt-get install -y git build-essential libatlas-base-dev libopencv-dev graphviz python3-pip

and added:

export PATH=/usr/local/cuda/bin${PATH:+:${PATH}}

to my .bashrc file (before that, I couldn’t use nvcc --version)

I saw that most of the instructions suggest installing stuff by using ‘sudo pip install…’, but I prefer to keep stuff in virtual environments, so I adapted the instructions accordingly.
I created a venv with

python3 -m venv gluon
source envs/gluon/bin/activate

I downloaded the wheel file: mxnet-1.6.0-py3-none-any.whl and ran

pip3 install cython
pip3 install mxnet-1.6.0-py3-none-any.whl

got the output:

Failed to build numpy
Installing collected packages: urllib3, chardet, certifi, idna, requests, numpy, mxnet
  Running setup.py install for numpy ... done

I ran pip list, and I got:

certifi (2021.5.30)
chardet (4.0.0)
Cython (0.29.23)
graphviz (0.8.4)
idna (2.10)
mxnet (1.6.0)
numpy (1.19.5)
pip (9.0.1)
pkg-resources (0.0.0)
requests (2.25.1)
setuptools (39.0.1)
urllib3 (1.26.5)

So, mxnet is installed, but it seems it’s not version supporting the GPU (it should be something like mxnet-cu102)
When trying to import mxnet, moreover, I got the error:

	Traceback (most recent call last):
	  File "<stdin>", line 1, in <module>
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/__init__.py", line 24, in <module>
	    from .context import Context, current_context, cpu, gpu, cpu_pinned
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/context.py", line 24, in <module>
	    from .base import classproperty, with_metaclass, _MXClassPropertyMetaClass
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/base.py", line 214, in <module>
	    _LIB = _load_lib()
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/base.py", line 204, in _load_lib
	    lib_path = libinfo.find_lib_path()
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/libinfo.py", line 74, in find_lib_path
	    'List of candidates:\n' + str('\n'.join(dll_path)))
	RuntimeError: Cannot find the MXNet library.
	List of candidates:
	/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/libmxnet.so
	/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/../../lib/libmxnet.so
	/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/../../build/libmxnet.so
	../../../libmxnet.so

Turned out that the file libmxnet.so was created, but in this other folder:

/home/lews/envs/gluon/mxnet

I added

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/home/lews/envs/gluon/mxnet

to my to my .bashrc file, and now whenever I open a terminal session it shows the error:

bash: ::/home/lews/envs/gluon/mxnet: No such file or directory

However, in spite of the message it seems to be finding the file, because now I’m getting another error when importing mxnet:

	Traceback (most recent call last):
	  File "<stdin>", line 1, in <module>
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/__init__.py", line 24, in <module>
	    from .context import Context, current_context, cpu, gpu, cpu_pinned
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/context.py", line 24, in <module>
	    from .base import classproperty, with_metaclass, _MXClassPropertyMetaClass
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/base.py", line 214, in <module>
	    _LIB = _load_lib()
	  File "/home/lews/envs/gluon/lib/python3.6/site-packages/mxnet/base.py", line 205, in _load_lib
	    lib = ctypes.CDLL(lib_path[0], ctypes.RTLD_LOCAL)
	  File "/usr/lib/python3.6/ctypes/__init__.py", line 348, in __init__
	    self._handle = _dlopen(self._name, mode)
	OSError: libcudart.so.10.0: cannot open shared object file: No such file or directory

It seems to be looking for CUDA 10.0, while I have 10.2:

	nvcc: NVIDIA (R) Cuda compiler driver
	Copyright (c) 2005-2019 NVIDIA Corporation
	Built on Wed_Oct_23_21:14:42_PDT_2019
	Cuda compilation tools, release 10.2, V10.2.89

PS: I tried all the steps described in the post even WITHOUT using a virtual environment, and I’m having the same issues.

ATTEMPT #2

This time, I tried building from source, following this procedure.
I didn’t flash the SD card clean yet (I’ll try to do that and to start Attempt #2 from scratch), but I’m having two issues with this method as well:

When building directly on the Nano, it’s painfully slow. I saw you can cross-compile it on a PC and…somehow transfer the compiled library to the Nano? But I didn’t find any tutorial for that.
I’m getting the following error:

In file included from src/io/image_aug_default.cc:31:0:
src/io/./image_augmenter.h:31:10: fatal error: opencv2/opencv.hpp: No such file or directory
 #include <opencv2/opencv.hpp>
          ^~~~~~~~~~~~~~~~~~~~
compilation terminated.
Makefile:461: recipe for target 'build/src/io/image_aug_default.o' failed
make: *** [build/src/io/image_aug_default.o] Error 1
make: *** Waiting for unfinished jobs....

AastaLLL · June 21, 2021, 2:55am

Hi,

The prebuilt package is for JetPack 4.3.
Since your environment is JetPack4.5.1, please build it from source.

To build MXNet, please refer to below comment:

And it’s recommended to the job on a clear environment.

Thanks.

lews_therin · June 23, 2021, 6:19pm

I tried following the procedure you linked, but I got the error

CMakeFiles/Makefile2:889: recipe for target 'CMakeFiles/mxnet_static.dir/all' failed
make[1]: *** [CMakeFiles/mxnet_static.dir/all] Error 2
Makefile:140: recipe for target 'all' failed
make: *** [all] Error 2
+ pushd .
~ ~
+ PYTHON_DIR=/home/lews/mxnet/python
+ BUILD_DIR=/home/lews/mxnet/build
+ cd /home/lews/mxnet/python
+ python3 setup.py bdist_wheel
Traceback (most recent call last):
  File "setup.py", line 47, in <module>
    LIB_PATH = libinfo['find_lib_path']()
  File "mxnet/libinfo.py", line 73, in find_lib_path
    'List of candidates:\n' + str('\n'.join(dll_path)))
RuntimeError: Cannot find the MXNet library.
List of candidates:
/home/lews/mxnet/python/mxnet/libmxnet.so
/home/lews/mxnet/python/mxnet/../../lib/libmxnet.so
/home/lews/mxnet/python/mxnet/../../build/libmxnet.so
../../../libmxnet.so

I tried looking for the ‘libmxnet.so’ file, but it seems it doesn’t exist

lews_therin · June 29, 2021, 9:38am

I noticed now there might be a typo at:

    -DCMAKE_CXX_FLAGS=-I/usr/local/cuda/targets/aarch64-linux/inlude  \

I assume it should be ‘include’?

lews_therin · June 30, 2021, 2:50pm

It seems the mxnet installation was successful after fixing the typo I mentioned above.
However, I still have 2 issues:

I tried importing mxnet and running a few simple tests, and the memory consumption when creating arrays using the gpu context seems too high. Specifically, I tried something very simple, like

test_cpu = mx.nd.ones((5,5))
test_gpu = mx.nd.ones((5,5), ctx=mx.gpu(0))

the creation of the first array is immediate, while the execution of the second instruction takes a few minutes, and basically saturates the device RAM.

I wanted to install GluonCV, so I tried building it manually by:
git clone GitHub - dmlc/gluon-cv: Gluon CV Toolkit
cd gluon-cv && python setup.py install — user

but I got the following error:
RuntimeError: Python version >= 3.7 required

I’ll try reinstalling a later Python version (I was assuming it should be taken care of by the autobuild script), but I don’t know how to fix the first issue

AastaLLL · July 6, 2021, 6:36am

Hi,

Since Nano’s GPU is limited, the slowness may cause by the shortage of memory.
Could you try it on a flash reboot environment to see if any difference?

Thanks.

lews_therin · July 7, 2021, 3:01pm

Hi!
Are there any instructions on how to cross-compile it? I’d be happy to flash the card and try as many times as necessary, but on the Nano the compilation using the autobuild_mxnet.sh takes something between 6-8 hours :(

EDIT:

Even better, this is the script I’m using to run object detection with GluonCV:

import time
import numpy as np
import cv2
import gluoncv as gcv
import mxnet as mx


def main():

	ctx = mx.gpu(0)


	## load a pretrained model
	net = gcv.model_zoo.get_model('ssd_512_mobilenet1.0_coco', pretrained=True, ctx=ctx)
	net.hybridize()

	## open video file
	cap = cv2.VideoCapture("test_video_files/vlc_test.avi")
	count_frame = 0

	while(True):
		print(f"Frame: {count_frame}")
		total_t_frame = 0

		## load frame from the camera
		ret, frame_np_orig = cap.read()
		if not ret:
		 	break
		frame_np_orig = cv2.resize(frame_np_orig,(683, 512))
		key = cv2.waitKey(1)
		if (key == ord('q')):
			break

		# Image pre-processing
		frame_nd_orig = mx.nd.array(cv2.cvtColor(frame_np_orig, cv2.COLOR_BGR2RGB)).astype('uint8')
		frame_nd_new, frame_np_new = gcv.data.transforms.presets.ssd.transform_test(frame_nd_orig, short=512, max_size=700)

		## measure inference time per frame
		start_t = time.time()
		frame_nd_new = frame_nd_new.as_in_context(ctx)
		class_IDs, scores, bboxes = net(frame_nd_new)
		if isinstance(class_IDs, mx.ndarray.ndarray.NDArray):
			class_IDs.wait_to_read()
		if isinstance(scores, mx.ndarray.ndarray.NDArray):
			scores.wait_to_read()
		if isinstance(bboxes, mx.ndarray.ndarray.NDArray):
			bboxes.wait_to_read()

		stop_t = time.time()
		total_t_frame += (stop_t - start_t)
		FPS = 1/(stop_t-start_t)
		print(f"\tinference time = {(stop_t-start_t)} -> FPS = {1/(stop_t-start_t)}")


		## display the result with cv
		frame_np_new = gcv.utils.viz.cv_plot_bbox(frame_np_new, bboxes[0], scores[0], class_IDs[0], thresh=0.5, class_names=net.classes)
		gcv.utils.viz.cv_plot_image(frame_np_new)
		count_frame += 1


	cv2.destroyAllWindows()

	cap.release()
	print("Done!!!")

I’m getting an average FPS=2, using SSD Mobilenet V1 (512x512).
Could you (or anyone who has MXNet/GluonCV installed on NANO) by any chance run this script, and tell me which FPS it is reasonable to expect?
I tried running the benchmarks as explained here, but I was getting an ‘Illegal instruction’ error (even after using export OPENBLAS_CORETYPE=ARMV8 ) .
Thanks a lot!

AastaLLL · July 20, 2021, 4:18am

Hi,

Have you tried to maximize the device performance first?

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

We are going to test your script internally.
Will share the data in our environment with you later.

However, since OpenCV uses CPU for image IO.
It’s expected to have a slower pipeline.

Thanks.

lews_therin · July 23, 2021, 11:34am

I seem to understand I’m approaching this whole thing the wrong way.
So far, I’ve been simply trying to install the deep learning libraries I generally use (MXNet, GluonCV, Tensorflow) on the Jetson Nano, and to run the code I have as it is (for example, like you pointed out, by using OpenCV for IO) .
However, I just tried running an optimized implementation of Mobilenet V2 following HELLO AI WORLD tutorial by Dustin Franklin, and I got a FPS of around 22.

So, I’m wondering now…is there any way that I can apply the same optimization to my code, or it would be better to re-implement everything with more device-friendly libraries?

AastaLLL · August 10, 2021, 7:22am

Hi,

jetson-inference uses TensorRT as the inference engine which has optimized for the Jetson platform.
For Jetson, we always recommend converting the model into TensorRT for saving resources and better performance.

If you are using an MXNet model, you can first try to export the model into ONNX format.
And run it with TensorRT with the following command:

$ /usr/src/tensorrt/bin/trtexec --onnx=[your/model]

Thanks.

Topic		Replies	Views
I was unable to compile and install MXNET on the jetson nano，Is there an official installation tutorial？ Jetson Nano	45	12964	October 14, 2021
I was unable to compile and install Mxnet1.5 with tensorrt on the jetson nano，Is there someone have compile it, please help me. Thank you. Jetson Nano	34	6256	October 14, 2021
Speed up inference time on Nano with mxnet Jetson Nano neural-network-framework	9	1793	October 18, 2021
MXNetError: ImageRec need opencv to process Jetson Xavier NX opencv , nvbugs	19	1819	September 5, 2021
install OpenCV for python3 in Jetson Nano Jetson Nano opencv	37	62310	October 14, 2021
Opencv Face Detection Poor Performance with jetson nano Jetson Nano opencv	51	14205	October 14, 2021
Build OpenCV 4.6.4 with Cuda on Jetson Nano Jetson Nano opencv , cuda	13	3594	September 18, 2023
How to install opencv-python for python3.6 Jetson Nano opencv	26	19327	October 14, 2021
Jetson Nano - Limiting the results shown by the DetectNet example. Jetson Nano	8	2307	October 14, 2021
OpenCV error Jetson Nano camera , opencv , jetson-inference , gstreamer , python	18	9351	October 15, 2021

Problem trying to install MXNet and GluonCV on Jetson Nano

Related topics