Trt_pose model in docker: ImportError: libnvmedia_tensor.so: cannot open shared object file: No such file or directory

user156593 · April 10, 2023, 11:14pm

Hello,

I am trying to make trt_pose model (NVIDIA-AI-IOT/trt_pose: Real-time pose estimation accelerated with NVIDIA TensorRT (github.com) work inside a docker container on Jetson Nano.

I build the image as described here: nvidia / container-images / l4t-jetpack · GitLab.

Then I use this command to get into the container:

sudo docker run -it --rm --net=host --runtime nvidia -e DISPLAY=$DISPLAY -v /tmp/.X11-unix/:/tmp/.X11-unix nvcr.io/nvidia/l4t-jetpack:r35.1.0

It works, builds and runs as it should. Inside the docker container, I clone the trt_pose repo and follow the instructions provided in the description. Than I try to install dependencies using:

sudo python3 setup.py install --plugins

, but it throws me an error:

  File "setup.py", line 2, in <module>
    import tensorrt
  File "/usr/lib/python3.8/dist-packages/tensorrt/__init__.py", line 68, in <module>
    from .tensorrt import *
ImportError: libnvmedia_tensor.so: cannot open shared object file: No such file or directory

I know that I am supposed to have tensorrt also installed on host machine (jetson nano in this case) and that my problem is related to this issue: Issue with tensorrt:r8.2.1 l4t container, Import error libnvmedia.so: cannot open shared object file: - Jetson & Embedded Systems / Jetson TX1 - NVIDIA Developer Forums. However, I was unable to get it work. I obviously need to map something else to the container?

Thank you in advance. Any help will be much appreciated!

dusty_nv · April 11, 2023, 1:01am

Hi @user156593, this container is for JetPack 5, whereas you are running JetPack 4 on your Nano, so it won’t work correctly with GPU acceleration - instead, please use the l4t-base container that is compatible with the version of JetPack-L4T that you are running (you can check your L4T version with cat /etc/nv_tegra_release)

Yes, on JetPack 4 you need CUDA/cuDNN/TensorRT installed onto your device, because they get mounted into your container dynamically when --runtime nvidia is used (and hence l4t-base will have all the JetPack components present). On JetPack 5, it moved to having CUDA/cuDNN/TensorRT installed into the containers themselves (and hence a l4t-jetpack container was created for this purpose)

user156593 · April 12, 2023, 3:56pm

Hello and thank you for your quick answer!

I saw it already 2 days ago, but didn’t want to answer so the topic wouldn’t be closed. Indeed as you said the image was incorrect. I will answer below, also for others, what my steps were to run trt_pose model inside a container on Jatson nano (NOTE: still not perfect!).

I checked my l4t version using cat /etc/nv_tegra_release which printed: R32 (release), REVISION: 7.1. Then I checked on this page: NVIDIA L4T ML | NVIDIA NGC to see which version of ml image I need to use - I, therefore, needed JetPack 4.6.1 (L4T R32.7.1).

Then I created Dockerfile with name Dockerfile.jetson:

ARG TAG
FROM nvcr.io/nvidia/l4t-ml:${TAG}

# Install any utils needed for execution
RUN apt-get update \
	&& apt-get install -y --no-install-recommends sudo \
    && rm -rf /var/lib/apt/lists/* \
    && apt-get clean

RUN sudo pip3 install tqdm cython pycocotools \
	&& sudo apt-get install python3-matplotlib

WORKDIR /app


RUN if [ -d /app/torch2trt ]; then \
	echo "torch2trt already cloned."; \
	else \
	git clone --depth 1 https://github.com/NVIDIA-AI-IOT/torch2trt \
	&& cd torch2trt \
	&& sudo python3 setup.py install --plugins; \
	fi


RUN if [ -d /app/trt_pose ]; then \
	echo "trt_pose already cloned."; \
	else \
	git clone --depth 1 https://github.com/NVIDIA-AI-IOT/trt_pose \
	&& cd trt_pose \
	&& sudo python3 setup.py install; \
	fi

and Makefile (TAG adjusted based on version of l4t):

TAG     ?= r32.7.1-py3
L4T_JETPACK_REGISTRY ?= "nvcr.io/nvidia/l4t-ml"

image:
	docker build -t $(L4T_JETPACK_REGISTRY):$(TAG) \
		--build-arg "TAG=$(TAG)" \
		-f ./Dockerfile.jetson .

run:
	docker run -it --rm --runtime nvidia --network host \
	-e DISPLAY=$(DISPLAY) \
	-v /tmp/.X11-unix/:/tmp/.X11-unix $(L4T_JETPACK_REGISTRY):$(TAG)

all:
	sudo make image ; sudo make run

Then I call sudo make all which pull the base image, builds and runs the container. Everything looks fine and I am able to run the model with 8 FPS - less than the stated 22 by using appended script in the repo: trt_pose/tasks/human_pose/live_demo.ipynb.

I think something is still wrong, because during the execution of line:

model_trt = torch2trt.torch2trt(model, [data], fp16_mode=True, max_workspace_size=1<<25)

is throwing the error [E] 3: [builderConfig.cpp::canRunOnDLA::382] Error Code 3: API Usage Error (Parameter check failed at: optimizer/api/builderConfig.cpp::canRunOnDLA::382, condition: dlaEngineCount > 0 - while the original torch model is being optimized.

Also, the line where the model should be saved:
torch.Savemodel_trt.state_dict(), OPTIMIZED_MODEL)

throws the error:

Traceback (most recent call last):
  File "trt.py", line 74, in <module>
    torch.save(model_trt.state_dict(), OPTIMIZED_MODEL)
  File "/usr/local/lib/python3.6/dist-packages/torch/serialization.py", line 379, in save
    _save(obj, opened_zipfile, pickle_module, pickle_protocol)
  File "/usr/local/lib/python3.6/dist-packages/torch/serialization.py", line 484, in _save
    pickler. Dump(obj)
MemoryError

Any idea how I should tackle this? Thank you in advance!

dusty_nv · April 12, 2023, 6:39pm

I’m not familiar with the trt_pose code itself (you may want to file an issue against the trt_pose GitHub), but is your Nano running in 5W or 10W mode? (you can check this with sudo nvpmodel -q and set it to 10W mode with sudo nvpmodel -m 0)

Also, these trt_pose models are integrated with jetson-inference so you could try that as well: https://github.com/dusty-nv/jetson-inference/blob/master/docs/posenet.md

user156593 · April 12, 2023, 8:58pm

Hello,

thank you. I will definitely check jetson-inference lib. It is a pity that I even struggled with this one, since it seem obsolete.

Anyways, the output of sudo nvpmodel -q gives me:

NVPM WARN: fan mode is not set!
NV Power Mode: MAXN
0

I assume I am already over 9000 :P

dusty_nv · April 13, 2023, 1:02am

Haha okay, yes you are already in maximum power mode.

GitHub thread for cross-reference:

github.com/NVIDIA-AI-IOT/trt_pose

Code fails to run in both jupyter and console

opened 02:41AM - 16 Nov 22 UTC

marcjasner

Sorry for the less than descriptive title, but I wasn't sure how else to title i…t. I've got a 4gb Jetson Nano (SeeedStudio Jetson Recomputer J2010 carrier board) with a 128gb SSD as the root storage device. It's running JetPack 4.6 (output of 'apt-cache show nvidia-jetpack' below) `$ sudo apt-cache show nvidia-jetpack Package: nvidia-jetpack Version: 4.6-b199 Architecture: arm64 Maintainer: NVIDIA Corporation Installed-Size: 194 Depends: nvidia-cuda (= 4.6-b199), nvidia-opencv (= 4.6-b199), nvidia-cudnn8 (= 4.6-b199), nvidia-tensorrt (= 4.6-b199), nvidia-visionworks (= 4.6-b199), nvidia-container (= 4.6-b199), nvidia-vpi (= 4.6-b199), nvidia-l4t-jetson-multimedia-api (>> 32.6-0), nvidia-l4t-jetson-multimedia-api (<< 32.7-0) Homepage: http://developer.nvidia.com/jetson Priority: standard Section: metapackages Filename: pool/main/n/nvidia-jetpack/nvidia-jetpack_4.6-b199_arm64.deb Size: 29368 SHA256: 69df11e22e2c8406fe281fe6fc27c7d40a13ed668e508a592a6785d40ea71669 SHA1: 5c678b8762acc54f85b4334f92d9bb084858907a MD5sum: 1b96cd72f2a434e887f98912061d8cfb Description: NVIDIA Jetpack Meta Package Description-md5: ad1462289bdbc54909ae109d1d32c0a8 Package: nvidia-jetpack Version: 4.6-b197 Architecture: arm64 Maintainer: NVIDIA Corporation Installed-Size: 194 Depends: nvidia-cuda (= 4.6-b197), nvidia-opencv (= 4.6-b197), nvidia-cudnn8 (= 4.6-b197), nvidia-tensorrt (= 4.6-b197), nvidia-visionworks (= 4.6-b197), nvidia-container (= 4.6-b197), nvidia-vpi (= 4.6-b197), nvidia-l4t-jetson-multimedia-api (>> 32.6-0), nvidia-l4t-jetson-multimedia-api (<< 32.7-0) Homepage: http://developer.nvidia.com/jetson Priority: standard Section: metapackages Filename: pool/main/n/nvidia-jetpack/nvidia-jetpack_4.6-b197_arm64.deb Size: 29356 SHA256: 104cd0c1efefe5865753ec9b0b148a534ffdcc9bae525637c7532b309ed44aa0 SHA1: 8cca8b9ebb21feafbbd20c2984bd9b329a202624 MD5sum: 463d4303429f163b97207827965e8fe0 Description: NVIDIA Jetpack Meta Package Description-md5: ad1462289bdbc54909ae109d1d32c0a8 ` I've set up a python 3.6 virtualenv and followed the installation instructions for all required packages. There were no errors during any of the installations and I've verified all of the packages import properly from the python command line. When I run the jupyter notebook 'live_demo.ipynb' I am able to run all of the steps up until the following step: `import torch2trt model_trt = torch2trt.torch2trt(model, [data], fp16_mode=True, max_workspace_size=1<<25)` When I attempt to run that step the systems thinks about it for a bit and then a dialog pops up that says the python kernel has crashed and will be automatically restarted. I cannot get past this step. To help debug/diagnose I took all of the code from the notebook and incrementally added it to a python file to see if I could reproduce the issue. The code I have so far is: `import cv2 import json import trt_pose.coco import trt_pose.models import torch import torch2trt from torch2trt import TRTModule import time import torchvision.transforms as transforms import PIL.Image from trt_pose.draw_objects import DrawObjects from trt_pose.parse_objects import ParseObjects from jetcam.usb_camera import USBCamera from jetcam.csi_camera import CSICamera from jetcam.utils import bgr8_to_jpeg import ipywidgets from IPython.display import display with open('human_pose.json', 'r') as f: human_pose = json.load(f) topology = trt_pose.coco.coco_category_to_topology(human_pose) num_parts = len(human_pose['keypoints']) num_links = len(human_pose['skeleton']) model = trt_pose.models.resnet18_baseline_att(num_parts, 2 * num_links).cuda().eval() MODEL_WEIGHTS = 'resnet18_baseline_att_224x224_A_epoch_249.pth' model.load_state_dict(torch.load(MODEL_WEIGHTS)) WIDTH = 224 HEIGHT = 224 data = torch.zeros((1, 3, HEIGHT, WIDTH)).cuda() print("Calling torch2trt.torch2trt\n") model_trt = torch2trt.torch2trt(model, [data], fp16_mode=True, max_workspace_size=1<<25) print("Done\n") OPTIMIZED_MODEL = 'resnet18_baseline_att_224x224_A_epoch_249_trt.pth' print("Calling torch.save\n") torch.save(model_trt.state_dict(), OPTIMIZED_MODEL) print("Done\n")` When I run this code I see "Calling torch2trt.torch2trt" in the console and then, after a pause, I see the following error repeated many times on the console: `[TensorRT] ERROR: 3: [builderConfig.cpp::canRunOnDLA::341] Error Code 3: Internal Error (Parameter check failed at: optimizer/api/builderConfig.cpp::canRunOnDLA::341, condition: dlaEngineCount > 0 )` After that the system will seem to hang. The desktop will display a low memory warning (If I run 'watch -n1 free -h' in another console window I can see free memory drop from 3+GB to as little as 96MB). After some time the process just reports "Killed" and exits back to the command line. I am at a loss. Can you please provide any helpful information you have that might help me correct this issue and continue? Thanks Marc

user156593 · April 13, 2023, 8:04am

Thank you! Answer on Github.

system · May 3, 2023, 2:57am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problems with the instalaltion of the trt_pose and torch2trt in the dcoker container Jetson Nano tensorrt , docker	3	528	November 13, 2023
Failed to run tensorrt docker image on Jetson Nano Jetson Nano tensorrt , docker	4	3638	October 18, 2021
How to use TensorRT in container with python3 application? Jetson Nano tensorrt , jetson-inference , nano	6	2253	October 11, 2021
Fail to run NVIDIA Pytorch docker container in jetson nano card (Jetpack 4.2.1) Jetson Nano tensorrt , docker , pytorch	2	757	October 18, 2021
TrT_Pose Docker Jetson Xavier NX docker	7	1065	October 18, 2021
ModuleNotFoundError: No module named 'tensorrt' Jetson Nano tensorrt	8	6288	October 15, 2021
Issue with tensorrt:r8.2.1 l4t container, Import error libnvmedia.so: cannot open shared object file: Jetson TX1 tensorrt , cuda , ubuntu , docker , python	9	2333	September 14, 2022
TF-TRT Error on Jetson Nano TensorRT tensorrt , nano	2	2169	August 26, 2021
Unable to use TensorRT inside the L4T-Tensorflow container Jetson Xavier NX tensorrt	9	2091	October 18, 2021
Failed to run Pytorch NGC docker on Jetson nano Jetson Nano docker , pytorch	7	1255	June 23, 2022

Trt_pose model in docker: ImportError: libnvmedia_tensor.so: cannot open shared object file: No such file or directory

Related topics