Deepstream_pose_estimation repository and provided libnvds_osd.so

l.weingart · April 2, 2023, 4:56pm

• Hardware Platform: Jetson
• DeepStream Version: 6.2
• JetPack Version: L4T 35.2.1
• Issue Type: questions

Hi everybody,

I’m writing here because on the github repository of NVIDIA-AI-IOT/deepstream_pose_estimation there is no possibility of opening issues, and they mention asking questions here.
So, I have a deepstream app running on a Jeston agx with deepstream 5.1 that uses this pose estimation model from this repository. To make the code work, I had to replace the libnvds_osd.so library from deepstream with the one provided in the repository.

Today I’m trying to make my app to work on my Jetson Orin with deepstream 6.2, but reusing this provided library results in a crash.
Not using it results in the following errors :

WARNING: Deserialize engine failed because file path: /home/jetson/git/blimp/jetson-agx/configs/../models/pose_estimation.onnx_b1_gpu0_fp16.engine open error
0:00:12.669327379  6593     0x70e73b90 WARN                 nvinfer gstnvinfer.cpp:677:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Warning from NvDsInferContextImpl::deserializeEngineAndBackend() <nvdsinfer_context_impl.cpp:1897> [UID = 2]: deserialize engine from file :/home/jetson/git/blimp/jetson-agx/configs/../models/pose_estimation.onnx_b1_gpu0_fp16.engine failed
0:00:12.867713246  6593     0x70e73b90 WARN                 nvinfer gstnvinfer.cpp:677:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Warning from NvDsInferContextImpl::generateBackendContext() <nvdsinfer_context_impl.cpp:2002> [UID = 2]: deserialize backend context from engine from file :/home/jetson/git/blimp/jetson-agx/configs/../models/pose_estimation.onnx_b1_gpu0_fp16.engine failed, try rebuild

I tried to reuse the pose_estimation.onnx_b1_gpu0_fp16.engine file from my jetson AGX but without success, the error in this case was:

ERROR: [TRT]: 1: [stdArchiveReader.cpp::StdArchiveReader::37] Error Code 1: Serialization (Serialization assertion safeVersionRead == safeSerializationVersion failed.Version tag does not match. Note: Current Version: 0, Serialized Engine Version: 97)
ERROR: [TRT]: 4: [runtime.cpp::deserializeCudaEngine::65] Error Code 4: Internal Error (Engine deserialization failed.)
ERROR: Deserialize engine failed from file: /home/jetson/git/blimp/jetson-agx/models/pose_estimation.onnx_b1_gpu0_fp16.engine
0:00:12.584017928  4502     0x5d6fe790 WARN                 nvinfer gstnvinfer.cpp:677:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Warning from NvDsInferContextImpl::deserializeEngineAndBackend() <nvdsinfer_context_impl.cpp:1897> [UID = 2]: deserialize engine from file :/home/jetson/git/blimp/jetson-agx/models/pose_estimation.onnx_b1_gpu0_fp16.engine failed
0:00:12.776925103  4502     0x5d6fe790 WARN                 nvinfer gstnvinfer.cpp:677:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Warning from NvDsInferContextImpl::generateBackendContext() <nvdsinfer_context_impl.cpp:2002> [UID = 2]: deserialize backend context from engine from file :/home/jetson/git/blimp/jetson-agx/models/pose_estimation.onnx_b1_gpu0_fp16.engine failed, try rebuild

From what I remember (though it was two years ago), the pose_estimation.onnx_b1_gpu0_fp16.engine file should be automatically generated the first time the code is executed.

So, my first question would be, is there a way for the authors of this repository to provide an updated version of the libnvds_osd.so library for deeptream 6.2?

My second question would be, is my error really/only because of this missing library (though the original provided by deepstream has always been available)?

Thank you all for your esteemed help

Cheers

fanzh · April 3, 2023, 6:36am

after testing on xavier+DS6.2, it works fine with the default libnvds_osd.so.

you might comment out the line of “model-engine-file” in the configuration file, then run the application again, it will generate a new engine.

l.weingart · April 7, 2023, 11:47am

Helo fanzh,

Thank you so much for getting back to me, I really appreciate!
I didn’t have the opportunity to get back to my jetson before now, but I did comment the “model-engine-file” according to your suggestion and indeed this error did not appear anymore.
However, now I get errors about the memory, I thought the Orin had more memory than the Xavier.
Additionaly, this only came on the first time I ran it, on subsequent tries it got stuck.

This is the result I had on my first attempt:

/home/jetson/.local/lib/python3.8/site-packages/pyds.so
main.py:757: PyGIDeprecationWarning: Since version 3.11, calling threads_init is no longer needed. See: https://wiki.gnome.org/PyGObject/Threading
  GObject.threads_init()
Creating Pipeline 
 
2023-04-07 13:02:31.227693: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:31.290434: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:31.290671: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:31.292334: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:31.292514: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:31.292636: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:33.987465: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:33.988103: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:33.988197: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1703] Could not identify NUMA node of platform GPU id 0, defaulting to 0.  Your kernel may not have been built with NUMA support.
2023-04-07 13:02:33.988375: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:02:33.988528: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1616] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 23253 MB memory:  -> device: 0, name: Orin, pci bus id: 0000:00:00.0, compute capability: 8.7
DEBUG:tensorflow:Layer lstm will use cuDNN kernels when running on GPU.
Creating streamux 
 
Creating source_bin  0  
 
Creating source bin
source-bin-00
Now playing...
rtsp://192.168.2.119:554
Starting pipeline 


Using winsys: x11 
Process PWM-proc:
Traceback (most recent call last):
  File "/usr/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
    self.run()
  File "/usr/lib/python3.8/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
TypeError: set_pwm_values() takes 4 positional arguments but 6 were given
Opening in BLOCKING MODE 
0:00:10.591125859  3323     0x5df70d90 INFO                 nvinfer gstnvinfer.cpp:680:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Info from NvDsInferContextImpl::buildModel() <nvdsinfer_context_impl.cpp:1923> [UID = 2]: Trying to create engine from model files
WARNING: [TRT]: onnx2trt_utils.cpp:375: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
WARNING: [TRT]: Tactic Device request: 547MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 3 due to insufficient memory on requested size of 547 detected for tactic 0x0000000000000005.
WARNING: [TRT]: Tactic Device request: 547MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 7 due to insufficient memory on requested size of 547 detected for tactic 0x000000000000003d.
WARNING: [TRT]: Tactic Device request: 547MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 11 due to insufficient memory on requested size of 547 detected for tactic 0x0000000000000075.
WARNING: [TRT]: Tactic Device request: 547MB Available: 408MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 3 due to insufficient memory on requested size of 547 detected for tactic 0x0000000000000005.
WARNING: [TRT]: Tactic Device request: 547MB Available: 408MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 6 due to insufficient memory on requested size of 547 detected for tactic 0x000000000000003d.
WARNING: [TRT]: Tactic Device request: 586MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 3 due to insufficient memory on requested size of 586 detected for tactic 0x0000000000000004.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 4 due to insufficient memory on requested size of 1092 detected for tactic 0x0000000000000005.
WARNING: [TRT]: Tactic Device request: 586MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 9 due to insufficient memory on requested size of 586 detected for tactic 0x000000000000003c.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 10 due to insufficient memory on requested size of 1092 detected for tactic 0x000000000000003d.
WARNING: [TRT]: Tactic Device request: 586MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 15 due to insufficient memory on requested size of 586 detected for tactic 0x0000000000000074.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 409MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 16 due to insufficient memory on requested size of 1092 detected for tactic 0x0000000000000075.
WARNING: [TRT]: Tactic Device request: 586MB Available: 404MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 3 due to insufficient memory on requested size of 586 detected for tactic 0x0000000000000004.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 404MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 4 due to insufficient memory on requested size of 1092 detected for tactic 0x0000000000000005.
WARNING: [TRT]: Tactic Device request: 586MB Available: 404MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 8 due to insufficient memory on requested size of 586 detected for tactic 0x000000000000003c.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 404MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 9 due to insufficient memory on requested size of 1092 detected for tactic 0x000000000000003d.
WARNING: [TRT]: Tactic Device request: 586MB Available: 405MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 3 due to insufficient memory on requested size of 586 detected for tactic 0x0000000000000004.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 405MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 4 due to insufficient memory on requested size of 1092 detected for tactic 0x0000000000000005.
WARNING: [TRT]: Tactic Device request: 586MB Available: 405MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 9 due to insufficient memory on requested size of 586 detected for tactic 0x000000000000003c.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 405MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 10 due to insufficient memory on requested size of 1092 detected for tactic 0x000000000000003d.
WARNING: [TRT]: Tactic Device request: 586MB Available: 405MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 15 due to insufficient memory on requested size of 586 detected for tactic 0x0000000000000074.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 406MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 16 due to insufficient memory on requested size of 1092 detected for tactic 0x0000000000000075.
WARNING: [TRT]: Tactic Device request: 586MB Available: 407MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 3 due to insufficient memory on requested size of 586 detected for tactic 0x0000000000000004.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 407MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 4 due to insufficient memory on requested size of 1092 detected for tactic 0x0000000000000005.
WARNING: [TRT]: Tactic Device request: 586MB Available: 407MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 8 due to insufficient memory on requested size of 586 detected for tactic 0x000000000000003c.
WARNING: [TRT]: Tactic Device request: 1092MB Available: 407MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 9 due to insufficient memory on requested size of 1092 detected for tactic 0x000000000000003d.
WARNING: [TRT]: Tactic Device request: 547MB Available: 407MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 2 due to insufficient memory on requested size of 547 detected for tactic 0x0000000000000003.
WARNING: [TRT]: Tactic Device request: 547MB Available: 406MB. Device memory is insufficient to use tactic.
WARNING: [TRT]: Skipping tactic 2 due to insufficient memory on requested size of 547 detected for tactic 0x0000000000000003.

This is the result of the following attempts.
It gets stuck at that point and I have to kill it:

/home/jetson/.local/lib/python3.8/site-packages/pyds.so
SIOCADDRT: File exists
main.py:757: PyGIDeprecationWarning: Since version 3.11, calling threads_init is no longer needed. See: https://wiki.gnome.org/PyGObject/Threading
  GObject.threads_init()
Creating Pipeline 
 
2023-04-07 13:16:23.662445: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:23.714250: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:23.714517: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:23.716006: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:23.716189: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:23.716348: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:25.716863: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:25.717240: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:25.717341: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1703] Could not identify NUMA node of platform GPU id 0, defaulting to 0.  Your kernel may not have been built with NUMA support.
2023-04-07 13:16:25.717515: I tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc:984] could not open file to read NUMA node: /sys/bus/pci/devices/0000:00:00.0/numa_node
Your kernel may have been built without NUMA support.
2023-04-07 13:16:25.717688: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1616] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 20311 MB memory:  -> device: 0, name: Orin, pci bus id: 0000:00:00.0, compute capability: 8.7
DEBUG:tensorflow:Layer lstm will use cuDNN kernels when running on GPU.
Creating streamux 
 
Creating source_bin  0  
 
Creating source bin
source-bin-00
Now playing...
rtsp://192.168.2.119:554
Starting pipeline 


Using winsys: x11 
Process PWM-proc:
Traceback (most recent call last):
  File "/usr/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap
    self.run()
  File "/usr/lib/python3.8/multiprocessing/process.py", line 108, in run
    self._target(*self._args, **self._kwargs)
TypeError: set_pwm_values() takes 4 positional arguments but 6 were given
Opening in BLOCKING MODE 
0:00:08.361644652  4318     0x5bf92b90 INFO                 nvinfer gstnvinfer.cpp:680:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Info from NvDsInferContextImpl::buildModel() <nvdsinfer_context_impl.cpp:1923> [UID = 2]: Trying to create engine from model files
WARNING: [TRT]: onnx2trt_utils.cpp:375: Your ONNX model has been generated with INT64 weights, while TensorRT does not natively support INT64. Attempting to cast down to INT32.
WARNING: [TRT]: TensorRT encountered issues when converting weights between types and that could affect accuracy.
WARNING: [TRT]: If this is not the desired behavior, please modify the weights or retrain with regularization to adjust the magnitude of the weights.
WARNING: [TRT]: Check verbose logs for the list of affected weights.
WARNING: [TRT]: - 35 weights are affected by this issue: Detected subnormal FP16 values.
WARNING: [TRT]: - 19 weights are affected by this issue: Detected values less than smallest positive FP16 subnormal value and converted them to the FP16 minimum subnormalized value.
0:03:18.857104291  4318     0x5bf92b90 INFO                 nvinfer gstnvinfer.cpp:680:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Info from NvDsInferContextImpl::buildModel() <nvdsinfer_context_impl.cpp:1955> [UID = 2]: serialize cuda engine to file: /home/jetson/git/blimp/jetson-agx/models/pose_estimation.onnx_b1_gpu0_fp16.engine successfully
WARNING: [TRT]: The getMaxBatchSize() function should not be used with an engine built from a network created with NetworkDefinitionCreationFlag::kEXPLICIT_BATCH flag. This function will always return 1.
INFO: [Implicit Engine Info]: layers num: 3
0   INPUT  kFLOAT input.1         3x224x224       
1   OUTPUT kFLOAT 262             18x56x56        
2   OUTPUT kFLOAT 264             42x56x56        

0:03:19.090411862  4318     0x5bf92b90 INFO                 nvinfer gstnvinfer_impl.cpp:328:notifyLoadModelStatus:<secondary-pose-estimation> [UID 2]: Load new model:configs/sgie.txt sucessfully
gstnvtracker: Loading low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so
gstnvtracker: Failed to open low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so
 dlopen error: /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so: cannot open shared object file: No such file or directory
gstnvtracker: Failed to initilaize low level lib.

And out of curiosity, this is the log when I run it successfully on my Xavier:

2023-04-07 13:25:56.988898: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.10.2
Creating Pipeline 
 
2023-04-07 13:26:06.445662: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcuda.so.1
2023-04-07 13:26:06.461388: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:06.461721: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1734] Found device 0 with properties: 
pciBusID: 0000:00:00.0 name: Xavier computeCapability: 7.2
coreClock: 1.377GHz coreCount: 8 deviceMemorySize: 31.18GiB deviceMemoryBandwidth: 82.08GiB/s
2023-04-07 13:26:06.461883: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudart.so.10.2
2023-04-07 13:26:06.462120: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcublas.so.10
2023-04-07 13:26:06.462282: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcublasLt.so.10
2023-04-07 13:26:06.462407: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcufft.so.10
2023-04-07 13:26:06.462546: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcurand.so.10
2023-04-07 13:26:06.462696: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcusolver.so.10
2023-04-07 13:26:06.462820: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcusparse.so.10
2023-04-07 13:26:06.462967: I tensorflow/stream_executor/platform/default/dso_loader.cc:53] Successfully opened dynamic library libcudnn.so.8
2023-04-07 13:26:06.463353: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:06.463707: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:06.464205: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1872] Adding visible gpu devices: 0
2023-04-07 13:26:06.470911: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:06.471172: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1734] Found device 0 with properties: 
pciBusID: 0000:00:00.0 name: Xavier computeCapability: 7.2
coreClock: 1.377GHz coreCount: 8 deviceMemorySize: 31.18GiB deviceMemoryBandwidth: 82.08GiB/s
2023-04-07 13:26:06.471478: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:06.471881: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:06.472008: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1872] Adding visible gpu devices: 0
2023-04-07 13:26:16.261133: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1258] Device interconnect StreamExecutor with strength 1 edge matrix:
2023-04-07 13:26:16.261279: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1264]      0 
2023-04-07 13:26:16.261327: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1277] 0:   N 
2023-04-07 13:26:16.261983: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:16.262496: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:16.262859: I tensorflow/stream_executor/cuda/cuda_gpu_executor.cc:1001] ARM64 does not support NUMA - returning NUMA node zero
2023-04-07 13:26:16.263118: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1418] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 25539 MB memory) -> physical GPU (device: 0, name: Xavier, pci bus id: 0000:00:00.0, compute capability: 7.2)
DEBUG:tensorflow:Layer lstm will use cuDNN kernels when running on GPU.
Creating streamux 
 
Creating source_bin  0  
 
Creating source bin
source-bin-00
Now playing...
rtsp://192.168.2.119:554
Starting pipeline 


Using winsys: x11 
Opening in BLOCKING MODE
Opening in BLOCKING MODE 
WARNING: [TRT]: Using an engine plan file across different models of devices is not recommended and is likely to affect performance or even cause errors.
0:00:29.446871448 10458     0xa35374a0 INFO                 nvinfer gstnvinfer.cpp:619:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Info from NvDsInferContextImpl::deserializeEngineAndBackend() <nvdsinfer_context_impl.cpp:1702> [UID = 2]: deserialized trt engine from :/home/jetson/git/blimp/jetson-agx/models/pose_estimation.onnx_b1_gpu0_fp16.engine
INFO: [Implicit Engine Info]: layers num: 3
0   INPUT  kFLOAT input.1         3x224x224       
1   OUTPUT kFLOAT 262             18x56x56        
2   OUTPUT kFLOAT 264             42x56x56        

0:00:29.448703491 10458     0xa35374a0 INFO                 nvinfer gstnvinfer.cpp:619:gst_nvinfer_logger:<secondary-pose-estimation> NvDsInferContext[UID 2]: Info from NvDsInferContextImpl::generateBackendContext() <nvdsinfer_context_impl.cpp:1806> [UID = 2]: Use deserialized engine model: /home/jetson/git/blimp/jetson-agx/models/pose_estimation.onnx_b1_gpu0_fp16.engine
0:00:29.496151292 10458     0xa35374a0 INFO                 nvinfer gstnvinfer_impl.cpp:313:notifyLoadModelStatus:<secondary-pose-estimation> [UID 2]: Load new model:configs/sgie.txt sucessfully
gstnvtracker: Loading low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so
gstnvtracker: Batch processing is ON
gstnvtracker: Past frame output is OFF
[NvDCF][Warning] `minTrackingConfidenceDuringInactive` is deprecated
[NvDCF] Initialized
WARNING: [TRT]: Using an engine plan file across different models of devices is not recommended and is likely to affect performance or even cause errors.
0:00:33.211706029 10458     0xa35374a0 INFO                 nvinfer gstnvinfer.cpp:619:gst_nvinfer_logger:<primary-nvinference-engine> NvDsInferContext[UID 1]: Info from NvDsInferContextImpl::deserializeEngineAndBackend() <nvdsinfer_context_impl.cpp:1702> [UID = 1]: deserialized trt engine from :/home/jetson/git/blimp/jetson-agx/models/yolov5m.engine
INFO: [Implicit Engine Info]: layers num: 2
0   INPUT  kFLOAT data            3x640x640       
1   OUTPUT kFLOAT prob            6001x1x1        

0:00:33.215408964 10458     0xa35374a0 INFO                 nvinfer gstnvinfer.cpp:619:gst_nvinfer_logger:<primary-nvinference-engine> NvDsInferContext[UID 1]: Info from NvDsInferContextImpl::generateBackendContext() <nvdsinfer_context_impl.cpp:1806> [UID = 1]: Use deserialized engine model: /home/jetson/git/blimp/jetson-agx/models/yolov5m.engine
0:00:33.280809750 10458     0xa35374a0 INFO                 nvinfer gstnvinfer_impl.cpp:313:notifyLoadModelStatus:<primary-nvinference-engine> [UID 1]: Load new model:configs/pgie.txt sucessfully
Decodebin child added: source

I have to admit I’m wondering the direction I should follow from here.
Basically, I have a working deepstream app that I developed on the Xavier, but as we encounter some lags we thought we would try it on the Orin, but it’s harder than I thought to simply reproduce the possibility of simply running the app on the Orin, with two years between the two.
The Xavier has L4T 32.5.1 and deepstream 5.1

NVIDIA Jetson AGX Xavier [16GB]
 L4T 32.5.1 [ JetPack 4.5.1 ]
   Ubuntu 18.04.6 LTS
   Kernel Version: 4.9.201-tegra
 CUDA 10.2.89
   CUDA Architecture: 7.2
 OpenCV version: 4.4.0
   OpenCV Cuda: YES
 CUDNN: 8.0.0.180
 TensorRT: 7.1.3.0
 Vision Works: 1.6.0.501
 VPI: ii libnvvpi1 1.0.15 arm64 NVIDIA Vision Programming Interface library
 Vulcan: 1.2.70

The Orin has L4T 35.2.1 and deepstream 6.2

NVIDIA Jetson AGX Orin
 L4T 35.2.1 [ JetPack UNKNOWN ]
   Ubuntu 20.04.5 LTS
   Kernel Version: 5.10.104-tegra
 CUDA 11.4.315
   CUDA Architecture: 8.7
 OpenCV version: 4.5.4
   OpenCV Cuda: NO
 CUDNN: 8.6.0.166
 TensorRT: 8.5.2.2
 Vision Works: NOT_INSTALLED
 VPI: 2.2.4
 Vulcan: 1.3.204

l.weingart · April 7, 2023, 1:15pm

Hi again,

Actually, there is this error now:

gstnvtracker: Failed to open low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so
 dlopen error: /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so: cannot open shared object file: No such file or directory
gstnvtracker: Failed to initilaize low level lib.

and indeed there is no libnvds_nvdcf.so anymore in deepstream 6.2. I tried to copy the one from the Xavier, which has deepstream 5.1, as a hail Mary, but then I get a new error, which make sense if this library is no meant to be part of deepstream 6.2 anymore, so I don’t think this is the right path to get my app to function properly (I mean copying libs from DS5):

gstnvtracker: Failed to open low-level lib at /opt/nvidia/deepstream/deepstream/lib/libnvds_nvdcf.so
 dlopen error: libnvvpi.so.1: cannot open shared object file: No such file or directory
gstnvtracker: Failed to initilaize low level lib.

But in this case, how am I supposed to run this app on my Orin?
Am I supposed to downgrade my Orin to have the exact same setup as the Xavier, with an older jetpack, older deepstream, older tensorflow, etc?

Additionaly, it’s getting difficult to install these old versions of deepstream…

fanzh · April 8, 2023, 8:38am

As you know, there are many NV samples, we will update this deepstream_pose_estimation code ASAP.
currently, you can use other tracker method, please refer to dstest2_tracker_config.txt of deepstream-test2 in DeepStream6.2 SDK.

l.weingart · April 8, 2023, 4:53pm

Hi fanzh,

Thank you so much for your help!
My knowledge of deepstream is really shallow, I didn’t realise that the issue was coming from the config file.
Anyway, it seems for now that this step is solved, now I have issues with the next model that loads in my pipeline, a yolo.
I will deal with it or open a new thread in the forum.

Thank you once again for your help, it is really appreciated.

Cheers
Laurent

fanzh · April 10, 2023, 9:45am

thanks for your update, we recommend to use this sample deepstream-bodypose-3d, which supports both 2d and 3d body points, it works fine with the latest DS6.2.

system · April 24, 2023, 9:46am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Deepstream with python bindings DeepStream SDK tensorrt , cuda , python , deepstream	26	978	May 15, 2024
Cannot load built engine resnet50_market1501_aicity156 DeepStream SDK nvbugs	53	1773	February 14, 2025
Deepstream 6.2 nvdcf tracker DeepStream SDK	29	1125	April 29, 2024
Deepstream-gaze-app segmentation fault DeepStream SDK	10	260	February 13, 2024
Failed to parse ONNX model from file DeepStream SDK jetson , deepstream	5	101	January 29, 2025
Deep Stream SDK DeepStream SDK gstreamer	4	514	October 12, 2021
Runtime errors when running the human pose estimation application DeepStream SDK tensorrt , cuda , ubuntu	25	4838	October 12, 2021
Implementing DeepStream/ TRT integration by Intels scenario DeepStream SDK	26	1766	September 24, 2020
DeepStream6.1 error in running sample app DeepStream SDK	5	658	November 21, 2022
Issue with Deepstream Pose Estimation App: the "Decodebin did not pick nvidia decoder plugin" error in GStreamer DeepStream SDK gstreamer	12	1275	March 29, 2023

Deepstream_pose_estimation repository and provided libnvds_osd.so

Related topics