I tried to deploy TensorRT-LLM according to the document below.
TensorRT-LLM Deployment on Jetson 0rin
I ran into some problems when I reached 1.3.
I tried to solve them, but I couldn’t fix the issues when running the build_wheel.py.
I tried the following:
Enter this website and choose v0.12.0-jetson.
Download the zip and copy it to my device.
unzip TensorRT-LLM-0.12.0-jetson.zip
mv TensorRT-LLM-0.12.0-jetson TensorRT-LLM
cd TensorRT-LLM
git init
git remote add origin https://github.com/NVIDIA/TensorRT-LLM.git
git fetch origin v0.12.0-jetson
git checkout FETCH_HEAD --force
git submodule update --init --recursive
sudo cp /etc/apt/sources.list.d/nvidia-l4t-apt-source.list /etc/apt/sources.list.d/nvidia-l4t-apt-source.list.bak
echo "deb https://repo.download.nvidia.com/jetson/common r36.4 main" | sudo tee /etc/apt/sources.list.d/nvidia-l4t-apt-source.list
echo "deb https://repo.download.nvidia.com/jetson/t234 r36.4 main" | sudo tee -a /etc/apt/sources.list.d/nvidia-l4t-apt-source.list
sudo apt update
sudo apt install -y tensorrt libnvinfer-dev libnvinfer-plugin-dev libcudnn9-dev
sudo apt install -y cmake build-essential ccache
echo "export PATH=/usr/local/cuda/bin:\$PATH" >> ~/.bashrc
echo "export LD_LIBRARY_PATH=/usr/local/cuda/lib64:\$LD_LIBRARY_PATH" >> ~/.bashrc
source ~/.bashrc
python3 scripts/build_wheel.py --clean --cuda_architectures 87 -DENABLE_MULTI_DEVICE=0 -DTENSORRT_ROOT=/usr --build_type Release --benchmarks --use_ccache
My error when running the build_wheel.py:
collect2: error: ld returned 1 exit status
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IProfile::setCanCompressStructuredSparseWeights(bool)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::destroyNetwork(nvdla::INetwork*)'
gmake[3]: *** [benchmarks/CMakeFiles/bertBenchmark.dir/build.make:114: benchmarks/bertBenchmark] Error 1
gmake[2]: *** [CMakeFiles/Makefile2:3957: benchmarks/CMakeFiles/bertBenchmark.dir/all] Error 2
gmake[2]: *** Waiting for unfinished jobs....
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::INetwork::addReduce(nvdla::ITensor*, nvdla::PoolingType, nvdla::Weights)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::createWisdom()'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IPoolingLayer::setPoolingPaddingInclusionType(nvdla::PoolingPaddingInclusionType)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IConcatenationLayer::setAxis(int)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::destroyWisdom(nvdla::IWisdom*)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IProfile::setCanGenerateDetailedLayerwiseStats(bool)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IProfile::setCanCompressStructuredSparseWeights(bool)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::INetwork::addTranspose(nvdla::ITensor*, nvdla::Dims4)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::INetwork::addReduce(nvdla::ITensor*, nvdla::PoolingType, nvdla::Weights)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::createNetwork()'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IPoolingLayer::setPoolingPaddingInclusionType(nvdla::PoolingPaddingInclusionType)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IProfile::setCanGenerateLayerwiseStats(bool)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::destroyWisdom(nvdla::IWisdom*)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IProfile::setCanGenerateDetailedLayerwiseStats(bool)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::ISoftMaxLayer::setAxis(int)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::INetwork::addResize(nvdla::ITensor*, nvdla::ResizeMode, nvdla::Weights)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::INetwork::addTranspose(nvdla::ITensor*, nvdla::Dims4)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::createNetwork()'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::IProfile::setCanGenerateLayerwiseStats(bool)'
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::ISoftMaxLayer::setAxis(int)'
collect2: error: ld returned 1 exit status
/usr/bin/ld: /usr/lib/aarch64-linux-gnu/libnvinfer.so: undefined reference to `nvdla::INetwork::addResize(nvdla::ITensor*, nvdla::ResizeMode, nvdla::Weights)'
gmake[3]: *** [benchmarks/CMakeFiles/gptSessionBenchmark.dir/build.make:114: benchmarks/gptSessionBenchmark] Error 1
gmake[2]: *** [CMakeFiles/Makefile2:3917: benchmarks/CMakeFiles/gptSessionBenchmark.dir/all] Error 2
collect2: error: ld returned 1 exit status
gmake[3]: *** [benchmarks/CMakeFiles/gptManagerBenchmark.dir/build.make:114: benchmarks/gptManagerBenchmark] Error 1
gmake[2]: *** [CMakeFiles/Makefile2:3997: benchmarks/CMakeFiles/gptManagerBenchmark.dir/all] Error 2
gmake[1]: *** [CMakeFiles/Makefile2:3884: benchmarks/CMakeFiles/benchmarks.dir/rule] Error 2
gmake: *** [Makefile:1154: benchmarks] Error 2
Traceback (most recent call last):
File "/home/evoc/TensorRT-LLM/scripts/build_wheel.py", line 412, in <module>
main(**vars(args))
File "/home/evoc/TensorRT-LLM/scripts/build_wheel.py", line 206, in main
build_run(
File "/usr/lib/python3.10/subprocess.py", line 526, in run
raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command 'cmake --build . --config Release --parallel 8 --target tensorrt_llm nvinfer_plugin_tensorrt_llm th_common bindings benchmarks executorWorker ' returned non-zero exit status 2.
Environment:
TensorRT Version: 10.3.0.30
CUDA Version: 12.6, V12.6.68
Operating System + Version: jetpack (6.2.1)
Python Version (if applicable): 3.10.12
Thanks to anyone who can help solve my problem.
