Hi,
When trying to run the enclosed docker compose for TRT LLM I end up with the following error. Any suggestions why this is the case, since as far as I can tell the docker file is aligned with the one at TRT LLM for Inference | DGX Spark ?
/magnus
docker-compose.txt (1.2 KB)
Fetching 18 files: 100%|██████████| 18/18 [09:41<00:00, 32.33s/it]
/root/.cache/huggingface/hub/models–openai–gpt-oss-20b/snapshots/6cee5e81ee83917806bbde320786a8fb61efebee
/usr/local/lib/python3.12/dist-packages/torch/cuda/init.py:63: FutureWarning: The pynvml package is deprecated. Please install nvidia-ml-py instead. If you did not install pynvml directly, please report this to the maintainers of the package that installed pynvml for you.
import pynvml # type: ignore[import]
Traceback (most recent call last):
File “/usr/local/bin/trtllm-serve”, line 3, in
from tensorrt_llm.commands.serve import main
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/init.py”, line 70, in
import tensorrt_llm._torch.models as torch_models
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/_torch/init.py”, line 1, in
from .llm import LLM
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/_torch/llm.py”, line 1, in
from tensorrt_llm.llmapi.llm import _TorchLLM
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/llmapi/init.py”, line 1, in
from .._torch.async_llm import AsyncLLM
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/_torch/async_llm.py”, line 3, in
from ..llmapi.llm import LLM
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/llmapi/llm.py”, line 17, in
from tensorrt_llm._utils import mpi_disabled
File “/usr/local/lib/python3.12/dist-packages/tensorrt_llm/_utils.py”, line 45, in
from tensorrt_llm.bindings import DataType, GptJsonConfig, LayerType
ImportError: libcuda.so.1: cannot open shared object file: No such file or directory