Hello
Iam testing VIA microservice, but when I pull the Docker and run it according to the user guide (using GPT-4o for VLM and Nvidia NIM API), I encounter the following error. Can you help me?
Hardware Platform:
- Ubuntu 20.04.6 LTS
- GPU: A100 40GB
- Driver Version: 535.161.07 ,
- CUDA Version: 12.2
Fatal Python error: take_gil: PyMUTEX_LOCK(gil->switch_mutex) failed
Python runtime state: initialized
Thread 0x00007f00a1bb5480 (most recent call first):
File ββ, line 241 in _call_with_frames_removed
File ββ, line 1176 in create_module
File ββ, line 571 in module_from_spec
File ββ, line 674 in _load_unlocked
File ββ, line 1006 in _find_and_load_unlocked
File ββ, line 1027 in _find_and_load
File ββ, line 241 in _call_with_frames_removed
File ββ, line 1078 in _handle_fromlist
File β/usr/local/lib/python3.10/dist-packages/scipy/fft/_pocketfft/basic.pyβ, line 6 in
File ββ, line 241 in _call_with_frames_removed
File ββ, line 883 in exec_module
File ββ, line 688 in _load_unlocked
File ββ, line 1006 in _find_and_load_unlocked
File ββ, line 1027 in _find_and_load
File "/usr/local/lib/python3.10/dist-packages/scipy/fft/_pocketfft/init
β¦
2024-08-10 16:26:18,863 PERF Summarization/BatchSummarization time = 3381.10 ms
special_tokens_map.json: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 695/695 [00:00<00:00, 4.75MB/s]
config.json: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 650/650 [00:00<00:00, 5.81MB/s]
tokenizer_config.json: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 1.43k/1.43k [00:00<00:00, 9.84MB/s]
tokenizer.json: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 712k/712k [00:00<00:00, 740kB/s]
model.onnx: 100%|βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 90.4M/90.4M [00:02<00:00, 43.7MB/s]
Fetching 5 files: 100%|ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 5/5 [00:04<00:00, 1.19it/s]
2024-08-10 16:26:27,024 INFO Loaded Guardrailsββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ| 90.4M/90.4M [00:02<00:00, 55.2MB/s]
2024-08-10 16:26:28,025 INFO Stopping VIA pipeline
2024-08-10 16:26:28,055 INFO Stopped VIA pipeline
2024-08-10 16:26:28,055 ERROR Failed to load VIA pipeline - Failed to load Decoder on GPU 0