Need troubleshooting assistance, please, with Tutorial - text-generation-webui.
Steps performed:
- Performed RAM Optimization:
2. Disabled desktop GUI temporarily
3. Have swap file on SSD (at boot) - Ran jetson-containers as follows:
$ jetson-containers run $(autotag text-generation-webui)
Namespace(packages=['text-generation-webui'], prefer=['local', 'registry', 'build'], disable=[''], user='dustynv', output='/tmp/autotag', quiet=False, verbose=False)
-- L4T_VERSION=36.4.3 JETPACK_VERSION=6.2 CUDA_VERSION=12.6
-- Finding compatible container image for ['text-generation-webui']
text-generation-webui:r36.4.3-transformers
V4L2_DEVICES: --device /dev/video0 --device /dev/video1
+ docker run --runtime nvidia -it --rm --network host --shm-size=8g --volume /tmp/argus_socket:/tmp/argus_socket --volume /etc/enctune.conf:/etc/enctune.conf --volume /etc/nv_tegra_release:/etc/nv_tegra_release --volume /tmp/nv_jetson_model:/tmp/nv_jetson_model --volume /var/run/dbus:/var/run/dbus --volume /var/run/avahi-daemon/socket:/var/run/avahi-daemon/socket --volume /var/run/docker.sock:/var/run/docker.sock --volume /ssd/projects/jetson-containers/data:/data -v /etc/localtime:/etc/localtime:ro -v /etc/timezone:/etc/timezone:ro --device /dev/snd -e PULSE_SERVER=unix:/run/user/1000/pulse/native -v /run/user/1000/pulse:/run/user/1000/pulse --device /dev/bus/usb --device /dev/video0 --device /dev/video1 --device /dev/i2c-0 --device /dev/i2c-1 --device /dev/i2c-2 --device /dev/i2c-4 --device /dev/i2c-5 --device /dev/i2c-7 --device /dev/i2c-9 --name jetson_container_20250318_091132 text-generation-webui:r36.4.3-transformers
root@jetson2:/#
- Ran web server launch as follows:
# cd /opt/text-generation-webui && python3 server.py \
--model-dir=/data/models/text-generation-webui \
--chat \
--listen
bash: cd: /opt/text-generation-webui: No such file or directory
root@jetson2:/#
Since steps 4 & 5 were run with cut-paste operations from the referenced tutorial, I am assuming that I didn’t mess up any keystroke.
The /opt
folder has only one sub-folder: nvidia
which it self has only nsight-compute
Can get text-generation-webui up and running with the documentation at the official site but Chat is totally unresponsive even with the smaller models.
What should I change in step 5 to fix the issue with the web server using the Jetson tutorial instructions? Thanks.
Regards.