Since NVIDIA released the official vllm image and THOR benchmark results (Jetson Benchmarks | NVIDIA Developer), I started testing vLLM-compatible models on this platform.
I found two FP8 models that benchmark with vllm bench but return garbled output: