Vllama - ollama-like runner for DGX and other Blackwell GPUs

Hello everyone!
I’ve made a nice runner for DGX Spark and RTX graphic cards that uses vllm backend.
All descriptions are on GitHub. Feel free to contribute.