How to run Gemma-4-NVFP4 in vLLM Docker?

If you haven’t already, I would take a look at the messages in this topic

Regarding NVFP4, while it’s supported, we still don’t have great performance with it. Lots of work to do. FP8 or INT4 will likely perform better until the issues are sorted