If you haven’t already, I would take a look at the messages in this topic
Regarding NVFP4, while it’s supported, we still don’t have great performance with it. Lots of work to do. FP8 or INT4 will likely perform better until the issues are sorted
If you haven’t already, I would take a look at the messages in this topic
Regarding NVFP4, while it’s supported, we still don’t have great performance with it. Lots of work to do. FP8 or INT4 will likely perform better until the issues are sorted