From what I understand talking to Opus, this is the underlying issue that will resolve NVFP4 use in TensorRT-LLM, vLLM, and SGLang.
is that true? or is Opus hallucinating?