Hello,
Has anyone successfully run Llama 405B nvfp4 on two sparks across CX7? I’ve tried a number of ways and failed with memory errors etc. I know there is a custom NIM container that is specifically made for SM 121 silicon. Any help or pointers greatly appreciated.