The missing functionality makes it as slow as CPU.
If you have the $ for 2x 4090, you should pick up 2x A5500 instead. The memory > the processing for inference.
< Before< After
The After pic destroys the performance of 2x4090. Not even close.
You need a completely un-hobbled GPU, including p2p to do anything. Unless you are using multiple systems and building your own p2p, 4090 is a total waste of time.
- note on the bracket: I changed cases later + bracket is no longer necessary. Old case needed it. LianLi O11XL fits everything without issue.