RuntimeError: [TensorRT-LLM][ERROR] CUDA runtime error in error: peer access is not supported between these two devices

Hi @pbi2 ,
Apologies for the miss.

You should give this a try Peer access not supported between devices - #12 by Robert_Crovella

Thanks