I'm encountering a consistent infrastructure-level CUDA crash when calling the Boltz2 endpoint via API (/v1/biology/mit/boltz2/predict)

I’m encountering a consistent infrastructure-level CUDA crash when calling the Boltz2 endpoint via API (/v1/biology/mit/boltz2/predict). The request is well-formed and authorized, but the model crashes with CUDA error: an illegal memory access was encountered. Please advise if this is known or if there’s a workaround.