Originally published at: Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA | NVIDIA Technical Blog
NVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined approach for building flexible and cost-effective accelerated infrastructure while ensuring compatibility and interoperability. The latest Enterprise RA details an optimized cluster configuration for systems integrated with NVIDIA GH200 NVL2 and the NVIDIA Spectrum-X Ethernet…
Where can I find the version of PyTorch that is mentioned on the post?
AI developers working at the framework level will appreciate recently added Universal Virtual Memory (UVM) support for PyTorch to GH200 NVL2.
@jwitsoe following up to see if there’s a version of PyTorch with UVM support. I would like to use this with vLLM on GH200