SC20 Demo: Maximizing Performance for Distributed Machine Learning and Deep Learning with SHARP

jwitsoe · November 15, 2021, 5:00pm

Originally published at: https://developer.nvidia.com/blog/sc20-demo-maximizing-performance-for-distributed-machine-learning-and-deep-learning-with-sharp/

Today’s modern-day machine learning data centers require complex computations and fast, efficient data delivery. The NVIDIA Mellanox Scalable Hierarchical Aggregation and Reduction Protocol (SHARP) takes advantage of the in-network computing capabilities in the NVIDIA Mellanox Quantum switch, dramatically improving the performance of distributed machine learning workloads. SHARP technology improves upon the performance of MPI and…

Topic		Replies	Views
Advancing Performance with NVIDIA SHARP In-Network Computing Technical Blog	1	17	October 25, 2024
SC20 Demo: Revolutionizing Supercomputing with NVIDIA Mellanox UFM Cyber AI Technical Blog	0	287	November 15, 2021
ISC20 Featured Demo: Taking a Closer Look at NVIDIA Mellanox UFM Technical Blog	0	274	August 21, 2022
Accelerating Cloud-Native Supercomputing with Magnum IO Technical Blog	2	387	November 10, 2021
Speeding Up Semantic Segmentation Using MATLAB Container from NVIDIA NGC Technical Blog	0	393	August 25, 2020
NVIDIA GH200 Superchip Delivers Breakthrough Energy Efficiency and Node Consolidation for Apache Spark Technical Blog	1	5	August 20, 2024
Share Your Science: High Performance Computing for Network Intelligence Technical Blog	0	291	June 25, 2021
NVIDIA Deep Learning SDK Now Available Technical Blog	0	306	August 21, 2022
Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand Technical Blog	0	310	November 8, 2023
Optimize Energy Efficiency of Multi-Node VASP Simulations with NVIDIA Magnum IO Technical Blog	0	466	November 13, 2023

SC20 Demo: Maximizing Performance for Distributed Machine Learning and Deep Learning with SHARP

Related topics