Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming

jwitsoe · April 16, 2025, 4:00pm

Originally published at: https://developer.nvidia.com/blog/efficient-federated-learning-in-the-era-of-llms-with-message-quantization-and-streaming/

Federated learning (FL) has emerged as a promising approach for training machine learning models across distributed data sources while preserving data privacy. However, FL faces significant challenges related to communication overhead and local resource constraints when balancing model requirements and communication capabilities. Particularly in the current era of large language models (LLMs), FL faces computational…

chesterc · April 16, 2025, 7:24pm

Reduce the communication size and reduce the memory usage are very important to Federated training especially for LLM. This work reduced the required memory usage in the LLM streaming and use quantization techniques to reduce the communication message size.

Topic		Replies	Views
Turning Machine Learning to Federated Learning in Minutes with NVIDIA FLARE 2.4 Technical Blog	1	239	March 7, 2024
Scalable Federated Learning with NVIDIA FLARE for Enhanced LLM Performance Technical Blog	1	229	February 29, 2024
Adapting LLMs to Downstream Tasks Using Federated Learning on Distributed Datasets Technical Blog	0	336	July 10, 2023
Mastering LLM Techniques: Inference Optimization Technical Blog	0	465	November 17, 2023
Mastering LLM Techniques: Training Technical Blog	0	470	November 16, 2023
Supercharging the Federated Learning Ecosystem by Integrating Flower and NVIDIA FLARE Technical Blog	2	23	March 24, 2025
Effortless Federated Learning on Mobile with NVIDIA FLARE and Meta ExecuTorch Technical Blog	2	47	April 11, 2025
LLM 기술 마스터하기: 훈련 Technical Blog - South Korea	0	605	November 24, 2023
LLM 기술 마스터하기: 인퍼런스 최적화 Technical Blog - South Korea	0	542	November 27, 2023
Applying Federated Learning to Traditional Machine Learning Methods Technical Blog	0	343	June 22, 2023

Efficient Federated Learning in the Era of LLMs with Message Quantization and Streaming

Related topics