Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data

  • Tailin ZHOU
  • , Zehong LIN*
  • , Jun ZHANG
  • , Danny H.K. TSANG
  • *Corresponding author for this work

Research output: Journal PublicationsJournal Article (refereed)peer-review

10 Citations (Scopus)

Abstract

Model averaging is a widely adopted technique in federated learning (FL) that aggregates multiple client models to obtain a global model. Remarkably, model averaging in FL yields a superior global model, even when client models are trained with non-convex objective functions and on heterogeneous local datasets. However, the rationale behind its success remains poorly understood. To shed light on this issue, we first visualize the loss landscape of FL over client and global models to illustrate their geometric properties. The visualization shows that the client models encompass the global model within a common basin, and interestingly, the global model may deviate from the basin's center while still outperforming the client models. To gain further insights into model averaging in FL, we decompose the expected loss of the global model into five factors related to the client models. Specifically, our analysis reveals that the global model loss after early training mainly arises from i) the client model's loss on non-overlapping data between client datasets and the global dataset and ii) the maximum distance between the global and client models. Based on the findings from our loss landscape visualization and loss decomposition, we propose utilizing iterative moving averaging (IMA) on the global model at the late training phase to reduce its deviation from the expected minimum, while constraining client exploration to limit the maximum distance between the global and client models. Our experiments demonstrate that incorporating IMA into existing FL methods significantly improves their accuracy and training speed on various heterogeneous data setups of benchmark datasets.

Original languageEnglish
Pages (from-to)12131-12145
Number of pages15
JournalIEEE Transactions on Mobile Computing
Volume23
Issue number12
Early online date28 May 2024
DOIs
Publication statusPublished - Dec 2024
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2002-2012 IEEE.

Funding

This work was supported in part by Hong Kong Research Grants Council under Grant AoE/E-601/22-R, in part by the NSFC/RGC Collaborative Research Scheme under Grant CRS_HKUST603/22, in part by Guangzhou Municipal Science and Technology Project under Grant 2023A03J0011, in part by the Guangdong Provincial Key Laboratory of Integrated Communications, Sensing and Computation for Ubiquitous Internet of Things, and in part by National Foreign Expert Project under Grant G2022030026.

Keywords

  • Federated learning
  • heterogeneous data
  • loss decomposition
  • loss landscape visualization
  • model averaging

Fingerprint

Dive into the research topics of 'Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data'. Together they form a unique fingerprint.

Cite this