D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

Bellet, Aurélien; Kermarrec, Anne-Marie; Lavoie, Erick

doi:10.1109/SRDS55811.2022.00011

Bellet, Aurélien; Kermarrec, Anne-Marie; Lavoie, Erick

2022

Download

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

The convergence speed of machine learning models trained with Federated Learning is significantly affected by non-independent and identically distributed (non-IID) data partitions, even more so in a fully decentralized setting without a central server. In this paper, we show that the impact of local class bias, an important type of data non-IIDness, can be significantly reduced by carefully designing the underlying communication topology. We present D-Cliques, a novel topology that reduces gradient bias by grouping nodes in interconnected cliques such that the local joint distribution in a clique is representative of the global class distribution. We also show how to adapt the updates of decentralized SGD to obtain unbiased gradients and implement an effective momentum with D-Cliques. Our empirical evaluation on MNIST and CIFAR10 demonstrates that our approach provides similar convergence speed as a fully-connected topology with a significant reduction in the number of edges and messages. In a 1000-node topology, D-Cliques requires 98% less edges and 96% less total messages, with further possible gains using a small-world topology across cliques.

Details

Title D-Cliques: Compensating for Data Heterogeneity with Topology in Decentralized Federated Learning

Author(s) Bellet, Aurélien ; Kermarrec, Anne-Marie ; Lavoie, Erick

Published in 2022 41St International Symposium On Reliable Distributed Systems (Srds 2022)

Pagination 11

Series Symposium on Reliable Distributed Systems Proceedings

Conference 41st International Symposium on Reliable Distributed Systems (SRDS 2022), Vienna, Austria, September 19-22, 2022

Date 2022-09-22

ISBN 978-1-665497-53-4

Keywords

Decentralized Learning; Federated Learning; Topology; Heterogeneous Data; Stochastic Gradient Descent

DOI https://doi.org/10.1109/SRDS55811.2022.00011

Additional link https://srds-conference.org/

Laboratories SACS

Record Appears in Scientific production and competences > SB - School of Basic Sciences > MATH - Institute of Mathematics > SACS - Scalable Computing Systems
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2022-06-29

Files

Abstract

Details

PDF