Refined Convergence and Topology Learning for Decentralized SGD with Heterogeneous Data

Batiste Le Bars; Aurélien Bellet; Marc Tommasi; Erick Lavoie; Anne-Marie Kermarrec

Preprints, Working Papers, ... Year : 2022

Refined Convergence and Topology Learning for Decentralized SGD with Heterogeneous Data

(1) , (1) , (1) , (2) , (2)

1
2

Batiste Le Bars

Function : Author

Machine Learning in Information Networks

Aurélien Bellet

Function : Author
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251
IdRef : 17653136X

Machine Learning in Information Networks

Marc Tommasi

Function : Author
PersonId : 399
IdHAL : marc-tommasi
ORCID : 0000-0003-2838-4408
IdRef : 121846385

Machine Learning in Information Networks

Erick Lavoie

Function : Author

Ecole Polytechnique Fédérale de Lausanne

Anne-Marie Kermarrec

Function : Author

Ecole Polytechnique Fédérale de Lausanne

Abstract

One of the key challenges in decentralized and federated learning is to design algorithms that efficiently deal with highly heterogeneous data distributions across agents. In this paper, we revisit the analysis of the popular Decentralized Stochastic Gradient Descent algorithm (D-SGD) under data heterogeneity. We exhibit the key role played by a new quantity, called neighborhood heterogeneity, on the convergence rate of D-SGD. By coupling the communication topology and the heterogeneity, our analysis sheds light on the poorly understood interplay between these two concepts. We then argue that neighborhood heterogeneity provides a natural criterion to learn data-dependent topologies that reduce (and can even eliminate) the otherwise detrimental effect of data heterogeneity on the convergence time of D-SGD. For the important case of classification with label skew, we formulate the problem of learning such a good topology as a tractable optimization problem that we solve with a Frank-Wolfe algorithm. As illustrated over a set of simulated and real-world experiments, our approach provides a principled way to design a sparse topology that balances the convergence speed and the per-iteration communication costs of D-SGD under data heterogeneity.

Domains

Machine Learning [cs.LG] Machine Learning [stat.ML]

Fichier principal

2204.04452.pdf (961.05 Ko)

Origin : Files produced by the author(s)

Aurélien Bellet : Connect in order to contact the contributor

https://inria.hal.science/hal-03905091

Submitted on : Saturday, December 17, 2022-7:21:04 PM

Last modification on : Wednesday, January 24, 2024-9:54:24 AM

Dates and versions

hal-03905091 , version 1 (17-12-2022)

hal-03905091 , version 2 (23-12-2023)

Identifiers

HAL Id : hal-03905091 , version 1
ARXIV : 2204.04452

Cite

Batiste Le Bars, Aurélien Bellet, Marc Tommasi, Erick Lavoie, Anne-Marie Kermarrec. Refined Convergence and Topology Learning for Decentralized SGD with Heterogeneous Data. 2022. ⟨hal-03905091v1⟩

Refined Convergence and Topology Learning for Decentralized SGD with Heterogeneous Data

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Altmetric

Share