Data Summarization for Federated Learning

Julianna Devillers; Olivier Brun; Balakrishna Prabhu

Communication Dans Un Congrès Année : 2023

Data Summarization for Federated Learning

(1, 2) , (2) , (2)

1
2

Julianna Devillers

Fonction : Auteur
PersonId : 1312095

Institut Supérieur de l'Aéronautique et de l'Espace

Équipe Services et Architectures pour Réseaux Avancés

Olivier Brun

Fonction : Auteur
PersonId : 13446
IdHAL : olivier-brun
ORCID : 0000-0003-4685-5306
IdRef : 156351994

Équipe Services et Architectures pour Réseaux Avancés

Balakrishna Prabhu

Fonction : Auteur
PersonId : 13979
IdHAL : balakrishnaprabhu
IdRef : 092720447

Équipe Services et Architectures pour Réseaux Avancés

Résumé

We explore data summarization techniques as a mean to reduce the energy footprint of Federated Learning (FL). We formulate the problem of selecting a small subset of data points that best represent the gradient of each local dataset as a submodular maximization problem and provide sufficient conditions under which the FL training is guaranteed to converge to the same global model as if the whole local datasets have been used on each client. Experimental results on IID and non-IID datasets show that this approach yields a similar accuracy as training on the full local datasets, but with a significant reduction of runtimes. There is however no clear advantage of data summarization over random sampling.

Mots clés

Data summarization FedAvg convergence

Domaines

Informatique [cs]

Fichier principal

MLNreport.pdf (1.17 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Balakrishna Prabhu : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04295982

Soumis le : lundi 20 novembre 2023-15:27:18

Dernière modification le : lundi 15 janvier 2024-16:53:56

Dates et versions

hal-04295982 , version 1 (20-11-2023)

Identifiants

HAL Id : hal-04295982 , version 1

Citer

Julianna Devillers, Olivier Brun, Balakrishna Prabhu. Data Summarization for Federated Learning. Proceedings of the 6th International Conference on Machine Learning for Networking (MLN'2023), Nov 2023, Paris, France. ⟨hal-04295982⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS INSA-TOULOUSE LAAS LAAS-SARA UT1-CAPITOLE LAAS-RESEAUX-ET-COMMUNICATIONS INSA-GROUPE LAAS-RISC TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

108 Consultations

44 Téléchargements

Data Summarization for Federated Learning

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager