Learning from Few Positives: a Provably Accurate Metric Learning Algorithm to deal with Imbalanced Data - Institut d'Optique Graduate School Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Learning from Few Positives: a Provably Accurate Metric Learning Algorithm to deal with Imbalanced Data

Rémi Emonet
Amaury Habrard
Guillaume Metzler
Marc Sebban

Résumé

Learning from imbalanced data, where the positive examples are very scarce, remains a challenging task from both a theoretical and algorithmic perspective. In this paper, we address this problem using a metric learning strategy. Unlike the state-of-the-art methods, our algorithm MLFP, for Metric Learning from Few Positives, learns a new representation that is used only when a test query is compared to a minority training example. From a geometric perspective, it artificially brings positive examples closer to the query without changing the distances to the negative (majority class) data. This strategy allows us to expand the decision boundaries around the positives, yielding a better F-Measure, a criterion which is suited to deal with imbalanced scenarios. Beyond the algorithmic contribution provided by MLFP, our paper presents generalization guarantees on the false positive and false negative rates. Extensive experiments conducted on several imbalanced datasets show the effectiveness of our method.
Fichier principal
Vignette du fichier
MLFP.pdf (1.5 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02611586 , version 1 (18-05-2020)

Identifiants

  • HAL Id : hal-02611586 , version 1

Citer

Rémi Viola, Rémi Emonet, Amaury Habrard, Guillaume Metzler, Marc Sebban. Learning from Few Positives: a Provably Accurate Metric Learning Algorithm to deal with Imbalanced Data. IJCAI 2020, the 29th International Joint Conference on Artificial Intelligence, Jul 2020, Yokohama, Japan. ⟨hal-02611586⟩
153 Consultations
183 Téléchargements

Partager

Gmail Facebook X LinkedIn More