Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Unsupervised domain adaptation with non-stochastic missing data

Abstract : We consider unsupervised domain adaptation (UDA) for classification problems in the presence of missing data in the unlabelled target domain. More precisely, motivated by practical applications, we analyze situations where distribution shift exists between domains and where some components are systematically absent on the target domain without available supervision for imputing the missing target components. We propose a generative approach for imputation. Imputation is performed in a domain-invariant latent space and leverages indirect supervision from a complete source domain. We introduce a single model performing joint adaptation, imputation and classification which, under our assumptions, minimizes an upper bound of its target generalization error and performs well under various representative divergence families (H-divergence, Optimal Transport). Moreover, we compare the target error of our Adaptation-imputation framework and the "ideal" target error of a UDA classifier without missing target components. Our model is further improved with self-training, to bring the learned source and target class posterior distributions closer. We perform experiments on three families of datasets of different modalities: a classical digit classification benchmark, the Amazon product reviews dataset both commonly used in UDA and real-world digital advertising datasets. We show the benefits of jointly performing adaptation, classification and imputation on these datasets.
Document type :
Preprints, Working Papers, ...
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03338879
Contributor : Matthieu Kirchmeyer Connect in order to contact the contributor
Submitted on : Wednesday, September 15, 2021 - 9:15:05 AM
Last modification on : Tuesday, October 19, 2021 - 5:34:13 PM

Files

adapt_imput.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03338879, version 2
  • ARXIV : 2109.09505

Citation

Matthieu Kirchmeyer, Patrick Gallinari, Alain Rakotomamonjy, Amin Mantrach. Unsupervised domain adaptation with non-stochastic missing data. 2021. ⟨hal-03338879v2⟩

Share

Metrics

Record views

36

Files downloads

32