Semi-generative modelling: Covariate-shift adaptation with cause and effect features

Julius von Kügelgen; Alexander Mey; Marco Loog

Semi-generative modelling: Covariate-shift adaptation with cause and effect features

Julius von Kügelgen, Alexander Mey, Marco Loog

Pattern Recognition and Bioinformatics

Research output: Contribution to journal › Conference article › Scientific › peer-review

9 Citations (Scopus)

45 Downloads (Pure)

Abstract

Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this requires learning with both causes, X_C, and effects, X_E, of a target variable, Y, and show how this setting leads to what we call a semi-generative model, P(Y,X_E|X_C,θ). Our approach is robust to domain shifts in the distribution of causal features and leverages unlabelled data by learning a direct map from causes to effects. Experiments on synthetic data demonstrate significant improvements in classification over purely-supervised and importance-weighting baselines.

Original language	English
Number of pages	9
Journal	Proceedings of Machine Learning Research
Volume	89
Publication status	Published - 2020
Event	22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 - Naha, Japan Duration: 16 Apr 2019 → 18 Apr 2019

Access to Document

kugelgen19aFinal published version, 944 KB

Cite this

@article{69fff93a08c64f8ba807b4ae8e1ffcbc,

title = "Semi-generative modelling: Covariate-shift adaptation with cause and effect features",

abstract = "Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this requires learning with both causes, XC, and effects, XE, of a target variable, Y, and show how this setting leads to what we call a semi-generative model, P(Y,XE|XC,θ). Our approach is robust to domain shifts in the distribution of causal features and leverages unlabelled data by learning a direct map from causes to effects. Experiments on synthetic data demonstrate significant improvements in classification over purely-supervised and importance-weighting baselines.",

author = "{von K{\"u}gelgen}, Julius and Alexander Mey and Marco Loog",

year = "2020",

language = "English",

volume = "89",

journal = "Proceedings of Machine Learning Research",

issn = "1938-7228",

note = "22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019 ; Conference date: 16-04-2019 Through 18-04-2019",

}

TY - JOUR

T1 - Semi-generative modelling

T2 - 22nd International Conference on Artificial Intelligence and Statistics, AISTATS 2019

AU - von Kügelgen, Julius

AU - Mey, Alexander

AU - Loog, Marco

PY - 2020

Y1 - 2020

N2 - Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this requires learning with both causes, XC, and effects, XE, of a target variable, Y, and show how this setting leads to what we call a semi-generative model, P(Y,XE|XC,θ). Our approach is robust to domain shifts in the distribution of causal features and leverages unlabelled data by learning a direct map from causes to effects. Experiments on synthetic data demonstrate significant improvements in classification over purely-supervised and importance-weighting baselines.

AB - Current methods for covariate-shift adaptation use unlabelled data to compute importance weights or domain-invariant features, while the final model is trained on labelled data only. Here, we consider a particular case of covariate shift which allows us also to learn from unlabelled data, that is, combining adaptation with semi-supervised learning. Using ideas from causality, we argue that this requires learning with both causes, XC, and effects, XE, of a target variable, Y, and show how this setting leads to what we call a semi-generative model, P(Y,XE|XC,θ). Our approach is robust to domain shifts in the distribution of causal features and leverages unlabelled data by learning a direct map from causes to effects. Experiments on synthetic data demonstrate significant improvements in classification over purely-supervised and importance-weighting baselines.

UR - http://www.scopus.com/inward/record.url?scp=85084995217&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85084995217

SN - 1938-7228

VL - 89

JO - Proceedings of Machine Learning Research

JF - Proceedings of Machine Learning Research

Y2 - 16 April 2019 through 18 April 2019

ER -

Semi-generative modelling: Covariate-shift adaptation with cause and effect features

Abstract

Access to Document

Other files and links

Fingerprint

Cite this