Single-source domain adaptation with target and conditional shift
Kun Zhang,Bernhard Schölkopf,Krikamol Muandet,Zhikun Wang,Z Zhou,Claudio Persello
2014-01-01
Abstract:The goal of supervised learning is to infer a function f from a training set Dtr={(xtr1, ytr1),...,(xtrm, ytrm)}⊆ X× Y, where X and Y denote the domains of predictors X and target Y, respectively. The estimated f is expected to generalize well on the test set Dte={(xte1, yte1),...,(xten, yten)}⊆ X× Y, where ytei are unknown. Traditionally, the training set and test set are assumed to follow the same distribution. However, in many real world problems, the training data and test data have different distributions, ie, PtrXY= PteXY, 1 and the goal is to find a learning machine that performs well on the test domain. This problem is known as domain adaptation in machine learning. If the data distribution changes arbitrarily, training data would be of no use to make predictions on the test domain. To perform domain adaptation successfully, relevant knowledge in the training (or source) domain should be transferred to the test (or target) domain. For instance, the situation where PtrXY and PteXY only differ in the marginal distribution of the covariate (ie, PtrX= PteX, while PtrY| X= PteY| X) is termed covariate shift [25, 33, 10] or sample selection bias [37], and has been well studied. For surveys on domain adaptation for classification, see, eg,[13, 17, 1]. In particular, we address the situation where both the marginal distribution PX and the conditional distribution PY| X may change across the domains. Clearly, we need to make certain assumptions for the training domain to be adaptable to the test domain. We first consider the case where PX| Y is the same on both domains. As a consequence of Bayes’ rule, the changes in PX and PY| X are caused by the change in PY, the …