Big Cities Transfer Learning

Abdoullahi Diasse,Zhiyong Li
DOI: https://doi.org/10.1145/3195106.3195121
2018-01-01
Abstract:Big data has brought many new challenges for machine learning research. In many learning tasks, we have to deal with diverse data from different domains, different representations, different distributions, scale, and density in order to achieve a good performance. With the recent advances in data storage and internet technology, data become more prominent, noisier and more complex which bring new opportunities and challenges into Transfer learning. In Urban computing when inferring knowledge for new or less developed cities we often need to deal with large-scale, multi-view, noisy and incomplete data. This calls for advanced techniques that can make practical use of massive, sparse and noisy data to efficiently transfer knowledge of multiple and diverse datasets (views) from a source domain to a target domain. Such a problem becomes much more challenging in an unsupervised learning setting where we do not dispose any label in the target domain which is not uncommon in my real-world scenarios. To tackle this challenge, in this paper we propose novel unsupervised multi-view transfer with missing data by learning a shared subspace across views from different domains through a latent low-rank transfer. Before performing knowledge transfer our approach learns an enriched representation of the source domain via a novel joint multi-view dictionary learning based on low-rank tensor. We also propose a multi-view co-classifier to predict the label in the target domain. Tailored for big data applications with EM-ADMM based optimization algorithm our method can efficiently perform knowledge transfer from a multi-view source domain to an unlabeled multi-view target domain with a high rate of missing values and noise.
What problem does this paper attempt to address?