Human-in-the-loop Active Covariance Learning for Improving Prediction in Small Data Sets

Homayun Afrabandpey,Tomi Peltola,Samuel Kaski
DOI: https://doi.org/10.48550/arXiv.1902.09834
2019-03-18
Abstract:Learning predictive models from small high-dimensional data sets is a key problem in high-dimensional statistics. Expert knowledge elicitation can help, and a strong line of work focuses on directly eliciting informative prior distributions for parameters. This either requires considerable statistical expertise or is laborious, as the emphasis has been on accuracy and not on efficiency of the process. Another line of work queries about importance of features one at a time, assuming them to be independent and hence missing covariance information. In contrast, we propose eliciting expert knowledge about pairwise feature similarities, to borrow statistical strength in the predictions, and using sequential decision making techniques to minimize the effort of the expert. Empirical results demonstrate improvement in predictive performance on both simulated and real data, in high-dimensional linear regression tasks, where we learn the covariance structure with a Gaussian process, based on sequential elicitation.
Machine Learning,Human-Computer Interaction
What problem does this paper attempt to address?