Multi-Task Learning Regression via Convex Clustering

Akira Okazaki,Shuichi Kawano
DOI: https://doi.org/10.1016/j.csda.2024.107956
IF: 2.035
2024-03-26
Computational Statistics & Data Analysis
Abstract:Multi-task learning (MTL) is a methodology that aims to improve the general performance of estimation and prediction by sharing common information among related tasks. In the MTL, there are several assumptions for the relationships and methods to incorporate them. One of the natural assumptions in the practical situation is that tasks are classified into some clusters with their characteristics. For this assumption, the group fused regularization approach performs clustering of the tasks by shrinking the difference among tasks. This enables the transfer of common information within the same cluster. However, this approach also transfers the information between different clusters, which worsens the estimation and prediction. To overcome this problem, an MTL method is proposed with a centroid parameter representing a cluster center of the task. Because this model separates parameters into the parameters for regression and the parameters for clustering, estimation and prediction accuracy for regression coefficient vectors are improved. The effectiveness of the proposed method is shown through Monte Carlo simulations and applications to real data.
statistics & probability,computer science, interdisciplinary applications
What problem does this paper attempt to address?