Towards Utilizing Unlabeled Data in Federated Learning: A Survey and Prospective

Yilun Jin,Xiguang Wei,Yang Liu,Qiang Yang
DOI: https://doi.org/10.48550/arxiv.2002.11545
2020-01-01
Abstract:Federated Learning (FL) proposed in recent years has received significant attention from researchers in that it can bring separate data sources together and build machine learning models in a collaborative but private manner. Yet, in most applications of FL, such as keyboard prediction, labeling data requires virtually no additional efforts, which is not generally the case. In reality, acquiring large-scale labeled datasets can be extremely costly, which motivates research works that exploit unlabeled data to help build machine learning models. However, to the best of our knowledge, few existing works aim to utilize unlabeled data to enhance federated learning, which leaves a potentially promising research topic. In this paper, we identify the need to exploit unlabeled data in FL, and survey possible research fields that can contribute to the goal.
What problem does this paper attempt to address?