Inconsistency Ranking-based Noisy Label Detection for High-quality Data

Ruibin Yuan,Hanzhi Yin,Yi Wang,Yifan He,Yushi Ye,Lei Zhang,Zhizheng Wu
2023-06-15
Abstract:The success of deep learning requires high-quality annotated and massive data. However, the size and the quality of a dataset are usually a trade-off in practice, as data collection and cleaning are expensive and time-consuming. In real-world applications, especially those using crowdsourcing datasets, it is important to exclude noisy labels. To address this, this paper proposes an automatic noisy label detection (NLD) technique with inconsistency ranking for high-quality data. We apply this technique to the automatic speaker verification (ASV) task as a proof of concept. We investigate both inter-class and intra-class inconsistency ranking and compare several metric learning loss functions under different noise settings. Experimental results confirm that the proposed solution could increase both the efficient and effective cleaning of large-scale speaker recognition datasets.
Computation and Language,Sound,Audio and Speech Processing
What problem does this paper attempt to address?
This paper aims to address the data quality issues caused by label noise in automatic speech recognition (ASV) tasks. Specifically, the paper proposes an automatic label noise detection (NLD) technique based on inconsistency ranking to improve the quality of large-scale speech recognition datasets. The study mainly focuses on two types of label noise: closed-set noise and open-set noise, and validates the effectiveness of the proposed method under different noise settings through experiments. Additionally, the paper compares the performance of several common metric learning loss functions (such as Softmax loss, additive angular margin loss, etc.) in the presence of noise and explores their impact on training performance. The experiments reveal that open-set noise has a greater impact on model performance than closed-set noise, and the proposed inconsistency ranking method can effectively detect and filter noisy labels, thereby significantly enhancing the model's performance.