PLBR: A Semi-supervised Document Key Information Extraction Via Pseudo-labeling Bias Rectification

Pengcheng Guo,Yonghong Song,Boyu Wang,Jiaohao Liu,Qi Zhang
DOI: https://doi.org/10.1109/tkde.2024.3443928
IF: 9.235
2024-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Document key information extraction (DKIE) methods often require a large number of labeled samples, imposing substantial annotation costs in practical scenarios. Fortunately, pseudo-labeling based semi-supervised learning (PSSL) algorithms provide an effective paradigm to alleviate the reliance on labeled data by leveraging unlabeled data. However, the main challenges for PSSL in DKIE tasks: 1) context dependency of DKIE results in incorrect pseudo-labels. 2) high intra-class variance and low inter-class variation on DKIE. To this end, this paper proposes a similarity matrix Pseudo-Label Bias Rectification (PLBR) semi-supervised method for DKIE tasks, which improves the quality of pseudo-labels on DKIE benchmarks with rare labels. More specifically, the Similarity Matrix Bias Rectification (SMBR) module is proposed to improve the quality of pseudolabels, which utilizes the contextual information of DKIE data through the analysis of similarity between labeled and unlabeled data. Moreover, a dual branch adaptive alignment (DBAA) mechanism is designed to adaptively align intra-class variance and alleviate inter-class variation on DKIE benchmarks, which is composed of two adaptive alignment ways. One is the intra-class alignment branch, which is designed to adaptively align intraclass variance. The other one is the inter-class alignment branch, which is developed to adaptively alleviate inter-class variance changes on the representation level. Extensive experiment results on two benchmarks demonstrate that PLBR achieves state-ofthe-art performance and its performance surpasses the previous SOTA by 2.11% ∼ 2.53%, 2.09% ∼ 2.49% F1-score on FUNSD and CORD with rare labeled samples, respectively. Code will be open to the public
What problem does this paper attempt to address?