Abstract:Semi-supervised learning (SSL) has attracted enormous attention due to its vast potential of mitigating the dependence on large labeled datasets. The latest methods (e.g., FixMatch) use a combination of consistency regularization and pseudo-labeling to achieve remarkable successes. However, these methods all suffer from the waste of complicated examples since all pseudo-labels have to be selected by a high threshold to filter out noisy ones. Hence, the examples with ambiguous predictions will not contribute to the training phase. For better leveraging all unlabeled examples, we propose two novel techniques: Entropy Meaning Loss (EML) and Adaptive Negative Learning (ANL). EML incorporates the prediction distribution of non-target classes into the optimization objective to avoid competition with target class, and thus generating more high-confidence predictions for selecting pseudo-label. ANL introduces the additional negative pseudo-label for all unlabeled data to leverage low-confidence examples. It adaptively allocates this label by dynamically evaluating the top-k performance of the model. EML and ANL do not introduce any additional parameter and hyperparameter. We integrate these techniques with FixMatch, and develop a simple yet powerful framework called FullMatch. Extensive experiments on several common SSL benchmarks (CIFAR-10/100, SVHN, STL-10 and ImageNet) demonstrate that FullMatch exceeds FixMatch by a large margin. Integrated with FlexMatch (an advanced FixMatch-based framework), we achieve state-of-the-art performance. Source code is at <a class="link-external link-https" href="https://github.com/megvii-research/FullMatch" rel="external noopener nofollow">this https URL</a>.

Boosting Statistical Word Alignment Using Labeled and Unlabeled Data

Boosting Statistical Word Alignment

Improving Word Alignment by Semi-Supervised Ensemble.

Iterative Task-adaptive Pretraining for Unsupervised Word Alignment

Improving statistical word alignment with ensemble methods

Improving Statistical Word Alignment with Various Clues.

Word Alignment by Fine-tuning Embeddings on Parallel Corpora

A Supervised Word Alignment Method based on Cross-Language Span Prediction using Multilingual BERT

: Improving Domain-Specific Word Alignment with a General Bilingual Corpus

Word alignment for languages with scarce resources using bilingual corpora of other language pairs

Graph-based Boosting Algorithm to Learn Labeled and Unlabeled Data

Enhancing Chinese Word Segmentation Using Unlabeled Data

Boosting Semi-Supervised Learning by Exploiting All Unlabeled Data

Unsupervised domain adaptation with weak source domain labels via bidirectional subdomain alignment

Search for Discriminative Word Alignment via Dual Decomposition

BinaryAlign: Word Alignment as Binary Sequence Labeling

Contrastive Unsupervised Word Alignment with Non-Local Features

Third-Party Aligner for Neural Word Alignments

Unsupervised Learning helps Supervised Neural Word Segmentation

Improving domain-specific word alignment for computer assisted translation

Alignment-Aware Word Distance.