Abstract:Neural sequence labeling is widely adopted for many Natural Language Processing (NLP) tasks, such as Named Entity Recognition (NER) and slot tagging for dialog systems and semantic parsing. Recent advances with large-scale pre-trained language models have shown remarkable success in these tasks when fine-tuned on large amounts of task-specific labeled data. However, obtaining such large-scale labeled training data is not only costly, but also may not be feasible in many sensitive user applications due to data access and privacy constraints. This is exacerbated for sequence labeling tasks requiring such annotations at token-level. In this work, we develop techniques to address the label scarcity challenge for neural sequence labeling models. Specifically, we propose a meta self-training framework which leverages very few manually annotated labels for training neural sequence models. While self-training serves as an effective mechanism to learn from large amounts of unlabeled data via iterative knowledge exchange -- meta-learning helps in adaptive sample re-weighting to mitigate error propagation from noisy pseudo-labels. Extensive experiments on six benchmark datasets including two for massive multilingual NER and four slot tagging datasets for task-oriented dialog systems demonstrate the effectiveness of our method. With only 10 labeled examples for each class in each task, the proposed method achieves 10% improvement over state-of-the-art methods demonstrating its effectiveness for limited training labels regime.

Self-training Strategies for Sentiment Analysis: An Empirical Study

Identification of Sentiment Labels Based on Self-training

Unsupervised Self-Training for Sentiment Analysis of Code-Switched Data

Sentiment Analysis in the Era of Large Language Models: A Reality Check

Self-Training: A Survey

Revisiting Sentiment Analysis for Software Engineering in the Era of Large Language Models

Bring Your Own Data! Self-Supervised Evaluation for Large Language Models

Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models

Fortunately, Discourse Markers Can Enhance Language Models for Sentiment Analysis

A Comparative Study of Pre-training and Self-training

Learning How to Self-Learn: Enhancing Self-Training Using Neural Reinforcement Learning

One for "All": a unified model for fine-grained sentiment analysis under three tasks

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems

Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction

Revisiting Self-Training for Few-Shot Learning of Language Model

The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models

Meta Self-training for Few-shot Neural Sequence Labeling

Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis

Interpretability in Sentiment Analysis: A Self-Supervised Approach to Sentiment Cue Extraction

Semi-supervised and Transfer learning approaches for low resource sentiment classification

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching