Abstract:While neural network-based models have achieved impressive performance on a large body of NLP tasks, the generalization behavior of different models remains poorly understood: Does this excellent performance imply a perfect generalization model, or are there still some limitations? In this paper, we take the NER task as a testbed to analyze the generalization behavior of existing models from different perspectives and characterize the differences of their generalization abilities through the lens of our proposed measures, which guides us to better design models and training methods. Experiments with in-depth analyses diagnose the bottleneck of existing neural NER models in terms of breakdown performance analysis, annotation errors, dataset bias, and category relationships, which suggest directions for improvement. We have released the datasets: (ReCoNLL, PLONER) for the future research at our project page: <a class="link-external link-http" href="http://pfliu.com/InterpretNER/" rel="external noopener nofollow">this http URL</a>. As a by-product of this paper, we have open-sourced a project that involves a comprehensive summary of recent NER papers and classifies them into different research topics: <a class="link-external link-https" href="https://github.com/pfliu-nlp/Named-Entity-Recognition-NER-Papers" rel="external noopener nofollow">this https URL</a>.

reproducing "ner and pos when nothing is capitalized"

Capitalization Normalization for Language Modeling with an Accurate and Efficient Hierarchical RNN Model

An Efficient Architecture for Predicting the Case of Characters using Sequence Models

UniCase -- Rethinking Casing in Language Models

ORTHOGRAPHIC CASE RESTORATION USING SUPERVISED LEARNING WITHOUT MANUAL ANNOTATION

Closing the Curious Case of Neural Text Degeneration

Renovating Names in Open-Vocabulary Segmentation Benchmarks

Multicultural Name Recognition For Previously Unseen Names

A Little Annotation does a Lot of Good: A Study in Bootstrapping Low-resource Named Entity Recognizers

Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study

Cross-Register Projection for Headline Part of Speech Tagging

IMPROVING NER IN SOCIAL MEDIA VIA ENTITY TYPE-COMPATIBLE UNKNOWN WORD SUBSTITUTION

Reproducibility Beyond the Research Community: Experience from NLP Beginners

The Impact of Data Corruption on Named Entity Recognition for Low-resourced Languages

Capitalization and Punctuation Restoration: a Survey

LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models

Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models

An Analysis and Mitigation of the Reversal Curse

An Investigation of Noise in Morphological Inflection

Character Eyes: Seeing Language through Character-Level Taggers

Position-Invariant Truecasing with a Word-and-Character Hierarchical Recurrent Neural Network