J2N -- Nominal Adjective Identification and its Application

Lemeng Qi,Yang Han,Zhuotong Xie
2024-10-13
Abstract:This paper explores the challenges posed by nominal adjectives (NAs) in natural language processing (NLP) tasks, particularly in part-of-speech (POS) tagging. We propose treating NAs as a distinct POS tag, "JN," and investigate its impact on POS tagging, BIO chunking, and coreference resolution. Our study shows that reclassifying NAs can improve the accuracy of syntactic analysis and structural understanding in NLP. We present experimental results using Hidden Markov Models (HMMs), Maximum Entropy (MaxEnt) models, and Spacy, demonstrating the feasibility and potential benefits of this approach. Additionally we finetuned a bert model to identify the NA in untagged text.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the inconsistent tagging of Nominal Adjectives (NAs) in natural language processing (NLP) tasks, especially in part - of - speech (POS) tagging tasks. Specifically, the paper points out that, according to the rules of the widely - used Penn Treebank corpus, common adjectives should be tagged as adjectives (JJ), not plural common nouns (NNS), even if they trigger subject - verb agreement. However, if a possible adjective cannot be modified by an adverb, it should be tagged as a common noun (NN). This rule is not entirely consistent in actual language use, especially in cases where adjectives substitute for nouns. Treating them as nouns is very important for accurately identifying key components in a sentence, such as the subject or the object. To address this challenge, the paper proposes to consider adjectives that function as nouns as an independent part - of - speech tag "JN" (Nominal Adjective) and studies the impact of this re - classification on NLP tasks such as POS tagging, BIO chunking, and coreference resolution. By identifying and tagging nominal adjectives, the author hopes to improve the performance of NLP systems and provide a more detailed understanding of English grammar in a computational environment. In addition, the author has also developed a tool to identify nominal adjectives in unannotated texts.