Abstract:Part-of-speech (POS) tagging is an indispensable method of text processing. The main aim is to assign part-of-speech to words after considering their actual contextual syntactic-cum-semantic roles in a piece of text where they occur (Siemund & Claridge 1997). This is a useful strategy in language processing, language technology, machine learning, machine translation, and computational linguistics as it generates a kind of output that enables a system to work with natural language texts with greater accuracy and success. Part-of-speech tagging is also known as ‘grammatical annotation’ and ‘word category disambiguation’ in some area of linguistics where analysis of form and function of words are important avenues for better comprehension and application of texts. Since the primary task of POS tagging involves a process of assigning a tag to each word, manually or automatically, in a piece of natural language text, it has to pay adequate attention to the contexts where words are used. This is a tough challenge for a system as it normally fails to know how word carries specific linguistic information in a text and what kind of larger syntactic frames it requires for its operation. The present paper takes up this issue into consideration and tries to critically explore how some of the well-known POS tagging systems are capable of handling this kind of challenge and if these POS tagging systems are at all successful in assigning appropriate POS tags to words without accessing information from extratextual domains. The novelty of the paper lies in its attempt for looking into some of the POS tagging schemes proposed so far to see if the systems are actually successful in dealing with the complexities involved in tagging words in texts. It also checks if the performance of these systems is better than manual POS tagging and verifies if information and insights gathered from such enterprises are at all useful for enhancing our understanding about identity and function of words used in texts. All these are addressed in this paper with reference to some of the POS taggers available to us. Moreover, the paper tries to see how a POS tagged text is useful in various applications thereby creating a sense of awareness about multifunctionality of tagged texts among language users.

Development of POS tagger for English-Bengali Code-Mixed data

Part-of-Speech Tagging for Code-mixed Indian Social Media Text at ICON 2015

SMPOST: Parts of Speech Tagger for Code-Mixed Indic Social Media Text

A POS Tagger for Code Mixed Indian Social Media Text - ICON-2016 NLP Tools Contest Entry from Surukam

Part-of-Speech Tagging for Code Mixed English-Telugu Social Media Data

Recurrent Neural Network based Part-of-Speech Tagger for Code-Mixed Social Media Text

Preparing Bengali-English Code-Mixed Corpus for Sentiment Analysis of Indian Languages

AsPOS: Assamese Part of Speech Tagger using Deep Learning Approach

Part-of-Speech Tagger for Konkani-English Code-Mixed Social Media Text

Part of speech tagging for code switched data

A hybrid approach for Bengali sentence validation

OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language Identification

Bengali Slang detection using state-of-the-art supervised models from a given text

BnSentMix: A Diverse Bengali-English Code-Mixed Dataset for Sentiment Analysis

Looking into the Operational Modalities Adopted in Some of the POS Tagging Tools in Identification of Contextual Part-of-Speech of Words in Texts

Improving accuracy of Part-of-Speech (POS) tagging using hidden markov model and morphological analysis for Myanmar Language

Probing a pretrained RoBERTa on Khasi language for POS tagging

Crowdsourcing Universal Part-Of-Speech Tags for Code-Switching

Deep Learning based UPoS Tagger for Assamese Religious Text

JU_KS@SAIL_CodeMixed-2017: Sentiment Analysis for Indian Code Mixed Social Media Texts

Parts-of-Speech Tagger Errors Do Not Necessarily Degrade Accuracy in Extracting Information from Biomedical Text