Abstract:Natural language processing (NLP) has experienced rapid advancements with the rise of deep learning, significantly outperforming traditional rule-based methods. By capturing hidden patterns and underlying structures within data, deep learning has improved performance across various NLP tasks, overcoming the limitations of rule-based systems. However, most research and development in NLP has been concentrated on a select few languages, primarily those with large numbers of speakers or financial significance, leaving many others underexplored. This lack of research is often attributed to the scarcity of adequately annotated datasets essential for training deep learning models. Despite this challenge, there is potential in leveraging the linguistic similarities between unexplored and well-studied languages, particularly those in close geographic and linguistic proximity. This thesis investigates the application of transfer learning for Part-of-Speech (POS) tagging between Hindi and Nepali, two highly similar languages belonging to the Indo-Aryan language family. Specifically, the work explores whether joint training of a POS tagging model for both languages enhances performance. Additionally, we assess whether multitask learning in Hindi, with auxiliary tasks such as gender and singular/plural tagging, can contribute to improved POS tagging accuracy. The deep learning architecture employed is the BLSTM-CNN-CRF model, trained under different conditions: monolingual word embeddings, vector-mapped embeddings, and jointly trained Hindi-Nepali word embeddings. Varying dropout rates (0.25 to 0.5) and optimizers (ADAM and AdaDelta) are also evaluated. Results indicate that jointly trained Hindi-Nepali word embeddings improve performance across all models compared to monolingual and vector-mapped embeddings.

Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging.

TranGAN: Generative Adversarial Network Based Transfer Learning for Social Tie Prediction.

Hypergraph Label Propagation Network.

Part-of-Speech Tagging for Twitter with Adversarial Neural Networks.

Cross-Register Projection for Headline Part of Speech Tagging

Incorporating External POS Tagger for Punctuation Restoration

Unsupervised Domain Adaptation using Lexical Transformations and Label Injection for Twitter Data

The finding probably will support those ... Lol , u think im being there ...

Learning to Compose over Tree Structures via POS Tags

A Personalized Cross-Platform Post Style Transfer Method Based on Transformer and Bi-Attention Mechanism

Exploring transfer learning for Deep NLP systems on rarely annotated languages

Is POS Tagging Necessary or Even Helpful for Neural Dependency Parsing?

Overview of the NLPCC 2015 Shared Task: Chinese Word Segmentation and POS Tagging for Micro-blog Texts

Joint POS Tagging and Dependency Parsing with Transition-based Neural Networks

Joint POS Tagging and Dependence Parsing With Transition-Based Neural Networks

Rapid Adaptation of POS Tagging for Domain Specific Uses

Part-of-Speech Tagging for Chinese-English Mixed Texts with Dynamic Features

SMPOST: Parts of Speech Tagger for Code-Mixed Indic Social Media Text

Towards Accurate and Efficient Chinese Part-of-Speech Tagging.

Combining Context Features by Canonical Belief Network for Chinese Part-Of-Speech Tagging.

Deep Learning for Chinese Word Segmentation and POS Tagging.