Abstract:Public communication in the contemporary world constitutes a multifaceted phenomenon. The Internet offers unlimited possibilities of contact and public expression, locally and globally, yet exerts its power, inducing use of the Internet lingo, loosening language norms, and encourages the use of a lingua franca, English in particular. This leads to linguistic choices that are liberating for some and difficult for others on ideological grounds, due to the norms of the discourse community, or simply because of insufficient language skills and linguistic means available. Such choices appear to particularly characterise post-colonial states, in which the co-existence of multiple local tongues with the language once imperially imposed and now owned by local users makes the web of repertoires especially complex. Such a case is no doubt India, where the use of English alongside the nationally encouraged Hindi and state languages stems not only from its historical past, but especially its present position enhanced not only by its local prestige, but also by its global status too, and also as the primary language of Online communication. The Internet, however, has also been recognised as a medium that encourages, and even revitalises, the use of local tongues, and which may manifest itself through the choice of a given language as the main medium of communication, or only a symbolic one, indicated by certain lexical or grammatical features as identity markers. It is therefore of particular interest to investigate how members of such a multilingual community, represented here by Hindi users, convey their cultural identity when interacting with friends and the general public Online, on social media sites. This study is motivated by Kachru’s (1983) classical study, and, among others, a recent discussion concerning the use of Hinglish (Kothari and Snell, eds., 2011). This paper analyses posts by Hindi users on Facebook (private profiles and fanpages) and Twitter, where personalities of users are largely known, and on YouTube, where they are often hidden, in order to identify how the users mark their Indian identity. Investigated will be Hindi lexical items, grammatical aspects and word order, cases of code-switching, and locally coloured uses of English words and spelling conventions, with an aim to establish, also from the point of view of gender preferences, the most dominating linguistic patterns found Online.

All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media

Is this word borrowed? An automatic approach to quantify the likeliness of borrowing in social media

Bengali Slang detection using state-of-the-art supervised models from a given text

SMPOST: Parts of Speech Tagger for Code-Mixed Indic Social Media Text

Language Identification of Hindi-English tweets using code-mixed BERT

Role of Artificial Intelligence in Detection of Hateful Speech for Hinglish Data on Social Media

BharatBhasaNet-A Unified Framework to Identify Indian Code Mix Languages

Improved Sentiment Detection via Label Transfer from Monolingual to Synthetic Code-Switched Text

Linguistic Analysis of Hindi-English Mixed Tweets for Depression Detection

Gender Prediction in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System

Analyzing Roles of Classifiers and Code-Mixed factors for Sentiment Identification

Leveraging Language Identification to Enhance Code-Mixed Text Classification

What is Indian in Indian English? Markers of Indianness in Hindi-Speaking Users’ Social Media Communication

Feature Selection on Noisy Twitter Short Text Messages for Language Identification

Combining multiple pre-trained models for hate speech detection in Bengali, Marathi, and Hindi

Language Modeling for Code-Switched Data: Challenges and Approaches

User-Aware Multilingual Abusive Content Detection in Social Media

Exploring transfer learning for Deep NLP systems on rarely annotated languages

Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages

Part-of-Speech Tagging for Code-mixed Indian Social Media Text at ICON 2015

On Detecting Borrowing