Psycholinguistic and emotion analysis of cryptocurrency discourse on X platform

Moein Shahiki Tash,Olga Kolesnikova,Zahra Ahani,Grigori Sidorov
DOI: https://doi.org/10.1038/s41598-024-58929-4
IF: 4.6
2024-04-15
Scientific Reports
Abstract:This paper provides an extensive examination of a sizable dataset of English tweets focusing on nine widely recognized cryptocurrencies, specifically Cardano, Binance, Bitcoin, Dogecoin, Ethereum, Fantom, Matic, Shiba, and Ripple. Our goal was to conduct a psycholinguistic and emotional analysis of social media content associated with these cryptocurrencies. Such analysis can enable researchers and experts dealing with cryptocurrencies to make more informed decisions. Our work involved comparing linguistic characteristics across the diverse digital coins, shedding light on the distinctive linguistic patterns emerging in each coin's community. To achieve this, we utilized advanced text analysis techniques. Additionally, this work unveiled an understanding of the interplay between these digital assets. By examining which coin pairs are mentioned together most frequently in the dataset, we established co-mentions among different cryptocurrencies. To ensure the reliability of our findings, we initially gathered a total of 832,559 tweets from X. These tweets underwent a rigorous preprocessing stage, resulting in a refined dataset of 115,899 tweets that were used for our analysis. Overall, our research offers valuable perception into the linguistic nuances of various digital coins' online communities and provides a deeper understanding of their interactions in the cryptocurrency space.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Psycholinguistics and Sentiment Analysis**: Researchers aim to explore discussions on social media about nine major cryptocurrencies (Cardano, Binance, Bitcoin, Dogecoin, Ethereum, Fantom, Matic, Shiba, and Ripple) through psycholinguistics and sentiment analysis methods. Specifically, they hope to reveal the linguistic characteristics and emotional tendencies of different cryptocurrency communities. 2. **Readability Assessment**: In addition to psycholinguistics and sentiment analysis, the study also focuses on the readability of tweets related to these cryptocurrencies to understand the differences in expression among different communities. 3. **Co-mention Analysis**: By analyzing which cryptocurrencies are mentioned together in tweets, researchers hope to reveal the relationships and market dynamics between different cryptocurrencies. The main goal of the paper is to fill an important research gap in the field of cryptocurrencies, namely understanding discussions on social media from a psycholinguistic and emotional perspective, and to provide guidance for new investors, helping traders better utilize indicators such as the Fear and Greed Index for trading strategy formulation. Researchers aim to achieve this goal by analyzing a large amount of tweet data using natural language processing (NLP) techniques, particularly psycholinguistic tools (such as LIWC), sentiment analysis, and readability assessment methods.