Combined sentiment score and star rating analysis of travel destination prediction based on user preference using morphological linear neural network model with correlated topic modelling approach
Niranjan Kumar,Bhagyashri R. Hanji
DOI: https://doi.org/10.1007/s11042-023-17995-y
IF: 2.577
2024-01-12
Multimedia Tools and Applications
Abstract:In the context of a globalized world where travel enthusiasts seek personalized recommendations for their favourite destinations, the study delves into sentiment analysis and travel recommendation systems. While previous research has explored various aspects of tourism destination selection, this work explores the use of star ratings to create sentiment lexicons tailored to specific domains. However, a notable limitation is the absence of a comprehensive investigation into the effectiveness of sentiment analysis techniques, in conjunction with star ratings, in accurately capturing review sentiment. This article aims to address this limitation by introducing a novel model that combines explicit sentiment scores and star ratings to predict optimal travel destinations based on user preferences. The model collects data from TripAdvisor but faces challenges related to noisy and non-informative elements such as HTML tags. To streamline the categorization process, preprocessing techniques like tokenization, stemming, and stop-word removal are applied. The study leverages Latent Dirichlet Allocation (LDA) topic modelling to extract user choice topics from the collected review data. Additionally, Correlated Topic Modeling (CTM) is employed to capture correlations between latent topics. The Morphological Linear Neural Network (MLNN) model is introduced to generate sentiment scores for textual content. These scores are then combined with star ratings from reviews to determine the most suitable destination. Furthermore, the study predicts average cumulative ratings by considering projected emotion scores and star ratings through the cumulative gain model. Implementation is carried out using Python software on a dataset comprising 67,871 samples. Evaluation metrics, including an F1-score of 89%, precision of 86%, and recall of 87%, indicate high performance in sentiment classification. The model exhibits an accuracy of approximately 95% and an RMSE value of 0.287, affirming its efficiency in polarity classification. Comparative analyses against state-of-the-art methods demonstrate the proposed model's superiority in terms of accuracy, precision, and recall. The practical implications of this model are underscored by its successful implementation and impressive evaluation results, highlighting its potential for enhancing personalized travel recommendations.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering