Finding fake reviews in e-commerce platforms by using hybrid algorithms

Mathivanan Periasamy,Rohith Mahadevan,Bagiya Lakshmi S,Raja CSP Raman,Hasan Kumar S,Jasper Jessiman
2024-04-09
Abstract:Sentiment analysis, a vital component in natural language processing, plays a crucial role in understanding the underlying emotions and opinions expressed in textual data. In this paper, we propose an innovative ensemble approach for sentiment analysis for finding fake reviews that amalgamate the predictive capabilities of Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Decision Tree classifiers. Our ensemble architecture strategically combines these diverse models to capitalize on their strengths while mitigating inherent weaknesses, thereby achieving superior accuracy and robustness in fake review prediction. By combining all the models of our classifiers, the predictive performance is boosted and it also fosters adaptability to varied linguistic patterns and nuances present in real-world datasets. The metrics accounted for on fake reviews demonstrate the efficacy and competitiveness of the proposed ensemble method against traditional single-model approaches. Our findings underscore the potential of ensemble techniques in advancing the state-of-the-art in finding fake reviews using hybrid algorithms, with implications for various applications in different social media and e-platforms to find the best reviews and neglect the fake ones, eliminating puffery and bluffs.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to identify fake reviews on e - commerce platforms. Specifically, the researchers proposed an innovative integrated method, which combines the advantages of Support Vector Machine (SVM), K - Nearest Neighbors algorithm (KNN) and decision tree classifier, in order to improve the accuracy and robustness of detecting fake reviews. Through this integrated method, the researchers aim to overcome the limitations of traditional single models when facing complex and changeable fake reviews, thus providing effective technical support for maintaining the integrity and credibility of online platforms. The main contributions of the paper are as follows: 1. **Integrating multiple machine - learning models**: By combining three different classifiers, namely SVM, KNN and decision tree, make full use of the advantages of each model while reducing their inherent weaknesses. 2. **Feature extraction methods**: Use multiple techniques such as Word2Vec, BERT and TF - IDF for feature extraction to capture complex semantic and emotional information in the text. 3. **Performance evaluation**: Verified by experiments, this integrated method has achieved high accuracy in detecting fake reviews, reaching an accuracy rate of 80%, which is significantly better than the traditional single - model method. 4. **Future prospects**: Plan to further integrate large - language models (LLMs), such as GPT and subsequent versions of BERT, to enhance the detection ability of the model, achieve continuous learning, and expand to multilingual and cross - domain applications. In conclusion, this paper provides a new solution for fake review detection on e - commerce platforms by proposing an innovative integrated method, which has important theoretical and practical significance.