Fake Review Detection using Supervised and Semi-Supervised Learning with Natural Language Processing Techniques in Python

K. Mahesh,K. Hari Priya,K. V. S Meghana,K. Om Sai Vinay,N. Anil Chakravarthy
DOI: https://doi.org/10.48175/ijarsct-9269
2023-04-20
Abstract:This research paper explores the use of supervised and semi-supervised learning techniques along with natural language processing in Python for detecting fake reviews. The study discusses the importance of detecting fake reviews and its impact on businesses and customers. The proposed approach involves extracting relevant features from text data using various natural language processing techniques and training supervised learning models such as logistic regression, support vector machines, and random forests. Additionally, a semi-supervised learning technique called self-training is employed to improve the model's performance using unlabeled data. The effectiveness of the proposed approach is evaluated on a dataset of reviews from Amazon and Yelp, and the results show that the models achieve high accuracy in detecting fake reviews. The study concludes that the proposed approach can be a useful tool for businesses and customers to identify and filter out fake reviews.
What problem does this paper attempt to address?