Multiscale cascaded domain-based approach for Arabic fake reviews detection in e-commerce platforms
Nour Qandos,Ghadir Hamad,Maitha Alharbi,Shatha Alturki,Waad Alharbi,Arwa A. Albelaihi
DOI: https://doi.org/10.1016/j.jksuci.2024.101926
IF: 9.006
2024-01-20
Journal of King Saud University - Computer and Information Sciences
Abstract:Highlights • Constructing the first Arabic dataset for fake reviews detection in the hotel, restaurant and product domains based on gold-standard. • Comparing the performance of four Deep Learning models, namely, Bi-LSTM, Bi-GRU, CNN+Bi-LSTM, and CNN+Bi-GRU in single-domain, multi-domain, and cross-domain experiments. • Introducing a cascading approach for optimizing the results based on the knowledge transferring between domains. Fake reviews in e-commerce can lead to customer deception and financial losses. Despite the importance of fake reviews detection, studies for Arabic language are scarce due to the lack of comprehensive datasets. This study addresses this gap by introducing a full-gold standard dataset, the Arabic Fake Reviews Detection (AFRD), across hotels, restaurants, and product domains. To identify the most effective model for each domain in the context of fake review detection, this research employed Bi-LSTM, Bi-GRU, CNN+Bi-LSTM, and CNN+Bi-GRU models. These models were then used in a cascading approach called Multiscale Cascaded domain-based (MCDB), which transfers knowledge from one domain to enhance results in other domains. Experimental results demonstrated that the MCDB approach improved the results of the models by 2.09% to 7.8% in terms of accuracy. The introduced dataset can be used to build effective models for Arabic e-commerce platforms, in addition to further Natural Language Processing applications. This study demonstrates that leveraging domain-specific datasets in a cascading manner can significantly improve performance, holding substantial implications for future research in problems with limited-size datasets.
computer science, information systems