Research on Multifeature Fusion False Review Detection Based on DynDistilBERT-BiLSTM-CNN
Jing Zhang,Ke Huang,Ding Lang,Yuguang Xu,Hong-An Li,Xuewen Li
DOI: https://doi.org/10.1109/jiot.2024.3410015
IF: 10.6
2024-09-03
IEEE Internet of Things Journal
Abstract:With the rapid expansion of e-commerce and social media platforms, the prevalence of fake reviews has become increasingly problematic, misleading consumers and harming both the reputation of businesses and fair market competition. This article aims to develop a more effective technological solution to accurately identify and filter deceptive reviews, ensuring a truthful shopping and communication environment for consumers. Initially, a multifeature fusion strategy is introduced, integrating text characteristics of reviews, reviewer behavior, and product information. Through a parameterized attention mechanism, the model meticulously assigns weights to various influential features, thereby enhancing the detection of deceptive reviews. Furthermore, a composite architecture, DynDistilBERT-BiLSTM-CNN, is proposed. DynDistilBERT employs a control gate to assess the complexity of the input text and the required processing power in real-time during model forward propagation, dynamically selecting active layers within DistilBERT. This selection process is optimized with a hierarchical training strategy to minimize additional computational overhead. BiLSTM excels in processing sequential data, capturing temporal text features, while convolutional neural network focuses on identifying local text features. This approach reduces computational resource consumption for simpler tasks while maintaining high accuracy for complex tasks, recognizing both local features and contextual relationships in text. Extensive testing on the Amazon data set, compared to models like ALBERT, SpanBERT, DistilBERT, and RoBERTa, demonstrates that our model achieves accuracy improvements of approximately 5.7%, 5.2%, 5.0%, and 3.9%, with a peak accuracy of 92.6%. These findings underscore the effectiveness of the multifeature fusion strategy and the superior performance of the DynDistilBERT-BiLSTM-CNN architecture in handling complex textual data.
computer science, information systems,telecommunications,engineering, electrical & electronic