Abstract:Nowadays, the dissemination of news information has become more rapid, liberal, and open to the public. People can find what they want to know more and more easily from a variety of sources, including traditional news outlets and new social media platforms. However, at a time when our lives are glutted with all kinds of news, we cannot help but doubt the veracity and legitimacy of these news sources; meanwhile, we also need to guard against the possible impact of various forms of fake news. To combat the spread of misinformation, more and more researchers have turned to natural language processing (NLP) approaches for effective fake news detection. However, in the face of increasingly serious fake news events, existing detection methods still need to be continuously improved. This study proposes a modified proof-of-concept model named NER-SA, which integrates natural language processing (NLP) and named entity recognition (NER) to conduct the in-domain and cross-domain analysis of fake news detection with the existing three datasets simultaneously. The named entities associated with any particular news event exist in a finite and available evidence pool. Therefore, entities must be mentioned and recognized in this entity bank in any authentic news articles. A piece of fake news inevitably includes only some entitlements in the entity bank. The false information is deliberately fabricated with fictitious, imaginary, and even unreasonable sentences and content. As a result, there must be differences in statements, writing logic, and style between legitimate news and fake news, meaning that it is possible to successfully detect fake news. We developed a mathematical model and used the simulated annealing algorithm to find the optimal legitimate area. Comparing the detection performance of the NER-SA model with current state-of-the-art models proposed in other studies, we found that the NER-SA model indeed has superior performance in detecting fake news. For in-domain analysis, the accuracy increased by an average of 8.94% on the LIAR dataset and 19.36% on the fake or real news dataset, while the F1-score increased by an average of 24.04% on the LIAR dataset and 19.36% on the fake or real news dataset. In cross-domain analysis, the accuracy and F1-score for the NER-SA model increased by an average of 28.51% and 24.54%, respectively, across six domains in the FakeNews AMT dataset. The findings and implications of this study are further discussed with regard to their significance for improving accuracy, understanding context, and addressing adversarial attacks. The development of stylometric detection based on NLP approaches using NER techniques can improve the effectiveness and applicability of fake news detection.

Similarity Detection Pipeline for Crawling a Topic Related Fake News Corpus

SGG: Spinbot, Grammarly and GloVe based Fake News Detection

Fake News Spreader Detection on Twitter using Character N-Grams. Notebook for PAN at CLEF 2020

Towards better representation learning using hybrid deep learning model for fake news detection

Fake News Detection Using NLP

Unveiling the Potential of BERTopic for Multilingual Fake News Analysis -- Use Case: Covid-19

CSI: A Hybrid Deep Model for Fake News Detection

FakeNewsNet: A Data Repository with News Content, Social Context and Spatialtemporal Information for Studying Fake News on Social Media

Stylometric Fake News Detection Based on Natural Language Processing Using Named Entity Recognition: In-Domain and Cross-Domain Analysis

Mul-FaD: attention based detection of multiLingual fake news

Multilingual deep learning framework for fake news detection using capsule neural network

TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection

Fake news detection using a deep learning transformer based encoder-decoder architecture

Fake news detection: A hybrid CNN-RNN based deep learning approach

A Self-Learning Multimodal Approach for Fake News Detection

An Implementation of Machine Learning Algorithm for Fake News Detection

DeepFND: an ensemble-based deep learning approach for the optimization and improvement of fake news detection in digital platform

Ensemble Techniques for Robust Fake News Detection: Integrating Transformers, Natural Language Processing, and Machine Learning

Divide-and-Conquer: Post-User Interaction Network for Fake News Detection on Social Media

Similarity-Aware Multimodal Prompt Learning for Fake News Detection

Fake News Detection Based on Knowledge-Guided Semantic Analysis