Improving Generalizability of Fake News Detection Methods using Propensity Score Matching

Bo Ni,Zhichun Guo,Jianing Li,Meng Jiang
DOI: https://doi.org/10.48550/arXiv.2002.00838
2020-01-28
Abstract:Recently, due to the booming influence of online social networks, detecting fake news is drawing significant attention from both academic communities and general public. In this paper, we consider the existence of confounding variables in the features of fake news and use Propensity Score Matching (PSM) to select generalizable features in order to reduce the effects of the confounding variables. Experimental results show that the generalizability of fake news method is significantly better by using PSM than using raw frequency to select features. We investigate multiple types of fake news methods (classifiers) such as logistic regression, random forests, and support vector machines. We have consistent observations of performance improvement.
Social and Information Networks,Computation and Language,Machine Learning
What problem does this paper attempt to address?