Finding Needles in a Haystack: Using Data Analytics to Improve Fraud Prediction

Johan Perols,Robert M. Bowen,Carsten Zimmermann,Basamba Samba
DOI: https://doi.org/10.2139/ssrn.2590588
2015-01-01
SSRN Electronic Journal
Abstract:Developing models to detect financial statement fraud involves challenges related to (i) the rarity of fraud observations, (ii) the relative abundance of explanatory variables identified in the prior literature, and (iii) the broad underlying definition of fraud. Following the emerging data analytics literature, we introduce and systematically evaluate three methods to address these challenges. Results from evaluating actual cases of financial statement fraud suggest that two of these methods improve fraud prediction performance by approximately ten percent relative to the best current techniques. Improved fraud prediction can result in meaningful benefits, such as improving the ability of the SEC to detect fraudulent filings and improving audit firms’ client portfolio decisions.
What problem does this paper attempt to address?