Software vulnerability prediction using text analysis techniques

Aram Hovsepyan,Riccardo Scandariato,Wouter Joosen,James Walden
DOI: https://doi.org/10.1145/2372225.2372230
2012-01-01
Abstract:Early identification of software vulnerabilities is essential in software engineering and can help reduce not only costs, but also prevent loss of reputation and damaging litigations for a software firm. Techniques and tools for software vulnerability prediction are thus invaluable. Most of the existing techniques rely on using component characteristic(s) (like code complexity, code churn) for the vulnerability prediction. In this position paper, we present a novel approach for vulnerability prediction that leverages on the analysis of raw source code as text, instead of using "cooked" features. Our initial results seem to be very promising as the prediction model achieves an average accuracy of 0.87, precision of 0.85 and recall of 0.88 on 18 versions of a large mobile application.
What problem does this paper attempt to address?