Abstract:Black hat hackers use malicious exploits to circumvent security controls and take advantage of system vulnerabilities worldwide, costing the global economy over $450 billion annually. While many organizations are increasingly turning to cyber threat intelligence (CTI) to help prioritize their vulnerabilities, extant CTI processes are often criticized as being reactive to known exploits. One promising data source that can help develop proactive CTI is the vast and ever-evolving Dark Web. In this study, we adopted the computational design science paradigm to design a novel deep learning (DL)- based exploit-vulnerability attention deep structured semantic model (EVA-DSSM) that includes bidirectional processing and attention mechanisms to automatically link exploits from the Dark Web to vulnerabilities. We also devised a novel device vulnerability severity metric (DVSM) that incorporates the exploit post date and vulnerability severity to help cybersecurity professionals with their device prioritization and risk management efforts. We rigorously evaluated the EVA-DSSM against state-of-theart non-DL and DL-based methods for short text matching on 52,590 exploit-vulnerability linkages across four testbeds: web application, remote, local, and denial of service. Results of these evaluations indicate that the proposed EVA-DSSM achieves precision at 1 scores 20% - 41% higher than non-DL approaches and 4% - 10% higher than DL-based approaches. We demonstrated the EVA-DSSM’s and DVSM’s practical utility with two CTI case studies: openly accessible systems in the top eight U.S. hospitals and over 20,000 Supervisory Control and Data Acquisition (SCADA) systems worldwide. A complementary user evaluation of the case study results indicated that 45 cybersecurity professionals found the EVADSSM and DVSM results more useful for exploit-vulnerability linking and risk prioritization activities than those produced by prevailing approaches. Given the rising cost of cyberattacks, the EVA-DSSM and DVSM have important implications for analysts in security operations centers, incident response teams, and cybersecurity vendors.

Inferring Discussion Topics about Exploitation of Vulnerabilities from Underground Hacking Forums

Cream Skimming the Underground: Identifying Relevant Information Points from Online Forums

Discovering Emerging Threats in the Hacker Community: A Nonparametric Emerging Topic Detection Framework

Mining user interaction patterns in the darkweb to predict enterprise cyber incidents

Under the Underground: Predicting Private Interactions in Underground Forums

Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach

Understanding and Predicting Private Interactions in Underground Forums

Deep Learning Algorithm for Threat Detection in Hackers Forum (Deep Web)

Classifying Web Exploits with Topic Modeling

EUREKHA: Enhancing User Representation for Key Hackers Identification in Underground Forums

Predicting enterprise cyber incidents using social network analysis on the darkweb hacker forums

Explaining the Contributing Factors for Vulnerability Detection in Machine Learning

Exploring key hackers and cybersecurity threats in Chinese hacker communities

Predicting Exploitation of Disclosed Software Vulnerabilities Using Open-source Data

What Do Programmers Discuss about Deep Learning Frameworks

pyDNetTopic: A Framework for Uncovering What Darknet Market Users Talking About

Latent Text Mining for Cybercrime Forensics

Predicting Organizational Cybersecurity Risk: A Deep Learning Approach

Nested Dirichlet models for unsupervised attack pattern detection in honeypot data

Linking Exploits from the Dark Web to Known Vulnerabilities for Proactive Cyber Threat Intelligence: An Attention-Based Deep Structured Semantic Model

Learning-based Models for Vulnerability Detection: An Extensive Study