Behaviour-diverse automatic penetration testing: a coverage-based deep reinforcement learning approach

Yizhou Yang,Longde Chen,Sha Liu,Lanning Wang,Haohuan Fu,Xin Liu,Zuoning Chen
DOI: https://doi.org/10.1007/s11704-024-3380-1
IF: 2.6688
2024-11-27
Frontiers of Computer Science
Abstract:Reinforcement Learning (RL) is gaining importance in automating penetration testing as it reduces human effort and increases reliability. Nonetheless, given the rapidly expanding scale of modern network infrastructure, the limited testing scale and monotonous strategies of existing RL-based automated penetration testing methods make them less effective in practical application. In this paper, we present CLAP (Coverage-Based Reinforcement Learning to Automate Penetration Testing), an RL penetration testing agent that provides comprehensive network security assessments with diverse adversary testing behaviours on a massive scale. CLAP employs a novel neural network, namely the coverage mechanism, to address the enormous and growing action spaces in large networks. It also utilizes a Chebyshev decomposition critic to identify various adversary strategies and strike a balance between them. Experimental results across various scenarios demonstrate that CLAP outperforms state-of-the-art methods, by further reducing attack operations by nearly 35%. CLAP also provides enhanced training efficiency and stability and can effectively perform pen-testing over large-scale networks with up to 500 hosts. Additionally, the proposed agent is also able to discover pareto-dominant strategies that are both diverse and effective in achieving multiple objectives.
computer science, information systems, theory & methods, software engineering
What problem does this paper attempt to address?