A PHP and JSP Web Shell Detection System with Text Processing Based on Machine Learning

Han Zhang,Ming Liu,Zihan Yue,Zhi Xue,Yong Shi,Xiangjian He
DOI: https://doi.org/10.1109/trustcom50675.2020.00219
2020-01-01
Abstract:Web shell is one of the most common network attack methods, and traditional detection methods may not detect complex and flexible variants of web shell attacks. In this paper, we present a comprehensive detection system that can detect both PHP and JSP web shells. After file classification, we use different feature extraction methods, i.e. AST for PHP files and bytecode for JSP files. We present a detection model based on text processing methods including TF-IDF and Word2vec algorithms. We combine different kinds of machine learning algorithms and perform a comprehensively controlled experiment. After the experiment and evaluation, we choose the detection machine learning model of the best performance, which can achieve a high detection accuracy above 98%.
What problem does this paper attempt to address?