PromptSAM+: Malware Detection based on Prompt Segment Anything Model

Xingyuan Wei,Yichen Liu,Ce Li,Ning Li,Degang Sun,Yan Wang
2024-08-04
Abstract:Machine learning and deep learning (ML/DL) have been extensively applied in malware detection, and some existing methods demonstrate robust performance. However, several issues persist in the field of malware detection: (1) Existing work often overemphasizes accuracy at the expense of practicality, rarely considering false positive and false negative rates as important metrics. (2) Considering the evolution of malware, the performance of classifiers significantly declines over time, greatly reducing the practicality of malware detectors. (3) Prior ML/DL-based efforts heavily rely on ample labeled data for model training, largely dependent on feature engineering or domain knowledge to build feature databases, making them vulnerable if correct labels are scarce. With the development of computer vision, vision-based malware detection technology has also rapidly evolved. In this paper, we propose a visual malware general enhancement classification framework, `PromptSAM+', based on a large visual network segmentation model, the Prompt Segment Anything Model(named PromptSAM+). Our experimental results indicate that 'PromptSAM+' is effective and efficient in malware detection and classification, achieving high accuracy and low rates of false positives and negatives. The proposed method outperforms the most advanced image-based malware detection technologies on several datasets. 'PromptSAM+' can mitigate aging in existing image-based malware classifiers, reducing the considerable manpower needed for labeling new malware samples through active learning. We conducted experiments on datasets for both Windows and Android platforms, achieving favorable outcomes. Additionally, our ablation experiments on several datasets demonstrate that our model identifies effective modules within the large visual network.
Cryptography and Security
What problem does this paper attempt to address?
This paper attempts to solve the following three main problems: 1. **Existing methods over - emphasize accuracy while ignoring practicality, false positive rate and false negative rate**: - The paper points out that existing malware detection methods often focus too much on accuracy while ignoring the false positive rate (FPR) and false negative rate (FNR). A high false negative rate may lead to malware not being detected and causing serious damage; a high false positive rate may wrongly block a large number of benign applications, affecting user experience and system availability. Therefore, when developing and evaluating malware detection systems, multiple performance indicators must be considered comprehensively to ensure the practicality of the system. 2. **The evolution of malware causes the performance of classifiers to decline over time**: - With the continuous evolution of malware, the performance of classifiers based on machine learning / deep learning (ML/DL) algorithms will decline significantly. This phenomenon is known as "model aging" or "concept drift". The paper mentions that although these classifiers perform well in the initial stage, their performance gradually deteriorates over time, thus reducing the practical application value of the detection system. 3. **Relying on a large amount of labeled data for model training, it is difficult to deal with the situation of label scarcity**: - Existing ML/DL methods usually require a large amount of labeled data for feature engineering or building a feature database, which makes them vulnerable in the absence of correct labels. In addition, malware detection methods on different platforms need to be customized separately, lacking universality and reusability. To solve the above problems, the paper proposes an enhanced classification framework based on the large - scale visual network segmentation model (Prompt Segment Anything Model, PromptSAM+) - **PromptSAM+**. This framework aims to improve the performance of malware detection and classification tasks by using semantic information in large - scale visual models, specifically: - **Reducing false positive rate and false negative rate**: By introducing effective modules, the model can more accurately distinguish between malware and benign software. - **Alleviating the problem of model aging**: By active learning, reducing the human resources required for labeling new malware samples and reducing the aging speed of image - based malware classifiers. - **Cross - platform universality**: Applicable to malware detection on Windows and Android platforms, with strong generalization ability. In summary, the main goal of the paper is to solve the problems of accuracy, practicality and model aging in current malware detection by proposing a new visual enhancement framework, thereby providing a more efficient and reliable malware detection method.