Semi-Supervised Learning for Anomaly Detection in Blockchain-based Supply Chains

Do Hai Son,Bui Duc Manh,Tran Viet Khoa,Nguyen Linh Trung,Dinh Thai Hoang,Hoang Trong Minh,Yibeltal Alem,Le Quang Minh
2024-07-22
Abstract:Blockchain-based supply chain (BSC) systems have tremendously been developed recently and can play an important role in our society in the future. In this study, we develop an anomaly detection model for BSC systems. Our proposed model can detect cyber-attacks at various levels, including the network layer, consensus layer, and beyond, by analyzing only the traffic data at the network layer. To do this, we first build a BSC system at our laboratory to perform experiments and collect datasets. We then propose a novel semi-supervised DAE-MLP (Deep AutoEncoder-Multilayer Perceptron) that combines the advantages of supervised and unsupervised learning to detect anomalies in BSC systems. The experimental results demonstrate the effectiveness of our model for anomaly detection within BSCs, achieving a detection accuracy of 96.5%. Moreover, DAE-MLP can effectively detect new attacks by improving the F1-score up to 33.1% after updating the MLP component.
Cryptography and Security
What problem does this paper attempt to address?
The paper primarily focuses on addressing the issue of anomaly detection in Blockchain-based Supply Chain (BSC) systems, particularly concerning potential network attacks at the network layer, consensus layer, and higher levels. Specifically, the paper attempts to solve the following key problems: 1. **Lack of dedicated BSC anomaly datasets**: The currently available datasets are either not suitable for BSC scenarios or fail to cover attack types at the consensus layer and higher levels. 2. **Detecting anomalous transactions (TXs) without labeled data**: For example, malicious transactions caused by Overflow and Underflow (OaU) vulnerabilities at the consensus layer and higher levels are difficult to distinguish using only network layer information. To address the above challenges, the research team first constructed a BSC system in a laboratory environment to collect a dataset containing various types of attacks. Subsequently, they proposed a semi-supervised learning model named DAE-MLP, which combines the unsupervised learning advantages of Deep AutoEncoder (DAE) with the supervised learning capabilities of Multilayer Perceptron (MLP). The DAE-MLP model can utilize traffic data captured at the network layer to detect anomalies at the consensus layer and higher levels, and it can update the model with new types of attacks to improve detection performance. Experimental results show that the proposed DAE-MLP model can effectively detect anomalies in BSC systems, with an accuracy of up to 96.5%. Additionally, the model's ability to detect new types of attacks has significantly improved, with the F1 score increasing by up to 33.1%. Furthermore, the model demonstrated the capability to detect anomalies even without being trained on specific types of attacks and can further enhance this capability by updating the model.