A Multi-feature Fusion Method for Smart Contract Classification

Xiaojin Wang,Hongjuan Yang,Gang Tian,Rui Wang
DOI: https://doi.org/10.1145/3584376.3584558
2022-12-16
Abstract:The classification of the smart contract can effectively reduce the search space and improve retrieval efficiency. The existing classification methods are based on natural language processing technologies. Because the processing of source code by these technologies lacks extraction and processing in the software engineering field, there is still a lot of room for improvement in their methods of feature extraction. Therefore, this paper proposes a multi-feature fusion method for smart contract classification (MFF-SC) based on the code processing technology. From the source code perspective, source code processing method and attention mechanism are used to extract local code features. Structure-based traversal method are used to extract global code features from abstract syntax tree. Local and global code features introduce attention mechanism to generate code semantic features. From the perspective of account transaction, the feature of account transaction is extracted by using TransR. Next, the code semantic features and account transaction features generate smart contract semantic features by an attention mechanism. Finally, the smart contract semantic features are fed into a stacked denoising autoencoder and a softmax classifier for classification. Compared with other baselines and variants, experimental results on a real-world dataset show that MFF-SC achieves an accuracy rate of 83.9%.
Computer Science
What problem does this paper attempt to address?