Multithread software plagiarism detection method based on thread slice birthmark

郑庆华,刘烃,刘均,范铭,田振洲
2014-01-01
Abstract:Proposed is a multithread software plagiarism detection method based on a thread slice birthmark, comprising: 1) based on a dynamic instrumentation technology, monitoring a target program to be analyzed during operation, identifying system calls in real time, recording related information related to the target program to be analyzed, such as thread IDs, system call numbers and return values, and then preprocessing same to obtain a valid system call sequence Trace; 2) based on the thread IDs, conducting thread slicing on Trace, so as to generate a series of thread slices Slice which are identified by the thread IDs; 3) on this basis, slicing all the threads of two pieces of software to be detected, so as to respectively generate thread slice birthmarks Birth thereof; 4) based on all the thread slice birthmarks of the two multithread programs to be detected, respectively generating software birthmarks PB 1 and PB 2 thereof; 5) based on the maximum bilateral graph matching, calculating the maximum similarity of the software birthmarks PB 1 and PB 2 ; and 6) deciding whether plagiarism has taken place or not according to the mean value of the similarity of the birthmarks and a given threshold value. The method directly acts on a binary code and does not require the existence of source codes.
What problem does this paper attempt to address?