Comprehensive Similarity Measurement Model Based on Three Algorithms

Nan Liu,Bao Lian Long,Xiao Kun Zheng,Li Fang Han,Tong Qu,Bao Jiang Cui
DOI: https://doi.org/10.4028/www.scientific.net/amr.989-994.1680
2014-01-01
Advanced Materials Research
Abstract:Software source code homologous detection is also called software copy or software clone. It is used to detect the homologous in the source code, by which we can easily find the plagiarism in the code. In this paper, it will discuss the homology detection results based on Text, Token and Abstract Syntax Tree. And will compare the three techniques and raise a model to calculate similarity by synthesizing the results. This model is based on the analysis of a large number of experimental results. Comprehensive similarity calculation model can calculate the respective contribution of the three algorithms and realize integrated computation of similarity according to this respective contribution. Finally, we can get a comprehensive similarity by this integrated similarity calculation model to make the homology detection results more accurate and closer to the actual similarity.
What problem does this paper attempt to address?