Program similarity detection approach based on static lexical tree

XIONG Hao,YAN Hai-hua,HE Jian-ying,ZHAO Chang-hai
DOI: https://doi.org/10.3969/j.issn.1001-3695.2009.04.034
2009-01-01
Abstract:Traditional detecting tools for program similarity can not detect some modification strategies effectively,such as senior lexical transform,and semantic rewriting.This paper concluded the three plagiarism methods that were often used by students first.Described a detection approach program similarity based on lexical tree.Taking C programming language for example,the article summarized the structure unit in order to generate lexical tree,obtained structure unit dependence graph through the analysis of the main data sequence,structure controlling sequence,and time sequence,and proposed a formulated graph similarity method to carry out code similarity detection.Also introduced a clustering algorithm to find all groups of similar programs.By using a set of plagiarized scripts as testing programs,the evaluation result shows that the method is more effective,compared to JPlag and BuaaSim.
What problem does this paper attempt to address?