The Method to Eliminate False Positives of Clone Code Detection with Hash Value and Identifier Conflict Ratio

Yixin BIAN,Tiantian WANG,Xiaohong SU,Peijun MA
2013-01-01
Abstract:There are some disadvantages when detecting syntax similar clone code with clone code detection method based on token method.To solve these problems,this paper proposes a method to eliminate part false detection of clone code detection with statement hash value and identifier conflict ratio.At first,statement hash value is compared to determine the statement structure similarity.Then the identifier conflict ratio is computed to decide the direction of false detection elimination and results.Finally,the statement relative line number is modified to eliminate false detection.The experimental results show that the proposed method can eliminate clone code false detection caused by inserting the same structure statement and the reverse order statements of same structure.It improves the accuracy of clone code detection and clone code related defects,as well as can benefit the study of clone code refactoring.
What problem does this paper attempt to address?