Software Plagiarism Detection:A Survey
TIAN Zhenzhou,LIU Ting,ZHENG Qinghua,TONG Feifei,WU Dinghao,ZHU Sencun,CHEN Kai
DOI: https://doi.org/10.19363/j.cnki.cn10-1380/tn.2016.03.005
2016-01-01
Abstract:With the burst of free and open source software projects, software plagiarism has become a serious threat to the healthy development of the software ecosystem. Researchers, educators, open source developers, and software company managers are paying more and more attention to the problem. Software plagiarism detection is critical to the protection of software intellectual property. This paper provides a review of the state-of-the-art software plagiarism detection techniques. First, the significance and threat models of plagiarism detection are presented, followed by the description and comparison of existing techniques on plagiarism detection. We classify the existing methods into three major categories, including source-code plagiarism detection, software watermark based plagiarism detection and software birthmark based plagiarism detection, according to the scenarios they are designed for and applicable to as well as different principles adopted. Finally, through analyzing the limitations of the existing plagiarism detection techniques, the emerging challenges and practical requirements, we discuss several possible future research directions.