NLP based Deep Learning Approach for Plagiarism Detection

Razvan Rosu,Alexandru Stefan Stoica,Paul Stefan Popescu,Marian Cristian Mihaescu
DOI: https://doi.org/10.37789/ijusi.2020.13.1.4
2020-01-01
Abstract:Plagiarism detection represents an application domain for the NLP research area, which has not been investigated too much by researchers in the context of lately developed attention mechanism and sentence transformers. In this paper, we present a plagiarism detection approach which uses state-of-the-art deep learning techniques in order to provide more accurate results than classical plagiarism detection techniques. This approach goes beyond classical word searching and matching, which is time-consuming and can be easily cheated because it uses attention mechanisms and aims for text encoding and contextualization. In order to get proper insight regarding the system, we investigate three approaches in order to be sure that the results are relevant and well-validated. The experimental results show that the systems that use BERT pre-trained model offers the best results and outperforms GloVe and RoBERTa
What problem does this paper attempt to address?