Similarity Analysis of Law Documents Based on Word2vec

Chunyu Xia,Tieke He,Wenlong Li,Zemin Qin,Zhipeng Zou
DOI: https://doi.org/10.1109/qrs-c.2019.00072
2019-01-01
Abstract:With the increasing demand for computer-assisted wisdom in justice, deep learning has gradually become an effective means of helping intelligent justice. The similarity analysis of law documents is the basis of intelligent justice, while law documents based on several types of cases are quite different in terms of format and length, which causes trouble in analyzing similarities. For that we propose a more specific approach to dealing with law documents, combining Word2vec with legal documents corpus. To measure the efficiency of the proposed method, we designed two sets of controls. The experimental results show that the Word2vec model can improve the accuracy by 0.20 compared with the bag of words (BOW) model, and the equipped law documents corpus can increase by 0.05-0.10 based on the Word2vec model. Thus, the combination of Word2vec and the law documents corpus is more compatible with the simple and efficient application of similarity analysis of law documents.
What problem does this paper attempt to address?