Chinese Judicial Summarising Based On Short Sentence Extraction And Gpt-2

Jie Liu,Jiaye Wu,Xudong Luo
DOI: https://doi.org/10.1007/978-3-030-82147-0_31
2021-01-01
Abstract:This paper studies the compilation of judicial case summarisation in China. Judicial case summaries are made through the abridgement, generalisation, and summarisation of court verdicts. It is a time-consuming, inefficient manual process done by legal professionals. The automatic generation of such summaries could save much time of legal professionals. Court verdicts are generally lengthy, exceeding the maximum word limit for inputs into pre-trained models. Through the observation and analysis of existing data sets, this paper conducts further treatment of these datasets. The dataset of one court verdict is split into five via phrase extraction to obtain the extracts of five key components of a court verdict and the corresponding manual summaries. In this way, we convert one text summarisation problem into five text compression and integration problems for sentences of five different categories. We adopt the GPT-2 pre-trained model, which excels in text generation, to conduct text compression and integration. From that, key points for compression of various parts of the verdict are obtained, which are eventually put together to obtain the summary of the court verdict. This paper divides datasets using extractive algorithms and compresses and integrates them using abstractive algorithms. Our experiments show that our approach proposed by this paper performs well.
What problem does this paper attempt to address?