Research Review On Key Techniques Of Topic-Based News Elements Extraction
Song Qing,Zhang Ying,Zhang Pengzhou
DOI: https://doi.org/10.1109/ICIS.2017.7960060
2017-01-01
Abstract:With the development of computer and network techniques, and the digital Chinese news texts explosion, facing a massive unstructured news data, a better way for knowledge extraction and storage, on the one hand, can help readers understand the core content of news, on the other hand, completed news knowledge accumulation will support the reportage. In recent years, information extraction technology of Chinese text has developed rapidly, and has big progress on Named Entity recognition, Entity Relation Extraction and Event Extraction. In this paper, we propose a topic-based Elements Extraction and storage of news method that based on thematic event frame, and the relationship between the event elements is stored in the form of element expressions to organize the knowledge of news. Expressions can be used to discover and extract event elements, relational instances in the same thematic news text, realize topic-based knowledge of news extraction and storage. This paper uses a variety of Natural Language Processing technologies, including document filtering, classify, cluster, dependency parsing, etc. Based on these theories we designed and realized the topic-based Chinese news texts Event Elements Automatic Extraction and Expressions Automatic Generation System.