Abstract:In this paper, we construct an academic literature knowledge graph based on the relationship between documents to facilitate the storage and research of academic literature data. Keywords are an important type of node in the knowledge graph. To solve the problem that there are no keywords in some documents for several reasons in the process of knowledge graph construction, an improved keyword extraction algorithm called TP-CoGlo-TextRank is proposed by using word frequency, position, word co-occurrence frequency, and a word embedding model. By combining the word frequency and position in the document, the importance of words is distinguished. By introducing the GloVe word-embedding model, which brings the external knowledge of documents into the TextRank algorithm, and combining the internal word co-occurrence frequency in the documents, the word-adjacency relationship is transferred non-uniformly. Finally, the words with the highest scores are combined into phrases if they are adjacent in the original text. The validity of the TP-CoGlo-TextRank algorithm is verified by experiments. On this basis, the Neo4j graph database is used to store and display the academic literature knowledge graph, to provide data support for research tasks such as text clustering, automatic summarization, and question-answering systems.

Keyphrase Extraction Using Knowledge Graphs

Extracting Keyphrases Using Heterogeneous Word Relations.

Exploring simultaneous keyword and key sentence extraction: improve graph-based ranking using wikipedia.

Exploring Simultaneous Keyword and Key Sentence Extraction

Keyphrase Extraction Using Semantic Networks Structure Analysis

Automatic Keyphrase Extraction Via Topic Decomposition.

Can Prior Knowledge Help Graph-Based Methods for Keyword Extraction?

GRAPH: A domain ontology-driven semantic graph auto extraction system

Clustering to Find Exemplar Terms for Keyphrase Extraction.

Automatic Keyphrase Extraction by Bridging Vocabulary Gap.

A Way to Improve Graph-Based Keyword Extraction

Keyphrase extraction based on topic relevance and term association

A semi-supervised key phrase extraction approach: learning from title phrases through a document semantic network

A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph Construction

Single Document Keyphrase Extraction Using Neighborhood Knowledge

WordTopic-MultiRank: A New Method for Automatic Keyphrase Extraction.

WikiRank: Improving Keyphrase Extraction Based on Background Knowledge

Semantic Health Knowledge Graph: Semantic Integration of Heterogeneous Medical Knowledge and Services

Heterogeneous Graph Neural Networks for Keyphrase Generation

A Review of Keyphrase Extraction

DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases