Abstract:Knowledge Graph Construction (KGC) from text unlocks information held within unstructured text and is critical to a wide range of downstream applications. General approaches to KGC from text are heavily reliant on the existence of knowledge bases, yet most domains do not even have an external knowledge base readily available. In many situations this results in information loss as a wealth of key information is held within "non-entities". Domain-specific approaches to KGC typically adopt unsupervised pipelines, using carefully crafted linguistic and statistical patterns to extract co-occurred noun phrases as triples, essentially constructing text graphs rather than true knowledge graphs. In this research, for the first time, in the same flavour as Collobert et al.'s seminal work of "Natural language processing (almost) from scratch" in 2011, we propose a Seq2KG model attempting to achieve "Knowledge graph construction (almost) from scratch". An end-to-end Sequence to Knowledge Graph (Seq2KG) neural model jointly learns to generate triples and resolves entity types as a multi-label classification task through deep learning neural networks. In addition, a novel evaluation metric that takes both semantic and structural closeness into account is developed for measuring the performance of triple extraction. We show that our end-to-end Seq2KG model performs on par with a state of the art rule-based system which outperformed other neural models and won the first prize of the first Knowledge Graph Contest in 2019. A new annotation scheme and three high-quality manually annotated datasets are available to help promote this direction of research.

A Novel End-to-End Multiple Tagging Model for Knowledge Extraction

Knowledge Representation Learning with Entity Descriptions, Hierarchical Types, and Textual Relations

Joint Extraction of Entities and Relations Based on a Novel Tagging Scheme

Joint Extraction of Triple Knowledge Based on Relation Priority.

Bridging Text and Knowledge with Multi-Prototype Embedding for Few-Shot Relational Triple Extraction.

Joint extraction of entities and relations using multi-label tagging and relational alignment

An Entity-Relation Joint Extraction Method Based on Two Independent Sub-Modules From Unstructured Text

A novel joint extraction model based on cross-attention mechanism and global pointer using context shield window

Entity relation joint extraction model combining pointer network and attention mechanism based on relative position embedding

Position-Aware Tagging for Aspect Sentiment Triplet Extraction

A Novel Table-to-Graph Generation Approach for Document-Level Joint Entity and Relation Extraction

Neural Entity Summarization with Joint Encoding and Weak Supervision

Joint Extraction of Entities and Overlapping Relations using Position-Attentive Sequence Labeling

A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction

Span-based joint entity and relation extraction augmented with sequence tagging mechanism

Neural Relation Extraction for Knowledge Base Enrichment

A novel knowledge extraction method based on deep learning in fruit domain

A Novel Chinese Overlapping Entity Relation Extraction Model Using Word-Label Based on Cascade Binary Tagging

A Novel Chinese Entity Relationship Extraction Method Based on the Bidirectional Maximum Entropy Markov Model

Joint Extraction of Entities and Relations Based on a Novel Decomposition Strategy

Seq2KG: An End-to-End Neural Model for Domain Agnostic Knowledge Graph (not Text Graph) Construction from Text