Constructing Information-Lossless Biological Knowledge Graphs from Conditional Statements

Tianwen Jiang,Tong Zhao,Bing Qin,Ting Liu,Nitesh V. Chawla,Meng Jiang
DOI: https://doi.org/10.48550/arXiv.1907.00720
2019-06-26
Abstract:Conditions are essential in the statements of biological literature. Without the conditions (e.g., environment, equipment) that were precisely specified, the facts (e.g., observations) in the statements may no longer be valid. One biological statement has one or multiple fact(s) and/or condition(s). Their subject and object can be either a concept or a concept's attribute. Existing information extraction methods do not consider the role of condition in the biological statement nor the role of attribute in the subject/object. In this work, we design a new tag schema and propose a deep sequence tagging framework to structure conditional statement into fact and condition tuples from biological text. Experiments demonstrate that our method yields a information-lossless structure of the literature.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?