Extracting Chemical-protein interactions via bi-directional long short-term memory network

Wei Wang,Xi Yang,Yuting Xing,Chengkun Wu,Zhuo Song
2017-01-01
Abstract:Understanding chemical-protein interactions (CPI) has been of great importance to drug discovery, precision medicine and basic biomedical research. It is a time-consuming and laborious task to annotate CPIs from numerous unstructured texts. We can employ automated methods to improve the efficiency of this task. In this work, we propose a CPI extraction method based on the bi-directional long shortterm memory network (a specific type of deep neural network), which does not require a complicated feature engineering procedure. Our key strategy is to break each sentence into fragments according to the position of the targeted entity pair and recombine them into chunks, which can help capture the structural knowledge hidden in the sentence. More specifically, our model consists of four network layers, including a feature layer, a Bi-LSTM layer, a pooling layer and a Softmax layer. Our results demonstrate that such a structure is beneficial for effective relation information. Keywords—Chemical-protein interaction; bi-directional long short-term memory network; structural knowledge;
What problem does this paper attempt to address?