A New Method For Automatic Pattern Acquisition To Extract Information From Biomedical Texts

Ml Huang,Xy Zhu,U Ming
2004-01-01
Abstract:Recently there have been many information extraction tasks applied to the biomedical domain, some of which contribute to extract protein-protein interactions from biomedical texts. This paper presents a new method for automatic pattern acquisition to extract protein interactions. The system automatically generates patterns by aligning sequences of tags of sentences from unlabeled corpus. To obtain a high tagging accuracy, we propose a morphology-based tagging method with a pre-tagging strategy for Brill's tagger. Our method differs from the previous pattern acquisition algorithms in the ways: First.. it does not need to provide any seed word or pattern before the algorithm runs; second, we do not apply any parsing algorithm. Last, our method which is based on dynamic programming is fast.
What problem does this paper attempt to address?