Combining Machine Learning with Dictionary Lookup for Chemical Compound and Drug Name Recognition Task

Lishuang Li,Rui Guo,Shanshan Liu,Panpan Zhang,Tianfu Zheng,Degen Huang,Huiwei Zhou
2013-01-01
Abstract:Following the interest taken into Name Entity Recognition in academic literature in the Gene Mention recognition task of BioCreative I and II, the BioCreative IV hopes to make the implementation of the system in the field of detecting mentions of chemical compounds and drugs. Considering that the machine learning methods have obtained great success in the correct identification of gene and protein names, and dictionary lookups also have the power to recognize the variable naming convention of chemical and drug names, we combine the above approaches by regarding dictionary results as features to help machine learning. Our system is based on Conditional Random Fields (CRF).
What problem does this paper attempt to address?