Basic Grammar Rule and Maximum Entropy Based Hybrid Model for Named Entity Recognition

LU Ming,KANG Yu-jie,YU Neng-hai
DOI: https://doi.org/10.3969/j.issn.1000-1220.2012.03.018
2012-01-01
Abstract:Recent years have witnessed the explosion of the World Wide Web.The massive unstructured data from WWW requires information extraction technology to relief the information overload.As a key technology of information extraction,named entity recognition attracts many research interests these years.Existing named entity recognition algorithms can be categorized into statistical methods,rule based methods and their combinations.However these methods either fail to consider the global information or lack of efficiency due to the statistical model.This paper proposed a hybrid named entity recognition system,which combines basic grammar rule model and maximum entropy model.The recognition system first recognize the named entity with basic grammar rules model,which consumes less time than complex grammar rule model,and then use partial matching to enhance the recall of basic grammar rule model.After partial matching,the result generated by grammar rules is refined with the maximum entropy model to get the final recognition result.Experiments on MUC-7 dataset show that our method can achieve 94% precision,91% recall and 92.48% F measure,which is a large improvement compared to existing systems.
What problem does this paper attempt to address?