Recognition of Protein/gene Names from Text Using an Ensemble of Classifiers

GuoDong Zhou,Dan Shen,Jie Zhang,Jian Su,SoonHeng Tan
DOI: https://doi.org/10.1186/1471-2105-6-s1-s7
IF: 3.307
2005-01-01
BMC Bioinformatics
Abstract:This paper proposes an ensemble of classifiers for biomedical name recognition in which three classifiers, one Support Vector Machine and two discriminative Hidden Markov Models, are combined effectively using a simple majority voting strategy. In addition, we incorporate three post-processing modules, including an abbreviation resolution module, a protein/gene name refinement module and a simple dictionary matching module, into the system to further improve the performance. Evaluation shows that our system achieves the best performance from among 10 systems with a balanced F-measure of 82.58 on the closed evaluation of the BioCreative protein/gene name recognitiontask (Task 1A).
What problem does this paper attempt to address?