Combining Multi-Models For Gene Mention Tagging

Lishuang Li,Degen Huang,Jing Sun
2011-01-01
Abstract:Gene mention tagging is one of the basic tasks in automatic information extraction from biomedical texts. It is still a challenge because of the irregularity of naming and the frequent appearing of new genes. In this paper, six divergent models are implemented with different machine learning algorithms and dissimilar feature sets. The recognition results from the six models are then combined using the simple set operation method (union and intersection) and the voting method to further improve tagging performance. Experiments conducted on the corpus of BioCreative II GM task show that our best performing integration model achieves an F-score of 88.10%, which outperforms most of the state-of-the-art systems.
What problem does this paper attempt to address?