Overview of BioCreative II gene mention recognition
Larry Smith,Lorraine K Tanabe,Rie Johnson nee Ando,Cheng-Ju Kuo,I-Fang Chung,Chun-Nan Hsu,Yu-Shi Lin,Roman Klinger,Christoph M Friedrich,Kuzman Ganchev,Manabu Torii,Hongfang Liu,Barry Haddow,Craig A Struble,Richard J Povinelli,Andreas Vlachos,William A Baumgartner Jr,Lawrence Hunter,Bob Carpenter,Richard Tzong-Han Tsai,Hong-Jie Dai,Feng Liu,Yifei Chen,Chengjie Sun,Sophia Katrenko,Pieter Adriaans,Christian Blaschke,Rafael Torres,Mariana Neves,Preslav Nakov,Anna Divoli,Manuel Maña-López,Jacinto Mata,W John Wilbur
DOI: https://doi.org/10.1186/gb-2008-9-s2-s2
IF: 17.906
2008-01-01
Genome Biology
Abstract:Nineteen teams presented results for the Gene Mention Task at the BioCreative II Workshop. In this task participants designed systems to identify substrings in sentences corresponding to gene name mentions. A variety of different methods were used and the results varied with a highest achieved F 1 score of 0.8721. Here we present brief descriptions of all the methods used and a statistical analysis of the results. We also demonstrate that, by combining the results from all submissions, an F score of 0.9066 is feasible, and furthermore that the best result makes use of the lowest scoring submissions.