BADREX: In situ expansion and coreference of biomedical abbreviations using dynamic regular expressions

Phil Gooch
DOI: https://doi.org/10.48550/arXiv.1206.4522
2012-06-20
Abstract:BADREX uses dynamically generated regular expressions to annotate term definition-term abbreviation pairs, and corefers unpaired acronyms and abbreviations back to their initial definition in the text. Against the Medstract corpus BADREX achieves precision and recall of 98% and 97%, and against a much larger corpus, 90% and 85%, respectively. BADREX yields improved performance over previous approaches, requires no training data and allows runtime customisation of its input parameters. BADREX is freely available from <a class="link-external link-https" href="https://github.com/philgooch/BADREX-Biomedical-Abbreviation-Expander" rel="external noopener nofollow">this https URL</a> as a plugin for the General Architecture for Text Engineering (GATE) framework and is licensed under the GPLv3.
Computation and Language
What problem does this paper attempt to address?