DRTE:A Term Extraction Method for K12 Education

Siliang LI,Bin XU,Yuji YANG
DOI: https://doi.org/10.3969/j.issn.1003-0077.2018.03.014
2018-01-01
Abstract:Term extraction is an essential task where terms are extracted automatically from unstructured text based on a specific domain.Previous methods largely rely on terms'statistic information.However,terms in k12 educa-tion area have serious long-tail effect,which makes it hard to extract terms at the tail part in methods based on sta-tistics.In this paper,we propose DRTE,a method which focus on extracting terms from their definitions and rela-tions.Our method also utilizes term-formation rules and boundary detection strategies.Experiments on math text-books for middle school and high school reveal 82.7% on F1 performance of our method,which significantly outperforms the current method by 40.8%.
What problem does this paper attempt to address?