Diagnosing and Minimizing Semantic Drift in Iterative Bootstrapping Extraction.

Zhixu Li,Ying He,Binbin Gu,An Liu,Hongsong Li,Haixun Wang,Xiaofang Zhou
DOI: https://doi.org/10.1109/TKDE.2017.2782697
IF: 9.235
2018-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Semantic drift is a common problem in iterative information extraction. Previous approaches for minimizing semantic drift may incur substantial loss in recall. We observe that most semantic drifts are introduced by a small number of questionable extractions in the earlier rounds of iterations. These extractions subsequently introduce a large number of questionable results, which lead to the semant...
What problem does this paper attempt to address?