Mining Uncertain Sentences With Multiple Instance Learning

Feng Ji,Xipeng Qiu,Xuanjing Huang
DOI: https://doi.org/10.1007/978-3-642-17316-5_49
2010-01-01
Abstract:Distinguishing uncertain information from factual ones in online texts is of essential importance in information extraction, because uncertain information would mislead systems to find useless even fault information. In this paper, we propose a method for detecting uncertain sentences with multiple instance learning (MIL). Based on the basic assumption, we derive two new constraints for estimating the weight vector by defining a probability margin, which is used in an online learning algorithm known as Passive-Aggressive algorithm. To demonstrate the effectiveness of our method, we experiment on the biomedical corpus. Compared with an intuitive method with conventional single instance learning (SIL), our method provide higher performance by raising the performance from 79.07% up to 82.55%, over 3% improvement.
What problem does this paper attempt to address?