THU QUANTA at TAC 2009 KBP and RTE Track.

Fangtao Li,Zhicheng Zheng,Fan Bu,Yang Tang,Xiaoyan Zhu,Minlie Huang
2009-01-01
Abstract:This paper describes the systems of THU QUANTA in Text Analysis Conference (TAC) 2008. We participated in the Question Answering (QA) track, and the Recognizing Textual Entailment (RTE) track. For question answering track, we enhanced the traditional question answering system by sentiment lexicon based opinion analysis. The rigid list questions are divided into two categories based on their answer types. And two snippet extraction approaches are proposed to answer squishy list questions. For RTE track, we design different strategies to recognize true entailment and false entailment. The similarity between hypothesis and text is measured to recognize true entailment. We detect the exact entity and relation mismatch to recognize the false entailment. The evaluation results show that the proposed approaches are very effective for the QA and RTE tasks. In this year's Text Analysis Conference, we participated in two tracks: the Question Answering (QA) track and the Recognizing Textual Entailment (RTE) track. This paper reports on our two developed systems for these two tracks. The TAC 2008 QA track is different from previous TREC QA. It focuses on finding answers to opinion questions. We develop the opinion question answering system by enhancing our past participated system QUANTA(3) with lexicon based sentiment analysis. We not only consider the topic relevance, but also pay attention to the sentiment match between question and answers. By analyzing the rigid list questions, we divide all the rigid list questions into two categories: the Opinion Holder Rigid List questions and Other Types Rigid List questions. Opinion Holder Rigid List questions refer to the questions whose answers are blog nicknames or blog holders. Other Types Rigid List questions refer to the questions whose answers are same to traditional factoid questions, such as person names, cities etc. The different strategies are designed for these two types of rigid list questions. We also propose two snippet extraction approaches to answer
What problem does this paper attempt to address?