Survey on Text Information Extraction from Web Page

Jiajun Bu
2009-01-01
Abstract:This paper supplied a comprehensive survey of the text information extraction from Web page.By presenting and analyzing the development of three kinds of extraction modules and four types of the learning algorithms used in this area,it comprehensively surveyed the relative technologies of the text information extraction from Web page,and analyzed the application scenarios of different technologies.Finally,discussed the difficulties and the trend of the development of this area.
What problem does this paper attempt to address?