Sham Battle Information Extraction System Based on Internet

Li Yuejin,Zhao Jing,Lin Hongfei
DOI: https://doi.org/10.3321/j.issn:1002-8331.2006.14.064
2006-01-01
Abstract:Information Extraction plays an important role in knowledge acquisition and information service.This paper discusses briefly the key techniques for information extraction,and it designs and implements a Sham Battle Information Extraction System(SBIES).It constructs automatically wrappers by machine learning algorithms,applies Maximum Entropy model to conduct Chinese chunk parsing and makes use of a sets of extraction patterns to extract specific information and relationships from relevant HTML documents.Moreover,it also combines the XML expression with the organization of database,so it realizes the presentation and query of information extracted based on Web.It shows higher recall and precision by testing SBIES.
What problem does this paper attempt to address?