Extracting Web table information in cooperative learning activities based on abstract semantic model

Ning Gu,Guowen Wu,Xiaoyuan Wu,Baile Shi
DOI: https://doi.org/10.1109/CSCWD.2001.942309
2001-01-01
Abstract:A great deal of Web table information exists in cooperative learning activities. The paper presents a new method that extracts information from tables of Web documents. Using a tabled abstract semantic model to describe complicated tables and understand tables from the point of view of semantics, the method reduces the dependence for the design difference of table constructions in the extraction process. At the same time, it utilizes the characteristics of HTML and the techniques of natural language processing to design some heuristic rules, and thus aids the identification of table items. On the above basis, we design a prototype, "EXTable", and then gain a better result according to experimentation.
What problem does this paper attempt to address?