Location Technology of Non-standardized Table Based on DOM Tree

张兴兰,刘岩
DOI: https://doi.org/10.11907/rjdk.161193
2016-01-01
Abstract:The information extraction of web table has become the important task of construct ontology .It extracts attrib-ute name and value for ontology automatically so that large volume human task can be saved .There are few studies for in-formation extraction of non-standardized table in the domestic and overseas .The above phenomenon causes information-missing in the process of building ontology .The present paper proposed a heuristic and inerratic location algorithm of non-standardized table which can provide a much higher accuracy rate for locating informal table .
What problem does this paper attempt to address?