COMMIX: towards effective web information extraction, integration and query answering.

Tengjiao Wang,Shiwei Tang,Dongqing Yang,Jun Gao,Yuqing Wu,Jian Pei
DOI: https://doi.org/10.1145/564691.564774
2002-01-01
Abstract:As WWW becomes more and more popular and powerful, how to search information on the web in database way becomes an important research topic. COMMIX, which is developed in the DB group in Peking University (China), is a system towards building very large database using data from the Web for information extraction, integration and query answering. COMMIX has some innovative features, such as ontology-based wrapper generation, XML-based information integration, view-based query answering, and QBE-style XML query interface.
What problem does this paper attempt to address?