Information Extraction Technology for Web Forums

奚伟鹏,李昕,蒋凯,武港山
DOI: https://doi.org/10.3969/j.issn.1000-3428.2005.04.024
2005-01-01
Abstract:Based on exhausting investigation for link mode and page format pattern in forum sites, the paper proposes an extraction framework to traw semantic topic threads from Web forums and describes the detailed system implementation for the extraction system. It defines a specification f information extraction, and provides a visualization tool to help users generate rules together with the background engine, which automatically download and extracts semantic information units inside Web forums.
What problem does this paper attempt to address?