Study on the Classification and Identification of Blog Pages

ZHENG De-quan,ZHANG Di,ZHAO Tie-jun,YU Hao
DOI: https://doi.org/10.3321/j.issn:1000-436x.2007.12.027
2007-01-01
Abstract:In order to find an automatic way to recognize the Blog pages from other Web pages for the content extraction of the Blog pages and other researches.According to the characteristic of Blog pages,some basic concepts and ideas in the area of Blog was described,and a novel method on the identification of Blog pages was proposed based on the struc-ture of the Blog pages and keywords.The experimental results showe that a high result can be achieved in precision.
What problem does this paper attempt to address?