Discover web forums via user browsing behavior detection

Jingtian Jiang,Nenghai Yu
DOI: https://doi.org/10.1109/ICCSNT.2011.6182453
2011-01-01
Abstract:Web forums are important services where users can request and exchange information with others.Recently, there are more and more research works on mining knowledge from web forums due to the richness of information. In contrast, there is little work about discovering web forums. However, automatic web forum discovery is crucial for large-scale applications, e.g. a forum search engine. In this paper, we study how to discover web forums from browse log automatically. Although web forums have different layouts or styles, they always have similar implicit navigation paths leading users from their entry pages to thread pages. The implicit navigation paths make the user browsing behavior in web forums different from that in general sites. Thus we propose an efficient approach to discover web forums from browse log via detecting the specific user browsing behavior. We first build a browse map by clustering the browsed URLs, and then detect the browse behavior from the browse map. Next we adopt a few features to determine whether a site is a web forum or not. Experiment results on a large data set show that our approach is very effective and efficient.
What problem does this paper attempt to address?