Detection of Web Communities from Community Cores

Xianchao Zhang,Liang Wang,Yueting Li,Wenxin Liang
DOI: https://doi.org/10.1007/978-3-642-24396-7_28
2011-01-01
Abstract:A Web community, as a significant pattern of the Web, formed by a group of pages focusing on a common topic. Web communities are able to be oriented by complete bipartite graphs (CBG for short, and also known as community cores). Investigations have recently been conducted to fix the community structures of the Web by extracting CBGs. However, they are far away from real communities. Focusing on the issue of automatically ascertaining the ideal sizes of Web communities, we first raise the community cores into initial condition to retrieve complete community structures. With the available of all CBGs, a two-step heuristic algorithm is proposed to specify Web communities. First, the sketches of communities are drawn by gradually merging overlapping communities cores. Then, communities are completed by extending and including highly referred members. Experiments on real and large data collections demonstrate that the proposed algorithm is capable to effectively identify such communities that satisfy: (1) the relationships among the members of intra-communities are close; (2) the boundaries between the inter-communities are sparse.
What problem does this paper attempt to address?