Measuring Structural Correlations in Graphs
Ziyu Guan,Xifeng Yan
2015-01-01
Abstract:The rise of the Web, social networks, and bioinformatics has presented the scientists with numerous graphs, each consisting of millions of nodes and edges. Hidden in these large datasets are the answers to important questions in networking, sociology, business, and biology. These graphs not only have topological structures but also contain events/activities that occurred on their nodes. For example, an eBay customer could sell or bid on a product; a Facebook user could play a Zynga game with friends; a computer in the Internet could suffer various attacks from hackers. This complex combination raises new research problems in the graph data analysis (Ester et al., 2006; Moser et al., 2009; Silva, Meira, and Zaki, 2010).Among different events taking place in a network, some exhibit strong correlations with the network structure, while others do not. Such structural correlations might shed light on viral influence existing in the corresponding network, which is the key to many research problems in product marketing (Chen, Wang, and Wang, 2010), online advertisement (Bao and Chang, 2010), and recommendation (Kautz, Selman, and Shah, 1997). Figure 2.1 shows the distribution of three different events over the same graph. We can easily instantiate Figure 2.1 into different application scenarios. They could be three different products bought by members in a social network, or three different intrusion alerts raised by computers in a computer network. In terms of the products, dark nodes in Figure 2.1 a–c represent the members who purchased the products A, Band C, respectively. Intuitively, Figure 2.1 shows that in this network, people who bought …