Distribution and dependence of extremes in network sampling processes

Konstantin Avrachenkov,Natalia M. Markovich,Jithin K. Sreedharan
DOI: https://doi.org/10.1186/s40649-015-0018-3
2015-07-22
Computational Social Networks
Abstract:Abstract We explore the dependence structure in the sampled sequence of complex networks. We consider randomized algorithms to sample the nodes and study extremal properties in any associated stationary sequence of characteristics of interest like node degrees, number of followers, or income of the nodes in online social networks, which satisfy two mixing conditions. Several useful extremes of the sampled sequence like the k th largest value, clusters of exceedances over a threshold, and first hitting time of a large value are investigated. We abstract the dependence and the statistics of extremes into a single parameter that appears in extreme value theory called extremal index (EI). In this work, we derive this parameter analytically and also estimate it empirically. We propose the use of EI as a parameter to compare different sampling procedures. As a specific example, degree correlations between neighboring nodes are studied in detail with three prominent random walks as sampling techniques.
What problem does this paper attempt to address?