Abstract:In this work, we study the properties of sampling sets on families of large graphs by leveraging the theory of graphons and graph limits. To this end, we extend to graphon signals the notion of removable and uniqueness sets, which was developed originally for the analysis of signals on graphs. We state the formal definition of a $\Lambda-$removable set and conditions under which a bandlimited graphon signal can be represented in a unique way when its samples are obtained from the complement of a given $\Lambda-$removable set in the graphon. By leveraging such results we show that graphon representations of graphs and graph signals can be used as a common framework to compare sampling sets between graphs with different numbers of nodes and edges, and different node labelings. Additionally, given a sequence of graphs that converges to a graphon, we show that the sequences of sampling sets whose graphon representation is identical in $[0,1]$ are convergent as well. We exploit the convergence results to provide an algorithm that obtains approximately close to optimal sampling sets. Performing a set of numerical experiments, we evaluate the quality of these sampling sets. Our results open the door for the efficient computation of optimal sampling sets in graphs of large size.
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is: in large - scale graphs, how to effectively select a set of sampling nodes to ensure the uniqueness and reconstructability of band - limited graph signals. Specifically, the authors use the concept of graphons in graph limit theory to extend the concepts of removable sets and unique sets in graph signal processing, so that they can compare sampling sets between graphs with different numbers of nodes and edges, and provide a general framework for large - scale graphs to find approximately optimal sampling sets.
The following are the core contributions of the paper and the problems they solve:
1. **Extension of the concepts of unique sets and removable sets**:
- The authors extend the concepts of unique sets and removable sets from graph signal processing to graphon signal processing, making these concepts applicable to band - limited graphon signals.
- The formula is defined as follows:
\[
\text{For an open set } S \subset [0,1] \text{, if there exists } \Lambda > 0 \text{ such that}
\]
\[
\|T_W x\|_2 < \Lambda \|x\|_2, \quad \forall x \in L^2(S),
\]
\[
\text{then } S \text{ is called a removable set. Here } L^2(S) \text{ is the space of square - integrable functions with support set in } S.
\]
2. **Comparison of unique sets of different graphs**:
- Through graphon representation, the authors prove that unique sets can be compared from any graph, and this comparison can be measured by the differences between graphon shift operators.
- There is a quantitative relationship between the unique sets of different graphs, which helps to understand the inheritance of unique sets between structurally similar graphs.
3. **Unique sets in convergent graph sequences**:
- The authors prove that in a graph sequence that converges to a graphon in the L2 - norm, the unique set sequence also converges to a limit unique set.
- This result shows that the unique set remains structurally invariant in the convergent graph sequence.
4. **Proposing an approximately optimal sampling set algorithm**:
- Based on the above theoretical results, the authors introduce a general algorithm for finding approximately optimal sampling sets in large - scale graph sequences that converge to graphons.
- This algorithm solves the problem that existing methods are not applicable to large - scale graphs by using the optimal sampling sets in small graphs and applying them to larger - scale graphs.
These contributions together solve the problem of effectively selecting a set of sampling nodes in large - scale graphs, especially in the case where the graph structures are similar but the numbers of nodes are different. In addition, through experimental verification, this algorithm is significantly superior to the random sampling method in performance.