Abstract:There is increasing appetite for analysing populations of network data due to the fast-growing body of applications demanding such methods. While methods exist to provide readily interpretable summaries of heterogeneous network populations, these are often descriptive or ad hoc, lacking any formal justification. In contrast, principled analysis methods often provide results difficult to relate back to the applied problem of interest. Motivated by two complementary applied examples, we develop a Bayesian framework to appropriately model complex heterogeneous network populations, whilst also allowing analysts to gain insights from the data, and make inferences most relevant to their needs. The first application involves a study in Computer Science measuring human movements across a University. The second analyses data from Neuroscience investigating relationships between different regions of the brain. While both applications entail analysis of a heterogeneous population of networks, network sizes vary considerably. We focus on the problem of clustering the elements of a network population, where each cluster is characterised by a network representative. We take advantage of the Bayesian machinery to simultaneously infer the cluster membership, the representatives, and the community structure of the representatives, thus allowing intuitive inferences to be made. The implementation of our method on the human movement study reveals interesting movement patterns of individuals in clusters, readily characterised by their network representative. For the brain networks application, our model reveals a cluster of individuals with different network properties of particular interest in Neuroscience. The performance of our method is additionally validated in extensive simulation studies.

Bayesian Bi-clustering Methods with Applications in Computational Biology

Bayesian Generalized Biclustering Analysis Via Adaptive Structured Shrinkage.

Bayesian Clustering with Variable and Transformation Selections

Bayesian model-based clustering for populations of network data

Bayesian Nonparametric Graph Clustering

Sparse Bayesian Hierarchical Modeling of High-dimensional Clustering Problems

A Bayesian hierarchical hidden Markov model for clustering and gene selection: Application to kidney cancer gene expression data

Bayesian network-driven clustering analysis with feature selection for high-dimensional multi-modal molecular data

Bayesian Nonparametric Clustering with Feature Selection for Spatially Resolved Transcriptomics Data

Bayesian mixtures of common factor analyzers: Model, variational inference, and applications

Generalized Bayesian nonparametric clustering framework for high-dimensional spatial omics data

Robust knowledge-guided biclustering for multi-omics data

A Clustering Approach to Integrative Analysis of Multiomic Cancer Data

An interpretable Bayesian clustering approach with feature selection for analyzing spatially resolved transcriptomics data

A probabilistic model-based bi-clustering method for single-cell transcriptomic data analysis

A New Strategy of Cooperativity of Biclustering and Hierarchical Clustering: a Case of Analyzing Yeast Genomic Microarray Datasets.

Integrated Simultaneous Analysis of Different Biomedical Data Types with Exact Weighted Bi-Cluster Editing.

Bayesian temporal biclustering with applications to multi-subject neuroscience studies

Hidden Markov Models on Variable Blocks with a Modal Clustering Algorithm and Applications

A clustering approach to integrative analyses of multiomic cancer data

Bayesian Variable Selection in Multinomial Probit Model for Classifying High-Dimensional Data