Abstract:There is increasing appetite for analysing populations of network data due to the fast-growing body of applications demanding such methods. While methods exist to provide readily interpretable summaries of heterogeneous network populations, these are often descriptive or ad hoc, lacking any formal justification. In contrast, principled analysis methods often provide results difficult to relate back to the applied problem of interest. Motivated by two complementary applied examples, we develop a Bayesian framework to appropriately model complex heterogeneous network populations, whilst also allowing analysts to gain insights from the data, and make inferences most relevant to their needs. The first application involves a study in Computer Science measuring human movements across a University. The second analyses data from Neuroscience investigating relationships between different regions of the brain. While both applications entail analysis of a heterogeneous population of networks, network sizes vary considerably. We focus on the problem of clustering the elements of a network population, where each cluster is characterised by a network representative. We take advantage of the Bayesian machinery to simultaneously infer the cluster membership, the representatives, and the community structure of the representatives, thus allowing intuitive inferences to be made. The implementation of our method on the human movement study reveals interesting movement patterns of individuals in clusters, readily characterised by their network representative. For the brain networks application, our model reveals a cluster of individuals with different network properties of particular interest in Neuroscience. The performance of our method is additionally validated in extensive simulation studies.

Optimal Bayesian estimators for latent variable cluster models

Bayesian model selection for the latent position cluster model for Social Networks

Bayesian approach to clustering real value, categorical and network data: solution via variational methods

A Bayesian approach for clustering and exact finite-sample model selection in longitudinal data mixtures

The parsimonious Gaussian mixture models with partitioned parameters and their application in clustering

Bayesian mixtures of common factor analyzers: Model, variational inference, and applications

A Bayesian Approach to Restricted Latent Class Models for Scientifically-Structured Clustering of Multivariate Binary Outcomes

Simultaneous Bayesian Clustering and Model Selection with Mixture of Robust Factor Analyzers

Bayesian estimation of cluster covariance matrices of unknown form

Bayesian clustering of high-dimensional data via latent repulsive mixtures

A Probabilistic Approach to Latent Cluster Analysis

Bayesian Clustering with Variable and Transformation Selections

Search Algorithms and Loss Functions for Bayesian Clustering

Optimal Clustering of Discrete Mixtures: Binomial, Poisson, Block Models, and Multi-layer Networks

Model-based Clustering with Sparse Covariance Matrices

Optimal Clustering under Uncertainty

Bayesian Clustering for Ordinal Data Based on Finite Mixture Models of Latent Variables

Bayesian Decision Process for Budget-efficient Crowdsourced Clustering

Bayesian model-based clustering for populations of network data

Mixture of Latent Trait Analyzers for Model-Based Clustering of Categorical Data

Model-based clustering based on sparse finite Gaussian mixtures