Collaborative graphical lasso

Alessio Albanese,Wouter Kohlen,Pariya Behrouzi
2024-03-27
Abstract:In recent years, the availability of multi-omics data has increased substantially. Multi-omics data integration methods mainly aim to leverage different molecular data sets to gain a complete molecular description of biological processes. An attractive integration approach is the reconstruction of multi-omics networks. However, the development of effective multi-omics network reconstruction strategies lags behind. This hinders maximizing the potential of multi-omics data sets. With this study, we advance the frontier of multi-omics network reconstruction by introducing "collaborative graphical lasso" as a novel strategy. Our proposed algorithm synergizes "graphical lasso" with the concept of "collaboration", effectively harmonizing multi-omics data sets integration, thereby enhancing the accuracy of network inference. Besides, to tackle model selection in this framework, we designed an ad hoc procedure based on network stability. We assess the performance of collaborative graphical lasso and the corresponding model selection procedure through simulations, and we apply them to publicly available multi-omics data. This demonstrated collaborative graphical lasso is able to reconstruct known biological connections and suggest previously unknown and biologically coherent interactions, enabling the generation of novel hypotheses. We implemented collaborative graphical lasso as an R package, available on CRAN as coglasso.
Methodology,Molecular Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the lagging development of network reconstruction strategies in multi - omics data integration. Specifically: - **Challenges in multi - omics data integration**: In recent years, the availability of multi - omics data has increased significantly. However, the development of effective multi - omics network reconstruction strategies has been relatively slow, which hinders maximizing the potential of multi - omics data sets. - **Limitations of existing methods**: Existing Gaussian graphical model (GGMs) estimation strategies are mainly designed for a single data set, and multi - omics data has inherent multi - detection characteristics, making it difficult to directly apply these strategies. - **The proposed new method**: To solve the above problems, this paper introduces a new algorithm - collaborative graphical lasso (coglasso). By combining the graphical lasso with the concept of collaboration, this algorithm effectively coordinates the integration of multi - omics data sets, thereby improving the accuracy of network inference. - **Model selection problem**: To perform model selection in this framework, the author designs a special procedure based on network stability. In summary, this research aims to advance the frontier of multi - omics network reconstruction by proposing the coglasso algorithm, in order to better integrate different molecular data sets, provide a complete molecular description of biological processes, and generate new biological hypotheses.