Multi-class boosting for the analysis of multiple incomplete views on microbiome data

Andrea Simeon,Miloš Radovanović,Tatjana Lončar-Turukalo,Michelangelo Ceci,Sanja Brdar and Gianvito Pio
DOI: https://doi.org/10.1186/s12859-024-05767-w
IF: 3.307
2024-05-15
BMC Bioinformatics
Abstract:Microbiome dysbiosis has recently been associated with different diseases and disorders. In this context, machine learning (ML) approaches can be useful either to identify new patterns or learn predictive models. However, data to be fed to ML methods can be subject to different sampling, sequencing and preprocessing techniques. Each different choice in the pipeline can lead to a different view (i.e., feature set) of the same individuals, that classical (single-view) ML approaches may fail to simultaneously consider. Moreover, some views may be incomplete, i.e., some individuals may be missing in some views, possibly due to the absence of some measurements or to the fact that some features are not available/applicable for all the individuals. Multi-view learning methods can represent a possible solution to consider multiple feature sets for the same individuals, but most existing multi-view learning methods are limited to binary classification tasks or cannot work with incomplete views.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?