Abstract:Accurate decision making in precision oncology depends on integration of multimodal molecular information, such as the genetic data, gene expression, protein abundance, and epigenetic measurements. Deep learning methods facilitate integration of heterogeneous datasets. However, almost all published deep learning-based bulk multi-omics integration methods have constrained usability. They suffer from lack of transparency, modularity, deployability, and are applicable exclusively to narrow tasks. To address these limitations, we introduce Flexynesis, a versatile tool designed with usability, and adaptability in mind. Flexynesis streamlines data processing, enforces structured data splitting, and ensures rigorous model evaluation. It offers unsupervised feature selection, different omics layer fusion options, and hyperparameter tuning. Users can choose from distinct architectures: fully connected networks, variational autoencoders, multi-triplet networks, graph neural networks, and cross-modality encoding networks. Each model is complemented with a straightforward input interface and standardized training, evaluation, and feature importance quantification methods, enabling easy incorporation into data integration pipelines. For improved user experience, Flexynesis supports features such as on-the-fly task determination and compatibility with regression, classification, and survival modeling. It accommodates multi-task prediction of a mixture of numerical/categorical outcome variables with a tolerance for missing labels. We also developed an extensive benchmarking pipeline, showcasing the tool's capability across diverse real-life datasets. This toolset should make deep-learning based bulk multi-omics data integration in the context of clinical/pre-clinical data analysis and marker discovery more accessible to a wider audience with or without experience in deep-learning development. Flexynesis is available at https://github.com/BIMSBbioinfo/flexynesis and can be installed from https://pypi.org/project/flexynesis/.

Flexiplex: a versatile demultiplexer and search tool for omics data

Flexiplex: A versatile demultiplexer and search tool for omics data

deMULTIplex2: robust sample demultiplexing for scRNA-seq

More cells, more doublets in highly multiplexed single-cell data

Enriched Methylomes of Low-input and Fragmented DNA Using Fragment Ligation EXclusive Methylation Sequencing (FLEXseq)

Bayexer: an Accurate and Fast Bayesian Demultiplexer for Illumina Sequences

coherent genetic demultiplexing in single-cell and single-nuclei experiments

Demuxalot: scaled up genetic demultiplexing for single-cell sequencing

Ensemblex: an accuracy-weighted ensemble genetic demultiplexing framework for population-scale scRNAseq sample pooling

Fleximer: Accurate Quantification of RNA-Seq via Variable-Length k-mers

Demuxafy: improvement in droplet assignment by integrating multiple single-cell demultiplexing and doublet detection methods

Sample-multiplexing approaches for single-cell sequencing

GMM-Demux: sample demultiplexing, multiplet detection, experiment planning, and novel cell-type verification in single cell sequencing

demuxSNP: supervised demultiplexing single-cell RNA sequencing using cell hashing and SNPs

SlopMap: a software application tool for quick and flexible identification of similar sequences using exact k-mer matching

Flexynesis: A deep learning framework for bulk multi-omics data integration for precision oncology and beyond

A hybrid demultiplexing strategy that improves performance and robustness of cell hashing

LongGF: Computational Algorithm and Software Tool for Fast and Accurate Detection of Gene Fusions by Long-Read Transcriptome Sequencing

BiomiX, a User-Friendly Bioinformatic Tool for Automatized Multiomics Data Analysis and Integration

Overloading And unpacKing (OAK) - droplet-based combinatorial indexing for ultra-high throughput single-cell multiomic profiling

MeFiT: merging and filtering tool for illumina paired-end reads for 16S rRNA amplicon sequencing