bioScience: A new python science library for high-performance computing bioinformatics analytics

Aurelio López-Fernández,Francisco A. Gómez-Vela,Jorge Gonzalez-Dominguez,Parameshachari Bidare-Divakarachari
DOI: https://doi.org/10.1016/j.softx.2024.101666
IF: 2.868
2024-02-21
SoftwareX
Abstract:BioScience is an advanced Python library designed to satisfy the growing data analysis needs in the field of bioinformatics by leveraging High-Performance Computing (HPC). This library encompasses a vast multitude of functionalities, from loading specialized gene expression datasets (microarrays, RNA-Seq, etc.) to preprocessing techniques and data mining algorithms suitable for this type of datasets. BioScience is distinguished by its capacity to manage large amounts of biological data, providing users with efficient and scalable tools for the analysis of genomic and transcriptomic data through the use of parallel architectures for clusters composed of CPUs and GPUs.
computer science, software engineering
What problem does this paper attempt to address?