Design and implementation of a scalable high-performance computing (HPC) cluster for omics data analysis: achievements, challenges and recommendations in LMICs

Kais Ghedira,Oussema Khamessi,Chaima Hkimi,Selim Kamoun,Nader Dhamer,Kamel Daassi,Wassim Ben Salah,Houcemeddine Othman,Wahbi Belhadj,Youssef Ghorbal
DOI: https://doi.org/10.1093/gigascience/giae060
IF: 7.658
2024-01-02
GigaScience
Abstract:Background: The advent of high-throughput technologies, including cutting-edge sequencing devices, has revolutionized biomedical data generation and processing. Nevertheless, big data applications require novel hardware and software for parallel computing and management to handle the ever-growing data size and analysis complexity. On-premise, high-performance computing (HPC) is increasingly used in biomedical research for big data stewardship. Findings: In this work, we present Tunisia's first high-performance computational infrastructure for omics research. Method: We highlight measurements and recommendations that may help institutions in other low- and middle-income countries that are eager to implement local HPC in facilities for bioinformatics research and omics data analyses.
What problem does this paper attempt to address?