Enabling data and compute intensive workflows in bioinformatics

Gaurang Mehta,Ewa Deelman,James A. Knowles,Ting Chen,Ying Wang,Jens Vöckler,Steven Buyske,tara c matise
DOI: https://doi.org/10.1007/978-3-642-29740-3_4
2011-01-01
Abstract:Accelerated growth in the field of bioinformatics has resulted in large data sets being produced and analyzed. With this rapid growth has come the need to analyze these data in a quick, easy, scalable, and reliable manner on a variety of computing infrastructures including desktops, clusters, grids and clouds. This paper presents the application of workflow technologies, and, specifically, Pegasus WMS, a robust scientific workflow management system, to a variety of bioinformatics projects from RNA sequencing, proteomics, and data quality control in population studies using GWAS data.
What problem does this paper attempt to address?