StreamFlow: cross-breeding cloud with HPC

Iacopo Colonnelli,Barbara Cantalupo,Ivan Merelli,Marco Aldinucci
DOI: https://doi.org/10.1109/TETC.2020.3019202
2020-08-31
Abstract:Workflows are among the most commonly used tools in a variety of execution environments. Many of them target a specific environment; few of them make it possible to execute an entire workflow in different environments, e.g. Kubernetes and batch clusters. We present a novel approach to workflow execution, called StreamFlow, that complements the workflow graph with the declarative description of potentially complex execution environments, and that makes it possible the execution onto multiple sites not sharing a common data space. StreamFlow is then exemplified on a novel bioinformatics pipeline for single-cell transcriptomic data analysis workflow.
Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?