Workflow scheduling algorithm considering intermediate data storage for hybrid amazon instances

Zitai MA,Jian CAO,Yan YAO
DOI: https://doi.org/10.13196/j.cims.2017.05.008
2017-01-01
Abstract:To select storage datasets under limited storage space or storage cost,a heuristic storage intermediate datasets considered selectivity was proposed with cloud characteristics,and a workflow scheduling algorithm combined Spot Instance (SI) with On-demand Instance (OI) was put forward.According to the global weight,the tasks were grouped and scheduled,and a trade-off between generation dataset cost and storage cost was considered to manage the storage the intermediate datasets.Through simulating the cloud environment,performing experiments and comparing with other scheduling and storage strategies,the results demonstrated that the proposed algorithm was effective in reducing cost while satisfying the deadline constraints of workflows.
What problem does this paper attempt to address?