Cost-effective Data Analytics Across Multiple Cloud Regions

Junyi Shu,Xin Jin,Yun Ma,Xuanzhe Liu,Gang Huang
DOI: https://doi.org/10.1145/3472716.3472842
2024-01-01
Abstract:We propose a cloud-native data analytics engine for processing data stored among geographically distributed cloud regions with reduced cost. A job is split into subtasks and placed across regions based on factors including prices of compute resources and data transmission. We present its architecture which leverages existing cloud infrastructures and discuss major challenges of its system design. Preliminary experiments show that the cost is reduced by 15.1% for a decision support query on a four-region public cloud setup.
What problem does this paper attempt to address?