Abstract:The heterogeneous edge-cloud computing paradigm can provide a more optimal direction to deploy scientific workflows than traditional distributed computing or cloud computing environments. Due to the different sizes of scientific datasets and some of these datasets must keep private, it is still a difficult problem to finding an data placement strategy that can minimize data transmission as well as placement cost. To address this issue, this paper combines advantages of both edge and cloud computing to construct a data placement model, which can balance data transfer time and data placement cost using intelligent computation. The most difficult research challenge the model solved is to consider many constrain in this hybrid computing environments, which including shared datasets within individual and among multiple workflows across various geographical regions. According to the constructed model, the study propose a new data placement strategy named DE-DPSO-DPS, which using a discrete particle swarm optimization algorithm with differential evolution (DE-DPSO-DPA) to distribute these scientific datasets. The strategy also not only consider the characteristics such as the number and storage capacity of edge micro-datacenters, the bandwidth between different datacenters and the proportion of private datasets, but also analysis the performance of algorithm during the workflows execution. Comprehensive experiments are designed in simulated heterogeneous edge-cloud computing environments demonstrate that the data placement strategy can effectively reduce the data transmission time and placement cost as compared to traditional strategies for data-sharing scientific workflows.

A Cost-Effective Strategy for Storing Scientific Datasets with Multiple Service Providers in the Cloud

Cost-Efficient Vm Configuration Algorithm In The Cloud Using Mix Scaling Strategy

Cost-Efficient Provisioning Strategy for Multiple Services in Distributed Clouds

An Energy-Efficient Scheme for Cloud Resource Provisioning Based on CloudSim

To store or not: Online cost optimization for running big data jobs on the cloud

DCStore: A Deduplication-Based Cloud-of-Clouds Storage Service

A Novel Data Placement Strategy for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

Towards Optimizing Storage Costs on the Cloud

Security-aware intermediate data placement strategy in scientific cloud workflows

Data Placement for Multi-Tenant Data Federation on the Cloud

Optimal Data Placement for Data-Sharing Scientific Workflows in Heterogeneous Edge-Cloud Computing Environments

CHARM: A Cost-Efficient Multi-Cloud Data Hosting Scheme with High Availability

Saving Money for Analytical Workloads in the Cloud

Moving Big Data to The Cloud: An Online Cost-Minimizing Approach

Cost-minimizing Dynamic Migration of Content Distribution Services into Hybrid Clouds

Cutting Your Cloud Computing Cost for Deadline-Constrained Batch Jobs

Title Cost-minimizing dynamic migration of content distributionservices into hybrid clouds

Security-Aware Efficient Mass Distributed Storage Approach for Cloud Systems in Big Data

Dynamic Pricing for Resource Consumption in Cloud Service.

Efficient Resource Scheduling for Big Data Processing in Cloud Platform