Abstract:Low-latency data service is an increasingly critical challenge for data center applications. In modern distributed storage systems, proper data placement helps reduce the data movement delay, which can contribute to the service latency reduction tremendously. Existing data placement solutions have often assumed the prior distribution of data requests or discovered it via trace analysis. However, data placement is a difficult online decision-making problem faced with dynamic network conditions and time-varying user request patterns. The conventional static model-based solutions are less effective to handle the dynamic system. With an overall consideration of data movement and analytical latency, we develop a reinforcement learning-based framework DataBot+, automatically learning the optimal placement policies. DataBot+ adopts neural networks, trained with a variant of <span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="1.838ex" height="2.509ex" style="vertical-align: -0.671ex;" viewBox="0 -791.3 791.5 1080.4" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-51" x="0" y="0"></use></g></svg></span>Q-learning, whose input is the real-time data flow measurements and whose output is a value function estimating the near-future latency. For instantaneous decision making, DataBot+ is decoupled into two asynchronous production and training components, ensuring that the training delay will not introduce extra overheads to handle the data flows. Evaluation results driven by real-world traces demonstrate the effectiveness of our design.<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"><path stroke-width="1" id="MJMATHI-51" d="M399 -80Q399 -47 400 -30T402 -11V-7L387 -11Q341 -22 303 -22Q208 -22 138 35T51 201Q50 209 50 244Q50 346 98 438T227 601Q351 704 476 704Q514 704 524 703Q621 689 680 617T740 435Q740 255 592 107Q529 47 461 16L444 8V3Q444 2 449 -24T470 -66T516 -82Q551 -82 583 -60T625 -3Q631 11 638 11Q647 11 649 2Q649 -6 639 -34T611 -100T557 -165T481 -194Q399 -194 399 -87V-80ZM636 468Q636 523 621 564T580 625T530 655T477 665Q429 665 379 640Q277 591 215 464T153 216Q153 110 207 59Q231 38 236 38V46Q236 86 269 120T347 155Q372 155 390 144T417 114T429 82T435 55L448 64Q512 108 557 185T619 334T636 468ZM314 18Q362 18 404 39L403 49Q399 104 366 115Q354 117 347 117Q344 117 341 117T337 118Q317 118 296 98T274 52Q274 18 314 18Z"></path></defs></svg>

Cost-Effective and Low-Latency Data Placement in Edge Environment Based on PageRank-Inspired Regional Value

Optimizing Data Placement in Multi-cloud Environments Considering Data Temperature

RLRP: High-Efficient Data Placement with Reinforcement Learning for Modern Distributed Storage Systems

A Cloudedge-Combined Data Placement Strategy Based On User Access Regions

Cost-Effective Data Placement in Edge Storage Systems with Erasure Code

How Far Have Edge Clouds Gone? A Spatial-Temporal Analysis of Edge Network Latency in the Wild.

A Low-Cost Edge Server Placement Strategy in Wireless Metropolitan Area Networks.

Efficient Location-Aware Data Placement for Data-Intensive Applications in Geo-Distributed Scientific Data Centers

An Effective Data Placement Strategy for IIoT Applications

Efficient Data Placement and Retrieval Services in Edge Computing.

Cost-Effective Edge Server Placement in Wireless Metropolitan Area Networks.

A Dynamic Deep-learning-based Virtual Edge Node Placement Scheme for Edge Cloud Systems in Mobile Environment

Preference-Aware Edge Server Placement in the Internet of Things

A Novel Data Placement and Retrieval Service for Cooperative Edge Clouds

Data-driven Approaches to Edge Caching

Data Caching Optimization in the Edge Computing Environment

A Learning-Based Data Placement Framework for Low Latency in Data Center Networks

Load Balance Awared Data Sharing Systems in Heterogeneous Edge Environment.

Spatio-Temporal Load Balancing for Energy Cost Optimization in Distributed Internet Data Centers

When Edge Caching Meets a Budget: Near Optimal Service Delivery in Multi-Tiered Edge Clouds.

Large-scale Measurements and Optimizations on Latency in Edge Clouds