Optimizing task allocation in multi-query edge analytics

Anna-Valentini Michailidou,Christos Bellas,Anastasios Gounaris
DOI: https://doi.org/10.1007/s10586-024-04427-1
2024-04-12
Cluster Computing
Abstract:Edge analytics receives an ever-increasing interest since processing streaming data closer to where they are produced, rather than transferring them to the cloud, ensures lower latency while also addresses data privacy issues. In this work, we deal with the placement of analytic tasks to heterogeneous geo-distributed edge devices while targeting three objectives, namely latency, quality of results, and resource utilization. In addition, we investigate this multi-objective problem in a multi-query setting, where we jointly optimize multiple analytic jobs while dynamically adjusting task placement decisions. We explore multiple solutions that we thoroughly evaluate; interestingly, in a multi-query setting, all three objectives can be improved simultaneously by our proposals in many cases. Furthermore, we develop a proof-of-concept prototype using Apache Storm. Our solutions are thoroughly evaluated and shown to yield improvements by more than 50% compared to advanced baselines targeting only latency. Moreover, our software prototype managed to achieve speedups of up to 6 over the Resource Aware Apache Storm scheduler, with an average speedup of 2.76 , when deployed over a small-scale infrastructure.
computer science, information systems, theory & methods
What problem does this paper attempt to address?