Importance-Driven Data Collection for Efficient Online Learning over the Wireless Edge

Nan Wang,Yinglei Teng,Gang Hu,F. Richard Yu
DOI: https://doi.org/10.1109/icc45041.2023.10278679
2023-01-01
Abstract:Online learning has been widely applied in real-time artificial intelligence (AI) applications to learn new classes from the dynamic environment. Although the deployment of AI model training over the edge can facilitate faster processing of real-time data, the learning efficiency is plagued by the limited capacity of distributed data acquisition. In fact, not all data samples are equally important, and the random data selection strategy is not beneficial to accelerate training due to redundant data processing. In this paper, we present an importance-driven data collection framework, which leverages the usefulness of important data to improve the learning efficiency over the wireless edge. Specifically, the novel model convergence metric (MCM) is constructed to evaluate the data importance dynamically for model learning. Moreover, considering the constraint of limited network resources on learning efficiency, we establish an MCM maximization problem of joint data collecting, scheduling, and feeding in an edge computing system. A two-timescale hierarchical reinforcement learning (TTHRL) algorithm is designed to decouple the original problem into two-timescale two-level subproblems, where the top-level agent is responsible for data feeding strategy in the long term and the low-level agent learns data scheduling and collecting strategy in the short term. Simulation results show that our proposed scheme can achieve better performance improvements over the baseline schemes.
What problem does this paper attempt to address?