Abstract:The recent breakthroughs in machine learning (ML) and deep learning (DL) have catalyzed the design and development of various intelligent systems over wide application domains. While most existing machine learning models require large memory and computing power, efforts have been made to deploy some models on resource-constrained devices as well. A majority of the early application systems focused on exploiting the inference capabilities of ML and DL models, where data captured from different mobile and embedded sensing components are processed through these models for application goals such as classification and segmentation. More recently, the concept of exploiting the mobile and embedded computing resources for ML/DL model training has gained attention, as such capabilities allow (i) the training of models via local data without the need to share data over wireless links, thus enabling privacy-preserving computation by design, (ii) model personalization and environment adaptation, and (ii) deployment of accurate models in remote and hardly accessible locations without stable internet connectivity. This work targets to summarize and analyze state-of-the-art systems research that allows such on-device model training capabilities and provide a survey of on-device training from a systems perspective.

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper "On-device Training: A First Overview on Existing Systems" aims to address the challenges of training machine learning (ML) and deep learning (DL) models on resource-constrained devices. Specifically, the paper focuses on the following aspects: 1. **Privacy Protection**: By training models on local devices, it avoids uploading data to the cloud, thereby protecting user data privacy. 2. **Personalization and Environmental Adaptation**: Local training allows devices to adjust models based on specific user usage or environmental changes, improving model accuracy and applicability. 3. **Applications in Remote and Low Connectivity Scenarios**: In the absence of stable internet connections, local training ensures model updates and optimizations, suitable for applications in remote areas such as healthcare and environmental monitoring. ### Main Challenges The paper points out the main technical challenges in achieving on-device training: 1. **Hardware Resource Mismatch**: IoT devices typically have limited hardware resources, such as memory capacity ranging from a few KB to a few MB, while model training requires substantial computational and memory resources. 2. **High Heterogeneity**: IoT devices vary widely, from microcontrollers to single-board computers, with significant differences in capabilities, making it difficult to propose a universal solution. 3. **Limitations of Existing Work**: Most existing model training optimization algorithms focus on improving accuracy or accelerating convergence, neglecting the resource constraints of edge devices, such as memory and battery life. ### Research Scope The paper summarizes and analyzes the current state-of-the-art on-device training systems, covering various resource-constrained devices from modern mobile platforms to microcontrollers. Additionally, the paper discusses system design on smartphones but excludes the NVIDIA Jetson platform. The paper primarily focuses on single-device training, excluding collaborative model training (e.g., federated learning). ### Summary By reviewing existing systems, the paper aims to provide researchers and developers with a comprehensive perspective on the current state and future directions of on-device training, thereby promoting further research and application in this field.

On-device Training: A First Overview on Existing Systems

Deep Learning on Mobile and Embedded Devices: State-of-the-art, Challenges, and Future Directions

Close the Gap Between Deep Learning and Mobile Intelligence by Incorporating Training in the Loop

Explore Training of Deep Convolutional Neural Networks on Battery-powered Mobile Devices: Design and Application

On-Device Machine Learning: An Algorithms and Learning Theory Perspective

On-Device Deep Learning for Mobile and Wearable Sensing Applications: A Review

Towards Ubiquitous Learning

Deep Learning on Mobile and Embedded Devices

A Survey of Deep Learning on Mobile Devices: Applications, Optimizations, Challenges, and Research Opportunities

A New Frontier of AI: On-Device AI Training and Personalization

Breaking On-device Training Memory Wall: A Systematic Survey

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Deep Learning on Mobile Devices - A Review

On-Device Training Under 256KB Memory

Understanding Sensor Data Using Deep Learning Methods on Resource-Constrained Edge Devices.

On-device Online Learning and Semantic Management of TinyML Systems

An Overview of Machine Learning within Embedded and Mobile Devices–Optimizations and Applications

Performance Analysis and Characterization of Training Deep Learning Models on Mobile Devices

A survey on deploying mobile deep learning applications: A systemic and technical perspective

A Review on On Device Privacy and Machine Learning Training