Abstract:Mobile applications can leverage high-quality deep learning models such as convolutional neural networks and deep neural networks to provide high-performance cognitive services. Prior work on deep learning models-based mobile applications in a cloud-edge computing environment focuses on performing lightweight data pre-processing tasks on edge servers for cloud-hosted cognitive servers. These approaches have two major limitations. First, it is uneasy for the mobile applications to assure satisfactory user experience in terms of network communication delay, because the intermediary edge servers are used only to pre-process data (e.g., images and videos) and the cloud servers are used to complete the tasks. Second, these approaches assume the pre-trained deep learning models deployed on cloud servers are static, and will not attempt to automatically upgrade in a context-aware manner. In this article, we propose a cloud-edge collaboration framework that facilitates delivering cognitive services with long-lasting, fast response, and high accuracy properties. We fist deploy a shallow model (i.e., EdgeCNN) on the edge server and a deep model (i.e., CloudCNN) on the cloud server. EdgeCNN can provide durable and rapid response cognitive services, because edge servers not only provide computing resources for mobile applications, but also close to users. Then, we enable CloudCNN to assist in training EdgeCNN to improve the performance of the latter. Thus, EdgeCNN also provides high-accuracy cognitive services. Furthermore, because users may continue to upload data to edge servers in real-world scenarios, we propose to use the ongoing assistance of CloudCNN to further improve the accuracy of the shallow model. Experimental results show that EdgeCNN can reduce the average response time of cognitive services by up to 55.08 percent and improve accuracy by up to 26.70 percent.

Research on Cloud Side Collaboration Architecture and Lightweight Model of Distribution Network

Extendable Multi-Device Collaborative Pipeline Parallel Inference in the Edge-Cloud Scenario

Research on Big Data Processing Model of Edge-Cloud Collaboration in Cyber Physical Systems

A Lightweight Collaborative Deep Neural Network for the Mobile Web in Edge Cloud

Context-Aware Deep Model Compression for Edge Cloud Computing

Collaborative DNNs Inference with Joint Model Partition and Compression in Mobile Edge-Cloud Computing Networks

A Distributed Hierarchical Deep Computation Model for Federated Learning in Edge Computing

Efficient Federated Learning for Cloud-Based AIoT Applications

Distributed hierarchical deep optimization for federated learning in mobile edge computing

FLEE: A Hierarchical Federated Learning Framework for Distributed Deep Neural Network over Cloud, Edge, and End Device

End-Edge-Cloud Collaborative Computing for Deep Learning: A Comprehensive Survey

A Cloud-Edge Collaboration Framework for Cognitive Service.

Multi-Compression Scale DNN Inference Acceleration based on Cloud-Edge-End Collaboration

DECC: Delay-Aware Edge-Cloud Collaboration for Accelerating DNN Inference

Edge-Cloud Cooperation for DNN Inference Via Reinforcement Learning and Supervised Learning

DDPQN: An Efficient DNN Offloading Strategy in Local-Edge-Cloud Collaborative Environments

Accelerating DNN Inference by Edge-Cloud Collaboration

Adaptive Deep Inference Framework for Cloud-Edge Collaboration

Lightweight Network Research Based on Deep Learning

Memory- and Communication-Aware Model Compression for Distributed Deep Learning Inference on IoT

Online Learning for Orchestration of Inference in Multi-User End-Edge-Cloud Networks