Review of Inference Time Prediction Approaches of DNN: Emphasis on Service Robots with Cloud-Edge-device Architecture

Tian Xiang,Qiwei Meng,Ji Zhang,Beibei Zhang,Wei Song,Anhuan Xie,Jason Gu
DOI: https://doi.org/10.1109/robio58561.2023.10354654
2023-01-01
Abstract:In recent years, the global robot market has witnessed substantial growth, particularly in the domain of service robots. Despite their expanding presence, service robots encounter limitations when operating autonomously in unstructured environments, primarily due to their constrained computational capacities. As a solution, the fusion of cloud and edge computing resources becomes imperative to expedite task inference and enhance scenario perception capabilities. The integration of cloud-edge-device models holds significant promise in bolstering the operational efficiency of robots. This entails the dynamic partitioning of intricate robotic tasks, executed collaboratively across cloud, edge, and device resources. In this landscape, deep neural network (DNN) models play a pivotal role in facilitating a wide array of robotic tasks. The inference time for each layer of a DNN model in actual deployment, emerges as a critical determinant in the model’s partitioning strategy. It also serves as an important metric influencing the model’s suitability for a specific hardware platform. This article presents an overview of recent advancements in predicting inference and training time of DNN models, summarizes the related methods, and finally discusses the challenges in this field and the research that can be studied in the future.
What problem does this paper attempt to address?