Abstract:Dark silicon is the phenomenon that a fraction of many-core chip has to be turned off or run in a low-power state in order to maintain the safe chip temperature. System-level thermal management techniques normally map application on non-adjacent cores, while communication efficiency among these cores will be oppositely affected over conventional network-on-chip (NoC). Recently, SMART NoC architecture is proposed, enabling single-cycle multi-hop bypass channels to be built between distant cores at runtime, to reduce communication latency. However, communication efficiency of SMART NoC will be diminished by communication contention, which will in turn decrease system performance. In this paper, we first propose an Integer-Linear Programming (ILP) model to properly address communication problem, which generates the optimal solutions with the consideration of inter-processor communication. We further present a novel heuristic algorithm for task mapping in dark silicon many-core systems, called TopoMap, on top of SMART architecture, which can effectively solve communication contention problem in polynomial time. With fine-grained consideration of chip thermal reliability and inter-processor communication, presented approaches are able to control the reconfigurability of NoC communication topology in task mapping and scheduling. Thermal-safe system is guaranteed by physically decentralized active cores, and communication overhead is reduced by the minimized communication contention and maximized bypass routing. Performance evaluation on PARSEC shows the applicability and effectiveness of the proposed techniques, which achieve on average 42.5 and 32.4 percent improvement in communication and application performance, and 32.3 percent reduction in system energy consumption, compared with state-of-the-art techniques. TopoMap only introduces 1.8 percent performance difference compared to ILP model and is more scalable to large-size NoCs.

Global-view based Task Migration for Deep Learning Processor

A Deterministic Optimal Task Migration Algorithm Design in NoC-based Multi-Core System.

Performance Optimization of Many-Core Systems by Exploiting Task Migration and Dark Core Allocation

Adaptive Partitioning and Efficient Scheduling for Distributed DNN Training in Heterogeneous IoT Environment

Load-aware task migration algorithm toward adaptive load balancing in Edge Computing

Dependent Task Scheduling and Offloading for Minimizing Deadline Violation Ratio in Mobile Edge Computing Networks

A Bandwidth-Fair Migration-Enabled Task Offloading for Vehicular Edge Computing: a Deep Reinforcement Learning Approach

A Computational Resources Scheduling Algorithm in Edge Cloud Computing: from the Energy Efficiency of Users’ Perspective

Dependent Task Offloading in Edge Computing Using GNN and Deep Reinforcement Learning

Interference-aware parallelization for deep learning workload in GPU cluster

Joint Job Offloading and Resource Allocation for Distributed Deep Learning in Edge Computing.

Energy-Aware Non-Preemptive Task Scheduling with Deadline Constraint in DVFS-Enabled Heterogeneous Clusters

Task Migration for Energy Saving in Real-Time Multiprocessor Systems

Multi-mobile vehicles task offloading for vehicle-edge-cloud collaboration: A dependency-aware and deep reinforcement learning approach

Joint Optimization of Task Caching and Computation Offloading for Multiuser Multitasking in Mobile Edge Computing

Collaborative Optimization Strategy for Dependent Task Offloading in Vehicular Edge Computing

Optimizing Memory Access Traffic Via Runtime Thread Migration for On-Chip Distributed Memory Systems

Dependent Task Scheduling Using Parallel Deep Neural Networks in Mobile Edge Computing

Packet Triggered Prediction Based Task Migration for Network-on-Chip

Thermal-Aware Task Mapping on Dynamically Reconfigurable Network-on-Chip Based Multiprocessor System-on-Chip

DLTAP: A Network-efficient Scheduling Method for Distributed Deep Learning Workload in Containerized Cluster Environment