Abstract:Fault tolerance, energy management, and quality of service (QoS) are essential aspects for the design of real-time embedded systems. In this work, we focus on exploring methods that can simultaneously address the above three critical issues under standby-sparing. The standby-sparing mechanism adopts a dual-processor architecture in which each processor plays the role of the backup for the other one dynamically. In this way it can provide fault tolerance subject to both permanent and transient faults. Due to its duplicate executions of the real-time jobs/tasks, the energy consumption of a standby-sparing system could be quite high. With the purpose of reducing energy under standby-sparing, we proposed three novel scheduling schemes: the first one is for (1, 1)-constrained tasks, and the second one and the third one (which can be combined into an integrated approach to maximize the overall energy reduction) are for general ( m , k )-constrained tasks which require that among any k consecutive jobs of a task no more than ( k − m ) out of them could miss their deadlines. Through extensive evaluations and performance analysis, our results demonstrate that compared with the existing research, the proposed techniques can reduce energy by up to 11% for (1, 1)-constrained tasks and 25% for general ( m , k )-constrained tasks while assuring ( m , k )-constraints and fault tolerance as well as providing better user perceived QoS levels under standby-sparing.

What problem does this paper attempt to address?

The paper primarily focuses on addressing the issues of fault tolerance, energy management, and Quality of Service (QoS) in real-time embedded systems, particularly in the context of using standby-sparing mechanisms. Specifically, the research objectives include: 1. **Reducing energy consumption**: Proposing new scheduling strategies to lower the energy consumption of systems employing standby-sparing mechanisms while ensuring fault tolerance and QoS. 2. **Meeting (m,k)-constraints**: Ensuring that in any consecutive m task instances, no more than k task instances miss their deadlines, which helps in maintaining the quality of service. 3. **Compatibility with fault tolerance**: The system design needs to handle both permanent and transient faults and continue operating in the presence of these faults. To achieve the above objectives, the authors propose the following contributions: - For (1,1)-constraint tasks, an efficient scheduling scheme is proposed to reduce energy consumption. - For general (m,k)-constraint tasks, two flexible and adaptive standby-sparing techniques are proposed, which can be combined to maximize overall energy savings. The main innovations in the paper include: - Proposing new scheduling strategies to effectively reduce energy consumption while ensuring fault tolerance and meeting (m,k)-constraints. - Optimizing the execution times of primary and backup tasks (i.e., executing primary tasks as early as possible and backup tasks as late as possible) so that once a primary task completes successfully, the backup task can be preemptively canceled, thereby saving energy. - The effectiveness of the proposed scheduling strategies is validated through theoretical analysis and experimental verification for different task sets, achieving higher energy savings compared to existing methods for different types of (m,k)-constraint tasks (e.g., saving up to 11% for (1,1)-constraint tasks and up to 25% for general (m,k)-constraint tasks). In summary, the paper addresses the key issue of energy management in real-time embedded systems by proposing new scheduling strategies while ensuring system fault tolerance and quality of service.

Energy Management for Fault-Tolerant (m,k)-Constrained Real-Time Systems that Use Standby-Sparing

An Efficient Fault-Tolerant Scheduling Algorithm for Periodic Real-Time Tasks in Heterogeneous Platforms

Research on Joint Optimal Scheduling of Task Energy Efficiency and Reliability in Heterogeneous Multiprocessor Real-time System

Energy-Efficient Fault-Tolerant Mapping and Scheduling on Heterogeneous Multiprocessor Real-Time Systems

Fault-tolerant real-time tasks scheduling with dynamic fault handling

Dynamic management of real time QoS according to (m, k)-firm

Scheduling Co-Design for Reliability and Energy in Cyber-Physical Systems

Efficient Scheduling Algorithm for Hard Real-Time Tasks in Primary-Backup Based Multiprocessor Systems

Feedback Scheduling of Real-Time Control Tasks in Power-Aware Embedded Systems

Expected Energy Optimization for Real-Time Multiprocessor SoCs Running Periodic Tasks with Uncertain Execution Time.

Energy-Efficient Scheduling Algorithms for Real-Time Parallel Applications on Heterogeneous Distributed Embedded Systems

Online Energy-Aware Scheduling for Deadline-Constrained Applications in Distributed Heterogeneous Systems

Energy-efficient scheduling for parallel applications with reliability and time constraints on heterogeneous distributed systems

Energy-Efficient Scheduling of Periodic Applications on Safety-Critical Time-Triggered Multiprocessor Systems

METSM: Multiobjective energy-efficient task scheduling model and method for an edge heterogeneous multiprocessor system

Energy efficient dynamic scheduling of dependent tasks for multi‐core real‐time systems using delay techniques

Energy-aware scheduling under reliability and makespan constraints

Energy efficient EDF-VD-based mixed-criticality scheduling with shared resources

Fault Tolerance in Real Time Multiprocessors - Embedded Systems

Energy-Minimized Scheduling of Real-Time Parallel Workflows on Heterogeneous Distributed Computing Systems

Energy Efficient Real-Time Task Scheduling for Embedded Systems with Hybrid Main Memory