Linwei Niu,Danda B. Rawat,Dakai Zhu,Jonathan Musselwhite,Zonghua Gu,Qingxu Deng
Abstract:Fault tolerance, energy management, and quality of service (QoS) are essential aspects for the design of real-time embedded systems. In this work, we focus on exploring methods that can simultaneously address the above three critical issues under standby-sparing. The standby-sparing mechanism adopts a dual-processor architecture in which each processor plays the role of the backup for the other one dynamically. In this way it can provide fault tolerance subject to both permanent and transient faults. Due to its duplicate executions of the real-time jobs/tasks, the energy consumption of a standby-sparing system could be quite high. With the purpose of reducing energy under standby-sparing, we proposed three novel scheduling schemes: the first one is for (1, 1)-constrained tasks, and the second one and the third one (which can be combined into an integrated approach to maximize the overall energy reduction) are for general ( m , k )-constrained tasks which require that among any k consecutive jobs of a task no more than ( k − m ) out of them could miss their deadlines. Through extensive evaluations and performance analysis, our results demonstrate that compared with the existing research, the proposed techniques can reduce energy by up to 11% for (1, 1)-constrained tasks and 25% for general ( m , k )-constrained tasks while assuring ( m , k )-constraints and fault tolerance as well as providing better user perceived QoS levels under standby-sparing.
What problem does this paper attempt to address?
The paper primarily focuses on addressing the issues of fault tolerance, energy management, and Quality of Service (QoS) in real-time embedded systems, particularly in the context of using standby-sparing mechanisms. Specifically, the research objectives include:
1. **Reducing energy consumption**: Proposing new scheduling strategies to lower the energy consumption of systems employing standby-sparing mechanisms while ensuring fault tolerance and QoS.
2. **Meeting (m,k)-constraints**: Ensuring that in any consecutive m task instances, no more than k task instances miss their deadlines, which helps in maintaining the quality of service.
3. **Compatibility with fault tolerance**: The system design needs to handle both permanent and transient faults and continue operating in the presence of these faults.
To achieve the above objectives, the authors propose the following contributions:
- For (1,1)-constraint tasks, an efficient scheduling scheme is proposed to reduce energy consumption.
- For general (m,k)-constraint tasks, two flexible and adaptive standby-sparing techniques are proposed, which can be combined to maximize overall energy savings.
The main innovations in the paper include:
- Proposing new scheduling strategies to effectively reduce energy consumption while ensuring fault tolerance and meeting (m,k)-constraints.
- Optimizing the execution times of primary and backup tasks (i.e., executing primary tasks as early as possible and backup tasks as late as possible) so that once a primary task completes successfully, the backup task can be preemptively canceled, thereby saving energy.
- The effectiveness of the proposed scheduling strategies is validated through theoretical analysis and experimental verification for different task sets, achieving higher energy savings compared to existing methods for different types of (m,k)-constraint tasks (e.g., saving up to 11% for (1,1)-constraint tasks and up to 25% for general (m,k)-constraint tasks).
In summary, the paper addresses the key issue of energy management in real-time embedded systems by proposing new scheduling strategies while ensuring system fault tolerance and quality of service.