Rui Wang,Vincent K. N. Lau,Ying Cui
Abstract:In this paper, we consider a queue-aware distributive resource control algorithm for two-hop MIMO cooperative systems. We shall illustrate that relay buffering is an effective way to reduce the intrinsic half-duplex penalty in cooperative systems. The complex interactions of the queues at the source node and the relays are modeled as an average-cost infinite horizon Markov Decision Process (MDP). The traditional approach solving this MDP problem involves centralized control with huge complexity. To obtain a distributive and low complexity solution, we introduce a linear structure which approximates the value function of the associated Bellman equation by the sum of per-node value functions. We derive a distributive two-stage two-winner auction-based control policy which is a function of the local CSI and local QSI only. Furthermore, to estimate the best fit approximation parameter, we propose a distributive online stochastic learning algorithm using stochastic approximation theory. Finally, we establish technical conditions for almost-sure convergence and show that under heavy traffic, the proposed low complexity distributive control is global optimal.
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is **in a two - hop MIMO cooperative system, how to design a queue - aware distributed resource control algorithm to optimize the latency performance and reduce the inherent penalty brought by half - duplex relay nodes**. Specifically, the researchers hope to find a method to minimize the end - to - end latency while ensuring the throughput performance, and this solution can achieve distributed and low - complexity control in practical applications.
### Specific Background and Challenges of the Problem
1. **Half - duplex Penalty Problem**: In a cooperative communication system, relay nodes are usually half - duplex, which means that they cannot send and receive signals simultaneously at the same time, resulting in reduced transmission efficiency. Traditional solutions mainly focus on physical - layer signal processing (such as echo cancellation techniques), but these methods are difficult to extend to general relay channels.
2. **Complexity of Delay - Optimal Resource Control**: In order to optimize the latency performance, it is necessary to consider the dynamic interaction of buffers between the source node and multiple relay nodes. This can be modeled by Markov decision process (MDP), but due to the huge dimension of the state space, it is very difficult to directly solve the MDP problem. Traditional methods usually assume infinite data backlog and only focus on throughput optimization.
3. **Challenge of Distributed Implementation**: In order to achieve distributed control, it is necessary to avoid relying on global channel state information (CSI) and global queue state information (QSI). However, traditional MDP solution methods usually rely on global information, which poses a great challenge to the distributed implementation in practical systems.
### Solutions Proposed in the Paper
To solve the above problems, the paper proposes the following innovations:
1. **MDP - based Delay - Optimal Resource Control Model**: Model the delay - optimal resource control problem as an average - cost infinite - horizon MDP problem. Simplify the complexity of the problem by introducing the value function of each node to approximate the value function of the Bellman equation.
2. **Distributed Two - stage Auction Mechanism**: Design a distributed two - stage auction mechanism based on local CSI and local QSI for selecting relay nodes and allocating data streams. This method can make decisions independently on each node without the need for global information.
3. **Online Stochastic Learning Algorithm**: Propose a distributed online stochastic learning algorithm that uses stochastic approximation theory to estimate the best - fitting parameters. This algorithm can gradually optimize the control strategy without relying on global information.
4. **Technical Conditions for Almost Sure Convergence**: Establish technical conditions for almost sure convergence and prove that under high traffic load, the proposed low - complexity distributed control is globally optimal.
### Summary
Through these methods, the paper successfully solves the latency and complexity problems brought by half - duplex relay nodes and provides an effective, distributed and low - complexity resource control scheme, which is suitable for practical two - hop MIMO cooperative systems.