Dynamic Spectrum Access in Time-varying Environment: Distributed Learning Beyond Expectation Optimization

Yuhua Xu,Jinlong Wang,Qihui Wu,Jianchao Zheng,Liang Shen,Alagan Anpalagan
DOI: https://doi.org/10.48550/arXiv.1502.06672
2017-07-28
Abstract:This article investigates the problem of dynamic spectrum access for canonical wireless networks, in which the channel states are time-varying. In the most existing work, the commonly used optimization objective is to maximize the expectation of a certain metric (e.g., throughput or achievable rate). However, it is realized that expectation alone is not enough since some applications are sensitive to fluctuations. Effective capacity is a promising metric for time-varying service process since it characterizes the packet delay violating probability (regarded as an important statistical QoS index), by taking into account not only the expectation but also other high-order statistic. Therefore, we formulate the interactions among the users in the time-varying environment as a non-cooperative game, in which the utility function is defined as the achieved effective capacity. We prove that it is an ordinal potential game which has at least one pure strategy Nash equilibrium. Based on an approximated utility function, we propose a multi-agent learning algorithm which is proved to achieve stable solutions with dynamic and incomplete information constraints. The convergence of the proposed learning algorithm is verified by simulation results. Also, it is shown that the proposed multi-agent learning algorithm achieves satisfactory performance.
Information Theory,Computer Science and Game Theory
What problem does this paper attempt to address?