Lightweight Neural Network with Knowledge Distillation for CSI Feedback

Yiming Cui,Jiajia Guo,Zheng Cao,Huaze Tang,Chao-Kai Wen,Shi Jin,Xin Wang,Xiaolin Hou
2024-03-03
Abstract:Deep learning has shown promise in enhancing channel state information (CSI) feedback. However, many studies indicate that better feedback performance often accompanies higher computational complexity. Pursuing better performance-complexity tradeoffs is crucial to facilitate practical deployment, especially on computation-limited devices, which may have to use lightweight autoencoder with unfavorable performance. To achieve this goal, this paper introduces knowledge distillation (KD) to achieve better tradeoffs, where knowledge from a complicated teacher autoencoder is transferred to a lightweight student autoencoder for performance improvement. Specifically, two methods are proposed for implementation. Firstly, an autoencoder KD-based method is introduced by training a student autoencoder to mimic the reconstructed CSI of a pretrained teacher autoencoder. Secondly, an encoder KD-based method is proposed to reduce training overhead by performing KD only on the student encoder. Additionally, a variant of encoder KD is introduced to protect user equipment and base station vendor intellectual property. Numerical simulations demonstrate that the proposed methods can significantly improve the student autoencoder's performance, while reducing the number of floating point operations and inference time to 3.05%-5.28% and 13.80%-14.76% of the teacher network, respectively. Furthermore, the variant encoder KD method effectively enhances the student autoencoder's generalization capability across different scenarios, environments, and bandwidths.
Signal Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve the performance of CSI (Channel State Information) feedback while maintaining low computational complexity in large - scale MIMO systems. Specifically, the paper focuses on implementing an efficient and high - performance CSI feedback mechanism on devices with limited computational resources. Traditional CSI feedback methods, such as codebook - based methods and compressed - sensing - based methods, are effective to a certain extent, but have problems such as high computational complexity or strong dependence on channel sparsity. Although deep - learning - based methods can significantly improve CSI feedback performance, they are usually accompanied by high computational complexity, which limits their application on devices with limited computational resources. To solve this problem, the paper introduces the Knowledge Distillation (KD) technique to achieve a better performance - complexity trade - off by transferring the knowledge of a complex teacher auto - encoder to a lightweight student auto - encoder. The paper proposes two KD - based methods: 1. **Auto - encoder KD method**: Train a lightweight student auto - encoder to imitate the CSI reconstructed by a pre - trained complex teacher auto - encoder. 2. **Encoder KD method**: Perform knowledge distillation only on the student encoder to reduce training overhead and directly use the teacher decoder to avoid training the decoder from scratch. In addition, a variant of the encoder KD method is also proposed to protect the intellectual property rights of user devices and base station suppliers and avoid transmitting the encoder architecture between the base station and user devices. Through these methods, the paper aims to significantly improve the performance of lightweight auto - encoders while greatly reducing the number of floating - point operations and inference time, making it more suitable for deployment on devices with limited computational resources.