Abstract:Pre-trained Transformers, through in-context learning (ICL), have demonstrated exceptional capabilities to adapt to new tasks using example prompts \textit{without model update}. Transformer-based wireless receivers, where prompts consist of the pilot data in the form of transmitted and received signal pairs, have shown high estimation accuracy when pilot data are abundant. However, pilot information is often costly and limited in practice. In this work, we propose the \underline{DE}cision \underline{F}eedback \underline{IN}-Cont\underline{E}xt \underline{D}etection (DEFINED) solution as a new wireless receiver design, which bypasses channel estimation and directly performs symbol detection using the (sometimes extremely) limited pilot data. The key innovation in DEFINED is the proposed decision feedback mechanism in ICL, where we sequentially incorporate the detected symbols into the prompts to improve the detections for subsequent symbols. Extensive experiments across a broad range of wireless communication settings demonstrate that DEFINED achieves significant performance improvements, in some cases only needing a single pilot pair.
What problem does this paper attempt to address?
### What problem does this paper attempt to solve?
This paper aims to solve the problem of symbol detection in wireless communication systems, especially in the case of limited pilot data. Specifically, traditional symbol detection methods are usually divided into two steps: first, estimate the channel through pilot signals, and then perform symbol detection based on the estimated channel. However, this method faces the following challenges in practical applications:
1. **High computational complexity**: Traditional methods such as the minimum mean - square - error (MMSE) estimator require complex matrix operations, resulting in a large computational burden.
2. **Dependence on high - quality channel estimation**: The quality of channel estimation directly affects the accuracy of symbol detection, and in the case of high noise or limited pilot data, the accuracy of channel estimation will decline.
3. **High cost and limited pilot data**: In practical applications, the cost of obtaining a large amount of pilot data is high, and in some scenarios, it may not be possible to obtain sufficient pilot data.
To solve these problems, the author proposes a new method named **DEcisiON FEEDback IN - ContE xt Detection (DEFINED)**. The main innovations of this method include:
- **Direct symbol detection**: Bypass the channel estimation step and directly use limited pilot data for symbol detection.
- **Decision - feedback mechanism**: By gradually adding the detected symbols to the prompt, the detection effect of subsequent symbols is improved.
- **Applicable to low - data - volume scenarios**: Even in the case of only one pilot pair, the detection performance can be significantly improved.
Through these improvements, the DEFINED method not only performs well in the case of limited pilot data, but also can maintain high accuracy when there is sufficient pilot data. Experimental results show that this method has good robustness and flexibility under various modulation schemes and signal - to - noise ratio conditions.
### Formula summary
1. **Received signal model**:
\[
y_t = H_t x_t + z_t
\]
where \( y_t \) is the received signal, \( H_t \) is the channel matrix, \( x_t \) is the transmitted symbol, and \( z_t \) is the additive white Gaussian noise (AWGN).
2. **Channel estimation** (traditional method):
\[
\hat{H}=(X^H X + \sigma^2 I)^{-1} X^H Y
\]
where \( X \) is the pilot matrix, \( Y \) is the received signal matrix, and \( \sigma^2 \) is the noise variance.
3. **Symbol detection** (traditional method):
\[
\hat{x}_t=\arg \min_{x \in X}\|\hat{H} x - y_t\|^2
\]
4. **Expected symbol error rate (SER)**:
\[
\text{SER}_k(\theta)=E_\tau E_{x,y|\tau}[f_\theta(D_k^\tau, y_t)\neq x_t]
\]
5. **Loss function** (ICL training):
\[
L_{\text{ICL}}(\theta)=\frac{1}{NT}\sum_{i = 1}^N\sum_{t = 1}^T\text{loss}(f_\theta(S_{\text{ICL}, t, i}), x_{t, i})
\]
6. **Loss function** (DF training):
\[
L_{\text{DF}}(\theta)=\frac{1}{NT}\sum_{i = 1}^N\sum_{t = k + 1}^T\text{loss}(f_\theta(S_{\text{DF}, t, i}), x_{t, i})
\]
7. **Fine - tuning loss function**:
\[
L_{\text{