How Much Data is Needed for Channel Knowledge Map Construction?

Xiaoli Xu,Yong Zeng
2023-12-12
Abstract:Channel knowledge map (CKM) has been recently proposed to enable environment-aware communications by utilizing historical or simulation generated wireless channel data. This paper studies the construction of one particular type of CKM, namely channel gain map (CGM), by using a finite number of measurements or simulation-generated data, with model-based spatial channel prediction. We try to answer the following question: How much data is sufficient for CKM construction? To this end, we first derive the average mean square error (AMSE) of the channel gain prediction as a function of the sample density of data collection for offline CGM construction, as well as the number of data points used for online spatial channel gain prediction. To model the spatial variation of the wireless environment even within each cell, we divide the CGM into subregions and estimate the channel parameters from the local data within each subregion. The parameter estimation error and the channel prediction error based on estimated channel parameters are derived as functions of the number of data points within the subregion. The analytical results provide useful guide for CGM construction and utilization by determining the required spatial sample density for offline data collection and number of data points to be used for online channel prediction, so that the desired level of channel prediction accuracy is guaranteed.
Information Theory,Signal Processing
What problem does this paper attempt to address?
This paper discusses how to construct Channel Gain Map (CGM) for environmental sensing communication to predict wireless channels with limited datasets or simulation-generated data. The core issue of this research is to determine how much data is needed to build CGM in order to ensure the accuracy of channel prediction. The paper first establishes the relationship between Average Mean Square Error (AMSE) and data collection density, and considers the number of data points used for online prediction. To account for spatial variations within the environment, CGM is divided into subregions, and local data within each subregion is used for parameter estimation. The analysis results provide guidance for the construction and utilization of CGM, determining the sample density required for offline data collection and the sample quantity required for online prediction to ensure the desired accuracy of channel prediction. Additionally, the paper investigates the impact of two data collection methods, random distribution and grid distribution, on prediction performance.