Coded Downlink Massive Random Access and a Finite de Finetti Theorem

Ryan Song,Kareem M. Attiah,Wei Yu
2024-05-14
Abstract:This paper considers a massive connectivity setting in which a base-station (BS) aims to communicate sources $(X_1,\cdots,X_k)$ to a randomly activated subset of $k$ users, among a large pool of $n$ users, via a common downlink message. Although the identities of the $k$ active users are assumed to be known at the BS, each active user only knows whether itself is active and does not know the identities of the other active users. A naive coding strategy is to transmit the sources alongside the identities of the users for which the source information is intended, which would require $H(X_1,\cdots,X_k) + k\log(n)$ bits, because the cost of specifying the identity of a user is $\log(n)$ bits. For large $n$, this overhead can be significant. This paper shows that it is possible to develop coding techniques that eliminate the dependency of the overhead on $n$, if the source distribution follows certain symmetry. Specifically, if the source distribution is independent and identically distributed (i.i.d.) then the overhead can be reduced to at most $O(\log(k))$ bits, and in case of uniform i.i.d. sources, the overhead can be further reduced to $O(1)$ bits. For sources that follow a more general exchangeable distribution, the overhead is at most $O(k)$ bits, and in case of finite-alphabet exchangeable sources, the overhead can be further reduced to $O(\log(k))$ bits. The downlink massive random access problem is closely connected to the study of finite exchangeable sequences. The proposed coding strategy allows bounds on the relative entropy distance between finite exchangeable distributions and i.i.d. mixture distributions to be developed, and gives a new relative entropy version of the finite de Finetti theorem which is scaling optimal.
Information Theory
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how a base station (BS) can effectively broadcast messages to a group of randomly activated users in a large - scale random access scenario while minimizing the bit rate of the broadcast messages. Specifically, when there is a large pool of potential users (with a total number of \(n\)), the base station needs to send information sources \((X_1,\ldots,X_k)\) to a small group of randomly activated users among them (with a total number of \(k\), and \(k \ll n\)). Each activated user only knows its own activation state and does not know the identities of other activated users. ### Problem Background In traditional communication protocols, it is usually divided into two stages: 1. **Activity Detection Stage**: Activated users send pilot signals to the base station, and the base station determines which users are activated through activity detection. 2. **Information Transmission Stage**: The base station broadcasts information sources to these activated users, and each user needs to recover its corresponding information from the broadcast message. In the information transmission stage, a simple coding strategy is that the base station not only sends the information sources but also sends the user identities corresponding to each information source. The total cost of this strategy is \(H(X_1,\ldots,X_k)+k\log(n)\) bits, where \(H(X_1,\ldots,X_k)\) is the entropy of the information sources, and \(k\log(n)\) is the cost required to identify each user. When \(n\) is very large, this part of the cost will become very significant. ### Core Problem of the Paper The main contribution of the paper is to propose a new coding strategy that can eliminate the dependence on \(n\) when the information sources meet certain symmetry conditions (such as independent and identically distributed or exchangeable distributions), thereby significantly reducing the cost. Specifically, the paper shows that: - If the information sources are independent and identically distributed (i.i.d.), the cost can be reduced to at most \(O(\log(k))\) bits. - If the information sources are uniformly i.i.d., the cost can be further reduced to \(O(1)\) bits. - For more general exchangeable distributions, the cost is at most \(O(k)\) bits. - For exchangeable distributions of a finite alphabet, the cost can be further reduced to \(O(\log(k))\) bits. ### Main Technical Contributions 1. **Coding Strategy**: The paper proposes a method of encoding using codebooks generated based on i.i.d. mixed distributions. Through this method, the rate of the common message can be controlled close to \(H(X)\) bits, and the cost does not depend on \(n\). 2. **Finite de Finetti Theorem**: The paper uses the relative entropy version of the finite de Finetti theorem to prove that under exchangeable distributions, an i.i.d. mixed distribution can be found such that the relative entropy distance between it and the original distribution is very small. This provides a theoretical basis for the coding strategy. ### Conclusion Through the introduction of new coding techniques and theoretical analysis, the paper solves the problem of efficient information broadcasting by the base station to randomly activated users in a large - scale random access scenario. Especially when the information sources are symmetric, it can significantly reduce the cost of the broadcast message. This result is of great significance for application scenarios such as machine - type communication (MTC) and the Internet of Things (IoT).