Optimal quantization for a probability measure on a nonuniform stretched Sierpiński triangle

Megha Pandey,Mrinal Kanti Roychowdhury
2024-02-14
Abstract:Quantization for a Borel probability measure refers to the idea of estimating a given probability by a discrete probability with support containing a finite number of elements. In this paper, we have considered a Borel probability measure $P$ on $\mathbb R^2$, which has support a nonuniform stretched Sierpiński triangle generated by a set of three contractive similarity mappings on $\mathbb R^2$. For this probability measure, we investigate the optimal sets of $n$-means and the $n$th quantization errors for all positive integers $n$.
Information Theory
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the optimal quantization problem of a Borel probability measure on a non - uniformly stretched Sierpinski triangle. Specifically, the author considers a Borel probability measure \(P\) in two - dimensional space \(\mathbb{R}^2\), whose support set is a non - uniformly stretched Sierpinski triangle generated by three contractive similarity mappings. For this probability measure, the author studies the optimal \(n\)-mean set and the \(n\)th quantization error, where \(n\) is any positive integer. ### Background and Definitions **Quantization** refers to approximating a given probability measure with a discrete probability distribution, where the support set of the discrete probability distribution contains a finite number of elements. For a Borel probability measure \(P\), the \(n\)th quantization error \(V_n\) is defined as: \[ V_n := V_n(P)=\inf_{\alpha\subset\mathbb{R}^2,\text{card}(\alpha)\leq n}\int\min_{a\in\alpha}\|x - a\|^2dP(x) \] where \(\text{card}(\alpha)\) represents the cardinality of the set \(\alpha\). The set \(\alpha\) that makes the above infimum hold is called the optimal \(n\)-mean set or the optimal \(n\)-quantizer set, denoted as \(C_n := C_n(P)\). ### Research Objectives The author's goal is to determine the optimal \(n\)-mean set and the \(n\)th quantization error of the probability measure \(P\) on the non - uniformly stretched Sierpinski triangle. Specifically, the author introduces a set of three contractive similarity mappings \(S_1, S_2, S_3\): - \(S_1(x_1,x_2)=\frac{1}{4}(x_1,x_2)\) - \(S_2(x_1,x_2)=\frac{1}{4}(x_1,x_2)+\frac{3}{4}(1,0)\) - \(S_3(x_1,x_2)=\frac{1}{2}(x_1,x_2)+\frac{1}{2}\left(\frac{1}{2},\frac{\sqrt{3}}{2}\right)\) These mappings generate a non - uniformly stretched Sierpinski triangle. The probability measure \(P\) is defined as: \[ P=\frac{1}{5}P\circ S_1^{-1}+\frac{1}{5}P\circ S_2^{-1}+\frac{3}{5}P\circ S_3^{-1} \] ### Main Results The author gives a formula for determining the optimal \(n\)-mean set by induction and shows how to derive the optimal \((n + 1)\)-mean set from the known optimal \(n\)-mean set. In addition, the author also provides some diagrams to illustrate the positions of the elements in the optimal set. ### Conclusions Through these studies, the author not only determines the optimal \(n\)-mean set and quantization error of the probability measure on the non - uniformly stretched Sierpinski triangle, but also provides a theoretical basis for the further development of quantization theory in practical applications.