Statistical visualisation for tidy and geospatial data in R via kernel smoothing methods in the eks package

Tarn Duong
DOI: https://doi.org/10.48550/arXiv.2203.01686
2022-03-03
Computation
Abstract:Kernel smoothers are essential tools for data analysis due to their ability to convey complex statistical information with concise graphical visualisations. Their inclusion in the base distribution and in the many user-contributed add-on packages of the R statistical analysis environment caters well to many practitioners. Though there remain some important gaps for specialised data types, most notably for tibbles (tidy data) within the tidyverse, and for simple features (geospatial data) within geospatial analysis. The proposed eks package fills in these gaps. In addition to kernel density estimation, this package also caters for more complex data analysis situations, such as density derivative estimation, density-based classification (supervised learning) and mean shift clustering (unsupervised learning). We illustrate with experimental data how to obtain and to interpret the statistical visualisations for these kernel smoothing methods.
What problem does this paper attempt to address?