Cellwise Outlier Detection with False Discovery Rate Control

Yanhong Liu,Haojie Ren,Xu Guo,Qin Zhou,Changliang Zou
DOI: https://doi.org/10.1002/cjs.11649
2021-01-01
Abstract:This article is concerned with detecting cellwise outliers in large data matrices. We introduce a novel method that is able to fully exploit dependence structures among variables while controlling the false discovery rate (FDR). We reframe cellwise outlier identification into a high‐dimensional variable selection paradigm and construct “binate references” for data screening, estimation and information pooling. With the binate references, the proposed procedure forms a series of statistics that incorporate covariance information and utilizes a global symmetry property of these statistics to approximate the false discovery proportion. We show that the proposed method can control the asymptotic FDR under some mild conditions. Extensive numerical studies demonstrate that our method has reasonable FDR control and satisfactory power in comparison to existing methods.
What problem does this paper attempt to address?