KiDS-Legacy: angular galaxy clustering from deep surveys with complex selection effects

Ziang Yan,Angus H. Wright,Nora Elisa Chisari,Christos Georgiou,Shahab Joudaki,Arthur Loureiro,Robert Reischke,Marika Asgari,Maciej Bilicki,Andrej Dvornik,Catherine Heymans,Hendrik Hildebrandt,Priyanka Jalan,Benjamin Joachimi,Giorgio Francesco Lesci,Shun-Sheng Li,Laila Linke,Constance Mahony,Lauro Moscardini,Nicola R. Napolitano,Benjamin Stoelzner,Maximilian Von Wietersheim-Kramsta,Mijin Yoon
2024-10-30
Abstract:Photometric galaxy surveys, despite their limited resolution along the line of sight, encode rich information about the large-scale structure (LSS) of the Universe thanks to the large number density and extensive depth of the data. However, the complicated selection effects in wide and deep surveys will potentially cause significant bias in the angular two-point correlation function (2PCF) measured from those surveys. In this paper, we measure the 2PCF from the newly published KiDS-Legacy sample. Given an $r$-band $5\sigma$ magnitude limit of $24.8$ and survey footprint of $1347$ deg$^2$, it achieves an excellent combination of sky coverage and depth for such a measurement. We find that complex selection effects, primarily induced by varying seeing, introduce over-estimation of the 2PCF by approximately an order of magnitude. To correct for such effects, we apply a machine learning-based method to recover an ``organised random'' (OR) that presents the same selection pattern as the galaxy sample. The basic idea is to find the selection-induced clustering of galaxies using a combination of self-organising maps (SOM) and hierarchical clustering (HC). This unsupervised machine learning method is able to recover complicated selection effects without specifying their functional forms. We validate this ``SOM+HC'' method on mock deep galaxy samples with realistic systematics and selections derived from the KiDS-Legacy catalogue. Using mock data, we demonstrate that the OR delivers unbiased 2PCF cosmological parameter constraints, removing the $27\sigma$ offset in the galaxy bias parameter that is recovered when adopting uniform randoms. Blinded measurements on the real KiDS-Legacy data show that the corrected 2PCF is robust to the SOM+HC configuration near the optimal setup suggested by the mock tests. Our software is open-source for future usage.
Cosmology and Nongalactic Astrophysics,Instrumentation and Methods for Astrophysics
What problem does this paper attempt to address?