Photometric Redshifts Probability Density Estimation from Recurrent Neural Networks in the DECam Local Volume Exploration Survey Data Release 2

G. Teixeira,C. R. Bom,L. Santana-Silva,B.M.O. Fraga,P. Darc,R. Teixeira,J. F. Wu,P. S. Ferguson,C. E. Martínez-Vázquez,A. H. Riley,A. Drlica-Wagner,Y. Choi,B. Mutlu-Pakdil,A. B. Pace,J. D. Sakowska,G. S. Stringfellow
2024-08-28
Abstract:Photometric wide-field surveys are imaging the sky in unprecedented detail. These surveys face a significant challenge in efficiently estimating galactic photometric redshifts while accurately quantifying associated uncertainties. In this work, we address this challenge by exploring the estimation of Probability Density Functions (PDFs) for the photometric redshifts of galaxies across a vast area of 17,000 square degrees, encompassing objects with a median 5$\sigma$ point-source depth of $g$ = 24.3, $r$ = 23.9, $i$ = 23.5, and $z$ = 22.8 mag. Our approach uses deep learning, specifically integrating a Recurrent Neural Network architecture with a Mixture Density Network, to leverage magnitudes and colors as input features for constructing photometric redshift PDFs across the whole DECam Local Volume Exploration (DELVE) survey sky footprint. Subsequently, we rigorously evaluate the reliability and robustness of our estimation methodology, gauging its performance against other well-established machine learning methods to ensure the quality of our redshift estimations. Our best results constrain photometric redshifts with the bias of $-0.0013$, a scatter of $0.0293$, and an outlier fraction of $5.1\%$. These point estimates are accompanied by well-calibrated PDFs evaluated using diagnostic tools such as Probability Integral Transform and Odds distribution. We also address the problem of the accessibility of PDFs in terms of disk space storage and the time demand required to generate their corresponding parameters. We present a novel Autoencoder model that reduces the size of PDF parameter arrays to one-sixth of their original length, significantly decreasing the time required for PDF generation to one-eighth of the time needed when generating PDFs directly from the magnitudes.
Instrumentation and Methods for Astrophysics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **In large - scale astronomical photometric surveys, how to estimate the photometric redshift of galaxies efficiently and accurately, and quantify its uncertainty**. Specifically, the authors carried out research on the following issues: 1. **Efficient estimation of photometric redshift**: As the amount of data provided by photometric surveys (such as DECam Local Volume Exploration Survey, DELVE) continues to increase, traditional methods are difficult to process these data efficiently. Therefore, a more effective method is needed to estimate the photometric redshifts of a large number of galaxies. 2. **Accurate quantification of uncertainty**: Traditional photometric redshift estimation methods usually only provide point estimates and ignore the uncertainties in redshift estimation. In order to better understand the physical properties of galaxies and cosmological parameters, these uncertainties must be accurately quantified. To solve these problems, the authors adopted deep - learning techniques, especially the method of **recurrent neural network (RNN) combined with mixture density network (MDN)**, using the brightness and color of galaxies as input features to construct the probability density function (PDF) of photometric redshift. This method can not only provide point estimates of redshift, but also comprehensively describe the uncertainty of redshift estimation through PDF. In addition, the authors also explored the use of **auto - encoder (AE)** to compress the PDF parameter array, thereby reducing the storage space requirements and increasing the speed of generating PDF. This method is of great significance for future large - scale survey projects (such as Vera Rubin Legacy Survey of Space and Time, LSST), because these projects will generate a huge amount of data, and it will become infeasible to directly store and process complete PDF information. ### Specific problem summary - **Problem 1**: How to efficiently estimate photometric redshift? - Solution: Adopt a deep - learning model of RNN combined with MDN, using the brightness and color of galaxies as input features. - **Problem 2**: How to accurately quantify the uncertainty of redshift estimation? - Solution: By constructing the probability density function (PDF) of photometric redshift, comprehensively describe the uncertainty of redshift estimation. - **Problem 3**: How to meet the requirements of PDF storage and generation time? - Solution: Use an auto - encoder to compress the PDF parameter array, reducing the storage space requirements and increasing the generation speed. Through these methods, the authors have successfully improved the accuracy and reliability of photometric redshift estimation, while solving the storage and computational efficiency problems in large - scale data processing.