The miniJPAS survey quasar selection IV: Classification and redshift estimation with SQUEzE

Ignasi Pérez-Ràfols,L. Raul Abramo,Ginés Martínez-Solaeche,Matthew M. Pieri,Carolina Queiroz,Natália V.N. Rodrigues,Silvia Bonoli,Jonás Chaves-Montero,Sean S. Morrison,Jailson Alcaniz,Narciso Benitez,Saulo Carneiro,Javier Cenarro,David Cristóbal-Hornillos,Renato Dupke,Alessandro Ederoclite,Rosa M. González Delgado,Antonio Hernán-Caballero,Carlos López-Sanjuan,Antonio Marín-Franch,Valerio Marra,Claudia Mendes de Oliveira,Mariano Moles,Laerte Sodré Jr.,Keith Taylor,Jesús Varela,Héctor Vázquez Ramió
DOI: https://doi.org/10.1051/0004-6361/202347488
2023-09-01
Abstract:We present a list of quasar candidates including photometric redshift estimates from the miniJPAS Data Release constructed using SQUEzE. This work is based on machine-learning classification of photometric data of quasar candidates using SQUEzE. It has the advantage that its classification procedure can be explained to some extent, making it less of a `black box' when compared with other classifiers. Another key advantage is that using user-defined metrics means the user has more control over the classification. While SQUEzE was designed for spectroscopic data, here we adapt it for multi-band photometric data, i.e. we treat multiple narrow-band filters as very low-resolution spectra. We train our models using specialized mocks from Queiroz et al. (2022). We estimate our redshift precision using the normalized median absolute deviation, $\sigma_{\rm NMAD}$ applied to our test sample. Our test sample returns an $f_1$ score (effectively the purity and completeness) of 0.49 for quasars down to magnitude $r=24.3$ with $z\geq2.1$ and 0.24 for quasars with $z<2.1$. For high-z quasars, this goes up to 0.9 for $r<21.0$. We present two catalogues of quasar candidates including redshift estimates: 301 from point-like sources and 1049 when also including extended sources. We discuss the impact of including extended sources in our predictions (they are not included in the mocks), as well as the impact of changing the noise model of the mocks. We also give an explanation of SQUEzE reasoning. Our estimates for the redshift precision using the test sample indicate a $\sigma_{NMAD}=0.92\%$ for the entire sample, reduced to 0.81\% for $r<22.5$ and 0.74\% for $r<21.3$. Spectroscopic follow-up of the candidates is required in order to confirm the validity of our findings.
Instrumentation and Methods for Astrophysics,Astrophysics of Galaxies
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to construct a high - quality quasar candidate catalogue and provide spectral redshift estimates for these candidates. Specifically, the research team used multi - band photometric data from the miniJPAS data release and applied a machine - learning algorithm named SQUEzE to perform quasar classification and redshift estimation. ### Overview of the Main Problems 1. **Selection of Quasar Candidates**: - Quasars are one of the brightest celestial bodies in the universe and are of great significance for studying supermassive black holes, cosmological parameters (such as baryon acoustic oscillation BAO), and non - Gaussianity. - Selecting quasar candidates through photometric data is to prepare for subsequent spectral observations, so as to confirm the true identities of these candidates and their redshifts. 2. **Redshift Estimation**: - Redshift is an important parameter for measuring the distance of celestial bodies in astronomy and is crucial for understanding the large - scale structure and evolution of the universe. - This research not only focuses on the identification of quasars but also is committed to providing accurate spectral redshift estimates, which is very critical for subsequent cosmological analysis. ### Methods and Challenges - **Data Source**: miniJPAS is a small - scale proof - of - concept survey, covering 1 square degree of the sky, using 54 narrow - band filters and 2 broadband filters, covering the entire optical wavelength range. - **Algorithm Selection**: SQUEzE is a machine - learning code, originally designed to process spectral data, but in this study, it is adaptively applied to multi - band photometric data. It simultaneously performs quasar identification and redshift estimation by identifying multiple emission lines and their relative wavelengths and intensities. - **Performance Evaluation**: The research team used synthetic data (mocks) to train and validate the model and evaluated the performance of the model through test samples. The results show that for high - redshift quasars (\( z \geq 2.1 \)), the F1 - score is 0.49; for low - redshift quasars (\( z < 2.1 \)), the F1 - score is 0.24. In addition, the normalized median absolute deviation of redshift accuracy (σNMAD) is 0.92% (for the overall sample), 0.81% (\( r < 22.5 \)) and 0.74% (\( r < 21.3 \)) respectively. ### Key Contributions - **Improving Classification Transparency**: The advantage of SQUEzE is that its classification process can be explained, not like other classifiers which are like "black boxes". - **User - Defined Indicators**: Users can better control the classification results by customizing indicators. - **Adapting to Multi - band Photometric Data**: Applying SQUEzE to multi - band photometric data enables effective quasar identification and redshift estimation even without high - resolution spectra. In conclusion, this research aims to improve the accuracy of quasar candidate selection and redshift estimation through advanced machine - learning methods and high - quality photometric data, so as to support future cosmological research.