The Survey of Surveys: machine learning for stellar parametrization

A. Turchi,E. Pancino,F. Rossi,A. Avdeeva,P. Marrese,S. Marinoni,N. Sanna,M. Tsantaki,G. Fanari

DOI: https://doi.org/10.1117/12.3018967

2024-12-06

Abstract:We present a machine learning method to assign stellar parameters (temperature, surface gravity, metallicity) to the photometric data of large photometric surveys such as SDSS and SKYMAPPER. The method makes use of our previous effort in homogenizing and recalibrating spectroscopic data from surveys like APOGEE, GALAH, or LAMOST into a single catalog, which is used to inform a neural network. We obtain spectroscopic-quality parameters for millions of stars that have only been observed photometrically. The typical uncertainties are of the order of 100K in temperature, 0.1 dex in surface gravity, and 0.1 dex in metallicity and the method performs well down to low metallicity, were obtaining reliable results is known to be difficult.

Instrumentation and Methods for Astrophysics,Astrophysics of Galaxies,Solar and Stellar Astrophysics

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: How to use machine - learning methods to derive the key parameters (effective temperature \(T_{\text{eff}}\), surface gravity \(\log g\) and metallicity \([Fe/H]\)) of stars from the data of large - scale photometric surveys (such as SDSS and SkyMapper), so as to provide parameter estimates of spectral - like quality for millions of stars observed only by photometric measurements. ### Specific problems: 1. **Large amount of data and difficulty in high - quality spectral measurement**: Although large - scale spectral surveys (such as APOGEE, GALAH, LAMOST, etc.) provide high - quality spectral data for a large number of stars, obtaining these data requires a large amount of telescope time, so it is difficult to cover all star samples. And although photometric data is huge in quantity, its precision is low. 2. **Difficulties in parameter estimation for stars with low metallicity**: In the case of low metallicity, it is especially difficult to obtain reliable star parameters. ### Solutions: The paper proposes a machine - learning - based method. Using previous work on homogenizing and recalibrating spectral data, a neural network model is constructed. This model can derive high - precision star parameters from photometric data. Specific steps include: - **Data integration**: Cross - match photometric data such as Gaia and SkyMapper with spectral data such as APOGEE and GALAH to form a high - quality input data set containing tens of millions of stars. - **Neural network training**: Use the spectral parameters in the SoS (Survey of Surveys) catalog as "true" values to train the neural network to predict \(T_{\text{eff}}\), \(\log g\) and \([Fe/H]\) of stars. - **Performance evaluation**: Verify the accuracy of the model through the test set. The results show that the errors of the model on the test set are: the standard deviation of \(T_{\text{eff}}\) is 85 K, the standard deviation of \(\log g\) is 0.14 dex, and the standard deviation of \([Fe/H]\) is 0.12 dex. ### Significance: This method enables researchers to use the existing large - scale photometric data to provide high - quality parameter estimates for hundreds of millions or even billions of stars, greatly expanding the available high - quality star parameter samples, and helping astronomers to study the structure and evolution of the Milky Way more in - depth.

The Survey of Surveys: machine learning for stellar parametrization

Stellar atmospheric parameters and chemical abundances of about 5 million stars from S-PLUS multi-band photometry

Automated Stellar Spectral Classification and Parameterization for the Masses

A machine learning approach to photometric metallicities of giant stars

A Machine-Learning Photometric Classifier for Massive Stars in Nearby Galaxies I. the Method

SPar: estimating stellar parameters from multi-band photometries with empirical stellar libraries

Machine-guided Exploration and Calibration of Astrophysical Simulations

deep-REMAP: Parameterization of Stellar Spectra Using Regularized Multi-Task Learning

Distance and stellar parameter estimations of solar-like stars from the LAMOST spectroscopic survey

SpectroTranslator: Deep-neural network algorithm for homogenising spectroscopic parameters

Automatic Survey-Invariant Variable Star Classification

Transferring spectroscopic stellar labels to 217 million Gaia DR3 XP stars with SHBoost

SpectroTranslator: a deep-neural network algorithm to homogenize spectroscopic parameters

Inferring stellar parameters and their uncertainties from high-resolution spectroscopy using invertible neural networks

Estimation of Physical Stellar Parameters from Spectral Models using Deep Learning Techniques

A Self-consistent Data-driven Model for Determining Stellar Parameters from Optical and Near-infrared Spectra

Beyond spectroscopy. II. Stellar parameters for over twenty million stars in the northern sky from SAGES DR1 and Gaia DR3

Automating Discovery and Classification of Transients and Variable Stars in the Synoptic Survey Era

The Stellar parametrization using Artificial Neural Network

The Gaia-ESO Survey: Chemical evolution of Mg and Al in the Milky Way with Machine-Learning

Estimating the Atmospheric Parameters of Early-type Stars from the Chinese Space Station Telescope (CSST) Slitless Spectra Survey