A machine learning approach to infer the accreted stellar mass fractions of galaxies

Rui Shi,Wenting Wang,Zhaozhou Li,Jiaxin Han,Jingjing Shi,Vicente Rodriguez-Gomez,Yingjie Peng
2021-01-01
Abstract:We propose a random forest (RF) machine learning approach to determine the accreted stellar mass fractions (facc) of central galaxies, based on various dark matter halo and galaxy features. The RF is trained and tested using 2,710 galaxies with stellar mass log10 M∗/M > 10.16 from the TNG100 simulation. For galaxies with log10 M∗/M > 10.6, global features such as halo mass, size and stellar mass are more important in determining facc, whereas for galaxies with log10 M∗/M 6 10.6, features related to merger histories have higher predictive power. Galaxy size is the most important when calculated in 3-dimensions, which becomes less important after accounting for observational effects. In contrast, the stellar age, galaxy colour and star formation rate carry very limited information about facc. When an entire set of halo and galaxy features are used, the prediction is almost unbiased, with rootmean-square error (RMSE) of ∼0.068. If only using observable features, the RMSE increases to ∼0.104. Nevertheless, compared with the case when only stellar mass is used, the inclusion of other observable features does help to decrease the RMSE by ∼20%. Lastly, when using galaxy density, velocity and velocity dispersion profiles as features, which represent approximately the maximum amount of information one can extract from galaxy images and velocity maps, the prediction is only slightly improved. Hence, with observable features, the limiting precision of predicting facc is ∼0.1, and the multi-component decomposition of galaxy images should have similar or larger uncertainties. If the central black hole mass and the spin parameter of galaxies can be accurately measured in future observations, the RMSE is promising to be further decreased by ∼20%.
What problem does this paper attempt to address?