Searching for chemo-kinematic structures in the Milky Way halo with deep clustering algorithms

Leda Berni
2024-09-14
Abstract:According to the lambda CDM scenario, galaxies are formed through the hierarchical accretion of building blocks. Our Galaxy is a privileged place to look for the remnants of accretion events through the study of the chemical and kinematic properties of its halo stellar populations. Due to its low density, the stellar halo holds the most favorable conditions for chemical tagging. However, chemical tagging alone often yields weak results due to both uncertainties in chemical abundances and to overlapping chemical properties among different populations. To overcome this problem, the use of chemical and kinematic properties can be combined. In this Thesis, we developed a machine learning algorithm, named the CREEK, which combines orbital and chemical properties of halo stars observed by two large public spectroscopic surveys, Gaia-ESO and APOGEE. The CREEK operates as follows: 1)Data selection: We selected halo stars from the APOGEE and Gaia-ESO surveys based both on their velocity and metallicity and we computed their orbital parameters. 2)Using kinematics: The selected data were passed to a Siamese Neural Network that established links between stars based on their kinematic similarities. 3)Using chemistry: The graph was passed through a Graph Neural Network (GNN) auto-encoder that took as input the selected abundances. The abundances were chosen to maximize homogeneity within stars from the same cluster while ensuring distinctiveness between stars from different clusters. Additionally, we prioritised elements with smallest errors. The GNN auto-encoder computed a mean of the abundances of all connected stars, weighted on the number of links of each star and mapped the chemical space into a more efficient representation in the latent space. 4)Recovering structures: Finally, OPTICS was applied to the latent space, providing groups based on the chemical similarities of the stars.
Astrophysics of Galaxies
What problem does this paper attempt to address?
This paper aims to explore the chemo-dynamic structure in the Milky Way halo using deep clustering algorithms. Specifically: 1. **Research Background**: The Milky Way is a complex system composed of stars, gas, dust, and dark matter. Since we are located within the Milky Way, we can resolve stars of different masses and infer their chemical compositions and ages. Therefore, the Milky Way serves as an important benchmark for studying disk galaxies. 2. **Main Objectives**: This paper analyzes stellar datasets in the Milky Way halo using deep learning techniques (such as graph neural networks and Siamese neural networks) to identify different chemical abundances and kinematic features. The focus is on discovering specific structures that may exist in the Milky Way halo, such as globular clusters and stellar streams, and exploring the formation mechanisms of these structures. 3. **Data Sources**: The study utilizes data from several large spectroscopic survey projects, including APOGEE (Apache Point Observatory Galactic Evolution Experiment) and the Gaia-ESO survey. These data provide rich information on chemical abundances and kinematics. 4. **Methodology**: By using deep clustering algorithms (such as OPTICS combined with Siamese neural networks), the study classifies and clusters a large sample of stars to reveal the associations and differences between different stellar populations in the Milky Way halo. In summary, the main purpose of this paper is to use advanced machine learning methods to mine the chemo-dynamic properties of stars in the Milky Way halo from observational data, thereby better understanding the formation and evolution processes of the Milky Way halo.