Structural Descriptors and Information Extraction from X-ray Emission Spectra: Aqueous Sulfuric Acid

E. A. Eronen,A. Vladyka,Ch. J. Sahle,J. Niskanen
DOI: https://doi.org/10.1039/D4CP02454K
2024-08-23
Abstract:Machine learning can reveal new insights into X-ray spectroscopy of liquids when the local atomistic environment is presented to the model in a suitable way. Many unique structural descriptor families have been developed for this purpose. We benchmark the performance of six different descriptor families using a computational data set of 24200 sulfur K$\beta$ X-ray emission spectra of aqueous sulfuric acid simulated at six different concentrations. We train a feed-forward neural network to predict the spectra from the corresponding descriptor vectors and find that the local many-body tensor representation, smooth overlap of atomic positions and atom-centered symmetry functions excel in this comparison. We found a similar hierarchy when applying the emulator-based component analysis to identify and separate the spectrally relevant structural characteristics from the irrelevant ones. In this case, the spectra were dominantly dependent on the concentration of the system, whereas adding the second most significant degree of freedom in the decomposition allowed for distinction of the protonation state of the acid molecule.
Chemical Physics,Data Analysis, Statistics and Probability
What problem does this paper attempt to address?