Learnable wavelet neural networks for cosmological inference

Christian Pedersen,Michael Eickenberg,Shirley Ho
2023-07-25
Abstract:Convolutional neural networks (CNNs) have been shown to both extract more information than the traditional two-point statistics from cosmological fields, and marginalise over astrophysical effects extremely well. However, CNNs require large amounts of training data, which is potentially problematic in the domain of expensive cosmological simulations, and it is difficult to interpret the network. In this work we apply the learnable scattering transform, a kind of convolutional neural network that uses trainable wavelets as filters, to the problem of cosmological inference and marginalisation over astrophysical effects. We present two models based on the scattering transform, one constructed for performance, and one constructed for interpretability, and perform a comparison with a CNN. We find that scattering architectures are able to outperform a CNN, significantly in the case of small training data samples. Additionally we present a lightweight scattering network that is highly interpretable.
Instrumentation and Methods for Astrophysics,Cosmology and Nongalactic Astrophysics,Machine Learning
What problem does this paper attempt to address?
This paper discusses how to solve the marginalization problem in cosmological inference and astrophysical effect using trainable scattering transforms, which is a convolutional neural network (CNN) that uses trainable wavelets as filters. Traditionally, CNNs have shown excellent performance in extracting information from cosmological fields and eliminating astrophysical effects, but they require a large amount of training data, which may be problematic for expensive cosmological simulations, and CNNs have poor interpretability. The researchers propose two models based on scattering transforms, one focusing on performance and the other on interpretability, and compare them with CNNs. The results show that the scattering architecture significantly outperforms CNNs in situations with small training data. Additionally, they introduce a lightweight scattering network that has high interpretability. The paper experiments with the CAMELs simulation dataset and compares the performance of different models on datasets of different sizes. The results show that the scattering network performs significantly better on small datasets and even slightly better on large datasets. Furthermore, the interpretable network (IN) achieves accurate predictions of key cosmological parameters with a lower number of parameters even when only using the cold dark matter mass field. In conclusion, the paper aims to solve the issues of high data demands and poor interpretability of CNNs in cosmological inference through scattering transforms. The proposed new models strike a balance between performance and interpretability, particularly in dealing with limited training data, providing a better solution.