Bayesian and Convolutional Networks for Hierarchical Morphological Classification of Galaxies

Jonathan Serrano-Pérez,Raquel Díaz Hernández,L. Enrique Sucar
2024-05-03
Abstract:This work is focused on the morphological classification of galaxies following the Hubble sequence in which the different classes are arranged in a hierarchy. The proposed method, BCNN, is composed of two main modules. First, a convolutional neural network (CNN) is trained with images of the different classes of galaxies (image augmentation is carried out to balance some classes); the CNN outputs the probability for each class of the hierarchy, and its outputs/predictions feed the second module. The second module consists of a Bayesian network that represents the hierarchy and helps to improve the prediction accuracy by combining the predictions of the first phase while maintaining the hierarchical constraint (in a hierarchy, an instance associated with a node must be associated to all its ancestors), through probabilistic inference over the Bayesian network so that a consistent prediction is obtained. Different images from the Hubble telescope have been collected and labeled by experts, which are used to perform the experiments. The results show that BCNN performed better than several CNNs in multiple evaluation measures, reaching the next scores: 67% in exact match, 78% in accuracy, and 83% in hierarchical F-measure.
Instrumentation and Methods for Astrophysics,Astrophysics of Galaxies,Machine Learning
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are several key challenges in galactic morphology classification, especially: 1. **Subjectivity and Inconsistency**: Traditional classification methods based on visual inspection are easily influenced by the subjective decisions of experts, leading to inconsistent and discrepant classification results. 2. **Morphological Complexity**: Some galaxies have complex morphological features, making it difficult to define clear boundaries between different categories, thus affecting the classification accuracy. 3. **Dataset Bias**: The datasets used for training and evaluating models may be biased, affecting the generalization ability and performance of the models. 4. **Image Quality Problems**: Due to factors such as observation techniques, sky region selection, and exposure time, the quality of the obtained galaxy images varies greatly, increasing the difficulty of classification. To solve these problems, the author proposes a new method - **Bayesian and Convolutional Neural Networks (BCNN)** for hierarchical galactic morphology classification. Specifically, this method combines Convolutional Neural Networks (CNN) and Bayesian Networks (Bayesian Network) and aims to: - Use CNN to automatically extract features from images and output the probability of each category. - Represent the hierarchical structure through Bayesian networks, combine the prediction results of CNN, and at the same time maintain hierarchical constraints (that is, in a hierarchical structure, when an instance is associated with a certain node, it must also be associated with all ancestor nodes of that node) to improve the prediction accuracy. - Solve the class imbalance problem and balance the dataset through image enhancement techniques (such as geometric transformation). The experimental results show that BCNN outperforms the method using CNN alone in multiple evaluation metrics. For example, it reaches 67% in the Exact Match, 78% in Accuracy, and 83% in the hierarchical F - measure. Through this method, the author hopes to achieve more accurate and efficient galactic morphology classification in larger - scale and more complex datasets.