An Interpretable LSTM Network for Solar Flare Prediction

Gautam Varma Datla,J. T. Wang,Haodi Jiang
DOI: https://doi.org/10.1109/ICTAI59109.2023.00084
2023-11-06
Abstract:Deep learning models are often considered black box models as their internal workings tend to be opaque to the user. Because of this lack of transparency, it is challenging to understand the reasoning behind the model’s predictions. Here, we present an approach to making a solar flare prediction model interpretable. This model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun’s surface would produce a large flare, namely an M- or X-class flare, within 24 hours. The flare events used in this study are collected from the Geostationary Operational Environmental Satellite X-ray flare catalogs provided by the National Centers for Environmental Information. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. Each data sample has 22 features including magnetic parameters and flare history parameters. To make the model’s predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model’s behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based flare prediction model.
Computer Science,Physics
What problem does this paper attempt to address?