TY - GEN
T1 - An Interpretable LSTM Network for Solar Flare Prediction
AU - Datla, Gautam Varma
AU - Jiang, Haodi
AU - Wang, Jason T.L.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Deep learning models are often considered black box models as their internal workings tend to be opaque to the user. Because of this lack of transparency, it is challenging to understand the reasoning behind the model's predictions. Here, we present an approach to making a solar flare prediction model interpretable. This model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun's surface would produce a large flare, namely an M- or X-class flare, within 24 hours. The flare events used in this study are collected from the Geostationary Operational Environmental Satellite X-ray flare catalogs provided by the National Centers for Environmental Information. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. Each data sample has 22 features including magnetic parameters and flare history parameters. To make the model's predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model's behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based flare prediction model.
AB - Deep learning models are often considered black box models as their internal workings tend to be opaque to the user. Because of this lack of transparency, it is challenging to understand the reasoning behind the model's predictions. Here, we present an approach to making a solar flare prediction model interpretable. This model, built based on a long short-term memory (LSTM) network with an attention mechanism, aims to predict whether an active region (AR) on the Sun's surface would produce a large flare, namely an M- or X-class flare, within 24 hours. The flare events used in this study are collected from the Geostationary Operational Environmental Satellite X-ray flare catalogs provided by the National Centers for Environmental Information. The crux of our approach is to model data samples in an AR as time series and use the LSTM network to capture the temporal dynamics of the data samples. Each data sample has 22 features including magnetic parameters and flare history parameters. To make the model's predictions accountable and reliable, we leverage post hoc model-agnostic techniques, which help elucidate the factors contributing to the predicted output for an input sequence and provide insights into the model's behavior across multiple sequences within an AR. To our knowledge, this is the first time that interpretability has been added to an LSTM-based flare prediction model.
KW - Interpretable deep learning
KW - LSTM
KW - Solar flares
UR - http://www.scopus.com/inward/record.url?scp=85182394786&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85182394786&partnerID=8YFLogxK
U2 - 10.1109/ICTAI59109.2023.00084
DO - 10.1109/ICTAI59109.2023.00084
M3 - Conference contribution
AN - SCOPUS:85182394786
T3 - Proceedings - International Conference on Tools with Artificial Intelligence, ICTAI
SP - 526
EP - 531
BT - Proceedings - 2023 IEEE 35th International Conference on Tools with Artificial Intelligence, ICTAI 2023
PB - IEEE Computer Society
T2 - 35th IEEE International Conference on Tools with Artificial Intelligence, ICTAI 2023
Y2 - 6 November 2023 through 8 November 2023
ER -