Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Solar Flare Prediction from Extremely Imbalanced Multivariate Time Series Data using Minimally Random Convolutional Kernel Transform

Version 1 : Received: 4 March 2024 / Approved: 5 March 2024 / Online: 5 March 2024 (10:55:34 CET)

How to cite: Saini, K.; Alshammari, K.; Hamdi, S.M.; Filali Boubrahimi, S. Solar Flare Prediction from Extremely Imbalanced Multivariate Time Series Data using Minimally Random Convolutional Kernel Transform. Preprints 2024, 2024030210. https://doi.org/10.20944/preprints202403.0210.v1 Saini, K.; Alshammari, K.; Hamdi, S.M.; Filali Boubrahimi, S. Solar Flare Prediction from Extremely Imbalanced Multivariate Time Series Data using Minimally Random Convolutional Kernel Transform. Preprints 2024, 2024030210. https://doi.org/10.20944/preprints202403.0210.v1

Abstract

Solar flares are characterized by sudden bursts of electromagnetic radiation from the Sun’s surface, and caused by the changes in magnetic field states in solar active regions. Earth and its surrounding space environment can suffer from various negative impacts caused by solar flares ranging from electronic communication disruption to radiation exposure-based health risks to the astronauts. In this paper, we address the solar flare prediction problem from magnetic field parameter-based multivariate time series (MVTS) data using multiple state-of-the-art machine learning classifiers that include MINImally RandOm Convolutional KErnel Transform (MINIROCKET), Support Vector Machine (SVM), Canonical Interval Forest (CIF), Multiple Representations SEQuence Learner (Mr-SEQL), and Long Short-Term Memory (LSTM)-based deep learning model. Our experiment is conducted on the on the Space Weather ANalytics for Solar Flares (SWAN-SF) benchmark data set, which is a partitioned collection of MVTS data of active region magnetic field parameters spanning over 9 years of operation of the Solar Dynamics Observatory (SDO). The MVTS instances of the SWAN-SF dataset are labeled by GOES X-ray flux-based flare class labels, and attributed to extreme class imbalance because of the rarity of the major flaring events (e.g., X and M). As a performance validation metric in this class-imbalanced dataset, we used the true skill statistic (TSS) score. Finally, we demonstrate the advantages of the MVTS learning algorithm MINIROCKET, which outperformed the aforementioned classifiers without the need for essential data preprocessing steps such as normalization, statistical summarization, and class imbalance handling heuristics.

Keywords

Multivariate Time Series; Solar Flare; Space Weather; Imbalanced Data

Subject

Physical Sciences, Space Science

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.