Submitted:
13 August 2024
Posted:
13 August 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Data Sources and Preprocessing
2.1. Data Source
2.2. Data Preprocessing
3. Methodology
3.1. Design of Neural Networks
3.1.1. ConvLSTM
- 1.
- The input layer: The matrix dimension of the input layer is denoted as (B, S, C, H, W), where B represents the batch size, indicating the number of samples processed concurrently in a single training iteration; S stands for sequence length, denoting the temporal length of a sample; C refers to the number of channels, indicating the quantity of channels in the data; H is the height, representing the vertical dimension of the data; and W is the width, denoting the horizontal extent of the data.
- 2.
-
The encoder is comprised of Convolutional Layers (Conv) and Convolutional LSTM (ConLSTM) units:
- a)
- Conv is one of the commonly used layers in deep learning models, primarily utilized for extracting features from input data. Each convolutional layer is represented by an ordered dictionary, which contains the parameters required for the convolution operation. These parameters include: Input channel number, which specifies the number of channels in the input data; Output channel number, which determines the number of convolutional kernels and consequently the depth of the output feature map; Kernel size, which specifies the dimensions of the convolutional kernel; Stride, which determines the step length of the convolutional kernel as it slides over the input data, affecting the size of the output feature map; Padding, which involves adding zeros to the edges of the input data to control the size of the output feature map.
- b)
- The ConLSTM unit, a specialized type of Recurrent Neural Network, integrates convolutional operations with the LSTM architecture. Each ConLSTM unit consists of a set of convolutional kernels and gating units (forget gate, input gate, and output gate), designed to capture spatio-temporal dependencies in sequential data. Within the encoder, the ConLSTM units receive feature representations from convolutional layers and utilize the convolutional kernels to slide along the sequence dimension, effectively capturing the spatio-temporal information of the input sequence.
- 3.
-
The decoder consists of Deconvolutional Layers (DeConv), Convolutional LSTM (ConLSTM), and Feature Mapping Recovery (Conv) processes: Deconvolutional layers are a primary component of the decoding layer, tasked with transforming the abstract representations from the encoded data back into the original data space.
- a)
- The deconvolution process is the inverse of the convolution process, gradually restoring the resolution from a lower resolution feature map to that of the original input. Each deconvolution layer contains a set of parameters, including the number of input channels, output channels, kernel size, stride, and padding.
- b)
- Within the decoder, Convolutional LSTM units are also employed to handle spatio-temporal sequence data, progressively restoring the original data space.
- c)
- The final phase of the decoding layer includes a feature mapping recovery step, which systematically transforms the abstract representations back to the form of the original data through convolution operations.
- 4.
- The output layer: The matrix dimension of the output layer is identical to that of the input layer.
3.1.2. Conger
3.2. Neural Network Parameter Configuration
4. Discussion
4.1. Predictive Accuracy



4.2. Models Comparison
4.2.1. Temperature
4.2.2. Eastward Wind
4.2.3. Northward Wind
4.2.4. Performance at different isobaric surface
4.2.5. Performance at different latitudes
5. Conclusions
- When predicting for eight different times in a day, as the time span increases, the model's prediction accuracy tends to decrease, and the ConvGRU model outperforms the ConvLSTM model in forecasting results.
- In a comparison experiment using randomly selected data from June 2, 2022, to predict the environmental parameters for June 3, 2022, the ConvGRU model performed better than the ConvLSTM model in forecasting temperature, eastward wind, and northward wind, with smaller root-mean-square errors (RMSEs). Specifically, at the 1.65 Pa pressure level, the ConvGRU model showed a significant improvement in prediction accuracy over the ConvLSTM model, with a 13.6% improvement in temperature, a 6.6% improvement in eastward wind, and an 8.8% improvement in northward wind.
- By evaluating the correlation coefficients, mean errors, and root-mean-square errors between the predicted and actual values across different pressure levels (72 levels in total) for the entire year of 2022 and various latitudes and longitudes, it was found that the models' forecasting capability decreases with increasing altitude. Furthermore, both models exhibited significant decreases in forecasting accuracy for temperature, eastward wind, and northward wind at the interfaces between the Troposphere and Stratosphere, and between the Stratosphere and Mesosphere, which is related to the complex atmospheric changes at these interfaces.
- By evaluating the correlation coefficients, mean errors, and root-mean-square errors between the predicted and actual values across different latitudes (96 latitudes in total) for the entire year of 2022 and various altitudes and longitudes, it was found that the models' prediction accuracy in the polar regions was not significantly lower than other latitudes, indicating that the MERRA2 dataset has relatively high reliability in the polar regions.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Chen, H. An overview of the space-based observations for upper atmospheric research[J]. Advances in Earth Science, 2009, 24, 229. [Google Scholar]
- Tomme, E.B.; Phil, D. The paradigm shift to effects-based space: Near-space as a combat space effects enabler[M]. Airpower Research Institute, College of Aerospace Doctrine, Research and Education, Air University, 2005.
- Goncharenko, L.P.; Chau, J.L.; Liu, H.L. , et al. Unexpected connections between the stratosphere and ionosphere[J]. Geophysical Research Letters, 2010, 37, 10. [Google Scholar]
- Drob, D.P.; Emmert, J.T.; Meriwether, J.W. , et al. An update to the Horizontal Wind Model (HWM): The quiet time thermosphere[J]. Earth and Space Science, 2015, 2, 301–319. [Google Scholar]
- Picone, J.M.; Hedin, A.E.; Drob, D.P. , et al. NRLMSISE-00 empirical model of the atmosphere: Statistical comparisons and scientific issues[J]. Journal of Geophysical Research: Space Physics, 2002, 107, SIA 15–1. [Google Scholar]
- Liu, H.L.; Bardeen, C.G.; Foster, B.T. , et al. Development and validation of the whole atmosphere community climate model with thermosphere and ionosphere extension (WACCM-X 2.0)[J]. Journal of Advances in Modeling Earth Systems, 2018, 10, 381–402. [Google Scholar]
- Allen, D.R.; Eckermann, S.D.; Coy, L., et al. Stratospheric Forecasting with NOGAPS-ALPHA[J]. 2004.
- Greenberg, D.A.; Dupont, F.; Lyard, F.H. , et al. Resolution issues in numerical models of oceanic and coastal circulation[J]. Continental Shelf Research, 2007, 27, 1317–1343. [Google Scholar]
- Roney, J.A. Statistical wind analysis for near-space applications[J]. Journal of Atmospheric and Solar-Terrestrial Physics, 2007, 69, 1485–1501. [Google Scholar] [CrossRef]
- Liu, T. Research on statistical forecasting method of atmospheric wind field in adjacent space [D]. Beijing: National Space Science Center of Chinese Academy of Sciences, 2017.
- Bi, K.; Xie, L.; Zhang, H. , et al. Accurate medium-range global weather forecasting with 3D neural networks[J]. Nature, 2023, 619, 533–538. [Google Scholar] [PubMed]
- Kaparakis, C.; Mehrkanoon, S. Wf-unet: Weather fusion unet for precipitation nowcasting[J]. arXiv:2302.04102, 2023.
- Nguyen, T.; Brandstetter, J.; Kapoor, A., et al. Climax: A foundation model for weather and climate[J]. arXiv:2301.10343, 2023.
- Lam, R.; Sanchez-Gonzalez, A.; Willson, M.; et al. GraphCast: Learning skillful medium-range global weather forecasting[J]. arXiv:2212.12794, 2022.
- Bi, K.; Xie, L.; Zhang, H., et al. Pangu-weather: A 3d high-resolution model for fast and accurate global weather forecast[J]. arXiv:2211.02556, 2022.
- Chen, B.; Sheng, Z.; Cui, F. Refined short-term forecasting atmospheric temperature profiles in the stratosphere based on operators learning of neural networks[J]. Earth and Space Science, 2024, 11, e2024EA003509. [Google Scholar] [CrossRef]
- Shi, X.; Chen, Z.; Wang, H., et al. Convolutional LSTM network: A machine learning approach for precipitation nowcasting[J]. Advances in neural information processing systems, 2015, 28.
- Palmer, T.N.; Shutts, G.J.; Hagedorn, R.; et al. Representing model uncertainty in weather and climate prediction[J]. Annu. Rev. Earth Planet. Sci., 2005, 33, 163–193. [Google Scholar] [CrossRef]
- Kim, S.; Hong, S.; Joh, M.; Song, S.K. Deeprain: Convlstm network for precipitation prediction using multichannel radar data. arXiv arXiv:1711.02316, 2017.
- Gelaro, R.; McCarty, W.; Suárez, M.J. , et al. The modern-era retrospective analysis for research and applications, version 2 (MERRA-2)[J]. Journal of climate, 2017, 30, 5419–5454. [Google Scholar]














| Dataset Type | Date Year | Amount of data |
|---|---|---|
| Training set | 2011-2012 | 3653*8*144*96*72*3 |
| Validation set Test set |
2021 | 365*8*144*96*72*3 |
| 2022 | 365*8*144*96*72*3 |
| Block | Module | Channel Number | Kernel size | Stride | Padding | |
|---|---|---|---|---|---|---|
| Input | Output | |||||
| Encoder | Stage1(Sequential) | 72 | 16 | (3, 3) | (1, 1) | (1, 1) |
| RNN1(ConvLSTM) | 80 | 256 | (5, 5) | (1, 1) | (2, 2) | |
| Stage2(Sequential) | 64 | 64 | (3, 3) | (2, 2) | (1, 1) | |
| RNN2(ConvLSTM) | 160 | 384 | (5, 5) | (1, 1) | (2, 2) | |
| Stage3(Sequential) | 96 | 96 | (3, 3) | (2, 2) | (1, 1) | |
| RNN3(ConvLSTM) | 192 | 384 | (5, 5) | (1, 1) | (2, 2) | |
| Decoder | RNN3(ConvLSTM) | 192 | 384 | (5, 5) | (1, 1) | (2, 2) |
| Stage3(Sequential) | 96 | 96 | (4, 4) | (2, 2) | (1, 1) | |
| RNN2(ConvLSTM) | 192 | 384 | (5, 5) | (1, 1) | (2, 2) | |
| Stage2(Sequential) | 96 | 96 | (4, 4) | (2, 2) | (1, 1) | |
| RNN1(ConvLSTM) | 160 | 256 | (5, 5) | (1, 1) | (2, 2) | |
| Stage1(Sequential) | 64 | 16 | (3, 3) | (1, 1) | (1, 1) | |
| Stage0(Conv) | 16 | 72 | (3, 3) | (1, 1) | (0, 0) | |
| Block | Module | Channel Number | Kernel size | Stride | Padding | |
|---|---|---|---|---|---|---|
| Input | Output | |||||
| Encoder | Stage1(Sequential) | 72 | 16 | (3, 3) | (1, 1) | (1, 1) |
| RNN1(ConvGRU) | 80 | 256 | (5, 5) | (1, 1) | (2, 2) | |
| Stage2(Sequential) | 64 | 64 | (3, 3) | (2, 2) | (1, 1) | |
| RNN2(ConvGRU) | 160 | 384 | (5, 5) | (1, 1) | (2, 2) | |
| Stage3(Sequential) | 96 | 96 | (3, 3) | (2, 2) | (1, 1) | |
| RNN3(ConvGRU) | 192 | 384 | (5, 5) | (1, 1) | (2, 2) | |
| Decoder | RNN3(ConvGRU) | 192 | 384 | (5, 5) | (1, 1) | (2, 2) |
| Stage3(Sequential) | 96 | 96 | (4, 4) | (2, 2) | (1, 1) | |
| RNN2(ConvGRU) | 192 | 384 | (5, 5) | (1, 1) | (2, 2) | |
| Stage2(Sequential) | 96 | 96 | (4, 4) | (2, 2) | (1, 1) | |
| RNN1(ConvGRU) | 160 | 256 | (5, 5) | (1, 1) | (2, 2) | |
| Stage1(Sequential) | 64 | 16 | (3, 3) | (1, 1) | (1, 1) | |
| Stage0(Conv) | 16 | 72 | (3, 3) | (1, 1) | (0, 0) | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).