Submitted:
24 June 2025
Posted:
26 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Input Data Collection and Preprocessing
2.1. Observational Meteorological and Air Quality Data
2.2. Numerical Models
2.3. Data Preprocessing and Data Set

| CMAQ v5.0.2 Module | Option |
|---|---|
| Horizontal advection (ModHadv) | Hyamo |
| Vertical advection (ModVadv) | YAMO |
| Horizontal diffusion (ModHdiff) | Multiscale |
| Vertical diffusion (ModVdiff) | Eddy |
| Aerosol module (ModAero) | AERO5 |
| Gas-phase chemistry solver (ModChem) | EBI |
| Deposition velocity calculation (ModDepv) | M3dry |
| Cloud module (ModCloud) | RADM |
| Gas-phase chemistry mechanism (Mechanism) | SAPRC99 |
| Integrated Model Name | T3V19 | T5V21 | T6V22 |
|---|---|---|---|
| Target Region |
Seoul | ||
| Training Period | 2016 ~ 2018 (3 years) | 2016 ~ 2020 (5 years) | 2016 ~ 2021 (6 years) |
| Evaluation Period | 2019 (1 year) | 2021 (1 year) | 2022 (1 year) |
| Training Data number |
450 million | 760 million | 910 million |
| Forecasting period | January 2023 ~ March 2023 (3 months) | ||
3. Description of LSTM Model
3.1. LSTM Model for Forecasting PM2.5 Concentration
3.2. Model Evaluation
4. Results
5. Concluding Remarks
Acknowledgments
Conflicts of Interest
References
- Bakels, L.; Tatsii, D.; Tipka, A.; Thompson, R.; Dütsch, M.; Blaschek, M.; Seibert, P.; Baier, K.; Bucci, S.; Cassiani, M.; et al. FLEXPART version 11: improved accuracy, efficiency, and flexibility. Geosci. Model Dev. 2024, 17, 7595–7627. [Google Scholar] [CrossRef]
- Baklanov, A.; Zhang, Y. Advances in air quality modeling and forecasting. Glob. Transitions 2020, 2, 261–270. [Google Scholar] [CrossRef]
- Baklanov, A.; Schlünzen, K.; Suppan, P.; Baldasano, J.; Brunner, D.; Aksoyoglu, S.; Carmichael, G.; Douros, J.; Flemming, J.; Forkel, R.; et al. Online coupled regional meteorology chemistry models in Europe: current status and prospects. Atmospheric Meas. Tech. 2014, 14, 317–398. [Google Scholar] [CrossRef]
- Chang, A. 2020. Cleaning and disinfectant chemical exposures and temporal associations with COVID-19—National poison data system, United States, January 1, 2020–March 31, 2020. MMWR. Morbidity and Mortality Weekly Report, 69.
- Chen, G.; Chen, S.; Li, D.; Chen, C. A hybrid deep learning air pollution prediction approach based on neighborhood selection and spatio-temporal attention. Sci. Rep. 2025, 15, 1–20. [Google Scholar] [CrossRef]
- Chen, Y.; Huang, L.; Xie, X.; Liu, Z.; Hu, J. Improved prediction of hourly PM2.5 concentrations with a long short-term memory and spatio-temporal causal convolutional network deep learning model. Sci. Total. Environ. 2023, 912, 168672. [Google Scholar] [CrossRef] [PubMed]
- Cheng, F.-Y.; Feng, C.-Y.; Yang, Z.-M.; Hsu, C.-H.; Chan, K.-W.; Lee, C.-Y.; Chang, S.-C. Evaluation of real-time PM2.5 forecasts with the WRF-CMAQ modeling system and weather-pattern-dependent bias-adjusted PM2.5 forecasts in Taiwan. Atmospheric Environ. 2021, 244. [Google Scholar] [CrossRef]
- Albuquerque, T.T.d.A.; Andrade, M.d.F.; Ynoue, R.Y.; Moreira, D.M.; Andreão, W.L.; dos Santos, F.S.; Nascimento, E.G.S. WRF-SMOKE-CMAQ modeling system for air quality evaluation in São Paulo megacity with a 2008 experimental campaign data. Environ. Sci. Pollut. Res. 2018, 25, 36555–36569. [Google Scholar] [CrossRef]
- Do, K.; Mahish, M.; Yeganeh, A.K.; Gao, Z.; Blanchard, C.L.; Ivey, C.E. Emerging investigator series: a machine learning approach to quantify the impact of meteorology on tropospheric ozone in the inland southern California. Environ. Sci. Atmos. 2023, 3, 1159–1173. [Google Scholar] [CrossRef]
- Emery, C.; Liu, Z.; Russell, A.G.; Odman, M.T.; Yarwood, G.; Kumar, N. Recommendations on statistics and benchmarks to assess photochemical model performance. J. Air Waste Manag. Assoc. 2017, 67, 582–598. [Google Scholar] [CrossRef]
- Ferreira, J.; Lopes, D.; Rafael, S.; Relvas, H.; Almeida, S.M.; Miranda, A.I. Modelling air quality levels of regulated metals: limitations and challenges. Environ. Sci. Pollut. Res. 2020, 27, 33916–33928. [Google Scholar] [CrossRef]
- Flemming, J.; Benedetti, A.; Inness, A.; Engelen, R.J.; Jones, L.; Huijnen, V.; Remy, S.; Parrington, M.; Suttie, M.; Bozzo, A.; et al. The CAMS interim Reanalysis of Carbon Monoxide, Ozone and Aerosol for 2003–2015. Atmos. Chem. Phys. 2017, 17, 1945–1983. [Google Scholar] [CrossRef]
- FLEXPART version 11 : improved accuracy, efficiency, and flexibility (2024).
- Fujino, R. and Miyamoto, Y., 2022. PM2.5 decrease with precipitation as revealed by single-point ground-based observation. Atmospheric Science Letters, 23(7), p.e1088.
- Gao, X. and Li, W., 2021. A graph-based LSTM model for PM2. 5 forecasting. Atmospheric Pollution Research, 12(9), p.101150. Yang, J. and Zhao, Y., 2023. Performance and application of air quality models on ozone simulation in China–A review. Atmospheric Environment, 293, p.119446.
- Gao, Z.; Zhou, X. A review of the CAMx, CMAQ, WRF-Chem and NAQPMS models: Application, evaluation and uncertainty factors. Environ. Pollut. 2023, 343, 123183. [Google Scholar] [CrossRef] [PubMed]
- Grell, G.A. , Peckham, S.E., Schmitz, R., McKeen, S.A., Frost, G., Skamarock, W.C. and Eder, B., 2005. Fully coupled “online” chemistry within the WRF model. Atmospheric environment, 39(37), pp.6957-6975.
- Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997).
- Huang, L. , Zhu, Y., Zhai, H., Xue, S., Zhu, T., Shao, Y., Liu, Z., Emery, C., Yarwood, G., Wang, Y. and Fu, J., 2021. Recommendations on benchmarks for numerical air quality model applications in China–Part 1: PM 2.5 and chemical species. Atmospheric Chemistry and Physics, 21(4), pp.2725-2743.
- Jeong, D.; Yoo, C.; Yeh, S.-W.; Yoon, J.-H.; Lee, D.; Lee, J.-B.; Choi, J.-Y. Statistical Seasonal Forecasting of Winter and Spring PM2.5 Concentrations Over the Korean Peninsula. Asia-Pacific J. Atmospheric Sci. 2022, 58, 549–561. [Google Scholar] [CrossRef]
- Karimian, H.; Li, Q.; Wu, C.; Qi, Y.; Mo, Y.; Chen, G.; Zhang, X.; Sachdeva, S. Evaluation of Different Machine Learning Approaches to Forecasting PM2.5 Mass Concentrations. Aerosol Air Qual. Res. 2019, 19, 1400–1410. [Google Scholar] [CrossRef]
- Kumar, N.; Park, R.J.; Jeong, J.I.; Woo, J.-H.; Kim, Y.; Johnson, J.; Yarwood, G.; Kang, S.; Chun, S.; Knipping, E. Contributions of international sources to PM2.5 in South Korea. Atmospheric Environ. 2021, 261. [Google Scholar] [CrossRef]
- Li, N.; Zhang, H.; Zhu, S.; Liao, H.; Hu, J.; Tang, K.; Feng, W.; Zhang, R.; Shi, C.; Xu, H.; et al. Secondary PM2.5 dominates aerosol pollution in the Yangtze River Delta region: Environmental and health effects of the Clean air Plan. Environ. Int. 2022, 171, 107725. [Google Scholar] [CrossRef]
- Li, X.; Peng, L.; Yao, X.; Cui, S.; Hu, Y.; You, C.; Chi, T. Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation. Environ. Pollut. 2017, 231, 997–1004. [Google Scholar] [CrossRef] [PubMed]
- Li, Z. , Zhang, X., Liu, X. and Yu, B., 2022. PM2.5 pollution in six major Chinese urban agglomerations: spatiotemporal variations, health impacts, and the relationships with meteorological conditions. Atmosphere, 13(10), p.1696.
- Li, Z. , Zhang, X., Liu, X. and Yu, B., 2022. PM2.5 pollution in six major Chinese urban agglomerations: spatiotemporal variations, health impacts, and the relationships with meteorological conditions. Atmosphere, 13(10), p.1696.
- Ma, X.; Sha, T.; Wang, J.; Jia, H.; Tian, R. Investigating impact of emission inventories on PM2.5 simulations over North China Plain by WRF-Chem. Atmospheric Environ. 2018, 195, 125–140. [Google Scholar] [CrossRef]
- Park et al., Variation of PM2.5 Chemical Compositions and their Contributions to light extinction in Seoul, Aerosol and Air Quality Research, 18, 2220-2229, 2018.
- Park, E.H. , Heo, J., Hirakura, S., Hashizume, M., Deng, F., Kim, H. and Yi, S.M., 2018. Characteristics of PM 2.5 and its chemical constituents in Beijing, Seoul, and Nagasaki. Air Quality, Atmosphere & Health, 11, pp.1167-1178.
- Qadeer, K. , Rehman, W.U., Sheri, A.M., Park, I., Kim, H.K. and Jeon, M., 2020. A long short-term memory (LSTM) network for hourly estimation of PM2.5 concentration in two cities of South Korea. Applied Sciences, 10(11), p.3984.
- Retrieving ground-level PM2.5 concentrations in China (2013-2021) with a numerical-model-informed testbed to mitigate sample-imbalance-induced biases.
- Reyes-Villegas, E.; Lowe, D.; Johnson, J.S.; Carslaw, K.S.; Darbyshire, E.; Flynn, M.; Allan, J.D.; Coe, H.; Chen, Y.; Wild, O.; et al. Simulating organic aerosol in Delhi with WRF-Chem using the volatility-basis-set approach: exploring model uncertainty with a Gaussian process emulator. Atmospheric Meas. Tech. 2023, 23, 5763–5782. [Google Scholar] [CrossRef]
- Rybarczyk and Zalkeviciute, Machine Learning Approaches for Outdoor Air Quality Modelling: A Systematic Review (2018).
- Skamarock, W.C. , Klemp, J.B., Dudhia, J., Gill, D.O., Barker, D.M., Duda, M.G., Huang, X.Y., Wang, W. and Powers, J.G., 2008. A description of the advanced research WRF version 3. NCAR technical note, 475(125), pp.10-5065.
- Skamarock, W.C. , Klemp, J.B., Dudhia, J., Gill, D.O., Liu, Z., Berner, J., Wang, W., Powers, J.G., Duda, M.G., Barker, D.M. and Huang, X.Y., 2019. A description of the advanced research WRF model version 4. National Center for Atmospheric Research: Boulder, CO, USA, 145(145), p.550.
- Sokhi, R.S.; Moussiopoulos, N.; Baklanov, A.; Bartzis, J.; Coll, I.; Finardi, S.; Friedrich, R.; Geels, C.; Grönholm, T.; Halenka, T.; et al. Advances in air quality research – current and emerging challenges. Atmospheric Meas. Tech. 2022, 22, 4615–4703. [Google Scholar] [CrossRef]
- Statistical Seasonal Forecasting of Winter and Spring PM2.5 Concentrations over the Korean Peninsula, 2022, 58.
- Tao, H.; Xing, J.; Zhou, H.; Pleim, J.; Ran, L.; Chang, X.; Wang, S.; Chen, F.; Zheng, H.; Li, J. Impacts of improved modeling resolution on the simulation of meteorology, air quality, and human exposure to PM2.5, O3 in Beijing, China. J. Clean. Prod. 2020, 243. [Google Scholar] [CrossRef]
- Travis, K.R. , Crawford, J.H., Chen, G., Jordan, C.E., Nault, B.A., Kim, H., Jimenez, J.L., Campuzano-Jost, P., Dibb, J.E., Woo, J.H. and Kim, Y., 2022. Limitations in representation of physical processes prevent successful simulation of PM 2.5 during KORUS-AQ. Atmospheric Chemistry and Physics, 22(12), pp.7933-7958.
- Vignesh, P.P. , Jiang, J.H. and Kishore, P., 2023. Predicting PM2. 5 concentrations across USA using machine learning. Earth and Space Science, 10(10), p.e2023EA002911.
- Wang, P.; Qiao, X.; Zhang, H. Modeling PM2.5 and O3 with aerosol feedbacks using WRF/Chem over the Sichuan Basin, southwestern China. Chemosphere 2020, 254, 126735. [Google Scholar] [CrossRef] [PubMed]
- WHO Ambient Air Pollution: A Global Assessment of Exposure and Burden of Disease, 9789241511353 (2016) https://www.who.int/publications/i/item/9789241511353.
- Y. S. Kim, G. H. Y. S. Kim, G. H. Kwak, K. D. Lee, S. I. Na, C. W. Park, & N. W. Park: Performance Evaluation of Machine Learning and Deep Learning Algorithms in Crop Classification: Impact of Hyper-parameters and Training Sample Size. Korean Journal of Remote Sensing, 2018, 34(5), 811~827pp.
- Yu, M.; Cai, X.; Song, Y.; Wang, X. A fast forecasting method for PM2.5 concentrations based on footprint modeling and emission optimization. Atmospheric Environ. 2019, 219. [Google Scholar] [CrossRef]
- Zhang, H. , Wang, J., García, L.C., Zhou, M., Ge, C., Plessel, T., Szykman, J., Levy, R.C., Murphy, B. and Spero, T.L., 2022. Improving surface PM2. 5 forecasts in the United States using an ensemble of chemical transport model outputs: 2. Bias correction with satellite data for rural areas. Journal of Geophysical Research: Atmospheres, 127(1), p.e2021JD035563.
- Zhou, L.; Sun, L.; Luo, Y.; Xia, X.; Huang, L.; Liao, Z.; Yan, X. Air pollutant concentration trends in China: correlations between solar radiation, PM2.5, and O3. Air Qual. Atmosphere Heal. 2023, 16, 1721–1735. [Google Scholar] [CrossRef]





| Data Type | Variable | |
|---|---|---|
| Observation | Meteorological Variables (surface) |
Temperature (K), Dew point (K), Wind component (U, V; m/s), Sea level pressure (hPa), Precipitation (mm), Solar radiation (W/m2) |
| Air Quality variables | PM10 (μg/m3), PM2.5 (μg/m3), O3 (ppm), NO2 (ppm), CO (ppm), SO2 (ppm) | |
| Numerical data | Meteorological Variables (surface) |
Temperature (K), Dew point (K), Wind component (U, V; m/s), Sea level pressure (hPa), Precipitation (mm), Solar radiation (W/m2) |
| Meteorological Variables (925 hPa, 850 hPa, 700 hPa, 500 hPa) |
Temperature (K), , Relative humidity (%), Wind component (U, V; m/s), Geopotential height (m) | |
| Air Quality variables | PM10 (μg/m3), PM2.5 (μg/m3), O3 (ppm), NO2 (ppm), CO (ppm), SO2 (ppm) | |
| Back trajectory pattern | Local wind group, Northwestern wind group, Western wind group, Northern wind group, Short flow | |
| • Cosine Similarity (1000 hPa, 925 hPa, 850 hPa, 700 hPa, 500 hPa, 300 hPa) |
Temperature (K), Relative humidity (%), Wind components (U, V; m/s), Geopotential height (m) | |
| Forecast Lead Time | Models | R | NMB (%) | NME (%) | RMSE | IOA |
|---|---|---|---|---|---|---|
| D+0 | CMAQ | 0.69 | 21.0 | 45.3 | 17.5 | 0.78 |
| T3V19 | 0.92 | 26.4 | 30.5 | 10.1 | 0.91 | |
| T5V21 | 0.94 | 8.6 | 16.7 | 6.3 | 0.96 | |
| T6V22 | 0.94 | 3.7 | 14.8 | 5.7 | 0.97 | |
| D+1 | CMAQ | 0.79 | 31.9 | 43.2 | 16.1 | 0.82 |
| T3V19 | 0.86 | 23.8 | 32.1 | 11.1 | 0.88 | |
| T5V21 | 0.83 | 2.9 | 24.1 | 9.3 | 0.90 | |
| T6V22 | 0.85 | 1.3 | 22.6 | 8.8 | 0.91 | |
| D+2 | CMAQ | 0.84 | 31.0 | 39.9 | 15.7 | 0.84 |
| T3V19 | 0.86 | 24.4 | 31.3 | 11.1 | 0.88 | |
| T5V21 | 0.89 | 4.4 | 20.6 | 7.8 | 0.93 | |
| T6V22 | 0.89 | -0.2 | 19.2 | 7.6 | 0.93 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).