Submitted:
27 October 2024
Posted:
28 October 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Data and Method
2.1. Study Area and Monitoring Sites
2.2. Reference Data - Air Quality Monitoring Station PM2.5 Measuremets
2.3. PurpleAir Sensor Data – PAS Measurements
2.4. Modelling Framework
2.4.1. Data Preprocessing
2.4.2. Development of the Bayesian Optimized Surrogate Model for Data Correction (BaySurcls)
2.4.3. Performance analysis
3. Results and Discussion
3.1. PAS2.5 Data Correction Where Collocated Reference Monitoring Exists – Applying BaySurcls to the Collocation Monitoring Scenario
3.1.1. Overall Correction Efficiency
3.1.2. Comparing BaySurcls vs the US-EPA Method
3.2 PAS2.5 Data Correction Where No Collocated Reference Monitoring Exists – Applying BaySurcls to the Non-Collocation Monitoring Scenario
3.2.1. Overall Correction Efficiency
3.2.2. Further Discussion on Performance of BaySurfs vs the US-EPA Method


4. Summary and Conclusion
- The US-EPA method showed limited skills for correcting PAS data under both collocation and non-collocation monitoring scenarios, thus deemed inappropriate for common application in NSW.
- In contrast, the ML/DL methods tested in this study generally showed much superior performance over the US-EPA method at the tested locations.
- Among the tested algorithms, the hybrid model BaySurcls, featuring the application of deep learning (DL) algorithms in conjunction with a Bayesian-optimized surrogate model, showed good promise as a tool for improving PAS PM2.5 measurement accuracy.
- The BaySurcls and other ML/DL-based methods demonstrated high-level skills for correcting PAS data under the collocation scenario, and generally moderate-to high-level skills under the non-collocation scenario.
- The Performance of both ML/DL based methods and the US-EPA method could vary significantly across different air quality regions, indicating a need for further research in developing rigorous methods for correcting PAS data in NSW.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Anenberg, S.C.; Belova, A.; Brandt, J.; Fann, N.; Greco, S.; Guttikunda, S.; Heroux, M.E.; Hurley, F.; Krzyzanowski, M.; Medina, S. Survey of ambient air pollution health risk assessment tools. Risk analysis 2016, 36, 1718–1736. [Google Scholar] [CrossRef]
- De Marco, A.; Proietti, C.; Anav, A.; Ciancarella, L.; D'Elia, I.; Fares, S.; Fornasier, M.F.; Fusaro, L.; Gualtieri, M.; Manes, F. Impacts of air pollution on human and ecosystem health, and implications for the National Emission Ceilings Directive: Insights from Italy. Environment International 2019, 125, 320–333. [Google Scholar] [CrossRef] [PubMed]
- Robinson, D.L. Accurate, low cost PM2. 5 measurements demonstrate the large spatial variation in wood smoke pollution in regional Australia and improve modeling and estimates of health costs. Atmosphere 2020, 11, 856. [Google Scholar] [CrossRef]
- Aini, Q.; Febriani, W.; Lukita, C.; Kosasi, S.; Rahardja, U. New normal regulation with face recognition technology using attendx for student attendance algorithm. In Proceedings of the 2022 International Conference on Science and Technology (ICOSTECH), 2022; pp. 1-7.
- Dominici, F.; Peng, R.D.; Zeger, S.L.; White, R.H.; Samet, J.M. Particulate air pollution and mortality in the United States: did the risks change from 1987 to 2000? American Journal of Epidemiology 2007, 166, 880–888. [Google Scholar] [CrossRef]
- Franklin, M.; Zeka, A.; Schwartz, J. Association between PM2. 5 and all-cause and specific-cause mortality in 27 US communities. Journal of exposure science & environmental epidemiology 2007, 17, 279–287. [Google Scholar]
- Di, Q.; Dai, L.; Wang, Y.; Zanobetti, A.; Choirat, C.; Schwartz, J.D.; Dominici, F. Association of short-term exposure to air pollution with mortality in older adults. Jama 2017, 318, 2446–2456. [Google Scholar] [CrossRef] [PubMed]
- Bell, M.L.; Ebisu, K.; Belanger, K. Ambient air pollution and low birth weight in Connecticut and Massachusetts. Environmental health perspectives 2007, 115, 1118–1124. [Google Scholar] [CrossRef]
- Grande, G.; Ljungman, P.L.; Eneroth, K.; Bellander, T.; Rizzuto, D. Association between cardiovascular disease and long-term exposure to air pollution with the risk of dementia. JAMA neurology 2020, 77, 801–809. [Google Scholar] [CrossRef]
- Barkjohn, K.K.; Gantt, B.; Clements, A.L. Development and application of a United States-wide correction for PM 2.5 data collected with the PurpleAir sensor. Atmospheric Measurement Techniques 2021, 14, 4617–4637. [Google Scholar] [CrossRef]
- Ahmed, A.A.M.; Jui, S.J.J.; Sharma, E.; Ahmed, M.H.; Raj, N.; Bose, A. An advanced deep learning predictive model for air quality index forecasting with remote satellite-derived hydro-climatological variables. Sci Total Environ 2024, 906, 167234. [Google Scholar] [CrossRef]
- Marshall, J.D.; Nethery, E.; Brauer, M. Within-urban variability in ambient air pollution: comparison of estimation methods. Atmospheric Environment 2008, 42, 1359–1369. [Google Scholar] [CrossRef]
- Tan, Y.; Lipsky, E.M.; Saleh, R.; Robinson, A.L.; Presto, A.A. Characterizing the spatial variation of air pollutants and the contributions of high emitting vehicles in Pittsburgh, PA. Environmental science & technology 2014, 48, 14186–14194. [Google Scholar]
- Zimmerman, N.; Presto, A.A.; Kumar, S.P.; Gu, J.; Hauryliuk, A.; Robinson, E.S.; Robinson, A.L.; Subramanian, R. Closing the gap on lower cost air quality monitoring: Machine learning calibration models to improve low-cost sensor performance. Atmos. Meas. Tech. Discuss 2017, 2017, 1–36. [Google Scholar]
- Rahardja, U.; Aini, Q.; Manongga, D.; Sembiring, I.; Sanjaya, Y.P.A. Enhancing machine learning with low-cost p m2. 5 air quality sensor calibration using image processing. APTISI Transactions on Management 2023, 7, 201–209. [Google Scholar]
- Lewis, A.; Edwards, P. Validate personal air-pollution sensors. Nature 2016, 535, 29–31. [Google Scholar] [CrossRef]
- McKercher, G.R.; Salmond, J.A.; Vanos, J.K. Characteristics and applications of small, portable gaseous air pollution monitors. Environmental Pollution 2017, 223, 102–110. [Google Scholar] [CrossRef] [PubMed]
- Moltchanov, S.; Levy, I.; Etzion, Y.; Lerner, U.; Broday, D.M.; Fishbain, B. On the feasibility of measuring urban air pollution by wireless distributed sensor networks. Science of The Total Environment 2015, 502, 537–547. [Google Scholar] [CrossRef]
- Snyder, E.G.; Watkins, T.H.; Solomon, P.A.; Thoma, E.D.; Williams, R.W.; Hagler, G.S.; Shelow, D.; Hindin, D.A.; Kilaru, V.J.; Preuss, P.W. The changing paradigm of air pollution monitoring. Environmental science & technology 2013, 47, 11369–11377. [Google Scholar]
- Jaffe, D.A.; Thompson, K.; Finley, B.; Nelson, M.; Ouimette, J.; Andrews, E. An evaluation of the US EPA's correction equation for PurpleAir sensor data in smoke, dust, and wintertime urban pollution events. Atmospheric Measurement Techniques 2023, 16, 1311–1322. [Google Scholar] [CrossRef]
- Jayaratne, R.; Liu, X.; Thai, P.; Dunbabin, M.; Morawska, L. The influence of humidity on the performance of a low-cost air particle mass sensor and the effect of atmospheric fog. Atmospheric Measurement Techniques 2018, 11, 4883–4890. [Google Scholar] [CrossRef]
- Zheng, T.; Bergin, M.H.; Johnson, K.K.; Tripathi, S.N.; Shirodkar, S.; Landis, M.S.; Sutaria, R.; Carlson, D.E. Field evaluation of low-cost particulate matter sensors in high-and low-concentration environments. Atmospheric Measurement Techniques 2018, 11, 4823–4846. [Google Scholar] [CrossRef]
- Masson, N.; Piedrahita, R.; Hannigan, M. Quantification method for electrolytic sensors in long-term monitoring of ambient air quality. Sensors 2015, 15, 27283–27302. [Google Scholar] [CrossRef] [PubMed]
- Pang, X.; Shaw, M.D.; Lewis, A.C.; Carpenter, L.J.; Batchellier, T. Electrochemical ozone sensors: A miniaturised alternative for ozone measurements in laboratory experiments and air-quality monitoring. Sensors and Actuators B: Chemical 2017, 240, 829–837. [Google Scholar] [CrossRef]
- Williams, D.E.; Henshaw, G.S.; Bart, M.; Laing, G.; Wagner, J.; Naisbitt, S.; Salmond, J.A. Validation of low-cost ozone measurement instruments suitable for use in an air-quality monitoring network. Measurement Science and Technology 2013, 24, 065803. [Google Scholar] [CrossRef]
- Tryner, J.; Mehaffy, J.; Miller-Lionberg, D.; Volckens, J. Effects of aerosol type and simulated aging on performance of low-cost PM sensors. Journal of Aerosol Science 2020, 150, 105654. [Google Scholar] [CrossRef]
- Ardon-Dryer, K.; Dryer, Y.; Williams, J.N.; Moghimi, N. Measurements of PM 2.5 with PurpleAir under atmospheric conditions. Atmospheric Measurement Techniques 2020, 13, 5441–5458. [Google Scholar] [CrossRef]
- Kelly, K.; Whitaker, J.; Petty, A.; Widmer, C.; Dybwad, A.; Sleeth, D.; Martin, R.; Butterfield, A. Ambient and laboratory evaluation of a low-cost particulate matter sensor. Environmental pollution 2017, 221, 491–500. [Google Scholar] [CrossRef]
- Malings, C.; Tanzer, R.; Hauryliuk, A.; Saha, P.K.; Robinson, A.L.; Presto, A.A.; Subramanian, R. Fine particle mass monitoring with low-cost sensors: Corrections and long-term performance evaluation. Aerosol Science and Technology 2020, 54, 160–174. [Google Scholar] [CrossRef]
- Magi, B.I.; Cupini, C.; Francis, J.; Green, M.; Hauser, C. Evaluation of PM2. 5 measured in an urban setting using a low-cost optical particle counter and a Federal Equivalent Method Beta Attenuation Monitor. Aerosol Science and Technology 2020, 54, 147–159. [Google Scholar] [CrossRef]
- Bi, J.; Wildani, A.; Chang, H.H.; Liu, Y. Incorporating low-cost sensor measurements into high-resolution PM2. 5 modeling at a large spatial scale. Environmental Science & Technology 2020, 54, 2152–2162. [Google Scholar]
- Feenstra, B.; Papapostolou, V.; Hasheminassab, S.; Zhang, H.; Der Boghossian, B.; Cocker, D.; Polidori, A. Performance evaluation of twelve low-cost PM2. 5 sensors at an ambient air monitoring site. Atmospheric Environment 2019, 216, 116946. [Google Scholar] [CrossRef]
- Mehadi, A.; Moosmüller, H.; Campbell, D.E.; Ham, W.; Schweizer, D.; Tarnay, L.; Hunter, J. Laboratory and field evaluation of real-time and near real-time PM2. 5 smoke monitors. Journal of the Air & Waste Management Association 2020, 70, 158–179. [Google Scholar]
- Schulte, N.; Li, X.; Ghosh, J.K.; Fine, P.M.; Epstein, S.A. Responsive high-resolution air quality index mapping using model, regulatory monitor, and sensor data in real-time. Environmental Research Letters 2020, 15, 1040a1047. [Google Scholar] [CrossRef]
- Lu, Y.; Giuliano, G.; Habre, R. Estimating hourly PM2. 5 concentrations at the neighborhood scale using a low-cost air sensor network: A Los Angeles case study. Environmental Research 2021, 195, 110653. [Google Scholar] [CrossRef] [PubMed]
- Kim, S.; Park, S.; Lee, J. Evaluation of performance of inexpensive laser based PM2. 5 sensor monitors for typical indoor and outdoor hotspots of South Korea. Applied Sciences 2019, 9, 1947. [Google Scholar] [CrossRef]
- Stavroulas, I.; Grivas, G.; Michalopoulos, P.; Liakakou, E.; Bougiatioti, A.; Kalkavouras, P.; Fameli, K.M.; Hatzianastassiou, N.; Mihalopoulos, N.; Gerasopoulos, E. Field evaluation of low-cost PM sensors (Purple Air PA-II) under variable urban air quality conditions, in Greece. Atmosphere 2020, 11, 926. [Google Scholar] [CrossRef]
- Dhammapala, R.; Basnayake, A.; Premasiri, S.; Chathuranga, L.; Mera, K. PM2. 5 in Sri Lanka: Trend analysis, low-cost sensor correlations and spatial distribution. Aerosol and Air Quality Research 2022, 22, 210266. [Google Scholar] [CrossRef]
- McFarlane, C.; Isevulambire, P.K.; Lumbuenamo, R.S.; Ndinga, A.M.E.; Dhammapala, R.; Jin, X.; McNeill, V.F.; Malings, C.; Subramanian, R.; Westervelt, D.M. First measurements of ambient PM2. 5 in Kinshasa, Democratic Republic of Congo and Brazzaville, Republic of Congo using field-calibrated low-cost sensors. Aerosol and Air Quality Research 2021, 21, 200619. [Google Scholar] [CrossRef]
- Chojer, H.; Branco, P.; Martins, F.; Alvim-Ferraz, M.; Sousa, S. Can data reliability of low-cost sensor devices for indoor air particulate matter monitoring be improved?–An approach using machine learning. Atmospheric Environment 2022, 286, 119251. [Google Scholar] [CrossRef]
- Yu, M.; Zhang, S.; Zhang, K.; Yin, J.; Varela, M.; Miao, J. Developing high-resolution PM2. 5 exposure models by integrating low-cost sensors, automated machine learning, and big human mobility data. Frontiers in Environmental Science 2023, 11, 1223160. [Google Scholar] [CrossRef]
- Kar, A.; Ahmed, M.; May, A.A.; Le, H.T. High spatio-temporal resolution predictions of PM2. 5 using low-cost sensor data. Atmospheric Environment 2024, 326, 120486. [Google Scholar] [CrossRef]
- Paton-Walsh, C.; Rayner, P.; Simmons, J.; Fiddes, S.L.; Schofield, R.; Bridgman, H.; Beaupark, S.; Broome, R.; Chambers, S.D.; Chang, L.T.-C. A clean air plan for Sydney: An overview of the special issue on air quality in New South Wales. Atmosphere 2019, 10, 774. [Google Scholar] [CrossRef]
- Barnett, A.G.; Williams, G.M.; Schwartz, J.; Best, T.L.; Neller, A.H.; Petroeschevsky, A.L.; Simpson, R.W. The effects of air pollution on hospitalizations for cardiovascular disease in elderly people in Australian and New Zealand cities. Environmental health perspectives 2006, 114, 1018–1023. [Google Scholar] [CrossRef]
- Cohen, D.D.; Stelcer, E.; Garton, D.; Crawford, J. Fine particle characterisation, source apportionment and long-range dust transport into the Sydney Basin: a long term study between 1998 and 2009. Atmospheric Pollution Research 2011, 2, 182–189. [Google Scholar] [CrossRef]
- Cope, M.; Keywood, M.; Emmerson, K.; Galbally, I.; Boast, K.; Chambers, S.; Cheng, M.; Crumeyrolle, S.; Dunne, E.; Fedele, R. Sydney particle study-stage-II; CSIRO Marine and Atmospheric Research: 2014.
- Barkjohn, K.K.; Gantt, B.; Clements, A.L. Development and Application of a United States wide correction for PM(2.5) data collected with the PurpleAir sensor. Atmos Meas Tech 2021, 4. [Google Scholar] [CrossRef]
- Ardon-Dryer, K.; Dryer, Y.; Williams, J.N.; Moghimi, N. Measurements of PM2.5 with PurpleAir under atmospheric conditions. Atmospheric Measurement Techniques 2020, 13, 5441–5458. [Google Scholar] [CrossRef]
- Magi, B.I.; Cupini, C.; Francis, J.; Green, M.; Hauser, C. Evaluation of PM2.5 measured in an urban setting using a low-cost optical particle counter and a Federal Equivalent Method Beta Attenuation Monitor. Aerosol Science and Technology 2019, 54, 147–159. [Google Scholar] [CrossRef]
- Si, M.; Xiong, Y.; Du, S.; Du, K. Evaluation and calibration of a low-cost particle sensor in ambient conditions using machine-learning methods. Atmospheric Measurement Techniques 2020, 13, 1693–1707. [Google Scholar] [CrossRef]
- Masrur Ahmed, A.A.; Akther, S.; Nguyen-Huy, T.; Raj, N.; Janifer Jabin Jui, S.; Farzana, S.Z. Real-time prediction of the week-ahead flood index using hybrid deep learning algorithms with synoptic climate mode indices. Journal of Hydro-environment Research 2024, 57, 12–26. [Google Scholar] [CrossRef]
- Akbari Asanjan, A.; Yang, T.; Hsu, K.; Sorooshian, S.; Lin, J.; Peng, Q. Short-Term Precipitation Forecast Based on the PERSIANN System and LSTM Recurrent Neural Networks. Journal of Geophysical Research: Atmospheres 2018, 123. [Google Scholar] [CrossRef]
- Aungiers, J. Time Series Prediction Using LSTM Deep Neural Networks. 2018.
- Huang, C.J.; Kuo, P.H. A Deep CNN-LSTM Model for Particulate Matter (PM2.5) Forecasting in Smart Cities. Sensors (Basel) 2018, 18. [Google Scholar] [CrossRef] [PubMed]
- Wang, J.; Li, X.; Jin, L.; Li, J.; Sun, Q.; Wang, H. An air quality index prediction model based on CNN-ILSTM. Sci Rep 2022, 12, 8373. [Google Scholar] [CrossRef] [PubMed]
- Jui, S.J.J.; Ahmed, A.A.M.; Bose, A.; Raj, N.; Sharma, E.; Soar, J.; Chowdhury, M.W.I. Spatiotemporal Hybrid Random Forest Model for Tea Yield Prediction Using Satellite-Derived Variables. Remote Sensing 2022, 14. [Google Scholar] [CrossRef]
- Hochreiter, S. Long Short-term Memory. Neural Computation MIT-Press 1997.
- Beretta, L.; Santaniello, A. Nearest neighbor imputation algorithms: a critical evaluation. BMC Med Inform Decis Mak 2016, 16 Suppl 3, 74. [Google Scholar] [CrossRef]
- Cabello-Solorzano, K.; Ortigosa de Araujo, I.; Peña, M.; Correia, L.; J. Tallón-Ballesteros, A. The impact of data normalization on the accuracy of machine learning algorithms: a comparative analysis. In Proceedings of the International Conference on Soft Computing Models in Industrial and Environmental Applications, 2023; pp. 344-353.
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 2016; pp. 785-794.
- Snoek, J.; Larochelle, H.; Adams, R.P. Practical bayesian optimization of machine learning algorithms. Advances in neural information processing systems 2012, 25. [Google Scholar]
- Snoek, J.; Rippel, O.; Swersky, K.; Kiros, R.; Satish, N.; Sundaram, N.; Patwary, M.; Prabhat, M.; Adams, R. Scalable bayesian optimization using deep neural networks. In Proceedings of the International conference on machine learning, 2015; pp. 2171-2180.
- Frazier, P.I. Bayesian optimization. In Recent advances in optimization and modeling of contemporary problems; Informs: 2018; pp. 255-278.









| Region Name | AQMS Name | Latitude | Longitude | PAS ID (online) |
PAS Name | Data Period |
|---|---|---|---|---|---|---|
| Riverina-Murray | Wagga Wagga North | 29959 | WAG_S1 | 01/03/2020 – 16/07/2024 | ||
| -35.10411 | 147.36037 | 29945 | WAG2 | 01/03/2020 – 16/07/2024 | ||
| 30007 | WAG3 | 01/03/2020 – 16/07/2024 | ||||
| Northern Tablelands | Armidale | -30.50851 | 151.66173 | 29949 | ARM1 | 01/03/2020 – 16/07/2024 |
| Sydney | Lidcombe | 91721 | LID1 | 08/02/2021 - 16/07/2024 | ||
| -33.88143 | 151.04676 | 92367 | LID2 | 15/02/2021 - 16/07/2024 | ||
| 91355 | LID3 | 15/02/2021 - 16/07/2024 | ||||
| Central Tablelands | Bathurst | -33.40178 | 149.57456 | 98435 | BAT1 | 28/05/2021 – 16/07/2024 |
| Millthorpe | -33.444339 | 149.185325 | 182853 | MIL1 | 26/07/2023 – 30/07/2024 | |
| Lower Hunter | Newcastle | -32.9312 | 151.75965 | 191067 | NEW1 | 15/11/2023 – 16/07/2024 |
| 191081 | NEW2 | 15/11/2023 – 16/07/2024 | ||||
| Upper Hunter | Merriwa | -32.12665 | 150.45824 | 92479 | MER1 | 09/02/2021 – 30/07/2024 |
| Ratio Name | Calculation | Significance |
|---|---|---|
| Coarse Aerosol Fraction (CAF) | CAF = (PM10 −PM2.5)/PM10 | High CAF (> 0.5): coarse particles, likely dust. Low CAF (< 0.5): fine particles, typical of smoke or urban pollution. |
| Mass Ratio (MR) | MR = PM1/PM10 | High MR (> 0.5): More ultrafine particles, likely from smoke or combustion. Low MR (< 0.5): More large particles, typical of dust. |
| Particle Count Ratio (PCR) | PCR = 0.3 µm /5µm | High PCR (> 500): Dominance of fine particles, indicating smoke. Low PCR (< 500): More coarse particles, suggesting dust. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).