Submitted:
18 February 2025
Posted:
19 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- A comparative evaluation of one-class classification algorithms and two-class models is conducted to determine their suitability for predicting wildfire risk using two fire incidence datasets.
- Shapley values [11] are used to interpret feature importance within one-class ML models, providing explainability and insights into the factors influencing wildfire predictions.
- A novel architecture for a web-based wildfire prediction tool is proposed, operationalizing the best-performing one-class ML model via a REST API.
2. Background
3. Data set
4. Methodology
5. Results
5.1. One-class Machine Learning Model Results
5.2. Two-Class Machine Learning Outcomes for the Same Ground-Truth Data
5.3. Feature Importance Derived Using Shapley Values
6. Deployment of Machine Learning Models
- The publicly available free Open Topo Data elevation REST API which gives elevation data of any location when latitude and longitude are given.
- The OpenWeather REST API provides historical, current and forecasted weather details through REST APIs for any point on the world.
- USGS Earth Explorer Website hosts LFMC data as different vegetation indexes carry a file format of all locations of California based on a MODIS grid.
- Choose some historical wildfire events to train the ML models and validate the model output. Users can also alter the input parameters and analyse and explore the most important features of the ML models.
- Select any location in California and Western Australia using the map, manually enter the input features, and use a probability to predict wildfire susceptibility.
- Search all input features for the next seven days.
- View historical yearly wildfire heat-maps based on ML model training and testing data.
7. Web-Based Prototype Evaluation
8. Threats to Validity and Limitations
9. Conclusion and Future Work
Abbreviations
| ML | Machine Learning |
| API | Application Programming Interface |
| AI | Artificial Intelligence |
| SVM | Support Vector Machine |
| IF | Isolation Forest |
| AE | AutoEncoder |
| VAE | Variational AutoEncoder |
| DeepSVD | Deep Support Vector Data Description |
| ALAD | Adversarially Learned Anomaly Detection |
| CV | Cross-Validation |
| OCSVM | One Class Support Vector Machine |
| ANN | Artificial Neural Networks |
| NOAA | National Oceanic and Atmospheric Administration |
| MODIS | Moderate Resolution Imaging Spectroradiometer |
| LFMC | Live Fuel Moisture Content |
| GCP | Google Cloud Platform |
| NZD | New Zealand Dollars |
| LLM | Large Language Model |
References
- Tyukavina, A.; Potapov, P.; Hansen, M.C.; Pickens, A.H.; Stehman, S.V.; Turubanova, S.; Parker, D.; Zalles, V.; Lima, A.; Kommareddy, I.; et al. Global Trends of Forest Loss Due to Fire From 2001 to 2019. Frontiers in Remote Sensing 2022, 3. [Google Scholar] [CrossRef]
- Bowman, D.M.J.S.; Williamson, G.J.; Price, O.F.; Ndalila, M.N.; Bradstock, R.A. Australian forests, megafires and the risk of dwindling carbon stocks. Plant Cell Environ. 2021, 44, 347–355. [Google Scholar] [PubMed]
- Reisen, F.; Duran, S.M.; Flannigan, M.; Elliott, C.; Rideout, K. Wildfire smoke and public health risk. International Journal of Wildland Fire 2015, 24, 1029–1044. [Google Scholar] [CrossRef]
- Jain, P.; Coogan, S.C.; Subramanian, S.G.; Crowley, M.; Taylor, S.; Flannigan, M.D. A review of machine learning applications in wildfire science and management. Environmental Reviews 2020, 28, 478–505. [Google Scholar] [CrossRef]
- European Centre for Medium-Range Weather Forecasts. 2024 was the warmest year on record, Copernicus data show. https://www.ecmwf.int/en/about/media-centre/news/2025/2024-was-warmest-year-record-copernicus-data-show. Accessed: 2025-1-20.
- Miller, C.; Hilton, J.; Sullivan, A.; Prakash, M. SPARK – A Bushfire Spread Prediction Tool. In Proceedings of the Environmental Software Systems. Infrastructures, Services and Applications, Cham; Denzer, R., Argent, R.M., Schimak, G., Hřebíček, J., Eds.; 2015; pp. 262–271. [Google Scholar] [CrossRef]
- Chandola, V.; Banerjee, A.; Kumar, V. Anomaly detection: A survey. ACM Comput. Surv. 2009, 41. [Google Scholar] [CrossRef]
- Khan, S.S.; Madden, M.G. One-Class Classification: Taxonomy of Study and Review of Techniques. The Knowledge Engineering Review 2014, 29, 345–374. [Google Scholar] [CrossRef]
- Nguyen, P.T.; Nguyen, T.T.; Nguyen, N.C.; Le, T.T. Multiclass breast cancer classification using convolutional neural network. In Proceedings of the 2019 International symposium on electrical and electronics engineering (ISEE). IEEE; 2019; pp. 130–134. [Google Scholar]
- Ismail, F.N.; Woodford, B.J.; Licorish, S.A.; Miller, A.D. An assessment of existing wildfire danger indices in comparison to one-class machine learning models. Natural Hazards 2024, 120, 14837–14868. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. In Proceedings of the Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 2017; p. 4768–4777. [CrossRef]
- Ismail, F.N. Novel machine learning approaches for wildfire prediction to overcome the drawbacks of equation-based forecasting. PhD dissertation, University of Otago, 2022.
- Ismail, F.N.; Sengupta, A.; Woodford, B.J.; Licorish, S.A. A Comparison of One-Class Versus Two-Class Machine Learning Models for Wildfire Prediction in California. Data Science and Machine Learning. AusDM 2023 2024, pp. 239–253.
- Ismail, F.N.; Amarasoma, S. One-class Classification-Based Machine Learning Model for Estimating the Probability of Wildfire Risk. Procedia Computer Science 2023, 222, 341–352, International Neural Network SocietyWorkshop on Deep Learning Innovations and Applications (INNS DLIA 2023). [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V. Support-Vector Networks. Machine learning 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Tax, D.M.; Duin, R.P. Support vector domain description. Pattern recognition letters 1999, 20, 1191–1199. [Google Scholar] [CrossRef]
- Liu, F.T.; Ting, K.M.; Zhou, Z. Isolation Forest. In Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, Piscataway, NJ; 2008; pp. 413–422. [Google Scholar] [CrossRef]
- Alkhatib, R.; Sahwan, W.; Alkhatieb, A.; Schütt, B. A Brief Review of Machine Learning Algorithms in Forest Fires Science. Applied Sciences 2023, 13. [Google Scholar] [CrossRef]
- Ntinopoulos, N.; Sakellariou, S.; Christopoulou, O.; Sfougaris, A. Fusion of Remotely-Sensed Fire-Related Indices for Wildfire Prediction through the Contribution of Artificial Intelligence. Sustainability 2023, 15. [Google Scholar] [CrossRef]
- Patterson, J.; Gibson, A. Deep learning: A practitioner’s approach; O’Reilly Media, Inc.: Sebastopol, CA, 2017. [Google Scholar]
- Ruff, L.; Vandermeulen, R.; Goernitz, N.; Deecke, L.; Siddiqui, S.A.; Binder, A.; Müller, E.; Kloft, M. Deep One-Class Classification. In Proceedings of the Proceedings of the 35th International Conference on Machine Learning; Dy, J.; Krause, A., Eds., 2018, Vol. 80, Proceedings of Machine Learning Research, pp. 4393–4402.
- Kim, S.; Choi, Y.; Lee, M. Deep learning with support vector data description. Neurocomputing 2015, 165, 111–117. [Google Scholar] [CrossRef]
- Zenati, H.; Romain, M.; Foo, C.S.; Lecouat, B.; Chandrasekhar, V. Adversarially learned anomaly detection. In Proceedings of the 2018 IEEE International Conference on Data Mining (ICDM). IEEE Press; 2018; pp. 727–736. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 2011, 12, 2825–2830. [Google Scholar] [CrossRef]
- Zhao, Y.; Nasrullah, Z.; Li, Z. PyOD: A Python Toolbox for Scalable Outlier Detection. Journal of Machine Learning Research 2019, 20, 1–7. [Google Scholar]
- Abdollahi, A.; Pradhan, B. Explainable artificial intelligence (XAI) for interpreting the contributing factors feed into the wildfire susceptibility prediction model. Science of The Total Environment 2023, 879, 163004. [Google Scholar] [CrossRef] [PubMed]
- Tonini, M.; D’Andrea, M.; Biondi, G.; Degli Esposti, S.; Trucchia, A.; Fiorucci, P. A Machine Learning-Based Approach for Wildfire Susceptibility Mapping. The Case Study of the Liguria Region in Italy. Geosciences 2020, 10, 105. [Google Scholar] [CrossRef]
- Jaafari, A.; Pourghasemi, H.R. 28 - Factors Influencing Regional-Scale Wildfire Probability in Iran: An Application of Random Forest and Support Vector Machine. In Spatial Modeling in GIS and R for Earth and Environmental Sciences; Pourghasemi, H.R.; Gokceoglu, C., Eds.; 2019; pp. 607–619. [CrossRef]
- Donovan, G.H.; Prestemon, J.P.; Gebert, K. The Effect of Newspaper Coverage and Political Pressure on Wildfire Suppression Costs. Society & Natural Resources 2011, 24, 785–798. [Google Scholar] [CrossRef]
- Jiménez-Ruano, A.; Mimbrero, M.R.; de la Riva Fernández, J. Understanding wildfires in mainland Spain. A comprehensive analysis of fire regime features in a climate-human context. Applied Geography 2017, 89, 100–111. [Google Scholar] [CrossRef]
- Papadopoulos, A.; Paschalidou, A.; Kassomenos, P.; McGregor, G. On the association between synoptic circulation and wildfires in the Eastern Mediterranean. Theoretical and Applied Climatology 2014, 115, 483–501. [Google Scholar] [CrossRef]
- Nunes, A.; Lourenço, L.; Meira Castro, A.C. Exploring spatial patterns and drivers of forest fires in Portugal (1980–2014). Science of The Total Environment 2016, 573, 1190–1202. [Google Scholar] [CrossRef] [PubMed]
- Stein, M.L. Interpolation of Spatial Data: Some Theory for Kriging; Springer Series in Statistics; Springer-Verlag: New York, 1999. [Google Scholar] [CrossRef]
- Tadić, J.M.; Ilić, V.; Biraud, S. Examination of geostatistical and machine-learning techniques as interpolators in anisotropic atmospheric environments. Atmospheric Environment 2015, 111, 28–38. [Google Scholar] [CrossRef]
- Bergstra, J.; Yamins, D.; Cox, D.D. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. In Proceedings of the Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28, Atlanta, GA, USA, 2013ICML’13, p. I–115–I–123. [CrossRef]
- Tien Bui, D.; Bui, Q.T.; Nguyen, Q.P.; Pradhan, B.; Nampak, H.; Trinh, P.T. A hybrid artificial intelligence approach using GIS-based neural-fuzzy inference system and particle swarm optimization for forest fire susceptibility modeling at a tropical area. Agricultural and Forest Meteorology 2017, 233, 32–44. [Google Scholar] [CrossRef]
- Sayad, Y.O.; Mousannif, H.; Al Moatassime, H. Predictive modeling of wildfires: A new dataset and machine learning approach. Fire Safety Journal 2019, 104, 130–146. [Google Scholar] [CrossRef]
- Nhu, V.H.; Shirzadi, A.; Shahabi, H.; Singh, S.K.; Al-Ansari, N.; Clague, J.J.; Jaafari, A.; Chen, W.; Miraki, S.; Dou, J.; et al. Shallow Landslide Susceptibility Mapping: A Comparison between Logistic Model Tree, Logistic Regression, Naïve Bayes Tree, Artificial Neural Network, and Support Vector Machine Algorithms. International Journal of Environmental Research and Public Health 2020, 17, 2749. [Google Scholar] [CrossRef]
- Ghorbanzadeh, O.; Valizadeh Kamran, K.; Blaschke, T.; Aryal, J.; Naboureh, A.; Einali, J.; Bian, J. Spatial Prediction of Wildfire Susceptibility Using Field Survey GPS Data and Machine Learning Approaches. Fire 2019, 2, 43. [Google Scholar] [CrossRef]
- Michael, Y.; Helman, D.; Glickman, O.; Gabay, D.; Brenner, S.; Lensky, I.M. Forecasting fire risk with machine learning and dynamic information derived from satellite vegetation index time-series. Science of The Total Environment 2021, 764, 142844. [Google Scholar] [CrossRef]
- Goldarag, Y.; Mohammadzadeh, A.; Ardakani, A. Fire Risk Assessment Using Neural Network and Logistic Regression. Journal of the Indian Society of Remote Sensing 2016, 44, 1–10. [Google Scholar] [CrossRef]
- de Bem, P.; de Carvalho Júnior, O.; Matricardi, E.; Guimarães, R.; Gomes, R. Predicting wildfire vulnerability using logistic regression and artificial neural networks: a case study in Brazil. International Journal of Wildland Fire 2018, 28, 35–45. [Google Scholar] [CrossRef]
- Ma, J.; Cheng, J.; Jiang, F.; Gan, V.; Wang, M.; Zhai, C. Real-time detection of wildfire risk caused by powerline vegetation faults using advanced machine learning techniques. Advanced Engineering Informatics 2020, 44, 101070. [Google Scholar] [CrossRef]
- Jolly, W.M.; Freeborn, P.H.; Page, W.G.; Butler, B.W. Severe Fire Danger Index: A Forecastable Metric to Inform Firefighter and Community Wildfire Risk Management. Fire 2019, 2, 47. [Google Scholar] [CrossRef]
- National Interagency Fire Center. National Wildfire Coordinating Group (NWCG). Interagency Standards for Fire and Fire Aviation Operations; Createspace Independent Publishing Platform: Great Basin Cache Supply Office: Boise, ID, USA, 2019. [Google Scholar]
- Sullivan, A.L. Physical Modelling of Wildland Fires. In Encyclopedia of Wildfires and Wildland-Urban Interface (WUI) Fires; Springer International Publishing: Cham, 2019; pp. 1–8. [Google Scholar] [CrossRef]
- Taylor, S.W.; Woolford, D.G.; Dean, C.B.; Martell, D.L. Wildfire Prediction to Inform Fire Management: Statistical Science Challenges. Statistical Science 2013, 28, 586–615. [Google Scholar] [CrossRef]
- Watanabe, S.; Bansal, A.; Hutter, F. PED-ANOVA: Efficiently Quantifying Hyperparameter Importance in Arbitrary Subspaces. In Proceedings of the Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, IJCAI-23; Elkind, E., Ed., California, 8 2023; pp. 4389–4396. [CrossRef]
| 1 | |
| 2 | |
| 3 | |
| 4 | More information on the cost calculation can be found on [12]. |
| 5 | More information on the outcome of the questionnaire can be found on [12]. |





| No. | Feature | Description | Prior Research |
|---|---|---|---|
| 1 | IDATE | Fire Occurrence Date (Month & Date as an Integer) | [26] |
| 2 | LAT | Fire location latitude (degrees) | [26,27,28] |
| 3 | LON | Fire location longitude (degrees) | [26,27,28] |
| 4 | ELEVATION_m | Fire location elevation (in meters) | [26,28,29] |
| 5 | ACRES | Acres burnt (in acres) | |
| 6 | PPT_mm | Precipitation (in mm for the fire incident date) | [26,28,30] |
| 7 | TMIN_c | Minimum temperature (in Celsius for the fire incident date) | [28,30] |
| 8 | TMEAN_c | Mean temperature (in Celsius for the fire incident date) | [28,30] |
| 9 | TMAX_c | Maximum temperature (in Celsius for the fire incident date) | [26,28,30] |
| 10 | TDMEAN_c | Mean dew point temperature (in Celsius for the fire incident date) | [28,30] |
| 11 | VPDMIN_hpa | Minimum vapor pressure (in hectopascals) | [29] |
| 12 | VPDMAX_hpa | Maximum vapor pressure (in hectopascals) | [29] |
| 13 | lfmc_mean | Mean fuel moisture for a particular day (numeric) | [28] |
| 14 | lfmc_stdv | Standard deviation of fuel moisture for a particular day (numeric) | [28] |
| 15 | Mean_Sea_Level _Pressure | Mean sea level pressure of the nearest weather station to the wildfire event (in hectopascals) - (Universal Kriging) | [31] |
| 16 | Mean_Station _Pressure | Nearest mean weather station pressure to the wildfire event (in hectopascasl) - (Universal Kriging) | [31] |
| 17 | Mean_Wind_Speed | Mean wind speed for a given location (numeric mph) - (Universal Kriging) | [26,28,29] |
| 18 | Maximum_Sustained _Wind_Speed | Maximum sustained wind speed for a given location (numeric MPH) - (Universal Kriging) | [28,29] |
| 19 | NAMELSAD | County name (string) | [30] |
| 20 | Population | Number of residents living in the respective county (numeric) | [30,32] |
| No. | Feature | Description | Prior Research |
|---|---|---|---|
| 1 | IDATE | Fire Occurrence Date (Month & Date as an Integer) | [26] |
| 2 | LAT | Fire location latitude (degrees) | [26,27,28] |
| 3 | LON | Fire location longitude (degrees) | [26,27,28] |
| 4 | ELEVATION_m | Fire location elevation (in meters) | [26,28,29] |
| 5 | ACRES | Acres burnt (in acres) | |
| 6 | PPT_mm | Precipitation (in mm for the fire incident date) | [26,28,30] |
| 7 | TMIN_c | Minimum temperature (in Celsius for the fire incident date) | [28,30] |
| 8 | TMEAN_c | Mean temperature (in Celsius for the fire incident date) | [28,30] |
| 9 | TMAX_c | Maximum temperature (in Celsius for the fire incident date) | [26,28,30] |
| 10 | TDMEAN_c | Mean dew point temperature (in Celsius for the fire incident date) | [28,30] |
| 11 | VPD9AM_hpa | Vapor pressure at 9AM (in hectopascals) | [29] |
| 12 | VPD3PM_hpa | Vapor pressure at 3PM (in hectopascals) | [29] |
| 13 | lfmc_mean | Mean fuel moisture for a particular day (numeric) | [28] |
| 14 | lfmc_stdv | Standard deviation of fuel moisture for a particular day (numeric) | [28] |
| 15 | Mean_Sea_Level _Pressure | Mean sea level pressure of the nearest weather station to the wildfire event (in hectopascals) - (Universal Kriging) | [31] |
| 16 | Mean_Station _Pressure | Nearest mean weather station pressure to the wildfire event (in hectopascasl) - (Universal Kriging) | [31] |
| 17 | Mean_Wind_Speed | Mean wind speed for a given location (numeric mph) - (Universal Kriging) | [26,28,29] |
| 18 | Maximum_Sustained _Wind_Speed | Maximum sustained wind speed for a given location (numeric MPH) - (Universal Kriging) | [28,29] |
| 19 | NAMELSAD | County name (string) | [30] |
| 20 | Population | Number of residents living in the respective county (numeric) | [30,32] |
| ML Technique | Data set Type | Data set Count | Inliers | Outliers | Mean Accuracy |
Mean Precision |
Mean Recall |
Mean F1-Score |
20 × Five-Fold CV |
|---|---|---|---|---|---|---|---|---|---|
| OCSVM (sklearn) | Train (80%) | 5868 | 5806 | 62 | 0.989 | 1.000 | 0.989 | 0.994 | 0.990± 0.0030 |
| Test (20%) | 1,467 | 1,443 | 24 | 0.983 | 1.000 | 0.983 | 0.991 | ||
| OCSVM (PyOD) | Train (80%) | 5,868 | 5,809 | 59 | 0.989 | 1.000 | 0.990 | 0.990 | 0.990± 0.0028 |
| Test (20%) | 1,467 | 1,458 | 9 | 0.993 | 1.000 | 0.990 | 1.000 | ||
| AE (PyOD) | Train (80%) | 5,868 | 5,809 | 59 | 0.989 | 1.000 | 0.990 | 0.990 | 0.989± 0.0030 |
| Test (20%) | 1,467 | 1,454 | 13 | 0.991 | 1.000 | 0.990 | 1.000 | ||
| VAE (PyOD) | Train (80%) | 5,868 | 5,809 | 59 | 0.989 | 1.000 | 0.990 | 0.990 | 0.989± 0.0028 |
| Test (20%) | 1,467 | 1,454 | 13 | 0.991 | 1.000 | 0.990 | 1.000 | ||
| IF (PyOD) | Train (80%) | 5,868 | 5,809 | 59 | 0.989 | 1.000 | 0.990 | 0.990 | 0.989± 0.0028 |
| Test (20%) | 1,467 | 1,458 | 9 | 0.993 | 1.000 | 0.990 | 1.000 | ||
| DeepSVDD (PyOD) | Train (80%) | 5,868 | 5,281 | 587 | 0.899 | 1.000 | 0.900 | 0.950 | 0.897± 0.0101 |
| Test (20%) | 1,467 | 1,316 | 151 | 0.897 | 1.000 | 0.900 | 0.950 | ||
| ALAD (PyOD) | Train (80%) | 5,868 | 5,281 | 587 | 0.899 | 1.000 | 0.900 | 0.950 | 0.900± 0.0081 |
| Test (20%) | 1,467 | 1,272 | 195 | 0.867 | 1.000 | 0.870 | 0.930 |
| ML Technique | Data set Type | Data set Count | Inliers | Outliers | Mean Accuracy |
Mean Precision |
Mean Recall |
Mean F1-Score |
20 × Five-Fold CV |
|---|---|---|---|---|---|---|---|---|---|
| OCSVM (sklearn) | Train (80%) | 26,640 | 26,336 | 304 | 0.988 | 1.000 | 0.988 | 0.994 | 0.998± 0.0015 |
| Test (20%) | 6,660 | 6,580 | 80 | 0.987 | 1.000 | 0.987 | 0.993 | ||
| OCSVM (PyOD) | Train (80%) | 26,640 | 26,373 | 267 | 0.989 | 1.000 | 0.989 | 0.994 | 0.989± 0.0012 |
| Test (20%) | 6,660 | 6,652 | 8 | 0.998 | 1.000 | 0.998 | 0.999 | ||
| AE (PyOD) | Train (80%) | 26,640 | 26,373 | 267 | 0.989 | 1.000 | 0.989 | 0.994 | 0.989± 0.0012 |
| Test (20%) | 6,660 | 6,611 | 49 | 0.992 | 1.000 | 0.992 | 0.996 | ||
| VAE (PyOD) | Train (80%) | 26,640 | 26,373 | 267 | 0.989 | 1.000 | 0.989 | 0.994 | 0.989± 0.0012 |
| Test (20%) | 6,660 | 6,611 | 49 | 0.992 | 1.000 | 0.992 | 0.996 | ||
| IF (PyOD) | Train (80%) | 26,640 | 26,378 | 262 | 0.990 | 1.000 | 0.990 | 0.995 | 0.989± 0.0015 |
| Test (20%) | 6,660 | 6,620 | 40 | 0.993 | 1.000 | 0.993 | 0.996 | ||
| DeepSVDD (PyOD) | Train (80%) | 26,640 | 23,976 | 2,664 | 0.900 | 1.000 | 0.900 | 0.950 | 0.899± 0.0047 |
| Test (20%) | 6,660 | 5,865 | 795 | 0.880 | 1.000 | 0.880 | 0.940 | ||
| ALAD (PyOD) | Train (80%) | 26,640 | 23,976 | 2,664 | 0.900 | 1.000 | 0.900 | 0.950 | 0.900± 0.0039 |
| Test (20%) | 6,660 | 6,379 | 281 | 0.957 | 1.000 | 0.960 | 0.980 |
| ML Technique | Data set Type | Data set Count | Mean Accuracy |
Mean Precision |
Mean Recall |
Mean F1-Score |
20 × Five-Fold CV |
|---|---|---|---|---|---|---|---|
| SVM[28,38,39] | Train (80%) | 11,756 | 0.628 | 0.657 | 0.763 | 0.706 | 0.670± 0.0648 |
| Test (20%) | 2,939 | 0.674 | 0.645 | 0.773 | 0.703 | ||
| RF[28,39,40] | Train (80%) | 11,756 | 0.679 | 0.664 | 0.724 | 0.693 | 0.677± 0.0713 |
| Test (20%) | 2,939 | 0.670 | 0.639 | 0.779 | 0.703 | ||
| LogisticRegression[38,41,42] | Train (80%) | 11,756 | 0.676 | 0.651 | 0.756 | 0.697 | 0.676± 0.0743 |
| Test (20%) | 2,939 | 0.676 | 0.651 | 0.756 | 0.699 | ||
| XGBoostRegression[40,43] | Train (80%) | 11,756 | 0.675 | 0.660 | 0.717 | 0.688 | 0.675± 0.0665 |
| Test (20%) | 2,939 | 0.674 | 0.660 | 0.717 | 0.687 | ||
| ANN[38,39,41] | Train (80%) | 11,756 | 0.682 | 0.665 | 0.732 | 0.697 | 0.677± 0.0742 |
| Test (20%) | 2,939 | 0.674 | 0.650 | 0.751 | 0.694 |
| No. | Feature | One-Class SVM (scikit-learn) |
One-class SVM (PyOD) |
AutoEncoder (PyOD) |
Variational AutoEncoder (PyOD) |
Isolation Forest (PyOD) |
|---|---|---|---|---|---|---|
| 1 | IDATE | 6.36% | 9.85% | 1.84% | 1.84% | 10.26% |
| 2 | LAT | 6.37% | 5.90% | 0.64% | 0.51% | 2.17% |
| 3 | LON | 7.42% | 3.74% | 0.5% | 0.59% | 2.10% |
| 4 | ELEVATION_m | 12.39% | 0.28% | 0.57% | 0.41% | 3.23% |
| 5 | ACRES | 0.88% | 4.70% | 15.93% | 15.94% | 14.97% |
| 6 | PPT_mm | 5.47% | 7.13% | 45.24% | 45.40% | 3.28% |
| 7 | TMIN_c | 3.45% | 7.55% | 2.91% | 2.99% | 4.23% |
| 8 | TMEAN_c | 4.45% | 9.59% | 4.01% | 4.23% | 3.21% |
| 9 | TMAX_c | 6.32% | 11.30% | 3.64% | 3.86% | 10.13% |
| 10 | TDMEAN_c | 5.34% | 7.89% | 0.84% | 1.23% | 2.61% |
| 11 | VPDMIN_hpa | 6.77% | 0.81% | 2.12% | 2.10% | 3.86% |
| 12 | VPDMAX_hpa | 6.58% | 5.66% | 2.94% | 2.87% | 7.31% |
| 13 | lfmc_mean | 6.11% | 6.67% | 4.10% | 4.05% | 2.89% |
| 14 | lfmc_stdv | 6.44% | 4.70% | 3.20% | 2.69% | 1.69% |
| 15 | Mean_Sea_Level _Pressure | 3.62% | 7.86% | 1.02% | 0.81% | 4.91% |
| 16 | Mean_Station _Pressure | 4.27% | 3.32% | 0.27% | 0.28% | 2.80% |
| 17 | Mean_Wind_Speed | 2.33% | 1.56% | 3.00% | 2.80% | 3.07% |
| 18 | Maximum_Sustained _Wind_Speed | 2.15% | 0.91% | 3.16% | 3.44% | 4.07% |
| 19 | NAMELSAD | 1.45% | 0.05% | 0.29% | 0.14% | 1.96% |
| 20 | Population | 1.86% | 0.52% | 3.77% | 3.81% | 11.26% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).