Submitted:
08 September 2025
Posted:
09 September 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methodology
2.1. Study areas
2.2. Data sources
- Remote Sensing Dataset. From the National Agriculture Imagery Program (NAIP) provided by the U.S. Department of Agriculture (USDA) [14], primarily used for agricultural monitoring and land cover analysis. The NAIP dataset contains high-resolution RGB imagery with a sampling resolution of 10m.
- Historical Fire Spot Data. From the Fire Information for Resource Management System (FIRMS) dataset, which provides near-real-time (NRT) active fire data based on observations from MODIS [20] and VIIRS [21,22] satellite sensors. The FIRMS dataset covers the period from November 1, 2000, to the most recent date, with a sampling resolution of 375m.
- Historical Fire Events Data (FireEvents). From MODIS/006/MCD64A1 [23], a MODIS (Moderate Resolution Imaging Spectroradiometer) fire product provided by NASA for monitoring and analyzing global fire activity. This dataset includes burned area data from 2001 to the present and is a key resource for studying fire dynamics and impacts.
- Topographic Data. From the Shuttle Radar Topography Mission (SRTM) [15], with a sampling resolution of 30m.
- ERA5 Weather Data. Provided by the European Centre for Medium-Range Weather Forecasts (ECMWF). ERA5 is the fifth-generation global climate reanalysis dataset widely used in meteorology, climate research, and environmental science, with a sampling resolution of 1km [16].
- Vegetation Data. MODIS NDVI data obtained from NASA’s Terra and Aqua satellites via the MODIS sensor. The specific dataset is MOD13Q1, which provides Normalized Difference Vegetation Index (NDVI) and Enhanced Vegetation Index (EVI) every 16 days, with a sampling resolution of 250m [17].
2.3. Data Aggregation
2.3.1. Single-Modality Baseline Processing (NAIP-WHP)
2.3.2. Fixed-Weight Multimodal Dataset (FIXED)
2.3.3. Geographically Adaptive Dynamic-Weight Dataset (FUSED)
- Climate-driven adaptation. During drought conditions (e.g., precipitation <30th percentile), vegetation and temperature weights increase by 5–10%, while precipitation influence decreases proportionally.
- Topographic feedback. In high-elevation zones (), slope and elevation weights scale with terrain complexity, increasing up to 15% in rugged terrain.
- Vegetation dynamics. NDVI-based fuel moisture thresholds trigger risk weight adjustments—when vegetation dryness exceeds seasonal norms, fire susceptibility weights rise by 8–12% [26].
2.4. Innovation Validation
2.4.1. Comparative Feature Importance Analysis
2.4.2. Quantitative Validation Metrics
2.4.3. Conclusion of Innovation Validation
2.5. Machine Learning Application
2.5.1. Data Preprocessing
2.5.2. Machine Learning Applications
- (1)
- (2)
- (3)
-
Phase 3 - Deep Learning.
- DNN: 256-128 fully connected layers with batch normalization and 30% Dropout (Kaiming initialization) [31].
- BiLSTM: 128-unit hidden layers [34].
All models use AdamW (lr=0.001) and BCELoss. - (4)
3. Results
3.1. Comparative Analysis
3.2. SHAP Analysis of Multimodal Feature Fusion
3.3. Ablation Study
3.4. Real-Fire Event Validation
3.5. Indirect comparison with existing benchmarks
- FireRisk only supports end-to-end image classification tasks.
- Our framework outputs continuous risk probability values (0-1).
- FireRisk uses 7-class discrete risk labels, while we adopt the 5-class WHP2023 standard.
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Abatzoglou JT, Williams AP. 2016. Impact of anthropogenic climate change on wildfire across western US forests. Proc Natl Acad Sci U S A. 113(42):11770–11775. [CrossRef]
- Jain P, Barber QE, Taylor SW, Whitman E, Castellanos Acuna D, Boulanger Y, Chavardès RD, Chen J, Englefield P, Flannigan M, Girardin MP, et al. 2024. Drivers and impacts of the record-breaking 2023 wildfire season in Canada. Nat Commun. [accessed 2024 Aug 20]; 15:Art. 51154. [CrossRef]
- Cunningham CX, Williamson GJ, Bowman DMJS. 2024. Increasing frequency and intensity of the most extreme wildfires on Earth. Nat Ecol Evol. 8:1420–1425. [CrossRef]
- Burton C, Lampe S, Kelley DI, Thiery W, Hantson S, Christidis N, Gudmundsson L, Forrest M, Burke E, Chang J, et al. 2024. Increasing frequency and intensity of the most extreme wildfires on Earth. Nat Clim Change. [accessed 2024 Oct 21]. [CrossRef]
- Dowdy AJ, Ye H, Pepler AS, Thatcher M, Osbrough SL, Evans JP, Di Virgilio G, McCarthy N. 2019. Australian fire weather as represented by the McArthur Forest Fire Danger Index and the Canadian Forest Fire Weather Index. Clim Dyn. 53(1-2):21–39. [CrossRef]
- Lim C-H, Kim YS, Won M, Kim SJ, Lee WK. 2019. Can satellite-based data substitute for surveyed data to predict the spatial probability of forest fire? A geostatistical approach to forest fire in the Republic of Korea. Geomatics Nat Hazards Risk. 10(1):719–739. [CrossRef]
- Bao Y, Li X, Wang Y, Liu C. 2024. Research trends in wildland fire prediction amidst climate change: a bibliometric analysis. Forests. 15(2):88.
- Yu Y, Mao J, Wullschleger SD, Chen A, Shi X, Wang Y, Hoffman FM, Zhang Y, Pierce E. 2022. Machine learning-based observation-constrained projections reveal elevated global socioeconomic risks from wildfire. Nat Commun. 13:2885.
- Shen S, Seneviratne SI, Wanyan X, Kirley M. 2023. FireRisk: A remote sensing dataset for fire risk assessment with benchmarks using supervised and self-supervised learning. arXiv:2303.07035. [CrossRef]
- Hantson S, Arneth A, Harrison SP, Kelley DI, Prentice IC, Rabin SS, Archibald S, Mouillot F, Arnold SR, Artaxo P, et al. 2016. The status and challenge of global fire modelling. Biogeosciences. 13(11):3359–3375. [CrossRef]
- Turner MG, Brazjunas KH, Hansen WD, Hoecker TJ, Rammer W, Rataiczak Z, Westerling AL, Seidl R. 2022. The magnitude, direction, and tempo of forest change in Greater Yellowstone in a warmer world with more fire. Ecol Monogr. 92(1):e01485. [CrossRef]
- Corbière C, Thome N, Bar-Hen A, Cord M, Pérez P. 2019. Addressing failure prediction by learning model confidence. Adv Neural Inf Process Syst. 32:2905–2915.
- Wen X, Xie Y, Jiang L, Li Y, Ge T. 2022. On the interpretability of machine learning methods in crash frequency modeling and crash modification factor development. Accid Anal Prev. 175:106617. [CrossRef]
- Maxwell AE, Warner TA, Vanderbilt BC, Ramezan CA. 2017. Land cover classification and feature extraction from National Agriculture Imagery Program (NAIP) orthoimagery: A review. Photogramm Eng Remote Sens. 83(11):737–747.
- Farr TG, Rosen PA, Caro E, Crippen R, Duren R, Hensley S, Kobrick M, Paller M, Rodriguez E, Roth L, et al. 2007. The shuttle radar topography mission. Rev Geophys. 45(2):RG2004. [CrossRef]
- Hersbach H, Bell B, Berrisford P, Hirahara S, Horányi A, Muñoz-Sabater J, Nicolas J, Peubey C, Radu R, Schepers D, et al. 2020. The ERA5 global reanalysis. Q J R Meteorol Soc. 146(730):1999–2049. [CrossRef]
- Didan K, Miura T, Solano R, Huete AR. 2015. MODIS Vegetation Index User’s Guide (MOD13 Series). Tucson (AZ): University of Arizona. [accessed 2024 Jun 15]. Available from: https://modis.gsfc.nasa.gov/data/dataprod/mod13.php.
- Wang Y, Liu Z, Wang X, Chen Y, Zhang G, Randerson JT. 2023. Interpretable deep learning for spatial variation in wildfire drivers. Environ Res Lett. 18(3):034033.
- Gorelick N, Hancher M, Dixon M, Ilyushchenko S, Thau D, Moore R. 2017. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens Environ. 202:18–27. [CrossRef]
- Giglio L, Justice C. 2015. MOD14A1 MODIS/Terra Thermal Anomalies/Fire Daily L3 Global 1km SIN Grid V006. NASA EOSDIS Land Processes DAAC. [accessed 2024 Jun 15]. Available from: https://lpdaac.usgs.gov/products/mod14a1v006/.
- Didan K, Barreto A. 2018. VIIRS/NPP Vegetation Indices 16-Day L3 Global 500m SIN Grid V001. NASA EOSDIS Land Processes DAAC. [accessed 2024 Jun 15]. Available from: https://lpdaac.usgs.gov/products/vnp13a1v001/.
- Schroeder W, Oliva P, Giglio L, Quayle B, Lorenz E, Morelli F. 2022. VIIRS active fire products: Algorithm improvements and global validation. Remote Sens. 14(3):578. [CrossRef]
- [NASA] National Aeronautics and Space Administration, Land Processes Distributed Active Archive Center. 2021. MOD/MYD C6.1 MCD64A1 burned area product. [accessed 2024 Jun 15]. Available from: https://lpdaac.usgs.gov/products/mcd64a1v006/.
- SpatialReference.org. 2024. EPSG:32611 WGS 84 / UTM Zone 11N. [accessed 2024 Nov 5]. Available from: https://spatialreference.org/ref/epsg/32611/.
- [USDA], U.S. Department of Agriculture, Forest Service. 2023. Wildfire Potential Hazard (WHP) 2023 Classification. ArcGIS Online. [accessed 2024 May 23]. Available from: https://usfs.maps.arcgis.com/home.
- Radeloff VC, Helmers DP, Kramer HA, Mockrin MH, Alexandre PM, Bar-Massada A, Butsic V, Hawbaker TJ, Martinuzzi S, Stewart SI, et al. 2018. Rapid growth of the US wildland-urban interface raises wildfire risk. Proc Natl Acad Sci U S A. 115(13):3314–3319. [CrossRef]
- Breiman, L. 2001. Random forests. Mach Learn. 45(1):5–32. [CrossRef]
- Wang, Q. 2022. Support Vector Machine Algorithm in Machine Learning. In: Proceedings of the 2022 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA); 2022 Jun 24-26; Dalian, China. Piscataway (NJ): IEEE; p. 750 - 756.
- Chen T, Guestrin C. 2016. XGBoost: A scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016 Aug 13–17; San Francisco, CA. New York (NY): Association for Computing Machinery; p. 785–794. [CrossRef]
- Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y. 2017. LightGBM: A highly efficient gradient boosting decision tree. Adv Neural Inf Process Syst. 30:3146–3154.
- Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, et al. 2019. PyTorch: an imperative style, high-performance deep learning library. In: Proceedings of the 33rd International Conference on Neural Information Processing Systems; 2019 Dec 8-14; Vancouver, Canada. Red Hook (NY): Curran Associates; p. 8026-8037. Article No.: 721.
- Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. 2017. Attention is all you need. Adv Neural Inf Process Syst. 30:5998–6008.
- Li H, Wang Y, Zhang J, Liu X, Chen Z. 2023. A transformer-based fusion framework for wildfire risk assessment using heterogeneous geospatial data. Remote Sens Environ. 287:113482. [CrossRef]
- Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural Comput. 9(8):1735–1780. [CrossRef]
- Tan M, Le QV. 2019. EfficientNet: Rethinking model scaling for convolutional neural networks. In: Proceedings of the 36th International Conference on Machine Learning; 2019 Jun 9–15; Long Beach, CA. Proceedings of Machine Learning Research; vol. 97. p. 6105–6114.
- Tan M, Chen B, Pang R, Vasudevan V, Sandler M, Howard A, Le QV. 2021. EfficientNetV2: Smaller models and faster training. In: Proceedings of the 38th International Conference on Machine Learning; 2021 Jul 18–24; Virtual. Proceedings of Machine Learning Research; vol. 139. p. 10096–10106.
- [IPCC] Intergovernmental Panel on Climate Change. 2023. Climate Change 2023: Synthesis Report. Cambridge (UK): Cambridge University Press. [accessed 2024 Jun 15]. [CrossRef]
- Szpakowski DM, Jensen JLR. 2019. A review of the applications of remote sensing in fire ecology. Remote Sens. 11(22):2638. [CrossRef]













| Risk Levels | Possible values (R) |
|---|---|
| Very Low | |
| Low | |
| Moderate | |
| High | |
| Very High |
| Model | AUC-ROC | AP | Accuracy | Recall | Precision | F1 | Specificity | G-Mean | MCC | Train Time (s) | Inference Time (ms/sample) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| SVM | 0.921 | 0.913 | 0.833 | 0.796 | 0.871 | 0.824 | 0.875 | 0.83 | 0.681 | 0.012 | 1.238 |
| RandomForest | 0.836 | 0.87 | 0.718 | 0.721 | 0.718 | 0.718 | 0.718 | 0.719 | 0.439 | 0.73 | 1.748 |
| Transformer | 0.721 | 0.718 | 0.628 | 0.443 | 0.712 | 0.525 | 0.821 | 0.584 | 0.288 | 0.54 | 1.362 |
| EfficientNet | 0.677 | 0.698 | 0.692 | 0.693 | 0.699 | 0.687 | 0.696 | 0.689 | 0.395 | 0.1 | 0.041 |
| MobileNet | 0.661 | 0.682 | 0.64 | 0.746 | 0.642 | 0.67 | 0.546 | 0.611 | 0.313 | 0.117 | 0.035 |
| PyTorchDNN | 0.651 | 0.694 | 0.602 | 0.618 | 0.607 | 0.609 | 0.589 | 0.599 | 0.208 | 0.276 | 0.062 |
| LSTM | 0.593 | 0.632 | 0.575 | 0.875 | 0.617 | 0.673 | 0.275 | 0.256 | 0.173 | 0.303 | 3.474 |
| Model | AUC-ROC | AP | Accuracy | Recall | Precision | F1 | Specificity | G-Mean | MCC | Train Time (s) | Inference Time (ms/sample) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| RandomForest | 0.801 | 0.825 | 0.705 | 0.636 | 0.765 | 0.674 | 0.768 | 0.686 | 0.428 | 0.716 | 1.746 |
| SVM | 0.738 | 0.774 | 0.679 | 0.536 | 0.768 | 0.62 | 0.818 | 0.655 | 0.379 | 0.036 | 0.186 |
| Transformer | 0.707 | 0.763 | 0.565 | 0.821 | 0.551 | 0.652 | 0.314 | 0.434 | 0.149 | 0.564 | 1.373 |
| MobileNet | 0.699 | 0.765 | 0.655 | 0.721 | 0.639 | 0.666 | 0.579 | 0.608 | 0.329 | 0.121 | 0.044 |
| PyTorchDNN | 0.698 | 0.762 | 0.653 | 0.596 | 0.717 | 0.631 | 0.718 | 0.635 | 0.337 | 0.122 | 1.123 |
| LSTM | 0.646 | 0.73 | 0.628 | 0.582 | 0.72 | 0.585 | 0.65 | 0.482 | 0.267 | 0.301 | 1.226 |
| EfficientNet | 0.642 | 0.731 | 0.602 | 0.536 | 0.628 | 0.575 | 0.668 | 0.593 | 0.207 | 0.12 | 0.048 |
| Model | AUC-ROC | AP | Accuracy | Recall | Precision | F1 | Specificity | G-Mean | MCC | Train Time (s) | Inference Time (ms/sample) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| RandomForest | 0.874 | 0.812 | 0.718 | 0.739 | 0.704 | 0.719 | 0.696 | 0.715 | 0.439 | 0.739 | 1.781 |
| SVM | 0.782 | 0.817 | 0.742 | 0.746 | 0.739 | 0.739 | 0.743 | 0.742 | 0.491 | 0.01 | 1.243 |
| PyTorchDNN | 0.695 | 0.754 | 0.614 | 0.596 | 0.633 | 0.605 | 0.639 | 0.608 | 0.244 | 0.127 | 0.57 |
| Transformer | 0.671 | 0.728 | 0.576 | 0.593 | 0.603 | 0.568 | 0.571 | 0.544 | 0.179 | 0.519 | 1.339 |
| EfficientNet | 0.639 | 0.688 | 0.551 | 0.821 | 0.533 | 0.638 | 0.286 | 0.45 | 0.16 | 0.105 | 0.058 |
| MobileNet | 0.606 | 0.699 | 0.475 | 0.789 | 0.483 | 0.597 | 0.161 | 0.298 | -0.058 | 0.121 | 0.811 |
| LSTM | 0.533 | 0.612 | 0.54 | 0.4 | 0.486 | 0.378 | 0.661 | 0.345 | 0.08 | 0.304 | 0.143 |
| Model | AUC-ROC | AP | Accuracy | Recall | Precision | F1 | Specificity | G-Mean | MCC |
|---|---|---|---|---|---|---|---|---|---|
| G4 | |||||||||
| RandomForest | 0.668 | 0.744 | 0.602 | 0.564 | 0.605 | 0.575 | 0.639 | 0.589 | 0.21 |
| PyTorchDNN | 0.431 | 0.546 | 0.503 | 0.414 | 0.507 | 0.429 | 0.575 | 0.427 | 0.167 |
| SVM | 0.491 | 0.648 | 0.528 | 0.414 | 0.567 | 0.463 | 0.596 | 0.437 | 0.002 |
| LightGBM | 0.655 | 0.699 | 0.514 | 0.95 | 0.507 | 0.661 | 0.075 | 0.171 | 0.049 |
| G4+G1 | |||||||||
| RandomForest | 0.737 | 0.766 | 0.639 | 0.693 | 0.662 | 0.714 | 0.671 | 0.358 | 0.365 |
| PyTorchDNN | 0.564 | 0.642 | 0.449 | 0.9 | 0.472 | 0.619 | 0.757 | 0.303 | 0.562 |
| SVM | 0.709 | 0.713 | 0.679 | 0.664 | 0.692 | 0.668 | 0.689 | 0.365 | 0.667 |
| LightGBM | 0.686 | 0.716 | 0.513 | 0.95 | 0.507 | 0.659 | 0.075 | 0.303 | 0.157 |
| G4+G1+G2 | |||||||||
| RandomForest | 0.794 | 0.841 | 0.718 | 0.668 | 0.753 | 0.7 | 0.764 | 0.443 | 0.708 |
| PyTorchDNN | 0.812 | 0.857 | 0.693 | 0.489 | 0.864 | 0.604 | 0.9 | 0.442 | 0.652 |
| SVM | 0.808 | 0.858 | 0.742 | 0.664 | 0.808 | 0.715 | 0.818 | 0.504 | 0.729 |
| LightGBM | 0.797 | 0.852 | 0.562 | 0.95 | 0.542 | 0.686 | 0.175 | 0.158 | 0.303 |
| G4+G1+G2+G3 | |||||||||
| SVM | 0.901 | 0.913 | 0.806 | 0.743 | 0.824 | 0.80 | 0.822 | 0.801 | 0.641 |
| LightGBM | 0.85 | 0.862 | 0.642 | 0.925 | 0.598 | 0.718 | 0.361 | 0.501 | 0.347 |
| RandomForest | 0.797 | 0.784 | 0.759 | 0.746 | 0.779 | 0.743 | 0.768 | 0.739 | 0.54 |
| PyTorchDNN | 0.713 | 0.803 | 0.552 | 0.404 | 0.75 | 0.426 | 0.7 | 0.165 | 0.443 |
| Region ID | Name | Bounding Box | Fire Pixels | Hits | Hit Rate |
|---|---|---|---|---|---|
| 28 | Carlsbad Caverns, NM | (-104.5,32.0) - (-104.0,32.5) | 37,500 | 32,351 | 86.27% |
| 38 | Big Bend NP, TX | (-103.5,29.0) - (-103.0,29.5) | 20,000 | 16,498 | 82.49% |
| 10 | Arches NP, UT | (-109.5,38.5) - (-109.0,38.9) | 16,250 | 12976 | 79.87% |
| 45 | Gulf Coast Refinery, TX | (-94.5,29.5) - (-94.0,30.0) | 2,063,250 | 1599257 | 77.61% |
| 47 | Lake Tahoe West, NV | (-120.0,39.0) - (-119.5,39.5) | 30,000 | 22542 | 75.14% |
| Characteristics | FireRisk | FireRisk-Multi |
|---|---|---|
| Modality | RGB | Four-layer modality fusion |
| Label Source | WHP2020 | WHP2023 |
| Risk Level | 7-class | Continuous |
| Dynamic Weighting | ✗ | ✓ |
| Accuracy | MAE 65.29% | SVM 83.3% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).