Submitted:
10 October 2024
Posted:
11 October 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Background: Mean, Variance and Precision of Random Maps
2.2. Mean Estimation from Monte Carlo Simulations
2.3. Burn-Probability Maps from Monte Carlo Simulations
2.4. Using Importance Sampling for Reducing the Cost-to-Precision Ratio
- the cost being a function of the simulated fire:
2.5. Computing Sampling Frequency Multipliers
2.6. The Optimal Proposal Distribution
2.7. Application to Burn-Probability Maps
- when the cost of simulating a fire is proportional to its size, then Importance Sampling makes no difference: the initial distribution is already the optimal proposal distribution;
- when the cost of simulating a fire is constant, then the optimal proposal distribution reweights the probability of each fire by the square root of its size.
2.8. Generalization: Optimal Proposal Distribution for Ignitions
- If is fully determined by (deterministic simulation), then we recover .
- If is uninformative (independent) w.r.t , then we obtain a no-progress optimum of .
2.9. Generalization: Impact-Weighted Distance
2.10. Generalization: Non-Geographic Maps and Non-Binary Outcomes
- a space-time region, like or , in which ;
- a combination of geographic space and fire severity buckets, like ;
- a combination of geographic space and paired scenario outcomes, like:
- need not be binary: for example, might be a density of greenhouse-gas emissions;
- need not be about fire.
2.11. Variant: Sampling from a Poisson Point Process
2.12. Practical Implementation
- Run simulations to draw a sample of ignition and simulated fire , yielding an empirical distribution which we will represent by the random variables .
- Design a parametric family of weights functions .
-
Find the optimal parameter vector by solving the following optimization problem, inspired by Equation (2.4):If does not yield a significant improvement compared to , try a new family of weights functions, or abandon Importance Sampling.
- Adopt as the proposal distribution for Importance Sampling.
2.13. Analysis of Geographic Inequalities in Precision
2.14. Justifying the Poisson Process Approximation
2.15. Importance Sampling versus Stratified or Neglected Ignitions
3. Results
3.1. Simulation Design and Results
3.2. Maximum Potential Efficiency Gain
3.3. Reweighting Pyromes Uniformly
3.4. Duration-Based Parametric Reweighting in Pyrome 33
3.5. Fitting a Machine-Learning Model to Empirical Optimal Weights
- Model A: used all predictor variables listed above.
- Model B: used all predictor variables except for the spatial coordinates .
4. Discussion
- by weighting the metric by value (Section 2.9), e.g. by giving more importance of Highly Valued Resources and Assets;
- by including non-geographic outcomes like burn severity (Section 2.10) and weighting them unevenly;
- by constraining the family of reweighting functions under consideration to be uniform in the tail of large fires.
5. Conclusions
- estimating a map rather than a scalar (Section 2.2), using the distance as the metric of deviation underlying variance (Section 2.1);
- accounting for the variability of computational costs across simulated fires, adopting cost-to-precision as the performance metric (Equation (7)) instead of simply variance;
- deriving the optimal cost-to-precision and the corresponding proposal distribution (Section 2.4), relying on the Poisson process-approximation (Section 2.14) that each fire affects only a small fraction of the area of interest;
- generalizing to proposal distributions constrained to an upstream variable (typically the ignition), see Section 2.8;
- generalizing to impact-weighted distance (Section 2.9);
- generalizing from burn-probability estimation to other types of maps (Section 2.10).
- Section 2.1 observes that the relevant metric for quantifying the efficiency of Monte Carlo simulation is the cost-to-precision ratio , that is the product of variance to expected computational cost (Equation (7)).
- The efficiency characteristics of the Monte Carlo sampling, and in particular the proposal distribution which maximizes convergence speed, are fully determined by the joint distribution of fire size and computational cost (Equation (2.4)). Beyond these two variables, the diversity of shapes and behaviors of simulated fires is irrelevant.
- The optimal proposal distribution (lowest cost-to-precision ratio) reweights the natural probability of each fire by a factor proportional to the square root of the ratio of burned area to computational cost (Equation (25)). When only the distribution of ignitions can be reweighted, the burned area and computational cost must be replaced by their expectation conditional on the ignition (Equation (33)).
- In particular, when computational cost is strictly proportional to burned area, importance sampling can achieve no progress: the natural distribution is already the optimal proposal distribution. This linearity assumption is most dubious at the “small fires” tail of the natural distribution.
- Finding a good proposal distribution is not trivial, as it requires predictive power. Section 2.12 suggests a machine learning approach based on calibration runs.
- However, the best achievable cost-to-precision ratio is easily estimated based on a calibration sample of fire sizes and computational costs, using Equation (26). This allows for quickly assessing whether Importance Sampling is worth pursuing.
- Stratified or neglected ignitions can be seen as approximations to special cases of Importance Sampling, in which the reweighting is piecewise constant (Section 2.15).
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AOI | Area Of Interest |
| CMC | Crude Monte Carlo |
| PPP | Poisson Point Process |
Appendix A. Mathematical Background
Appendix A.1. Probability Basics
Appendix A.2. The Probabilistic Cauchy-Schwarz Inequality
References
- Rubinstein, R.Y.; Kroese, D.P. Simulation and the Monte Carlo method; John Wiley & Sons, 2016.
- Mell, W.E.; McDermott, R.J.; Forney, G.P. Wildland fire behavior modeling: perspectives, new approaches and applications. Proceedings of 3rd fire behavior and fuels conference, 2010, pp. 25–29.
- Miller, C.; Parisien, M.A.; Ager, A.; Finney, M. Evaluating spatially-explicit burn probabilities for strategic fire management planning. WIT Transactions on Ecology and the Environment 2008, 119, 245–252. [Google Scholar]
- Xue, H.; Gu, F.; Hu, X. Data assimilation using sequential Monte Carlo methods in wildfire spread simulation. ACM Transactions on Modeling and Computer Simulation (TOMACS) 2012, 22, 1–25. [Google Scholar] [CrossRef]
- Mendez, A.; Farazmand, M. Quantifying rare events in spotting: How far do wildfires spread? Fire safety journal 2022, 132, 103630. [Google Scholar] [CrossRef]
- Jensen, J.L.W.V. Sur les fonctions convexes et les inégalités entre les valeurs moyennes. Acta mathematica 1906, 30, 175–193. [Google Scholar] [CrossRef]
- Short, K.C. Spatial wildfire occurrence data for the United States, 1992-2020 [FPA_FOD_20221014], 2022.
- Kallenberg, O., Poisson and Related Processes. In Random Measures, Theory and Applications; Springer International Publishing: Cham, 2017; pp. 70–108. [CrossRef]
- Hastie, T.; Tibshirani, R.; Friedman, J.H.; Friedman, J.H. The elements of statistical learning: data mining, inference, and prediction; Vol. 2, Springer, 2009.
- Waeselynck, V.; Johnson, G.; Schmidt, D.; Moritz, M.A.; Saah, D. Quantifying the sampling error on burn counts in Monte-Carlo wildfire simulations using Poisson and Gamma distributions. Stochastic Environmental Research and Risk Assessment 2024, pp. 1–15.
- Kalton, G. Introduction to survey sampling; Number 35, Sage Publications, 2020.
- Kearns, E.J.; Saah, D.; Levine, C.R.; Lautenberger, C.; Doherty, O.M.; Porter, J.R.; Amodeo, M.; Rudeen, C.; Woodward, K.D.; Johnson, G.W.; others. The construction of probabilistic wildfire risk estimates for individual real estate parcels for the contiguous United States. Fire 2022, 5, 117. [Google Scholar] [CrossRef]
- Tibshirani, R.J.; Efron, B. An introduction to the bootstrap. Monographs on statistics and applied probability 1993, 57, 1–436. [Google Scholar]
- Short, K.C.; Grenfell, I.C.; Riley, K.L.; Vogler, K.C. Pyromes of the conterminous United States, 2020.
- Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; van der Walt, S.J.; Brett, M.; Wilson, J.; Millman, K.J.; Mayorov, N.; Nelson, A.R.J.; Jones, E.; Kern, R.; Larson, E.; Carey, C.J.; Polat, İ.; Feng, Y.; Moore, E.W.; VanderPlas, J.; Laxalde, D.; Perktold, J.; Cimrman, R.; Henriksen, I.; Quintero, E.A.; Harris, C.R.; Archibald, A.M.; Ribeiro, A.H.; Pedregosa, F.; van Mulbregt, P.; SciPy 1.0 Contributors. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods 2020, 17, 261–272. [Google Scholar] [CrossRef] [PubMed]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; Vanderplas, J.; Passos, A.; Cournapeau, D.; Brucher, M.; Perrot, M.; Duchesnay, E. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 2011, 12, 2825–2830. [Google Scholar]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. Lightgbm: A highly efficient gradient boosting decision tree. Advances in neural information processing systems 2017, 30. [Google Scholar]
- Andrews, P.L. The Rothermel surface fire spread model and associated developments: a comprehensive explanation. 2018.
| 1 | We disapprove of the term “Likelihood Ratio estimator”. We find it confusing because it is essentially a ratio of probabilities, not likelihoods. The term “Likelihood ratio” arises in other parts of statistics (for which it is an apt name, e.g. “Likelihood Ratio test”), but the similarity of the Likelihood Ratio estimator to these is only superficial. We follow the convention of using the term “Likelihood Ratio estimator”, but we feel compelled to warn readers of its potential for confusion. |
| 2 | This is why we named it the Poisson Process approximation. |
| 3 | Using of makes it possible to compute expressions like from the empirical sample, without full knowledge of the underlying distribution . |
| 4 | Astute readers will have noticed that this sub-sampling strategy is yet another form of Importance Sampling. |
| 5 | No-slope heading fire with a 10 mi/hr mid-flame wind speed, 1hr/10hr/100hr dead fuel moistures of 3%,4% and 5%, live herbaceous/woody fuel moistures of 40%/80%. |




| Symbol | Type | Meaning |
|---|---|---|
| Same as | Expected value of random1 variable | |
| Variance of | ||
| Precision (inverse variance) of | ||
| Area of Interest | ||
| s | Spatial location | |
| F | (Potentially) simulated fire | |
| Random variable representing the natural distribution of simulated fires () | ||
| Area (or value) burned by fire F | ||
| Binary burn map (locations burned by F) | ||
| computational cost | ||
| Asymptotic burn-probability map (estimation target) | ||
| norm of map | ||
| impact-weighted norm of map | ||
| spatial density of (asset) value | ||
| Ignition or initial conditions | ||
| probability density of the natural distribution | ||
| probability density of a proposal distribution | ||
| Importance weight (ratio ) under proposal distribution q | ||
| Random variable representing the proposal distribution () | ||
| Importance Sampling estimator with proposal distribution q | ||
| Cost-to-precision ratio (inverse efficiency) | ||
| Efficiency gain: | ||
| optimal proposal distribution |
| Pyrome | Frequency multiplier | Model-predicted expected total | ||
| ID | (Same cost) | Burned area (ha/yr) | Runtime (s/yr) | Fires () |
| 1 | 1.00 | 4191.2 | 21.97 | 50.7 |
| 2 | 0.74 | 1509.5 | 14.17 | 19.6 |
| 3 | 0.68 | 16112.7 | 181.65 | 26.1 |
| 4 | 1.14 | 6427.3 | 25.61 | 51.9 |
| 5 | 0.92 | 58477.0 | 359.71 | 98.5 |
| 6 | 1.07 | 19684.2 | 88.78 | 19.0 |
| 7 | 0.72 | 32253.7 | 323.36 | 30.9 |
| 8 | 0.99 | 13646.9 | 73.00 | 22.6 |
| 9 | 1.04 | 20200.9 | 96.46 | 19.9 |
| 10 | 0.92 | 18636.2 | 115.63 | 23.7 |
| 11 | 1.13 | 28773.0 | 118.23 | 26.1 |
| 12 | 0.91 | 5986.2 | 38.03 | 9.6 |
| 13 | 1.07 | 6355.7 | 29.07 | 8.6 |
| 14 | 1.01 | 93709.1 | 477.19 | 107.5 |
| 15 | 1.79 | 77990.2 | 126.73 | 86.8 |
| 16 | 1.20 | 83122.4 | 300.23 | 73.4 |
| 17 | 0.99 | 42864.7 | 226.30 | 38.0 |
| 18 | 1.02 | 7899.4 | 39.51 | 3.6 |
| 19 | 1.47 | 29049.8 | 69.55 | 28.0 |
| 20 | 1.61 | 102857.4 | 206.72 | 70.5 |
| Pyrome | Frequency multiplier | Model-predicted expected total | ||
| ID | (Same cost) | Burned area (ha/yr) | Runtime (s/yr) | Fires () |
| 21 | 1.64 | 79368.3 | 153.83 | 28.9 |
| 22 | 1.59 | 7315.9 | 15.07 | 4.0 |
| 23 | 1.59 | 22119.1 | 45.65 | 4.5 |
| 24 | 1.41 | 57912.5 | 151.65 | 19.6 |
| 25 | 1.52 | 22139.9 | 50.08 | 3.8 |
| 26 | 0.93 | 20068.7 | 119.89 | 12.7 |
| 27 | 0.94 | 8690.8 | 50.76 | 5.5 |
| 28 | 1.27 | 20809.8 | 67.36 | 8.1 |
| 29 | 1.10 | 25451.7 | 109.37 | 15.3 |
| 30 | 1.08 | 21434.3 | 95.46 | 40.3 |
| 31 | 1.05 | 34229.6 | 161.90 | 26.2 |
| 32 | 1.00 | 14104.7 | 73.13 | 16.1 |
| 33 | 1.26 | 26953.8 | 88.63 | 31.5 |
| 34 | 0.82 | 69841.7 | 537.65 | 66.1 |
| 35 | 1.12 | 3954.4 | 16.28 | 4.7 |
| 36 | 1.45 | 23479.8 | 58.22 | 9.1 |
| 37 | 0.89 | 20517.2 | 133.50 | 30.9 |
| 38 | 0.49 | 26340.3 | 569.67 | 19.0 |
| 39 | 1.05 | 104106.2 | 490.07 | 14.9 |
| 40 | 1.26 | 6612.4 | 21.79 | 2.8 |
| 41 | 0.81 | 1688.9 | 13.52 | 3.4 |
| 42 | 0.97 | 12563.8 | 68.93 | 7.9 |
| 43 | 1.07 | 11667.0 | 53.55 | 7.8 |
| 44 | 1.00 | 22069.4 | 115.37 | 14.8 |
| 45 | 1.04 | 7342.0 | 35.48 | 7.4 |
| 46 | 1.18 | 27092.5 | 100.58 | 9.7 |
| 47 | 1.31 | 21751.2 | 66.38 | 4.2 |
| 48 | 0.82 | 4397.1 | 34.08 | 3.7 |
| 49 | 0.91 | 21746.4 | 136.82 | 12.6 |
| 50 | 1.52 | 13686.4 | 30.77 | 5.2 |
| 51 | 1.98 | 14818.8 | 19.67 | 5.7 |
| 52 | 1.54 | 5911.9 | 13.02 | 2.8 |
| 53 | 1.41 | 50181.2 | 131.02 | 23.7 |
| 54 | 1.32 | 23050.1 | 69.03 | 12.9 |
| 55 | 1.34 | 45158.7 | 130.02 | 48.8 |
| 56 | 1.36 | 9767.5 | 27.46 | 11.8 |
| 57 | 1.21 | 21448.6 | 75.86 | 11.9 |
| 58 | 1.45 | 8336.1 | 20.74 | 4.9 |
| 59 | 1.21 | 12332.9 | 43.69 | 9.3 |
| 60 | 1.04 | 5640.0 | 27.25 | 6.9 |
| 61 | 1.08 | 11456.3 | 51.21 | 14.2 |
| 62 | 1.28 | 1152.2 | 3.67 | 3.2 |
| 63 | 1.15 | 1894.4 | 7.40 | 25.2 |
| 64 | 0.73 | 5974.9 | 58.92 | 53.3 |
| 65 | 1.14 | 7788.4 | 31.15 | 4.4 |
| 66 | 1.32 | 18366.4 | 54.87 | 13.2 |
| 67 | 1.47 | 54847.9 | 132.00 | 35.1 |
| 68 | 1.17 | 9244.3 | 35.42 | 15.6 |
| 69 | 1.08 | 17408.7 | 77.19 | 17.9 |
| 70 | 1.11 | 22850.1 | 96.64 | 21.1 |
| 71 | 1.06 | 13200.3 | 61.42 | 9.4 |
| 72 | 0.90 | 35760.5 | 227.94 | 60.4 |
| 73 | 1.05 | 20397.4 | 97.16 | 15.7 |
| 74 | 1.72 | 11097.9 | 19.48 | 6.2 |
| 75 | 1.91 | 8693.3 | 12.47 | 6.3 |
| 76 | 0.50 | 7261.7 | 151.96 | 18.0 |
| 77 | 1.13 | 49.6 | 0.20 | 0.0 |
| 78 | 1.17 | 6659.3 | 25.22 | 4.2 |
| 79 | 1.16 | 2682.5 | 10.45 | 6.8 |
| 80 | 0.98 | 7371.4 | 39.82 | 16.5 |
| Pyrome | Frequency multiplier | Model-predicted expected total | ||
| ID | (Same cost) | Burned area (ha/yr) | Runtime (s/yr) | Fires () |
| 81 | 0.98 | 6986.1 | 37.50 | 42.5 |
| 82 | 1.20 | 10818.0 | 39.03 | 26.2 |
| 83 | 1.01 | 6063.0 | 30.89 | 48.5 |
| 84 | 0.96 | 3881.1 | 21.79 | 20.7 |
| 85 | 1.06 | 5546.6 | 25.64 | 45.9 |
| 86 | 1.06 | 12495.0 | 57.82 | 79.6 |
| 87 | 0.97 | 13980.1 | 77.81 | 67.5 |
| 88 | 0.96 | 6212.0 | 34.99 | 21.8 |
| 89 | 1.04 | 11715.3 | 56.52 | 14.3 |
| 90 | 2.06 | 5425.1 | 6.67 | 2.3 |
| 91 | 1.08 | 3879.9 | 17.43 | 65.4 |
| 92 | 0.97 | 2469.0 | 13.58 | 34.5 |
| 93 | 0.97 | 16873.5 | 93.54 | 627.9 |
| 94 | 1.00 | 3696.6 | 19.13 | 46.7 |
| 95 | 0.40 | 14253.1 | 475.06 | 69.9 |
| 96 | 0.39 | 1101.0 | 38.10 | 2.1 |
| 97 | 0.75 | 7047.1 | 65.41 | 65.5 |
| 98 | 1.04 | 964.2 | 4.67 | 20.5 |
| 99 | 1.06 | 680.5 | 3.18 | 14.5 |
| 100 | 1.02 | 1316.2 | 6.58 | 15.7 |
| 101 | 0.79 | 793.6 | 6.69 | 7.7 |
| 102 | 1.02 | 544.3 | 2.73 | 15.7 |
| 103 | 0.81 | 107.8 | 0.85 | 10.1 |
| 104 | 0.77 | 130.7 | 1.15 | 10.0 |
| 105 | 0.94 | 146.1 | 0.87 | 5.4 |
| 106 | 0.95 | 4321.1 | 24.81 | 91.9 |
| 107 | 1.06 | 2006.8 | 9.22 | 22.8 |
| 108 | 1.11 | 141.3 | 0.60 | 2.0 |
| 109 | 0.92 | 21235.4 | 130.98 | 137.4 |
| 110 | 1.11 | 8691.4 | 36.47 | 103.1 |
| 111 | 0.91 | 13278.6 | 83.02 | 114.6 |
| 112 | 0.86 | 57716.8 | 409.04 | 313.4 |
| 113 | 1.04 | 52045.2 | 252.31 | 45.0 |
| 114 | 0.94 | 1905.3 | 11.24 | 24.4 |
| 115 | 0.98 | 15348.7 | 84.02 | 123.8 |
| 116 | 1.00 | 2356.0 | 12.37 | 23.7 |
| 117 | 0.95 | 1816.4 | 10.42 | 14.0 |
| 118 | 0.97 | 3994.9 | 22.06 | 80.9 |
| 119 | 1.04 | 3806.6 | 18.19 | 26.6 |
| 120 | 1.02 | 11617.3 | 58.66 | 174.3 |
| 121 | 0.98 | 8300.3 | 45.44 | 199.3 |
| 122 | 0.96 | 1127.7 | 6.42 | 21.4 |
| 123 | 0.98 | 236.2 | 1.29 | 5.4 |
| 124 | 0.91 | 916.8 | 5.75 | 18.3 |
| 125 | 1.02 | 309.1 | 1.56 | 9.9 |
| 126 | 1.01 | 6965.1 | 35.50 | 152.1 |
| 127 | 1.08 | 12631.8 | 56.76 | 3.8 |
| 128 | 1.34 | 11486.8 | 33.47 | 11.8 |
| Model | Efficiency gain |
|---|---|
| Model A | 1.25 () |
| Model B | 1.15 () |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).