Submitted:
24 August 2025
Posted:
25 August 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Modular and Explainable Framework: EcoWild is structured as a flexible pipeline where each component—DT-based risk estimation, smoke detection, and RL for the adaptive sampling—can be enabled or disabled independently. It supports any ML model that runs efficiently on edge devices, enabling customizable trade-offs between accuracy, energy, and responsiveness.
- Dynamic and Adaptive Sensing: The RL policy adjusts sampling periods in real time based on fire risk, battery level, and solar input, balancing responsiveness and energy conservation without requiring manual tuning.
- Fully Embedded, Energy-Aware Operation: All sensing, inference, and decision-making occur locally on solar-powered embedded devices, supporting long-term autonomy in remote, infrastructure-limited environments.
- Robustness Across Deployment Scenarios: EcoWild maintains reliable performance across seasonal and geographic variations under diverse communication conditions, including multi-node relaying and gateway-adjacent load.
- Quantitative Advantages: Compared to static policies, EcoWild achieves 2.4×–7.7× faster wildfire detection with moderate energy consumption and no battery depletion.
2. Related Work
3. EcoWild Framework
3.1. Sample Weather Sensors
3.2. Risk Assessment Using a Decision Tree
3.3. Smoke Detection
3.4. Communication Decision
- Regular sensor suite: These EcoWild sample the weather sensors and operate as discussed so far. In addition to transmitting their own wildfire alerts and images, they also forward the alerts and images they receive from their neighbors toward the gateway suites.
- Gateway sensor suite: In addition to all hardware in the regular suites, the gateway EcoWild has a cellular network with long-range uplink capability [9,10]. In this way, they can send their own data or the data forwarded by their neighbors directly to a control center. They are placed intermittently (e.g., every towers) since they require extra communication hardware and experience the highest communication burden.
4. Energy-Aware Sensing Scheduling with RL
4.1. Overview of the Proposed RL Technique
4.2. State and Action Spaces
4.3. Reward Function Design
4.4. Learning Strategy
5. Experimental Results
5.1. Experimental Setup
- A Sony IMX219 8-megapixel RGB camera [31] to take environmental images for daytime smoke detection and nighttime fire or glow detection.
- A NVIDIA Jetson Orin Nano [8] embedded device for real-time, on-device inference and adaptive decision-making using reinforcement learning and risk estimation.
- A LoRa radio module [32] for long-range, low-power wireless communication.
- A solar panel [33] and rechargeable battery for continuous energy harvesting and storage to enable long-term, maintenance-free operation. This is achieved by dynamically adapting sensing and communication schedules based on real-time battery levels, sunlight availability, and fire risk—ensuring sustainable energy use without requiring manual recharging or battery replacement.
- Weather and Environmental Logs: Historical temperature, humidity, and wind speed data are collected for each camera location using the Open-Meteo archive API [11]. Weather data goes back up to one year prior to image collection and fire start time in one-minute granularity.
- Smoke Image and Fire Event Labels: Smoke ignition events are sourced from the public FigLib wildfire dataset [12], which provides time-sequenced images from multiple camera locations. Each location contains 81 images captured at 1-minute intervals: 40 images with no smoke, followed by one ignition event, and 40 post-ignition images containing smoke. The dataset is partitioned into 70% training, 15% validation, and 15% testing splits, following the standard configuration used in prior work [23]. Ground-truth fire labels are aligned to the ignition frame for each location to support supervised RL training.
- Solar Energy Data: Solar panel energy harvesting is simulated at each location using the PVlib library [13] and a single-diode photovoltaic model calibrated to a UV-resistant 6V, 2.38W panel [33]. Hourly solar irradiance profiles are generated based on the GPS coordinates of the camera sites provided in the FigLib dataset [12], then interpolated to 1-minute granularity. The solar model incorporates temperature effects, soiling losses, and wiring inefficiencies to reflect realistic panel behavior.
- Active Energy Consumption: We account for the active energy cost of each operation, including weather sensing, image capture, decision-making, SD inference, RL inference, and LoRa-based communication. These values are derived from empirical measurements on embedded hardware platforms, as detailed in our prior work [23]. Component-specific characterization includes the SHT10 temperature and humidity sensor, for which active power consumption is obtained from the manufacturer’s datasheet [29]; the DS6410 anemometer, a passive sensor whose energy usage depends on microcontroller pulse processing, following the method described in [34]; and the LoRa transceiver (STM Nucleo-WL55JC2), where transmission energy was measured and standby draw is based on datasheet specifications [32]. The camera’s active energy was empirically measured, while its standby power is derived from existing literature [35].
- Standby and Leakage Losses: Standby energy drain from all hardware components, along with battery self-discharge, is incorporated into the simulation’s energy model. Standby values for the SHT10 temperature-humidity sensor and LoRa transceiver are taken from respective datasheets [29,32], while the camera’s standby consumption is obtained from prior literature [35]. Battery leakage is modeled using conservative estimates from published work, assuming a low self-discharge rate below 5% per month [36].
- Solar Energy Harvesting: Minute-by-minute solar energy yield is updated based on the interpolated PVlib simulations [13].
- Deployment-Aware Energy Reserve and Losses: To reflect real-world deployment constraints, we provision each sensor suite with a 7-day battery energy reserve, ensuring uninterrupted operation during extended periods of low solar irradiance (e.g., overcast days or shaded locations). Additionally, we model realistic solar harvesting losses due to environmental factors such as dirt accumulation, panel tilt, and shading. In our simulations, we assume a 50% harvesting loss for edge sensor suites (typically at the network perimeter with limited solar exposure), and a 30% loss for relay and gateway-adjacent suites. These deployment-aware assumptions ensure that EcoWild remains robust under practical conditions where harvested solar energy may be significantly reduced.
- Update Environment State: Load the following timestamped weather features (temperature, humidity, wind speed), solar irradiance value, and wildfire label from the offline logs.
-
Execute Action: Based on the selected sampling period, the environment simulates the corresponding system operations:
- Sense: Measure environmental variables (e.g., temperature, humidity, wind speed).
- Risk Assessment using DT: A pre-trained decision tree model processes weather features to estimate wildfire risk in real time.
- Infer: If the DT predicts high risk, the system captures an image and performs smoke detection using a pre-trained SD model. This step incurs additional compute energy.
- Transmit: If smoke is detected, the system transmits the alert and corresponding image using LoRa, incurring communication energy cost.
- Update Battery Level: Increase battery level with harvested solar energy, subtract standby and leakage losses, and account for energy consumed during the step.
- Log Reward and Transition: Compute the reward for the action based on detection performance and energy impact, and store the transition for training or evaluation.
- Invoke RL Agent: Provide the current state (weather, battery level, risk estimate) to the reinforcement learning agent to select an action: whether to sense, run inference, or remain idle.
5.2. Baseline Algorithms
- Fixed baseline captures weather data and images at every fixed interval and transmits them without any local filtering, decision-making, or smoke detection.
- DT-only algorithm uses the same DT used in EcoWild to evaluate wildfire risk from weather data. An image is captured and transmitted only when the estimated risk is high without running the smoke detection algorithm.
- SD-time algorithm takes an image at each interval (without a DT) and performs smoke detection using the aggressive performance SD model (see Table 2). This smoke detection-based filtering prioritizes fast detection but leads to increased communication and energy consumption.
- SD-energy algorithm is the same as the SD-time algorithm (i.e., takes and processes images at every interval), but it uses the conservative (low energy) SD model. It minimizes the communication and energy use, at the potential cost of delayed detection.
- DT-SD-time algorithm combines the DT-based wildfire risk estimation and aggressive ML-based smoke detection (see Table 2). The DT filters out low-risk intervals, and the SD model further refines image transmission decisions by prioritizing fast detection under high-risk conditions.
- DT-SD-energy algorithm performs like the DT-SD-time algorithm, but it uses the conservative (low energy) SD model (see Table 2). This configuration minimizes communication and energy usage while still detecting probable fire events.
5.3. Balancing Responsiveness vs. Sustainability
5.4. Risk-Aware Sampling Behavior
5.5. Multi-Node Evaluation and Sustainability Analysis
- Superior detection time in all scenarios: EcoWild (black star) consistently outperforms the best-performing configuration of each baseline, achieving 2.4×–7.7× faster detection while maintaining moderate energy use.
- Pareto Frontier Breaker: Baseline policies provide a visible Pareto trade-off between detection time and energy. EcoWild lies outside this frontier in all three settings, demonstrating its ability to achieve both goals simultaneously.
- Widening Advantage in High-Cost Settings: As additional communication energy burden increases, the performance gap between EcoWild and the baselines becomes more pronounced, especially for sensor suites close to the gateway that need to forward more messages from their neighbors.
- Sustained battery energy: EcoWild (black star) never depletes the battery energy in any of the considered scenarios in 125 locations. It maintains an average battery energy of 6 Wh (edge), 14 Wh (relay), and 13 Wh (gateway-adjacent), never dropping below 11 Wh in any scenario.
5.6. Per-Location Comparison: Generalizability
6. Conclusions and Future Work
References
- Mohapatra, A.; Trinh, T. Early wildfire detection technologies in practice—a review. Sustainability 2022, 14, 12270. [Google Scholar] [CrossRef]
- CAL FIRE. 2025 California Wildfire Incidents. https://www.fire.ca.gov/incidents/, 2025. Accessed: 2025-04-15.
- Abatzoglou, J.T.; Kolden, C.A.; Cullen, A.C.; Sadegh, M.; Williams, E.L.; Turco, M.; Jones, M.W. Climate change has increased the odds of extreme regional forest fire years globally. Nature Communications 2025, 16, 6390. [Google Scholar] [CrossRef] [PubMed]
- Zhang, A.; Zhang, A.S. Real-time wildfire detection and alerting with a novel machine learning approach. International Journal of Advanced Computer Science and Applications 2022, 13. [Google Scholar] [CrossRef]
- Meimetis, D.; Papaioannou, S.; Katsoni, P.; Lappas, V. An Architecture for Early Wildfire Detection and Spread Estimation Using Unmanned Aerial Vehicles, Base Stations, and Space Assets. Drones and Autonomous Vehicles 2024, 1, 10006. [Google Scholar] [CrossRef]
- Dampage, U.; Bandaranayake, L.; Wanasinghe, R.; Kottahachchi, K.; Jayasanka, B. Forest fire detection system using wireless sensor networks and machine learning. Scientific reports 2022, 12, 46. [Google Scholar] [CrossRef] [PubMed]
- Ding, Y.; Wang, M.; Fu, Y.; Zhang, L.; Wang, X. A wildfire detection algorithm based on the dynamic brightness temperature threshold. Forests 2023, 14, 477. [Google Scholar] [CrossRef]
- NVIDIA Corporation. Jetson Orin Nano Series Data Sheet, 2024. Accessed: 2024-11-19.
- Migabo, E.M.; Djouani, K.D.; Kurien, A.M. The narrowband Internet of Things (NB-IoT) resources management performance state of art, challenges, and opportunities. IEEE Access 2020, 8, 97658–97675. [Google Scholar] [CrossRef]
- Borkar, S.R. Long-term evolution for machines (LTE-M). In LPWAN technologies for IoT and M2M applications; Elsevier, 2020; pp. 145–166.
- Zippenfenig, P. Open-Meteo.com Weather API. https://open-meteo.com/, 2023. [CrossRef]
- Dewangan, A.; Pande, Y.; Braun, H.W.; Vernon, F.; Perez, I.; Altintas, I.; Cottrell, G.W.; Nguyen, M.H. FIgLib & SmokeyNet: Dataset and deep learning model for real-time wildland fire smoke detection. Remote Sensing 2022, 14, 1007. [Google Scholar]
- Anderson, K.; Hansen, C.; Holmgren, W.; Jensen, A.; Mikofski, M.; Driesse, A. pvlib python: 2023 project update. Journal of Open Source Software 2023, 8, 5994. [Google Scholar] [CrossRef]
- Jang, E.; Kang, Y.; Im, J.; Lee, D.W.; Yoon, J.; Kim, S.K. Detection and monitoring of forest fires using Himawari-8 geostationary satellite data in South Korea. Remote Sensing 2019, 11, 271. [Google Scholar] [CrossRef]
- Akhloufi, M.A.; Couturier, A.; Castro, N.A. Unmanned aerial vehicles for wildland fires: Sensing, perception, cooperation and assistance. Drones 2021, 5, 15. [Google Scholar] [CrossRef]
- Yu, L.; Wang, N.; Meng, X. Real-time forest fire detection with wireless sensor networks. In Proceedings of the Proceedings. 2005 International Conference on Wireless Communications, Networking and Mobile Computing, 2005; 2005; Vol. 2, pp. 1214–1217. [Google Scholar] [CrossRef]
- Chen, J.; Jun, S.W.; Mundra, A.; Ta, J. Xyloni: Very Low Power Neural Network Accelerator for Intermittent Remote Visual Detection of Wildfire and Beyond. In Proceedings of the Proceedings of the 29th ACM/IEEE International Symposium on Low Power Electronics and Design, 2024, pp. 1–6.
- Tuncel, Y.; Basaklar, T.; Carpenter-Graffy, D.; Ogras, U. A self-sustained cps design for reliable wildfire monitoring. ACM Transactions on Embedded Computing Systems 2023, 22, 1–23. [Google Scholar] [CrossRef]
- Bonilla-Ormachea, K.; Cuizaga, H.; Salcedo, E.; Castro, S.; Fernandez-Testa, S.; Mamani, M. ForestProtector: An IoT Architecture Integrating Machine Vision and Deep Reinforcement Learning for Efficient Wildfire Monitoring. In Proceedings of the 2025 11th International Conference on Automation, Robotics, and Applications (ICARA). IEEE, 2025; pp. 70–75. [Google Scholar]
- Julian, K.D.; Kochenderfer, M.J. Distributed wildfire surveillance with autonomous aircraft using deep reinforcement learning. Journal of Guidance, Control, and Dynamics 2019, 42, 1768–1778. [Google Scholar] [CrossRef]
- Khoshdel, S.; Luo, Q.; Afghah, F. Pyrotrack: Belief-based deep reinforcement learning path planning for aerial wildfire monitoring in partially observable environments. In Proceedings of the 2024 American Control Conference (ACC). IEEE; 2024; pp. 601–607. [Google Scholar]
- Diaz-Vilor, C.; Lozano, A.; Jafarkhani, H. A Reinforcement Learning Approach for Wildfire Tracking with UAV Swarms. IEEE Transactions on Wireless Communications 2025. [Google Scholar] [CrossRef]
- Yildirim, N.; Kim, G.; Yun, M.; Elango, N.; Park, J.; Ogras, U.Y. Energy-Constrained Optimization for Wildfire Detection Using RGB Images. In Proceedings of the Proceedings of the 13th International Workshop on Energy Harvesting and Energy-Neutral Sensing Systems, 2025, pp. 42–48.
- Contributors, T. ResNet—Torchvision 0.15.2 documentation, 2016. Accessed: 2024-11-19.
- 2024 Ultralytics Inc.. Ultralytics Yolo Docs. Accessed: 2024-11-20.
- Fujimoto, S.; van Hoof, H.; Meger, D. Addressing Function Approximation Error in Actor-Critic Methods. In Proceedings of the 35th International Conference on Machine Learning; Dy, J.; Krause, A., Eds. PMLR, 10–15 Jul 2018, Vol. 80, Proceedings of Machine Learning Research, pp. 587–1596.
- Raffin, A.; Hill, A.; Gleave, A.; Kanervisto, A.; Ernestus, M.; Dormann, N. Stable-Baselines3: Reliable Reinforcement Learning Implementations. Journal of Machine Learning Research 2021, 22, 1–8. [Google Scholar]
- Glorot, X.; Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. Proceedings of the thirteenth international conference on artificial intelligence and statistics.
- Sensirion. SHT1x (SHT10, SHT11, SHT15) Humidity and Temperature Sensor Datasheet. https://cdn.sparkfun.com/assets/8/7/5/9/4/SHT1x_datasheet.pdf, 2008.
- Davis Instruments. In Anemometer for Vantage Pro2™ and EnviroMonitor®: User Manual; Product Number: 6410; Hayward, CA, USA, 2020.
- Sony Semiconductor Solutions Corporation. IMX219PQ CMOS Image Sensor Datasheet, 2019. Accessed: 2024-11-19.
- STMicroelectronics. AN5406: How to Build a LoRa® Application with STM32CubeWL. https://www.st.com/resource/en/application_note/an5406-how-to-build-a-lora-application-with-stm32cubewl-stmicroelectronics.pdf, 2025.
- Voltaic Systems. P126 Solar Panel Datasheet. Accessed: 2024-11-19.
- Leelavinodhan, P.B.; Vecchio, M.; Antonelli, F.; Maestrini, A.; Brunelli, D. Design and implementation of an energy-efficient weather station for wind data collection. Sensors 2021, 21, 3831. [Google Scholar] [CrossRef] [PubMed]
- Lubana, E.S.; Dick, R.P. Digital foveation: An energy-aware machine vision framework. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2018, 37, 2371–2380. [Google Scholar] [CrossRef]
- Seong, W.M.; Park, K.Y.; Lee, M.H.; Moon, S.; Oh, K.; Park, H.; Lee, S.; Kang, K. Abnormal self-discharge in lithium-ion batteries. Energy & Environmental Science 2018, 11, 970–978. [Google Scholar]
- Srock, A.F.; Charney, J.J.; Potter, B.E.; Goodrick, S.L. The hot-dry-windy index: A new fire weather index. Atmosphere 2018, 9, 279. [Google Scholar] [CrossRef]






| Feature (Unit) | Symbol |
|---|---|
| Temperature (°C) | T |
| Relative Humidity (%) | H |
| Wind Speed (km/h) | W |
| Hot-Dry-Windy Index | HDWI |
| Time of Day (normalized [0–1]) | |
| Season (categorical) | |
| Harvested Energy from Solar Panels (Wh) | |
| Battery Energy (Wh) | |
| Elapsed Time Since Last Image Capture (min) | |
| Previous Sampling Period (min) | |
| Previous SD Result (binary: 1=smoke, 0=none) |
| Model | TPR | FPR |
|---|---|---|
| Aggressive performance (fast detection) | 0.90 | 0.58 |
| Conservative (low energy consumption) | 0.66 | 0.33 |
| Baseline | Fixed Sampling Period | DT | ML | RL |
|---|---|---|---|---|
| Fixed | ✔ | ✗ | ✗ | ✗ |
| DT-only | ✔ | ✔ | ✗ | ✗ |
| SD-time | ✔ | ✗ | ✔ * | ✗ |
| SD-energy | ✔ | ✗ | ✔ ** | ✗ |
| DT-SD-time | ✔ | ✔ | ✔ * | ✗ |
| DT-SD-energy | ✔ | ✔ | ✔ ** | ✗ |
| EcoWild (Ours) | ✗ | ✔ | ✔ | ✔ |
| Parameter | k | |||
|---|---|---|---|---|
| Value | 525600 | 5000 | 0.9 | 100 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).