Submitted:
17 January 2025
Posted:
17 January 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Works
2.1. RL in Crop Simulation Models
2.2. RL Applications in Irrigation Management
2.3. AquaCrop-Gym
2.4. Research Gaps and Contributions
3. Materials and Methods
3.1. Reinforcement Learning Environment
3.2. Simulation Setup and Data
3.3. State and Action Spaces
3.4. Reward Mechanism
- (mm) is the penalty at time step t.
- (mm) is the total water applied up to time t.
- (mm) is the irrigation depth applied at time t.
- DryYield (t/ha) is the final dry yield of the maize crop.
- T is the total number of timesteps in the growing season.
3.5. Proximal Policy Optimization (PPO)
3.6. PPO Hyperparameter Optimization
3.7. Irrigation Strategies
3.7.1. Optimized Strategies
3.7.2. Conventional Strategies
3.8. Evaluation Framework and Performance Metrics
- Dry Yield (t/ha): Final maize yield at the end of the season.
- Total Irrigation (mm): Total volume of irrigation water applied.
- Water Efficiency (kg/ha/mm): Yield produced per millimeter of irrigation water, indicating how effectively water is converted into biomass.
- Profitability ($): Net economic gain, factoring in both crop yield and irrigation costs.
4. Results and Discussion
4.1. PPO Training Progress and Performance at Different Timesteps
4.2. Comparison of Irrigation Strategies
4.2.1. Crop Yield (t/ha)
4.2.2. Irrigation (mm)
4.2.3. Water Efficiency (kg/ha/mm)
4.2.4. Profit ($/ha)
4.3. Long-Term Environmental Benefits
4.4. Bridging Simulation to Practical Implementation
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Food and Agriculture Organization of the United Nations. The Future of Food and Agriculture: Trends and Challenges. Technical report, Food and Agriculture Organization of the United Nations, Rome, 2017.
- Kılıç, O.; Boz, I.; Eryılmaz, G.A. Comparison of conventional and good agricultural practices farms: A socio-economic and technical perspective. J. Clean. Prod. 2020, 258, 120666. [Google Scholar] [CrossRef]
- Siebert, S.; Burke, J.; Faures, J.M.; Frenken, K.; Hoogeveen, J.; Döll, P.; Portmann, F.T. Groundwater use for irrigation–a global inventory. Hydrol. Earth Syst. Sci. 2010, 14, 1863–1880. [Google Scholar] [CrossRef]
- Davis, K.F.; Rulli, M.C.; Seveso, A.; D’Odorico, P. Increased food production and reduced water use through optimized crop distribution. Nat. Geosci. 2017, 10, 919–924. [Google Scholar] [CrossRef]
- Biswas, A.K.; Tortajada, C. Groundwater: an unseen, overused and unappreciated resource, 2024. [CrossRef]
- Zhao, H.; Di, L.; Guo, L.; Zhang, C.; Lin, L. An Automated Data-Driven Irrigation Scheduling Approach Using Model Simulated Soil Moisture and Evapotranspiration. Sustainability 2023, 15, 12908. [Google Scholar] [CrossRef]
- Kelly, T.; Foster, T.; Schultz, D.M. Assessing the value of adapting irrigation strategies within the season. Agric. Water Manag. 2023, 275, 107986. [Google Scholar] [CrossRef]
- Pretty, J.; Benton, T.G.; Bharucha, Z.P.; Dicks, L.V.; Flora, C.B.; Godfray, H.C.J.; Goulson, D.; Hartley, S.; Lampkin, N.; Morris, C.; et al. Global assessment of agricultural system redesign for sustainable intensification. Nat. Sustain. 2018, 1, 441–446. [Google Scholar] [CrossRef]
- Li, M.; Xu, Y.; Fu, Q.; Singh, V.P.; Liu, D.; Li, T. Efficient irrigation water allocation and its impact on agricultural sustainability and water scarcity under uncertainty. J. Hydrol. 2020, 586, 124888. [Google Scholar] [CrossRef]
- Holzworth, D.; Huth, N.I.; Fainges, J.; Brown, H.; Zurcher, E.; Cichota, R.; Verrall, S.; Herrmann, N.I.; Zheng, B.; Snow, V. APSIM Next Generation: Overcoming challenges in modernising a farming systems model. Environ. Model. Softw. 2018, 103, 43–51. [Google Scholar] [CrossRef]
- Hoogenboom, G.; Porter, C.H.; Boote, K.J.; Shelia, V.; Wilkens, P.W.; Singh, U.; White, J.W.; Asseng, S.; Lizaso, J.I.; Moreno, L.P.; et al. The DSSAT crop modeling ecosystem. In Adv. Crop Modelling Sustain. Agric.; Burleigh Dodds Science Publishing, 2019; pp. 173–216.
- Steduto, P.; Hsiao, T.C.; Raes, D.; Fereres, E. AquaCrop—The FAO crop model to simulate yield response to water: I. Concepts and underlying principles. Agron. J. 2009, 101, 426–437. [Google Scholar] [CrossRef]
- Bhatia, A.K. Crop Growth Simulation Modeling. In Modelling and Simulation of Diffusive Processes: Methods and Applications; Basu, S.; Kumar, N., Eds.; Springer International Publishing, 2014; pp. 315–332. [CrossRef]
- Shaikh, T.A.; Rasool, T.; Lone, F.R. Towards leveraging the role of machine learning and artificial intelligence in precision agriculture and smart farming. Comput. Electron. Agric. 2022, 198, 107119. [Google Scholar] [CrossRef]
- Gautron, R.; Maillard, O.A.; Preux, P.; Corbeels, M.; Sabbadin, R. Reinforcement learning for crop management support: Review, prospects and challenges. Comput. Electron. Agric. 2022, 200, 107182. [Google Scholar] [CrossRef]
- Turchetta, M.; Corinzia, L.; Sussex, S.; Burton, A.; Herrera, J.; Athanasiadis, I.; Buhmann, J.M.; Krause, A. Learning Long-Term Crop Management Strategies with CyclesGym. Proceedings of the Adv. Neural Inf. Process. Syst. 2022, 35, 11396–11409. [Google Scholar]
- Overweg, H.; Berghuijs, H.N.; Athanasiadis, I.N. CropGym: a reinforcement learning environment for crop management. arXiv preprint, arXiv:2104.04326 2021. [CrossRef]
- Kallenberg, M.G.; Overweg, H.; van Bree, R.; Athanasiadis, I.N. Nitrogen management with reinforcement learning and crop growth models. Environ. Data Sci. 2023, 2, e34. [Google Scholar] [CrossRef]
- Gautron, R.; Padrón, E.J.; Preux, P.; Bigot, J.; Maillard, O.A.; Emukpere, D. gym-DSSAT: A crop model turned into a Reinforcement Learning environment. arXiv preprint, arXiv:2207.03270 2022. [CrossRef]
- Madondo, M.; Azmat, M.; Dipietro, K.; Horesh, R.; Jacobs, M.; Bawa, A.; Srinivasan, R.; O’Donncha, F. A swat-based reinforcement learning framework for crop management. arXiv preprint, arXiv:2302.04988 2023.
- Kemanian, A.R.; Shi, Y.; White, C.M.; Montes, F.; Stöckle, C.O.; Huggins, D.R.; Cangiano, M.L.; Stefani-Faé, G.; Nydegger Rozum, R.K. The Cycles Agroecosystem Model: Fundamentals, Testing, and Applications. Available at SSRN: https://ssrn.com/abstract=4188402 or http://dx.doi.org/10.2139/ssrn.4188402. 2023. [Google Scholar] [CrossRef]
- Kelly, T.; Foster, T. AquaCrop-OSPy: Bridging the gap between research and practice in crop-water modeling. Agric. Water Manag. 2021, 254, 106976. [Google Scholar] [CrossRef]
- Chen, M.; Cui, Y.; Wang, X.; Xie, H.; Liu, F.; Luo, T.; Zheng, S.; Luo, Y. A reinforcement learning approach to irrigation decision-making for rice using weather forecasts. Agric. Water Manag. 2021, 250, 106838. [Google Scholar] [CrossRef]
- Ding, X.; Du, W. Drlic: Deep reinforcement learning for irrigation control. In Proceedings of the 21st ACM/IEEE Int. Conf. Inf. Process. Sensor Networks (IPSN). IEEE; 2022; pp. 41–53. [Google Scholar] [CrossRef]
- Alibabaei, K.; Gaspar, P.D.; Assunção, E.; Alirezazadeh, S.; Lima, T.M. Irrigation optimization with a deep reinforcement learning model: Case study on a site in Portugal. Agric. Water Manag. 2022, 263, 107480. [Google Scholar] [CrossRef]
- Kelly, T.; Foster, T.; Schultz, D.M. Assessing the value of deep reinforcement learning for irrigation scheduling. Smart Agric. Technol. 2024, 7, 100403. [Google Scholar] [CrossRef]
- Towers, M.; Terry, J.K.; Kwiatkowski, A.; Balis, J.U.; Cola, G.d.; Deleu, T.; Goulão, M.; Kallinteris, A.; KG, A.; Krimmel, M.; et al. Gymnasium, 2023. [CrossRef]
- Brockman, G. OpenAI Gym. arXiv preprint, arXiv:1606.01540 2016.
- Schulman, J.; Wolski, F.; Dhariwal, P.; Radford, A.; Klimov, O. Proximal policy optimization algorithms. arXiv preprint, arXiv:1707.06347 2017. [CrossRef]
- De Wit, A.; Boogaard, H.; Fumagalli, D.; Janssen, S.; Knapen, R.; van Kraalingen, D.; Supit, I.; van der Wijngaart, R.; van Diepen, K. 25 years of the WOFOST cropping systems model. Agric. Syst. 2019, 168, 154–167. [Google Scholar] [CrossRef]
- Bieger, K.; Arnold, J.G.; Rathjens, H.; White, M.J.; Bosch, D.D.; Allen, P.M.; Volk, M.; Srinivasan, R. Introduction to SWAT+, a completely restructured version of the soil and water assessment tool. JAWRA J. Am. Water Resour. Assoc. 2017, 53, 115–130. [Google Scholar] [CrossRef]
- Foster, T.; Brozović, N.; Butler, A.; Neale, C.; Raes, D.; Steduto, P.; Fereres, E.; Hsiao, T.C. AquaCrop-OS: An open source version of FAO’s crop water productivity model. Agric. Water Manag. 2017, 181, 18–22. [Google Scholar] [CrossRef]
- Akiba, T.; Sano, S.; Yanase, T.; Ohta, T.; Koyama, M. Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the Proceedings of the 25th ACM SIGKDD Int. Conf. Knowl. Discov. Data Min., 2019, pp. 2623–2631. [CrossRef]
- Eustace, S.; contributors, T.P. Poetry: Python packaging and dependency management made easy. https://python-poetry.org, 2022. Software; Version 3.09.
- Raffin, A.; Hill, A.; Gleave, A.; Kanervisto, A.; Ernestus, M.; Dormann, N. Stable-baselines3: Reliable reinforcement learning implementations. J. Mach. Learn. Res. 2021, 22, 1–8. [Google Scholar]
- Song, L.; Jin, J.; He, J. Effects of severe water stress on maize growth processes in the field. Sustainability 2019, 11, 5086. [Google Scholar] [CrossRef]






| Hyperparameter | Search Range |
|---|---|
| Learning Rate | Log-uniform between and |
| Number of Steps per Update (n_steps) | {1024, 2048, 4096} |
| Batch Size | {128, 256, 512, 1024} |
| Number of Epochs | Integers from 10 to 50 |
| Discount Factor () | Uniform between 0.90 and 0.99 |
| Clip Range () | Uniform between 0.1 and 0.3 |
| Entropy Coefficient | Log-uniform between and |
| Hyperparameter | Value |
|---|---|
| Learning Rate () | |
| Number of Steps per Update (n_steps) | 2048 |
| Batch Size | 512 |
| Number of Epochs | 23 |
| Discount Factor () | 0.98 |
| Clip Range () | 0.22 |
| Entropy Coefficient |
| Training Steps | Mean Yield | Seasonal Irrigation | Profit | Water Efficiency |
|---|---|---|---|---|
| (tonne/ha) | (mm) | ($) | (kg/ha/mm) | |
| 500,000 | 14.00 ± 0.22 | 240.75 ± 86.59 | 551.10 | 58.15 |
| 1,000,000 | 13.95 ± 0.21 | 226.92 ± 88.15 | 556.81 | 61.49 |
| 1,500,000 | 13.80 ± 0.17 | 179.83 ± 89.44 | 576.91 | 76.76 |
| 2,000,000 | 13.51 ± 0.16 | 166.17 ± 85.95 | 538.27 | 81.33 |
| 2,500,000 | 13.64 ± 0.28 | 166.92 ± 87.26 | 560.41 | 81.72 |
| Strategy | Yield (t/ha) | Irrigation (mm) | Water Efficiency (kg/ha/mm) | Profit ($/ha) |
|---|---|---|---|---|
| PPO | 13.80 ± 0.17 | 179.83 ± 89.44 | 76.76 | 576.91 |
| Thresholds (Optimized) | 13.95 ± 0.20 | 255.00 ± 112.05 | 54.72 | 528.83 |
| Net Irrigation | 13.98 ± 0.23 | 305.61 ± 119.17 | 45.73 | 482.21 |
| Interval-Based | 13.81 ± 0.70 | 380.17 ± 33.69 | 36.32 | 377.42 |
| Random | 14.02 ± 0.24 | 1640.83 ± 141.89 | 8.54 | -845.60 |
| Rainfed | 8.88 ± 3.75 | 0.0 | N/A | -130.19 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).