Submitted:
12 June 2025
Posted:
13 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Develop a high-accuracy model for predicting rice yield under climate variability.
- Provide interpretable insights into the role of climate variables (temperature, precipitation, humidity, soil moisture).
- Support climate-resilient agricultural strategies aligned with national policies like the Pradhan Mantri Fasal Bima Yojana (PMFBY) and Krishi Vigyan Kendra initiatives [23].
2. Related Work
3. Methodology
3.1. Data Collection
- Climate Data: Daily measurements of temperature (°C), precipitation (mm), humidity (%), and soil moisture (%) from the India Meteorological Department (IMD) [23], covering five agro-climatic zones: Gangetic Plains, Deccan Plateau, Coastal Plains, Himalayan Region, and Arid Zone.
- Agricultural Data: Annual rice yield (tons/ha) from the Ministry of Agriculture, India [24], aggregated at the district level.
3.2. Proposed Model
- Long Short-Term Memory (LSTM): A recurrent neural network designed to capture temporal dependencies in time-series data. The architecture includes:
- o Input Layer: 4 features (temperature, precipitation, humidity, soil moisture) with 1 timestep.
- o LSTM Layer: 64 units, return_sequences=False, to process sequential climate data.
- o Dense Layer: 32 units with ReLU activation for non-linear transformation.
- o Output Layer: 1 unit for continuous yield prediction (tons/ha).
- SHAP (SHapley Additive exPlanations): Computes feature importance scores to explain model predictions, quantifying the contribution of each climate variable to yield variability.
3.3. Experimental Setup
- Data Split: 80% training (9,600 samples), 20% testing (2,400 samples), stratified by agro-climatic zone to ensure balanced representation.
- Hyperparameters: Optimized using grid search:
- o Learning rate: 0.001 (selected from [0.001, 0.01, 0.1]).
- o Epochs: 100 (selected from [50, 100, 200]).
- o Batch size: 32 (selected from [16, 32, 64]).
- o Optimizer: Adam.
4. Results
- LSTM-SHAP:
- o R²: 0.88
- o MAE: 0.11 tons/ha
- o RMSE: 0.15 tons/ha
- o Precision (yield threshold classification, e.g., above/below 2.5 tons/ha): 0.85
- o Recall: 0.82
- Random Forest:
- o R²: 0.82
- o MAE: 0.14 tons/ha
- o RMSE: 0.18 tons/ha
- o Precision: 0.80
- o Recall: 0.78
- SVR:
- o R²: 0.79
- o MAE: 0.16 tons/ha
- o RMSE: 0.20 tons/ha
- o Precision: 0.76
- o Recall: 0.74
- Feature Importance: Temperature contributed 42% to yield predictions, followed by precipitation (33%), humidity (15%), and soil moisture (10%).
- Dependence Plots:
- o Yields decline by up to 20% when temperatures exceed 30°C, with the Gangetic Plains showing the highest sensitivity (25% yield drop above 32°C).
- o Precipitation below 100 mm/month reduces yields by 15%, particularly in the Deccan Plateau.
- Interaction Effects: SHAP interaction plots revealed that high temperatures combined with low precipitation amplify yield losses by up to 30% in arid zones, indicating a synergistic effect.
- Regional Insights:
- o Gangetic Plains: High temperature sensitivity, with SHAP Savi Value Analysis showing a 25% yield drop above 32°C.
- o Deccan Plateau: Precipitation deficits below 80 mm/month reduce yields by 18%.
- o Coastal Plains: Humidity mitigates yield losses in high-precipitation scenarios.

| Model | R² | MAE (tons/ha) | RMSE (tons/ha) | Precision | Recall |
|---|---|---|---|---|---|
| LSTM-SHAP | 0.88 | 0.11 | 0.15 | 0.85 | 0.82 |
| Random Forest | 0.82 | 0.14 | 0.18 | 0.80 | 0.78 |
| SVR | 0.79 | 0.16 | 0.20 | 0.76 | 0.74 |
5. Discussion
- Historical Data: The model relies on historical data, which may not fully capture future climate scenarios (e.g., unprecedented heatwaves).
- Real-Time Data: Lack of IoT or satellite data limits real-time applicability [9].
- Geographical Scope: The model covers five agro-climatic zones but could benefit from village-level granularity.
- Computational Cost: LSTM and SHAP require significant computational resources, which may limit scalability in resource-constrained settings.
- Integrate satellite imagery (e.g., MODIS, Sentinel-2) for real-time monitoring [10].
- Incorporate IoT sensors for dynamic soil and weather data [9].
- Apply transfer learning to extend the model to other crops (e.g., wheat, maize) [11].
- Explore ensemble XAI techniques (e.g., SHAP + LIME) for enhanced interpretability [18].
6. Conclusion
References
- Hochreiter, S. , Schmidhuber, J., 1997. Long short-term memory. Neural Computation, 1735. [Google Scholar]
- Lobell, D.B. , Burke, M.B., 2010. On the use of statistical models to predict crop yields. Agricultural and Forest Meteorology, 1443. [Google Scholar]
- Lobell, D.B. , et al., 2011. Climate trends and global crop production. Science.
- Field, C.B. , 2014. Climate Change 2014: Impacts, Adaptation, and Vulnerability.
- Ribeiro, M.T. , et al., 2016. Why should I trust you? Explaining the predictions of any classifier. KDD, 1144. [Google Scholar]
- Lesk, C. , et al., 2016. Influence of extreme weather disasters on global crop production. Nature.
- Jeong, J.H. , et al., 2016. Random forests for global and regional crop yield predictions. PLOS ONE, 0156. [Google Scholar]
- You, J. , et al., 2017. Deep Gaussian process for crop yield prediction. Nature Communications.
- Greff, K. , et al., 2017. LSTM: A search space odyssey. IEEE Transactions on Neural Networks, 2222. [Google Scholar]
- Zhao, C. , et al., 2017. Temperature increase reduces global yields of major crops. PNAS, 9326. [Google Scholar]
- Pathak, H. , et al., 2018. Climate change impacts on agriculture in India. Current Science.
- Liakos, K.G. , et al., 2018. Machine learning in agriculture: A review. Sensors, 2674. [Google Scholar]
- Adadi, A. , Berrada, M., 2018. Peeking inside the black-box: A survey on explainable AI. IEEE Access, 5216. [Google Scholar]
- Rudin, C. , 2019. Stop explaining black box machine learning models. Nature Machine Intelligence.
- Gunning, D. , 2019. Explainable artificial intelligence (XAI). DARPA Report.
- Reichstein, M. , et al., 2019. Deep learning and process understanding for Earth system modeling. Nature.
- Holzinger, A. , et al., 2019. Causability and explainability of AI in agriculture. Machine Learning and Knowledge Extraction.
- Lundberg, S.M. , Lee, S.I., 2019. A unified approach to interpreting model predictions. NeurIPS, 4774. [Google Scholar]
- Crane-Droesch, A. , 2019. Machine learning methods for crop yield prediction. Environmental Research Letters, 1140. [Google Scholar]
- Ahmad, S. , et al., 2020. Explainable AI for soil moisture prediction. Environmental Modelling & Software.
- Government of India, 2020. Pradhan Mantri Fasal Bima Yojana: Guidelines.
- Linardatos, P. , et al., 2021. Explainable AI: A review of methods. Artificial Intelligence Review, 1301. [Google Scholar]
- Ray, D.K. , et al., 2015. Climate variation explains a third of global crop yield variability. Nature Communications.
- Rolnick, D. , et al., 2022. Tackling climate change with machine learning. ACM Computing Surveys.
- Kumar, S. , et al., 2023. AI-based crop yield prediction under climate variability. Agricultural Systems, 1036. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).