Submitted:
04 June 2025
Posted:
06 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Background Work and Preliminaries
A. Crop Yield Prediction

B. Satellite Remote Sensing
- The Moderate Resolution Imaging Spectroradiometer (MODIS): Provides a high temporal resolution with intervals from one day to eight days with moderate spatial resolution (250–1000 m).
- Sentinel-2: Sentinel-2 offers 10–20 m resolution and frequent revisits for vegetation monitoring.
- Landsat: Historical 30-meter data for long-term trend analysis.
C. Vegetation Indices
- Normalized Difference Vegetation Index (NDVI):

- Enhanced Vegetation Index (EVI):
- Normalized Difference Water Index (NDWI):
D. Machine Learning and Deep Learning in Agriculture
- Random Forest (RF): A technique combining multiple decision trees to enhance the prediction accuracy and mitigate overfitting. It can effectively manage nonlinear relationships and high-dimensional data.
- XGBoost: An optimized gradient boosting algorithm, XGBoost excels in speed, accuracy, and handling of struc- tured tabular data. This effectively manages the complex feature interactions.
- Convolutional Neural Networks (CNNs): Convolutional neural networks are highly effective for analyzing spatial data, such as satellite images, to identify features such as the health of plants, the structure of canopies, and the boundaries of crops.
- Long Short-Term Memory (LSTM) Networks: LSTM networks, a variant of RNN, are highly effective at capturing long-term dependencies in sequential data, making them particularly suitable for modeling temporal variations in crop growth stages over an agricultural season.
E. Multisource Data Fusion
3. Literature Survey
| Comparison Table | |||
| Sl. No. | Title | Algorithms | Limitations |
| 1. | Crop Yield Prediction Using a DRL Model for Sustainable Agrarian Applications | DRQN: Deep RL combined with recurrent neural networks | Gradient issues with long sequences and limited generalization |
| 2. | Estimating Crop Yields with Remote Sensing and Deep Learning | DL models using weather data, soil data, and crop calendars | Depends on self-reported yields; performance varies across crops |
| 3. | Estimating soybean yields using causal inference and deep learning approaches using satellite remote sensing data | structural causal models with graph attention networks (SCMGAT) | causal findings need broader validation |
| 4. | Crop Yield Prediction using Deep Learning Algorithm based on CNN-LSTM with attention layer and skip connection | CNN-LSTM with multihead attention and skip connections | small dataset size. The model has not yet reached optimal performance. |
| 5. | Indian Crop Yield Prediction using LSTM Deep Learning Networks | LSTM-based two-phase crop prediction | LSTM is time- and memory-intensive for training |
| 6. | Predicting Agriculture Yields Based on Machine Learning Using Regression and Deep Learning | DT, XGBoost RF, CNN, LSTM | Limited to 10 crops; relies on historical data; lacks satellite or remote sensing data |
| 7. | A Systematic Literature Review on Crop Yield Prediction with Deep Learning and Remote Sensing | CNN, LSTM, CNN-LSTM, YieldNet, BNN | Requires large amounts of data and integration complexity |
| 8. | Rice yield prediction through integration of biophysical parameters with SAR and optical remote sensing data using machine learning models | ML models (XGB, NNET, RF, SVM, KNN); optical & SAR remote sensing integration; prediction at 45, 60, 90 DAT | Study in Udham Singh Nagar; requires validation for other regions, limited to rice |
| 8. | Rice yield prediction through integration of biophysical parameters with SAR and optical remote sensing data using machine learning models | ML models optical & SAR remote sensing integration; prediction at 45, 60, 90 DAT | Study in Udham Singh Nagar; needs validation for other regions; limited to rice |
| 9. | Enhancing crop yield estimation from remote sensing data: A comparative study of the Quartile Clean Image method and vision transformer | Quartile Clean image preprocessing, CNN&LSTM, ViT | Negative R² for corn in some years; ViT’s RMSE is higher than that of CNN and LSTM |
| 10. | Enhancing Crop Yield Prediction Utilizing Machine Learning on Satellite-Based Vegetation Health Indices | sICA, PCA-ML, and EBT (Ensemble Boosted Tree) | PCA impact-limited; VCI/TCI accuracy affected by variability |
| 11. | Automated Rice Crop Yield Prediction using Sine Cosine Algorithm with Weighted Regularized Extreme Learning Machine | SCA-WRELM Model | Focuses only on rice crop, SCA-WRELM can be further improved |
| 12. | A GNN-RNN Approach for Harnessing Geospatial and Temporal Information: Application to Crop Yield Prediction | GNN-RNN Model | Can underpredict high yields, especially in the midwest |
| 13. | A CNN-RNN Framework for Crop Yield Prediction | CNN-RNN hybrid model | Requires data, lacks interpretability and is sensitive to input quality |
| 14. | A deep learning multilayer perceptron and remote sensing approach for soil health-based crop yield estimation | DLMLP for yield; ML models for soil health | Limited by SAR penetration and missing full-season variability |
| 15. | Deep learning-enhanced remote sensing integrated crop modeling for rice yield prediction | Deep Learning crop modeling with an FNN, LSTM, GRU, BLSTM | Lower accuracy, limited interpretability, and region-specific training |
4. Proposed System
5. Methodology
- 1)
-
Data Collection and Preprocessing
-
Gather multisource data, including:
- -
- Satellite imagery (e.g., MODIS, Sentinel, Landsat)
- -
- Vegetation indices (NDVI, EVI, NDWI)
- -
- Weather data (temperature, precipitation, wind speed)
- -
- Soil health parameters
- -
- Historical yield data
- Preprocess and normalize data to handle missing values and ensure consistency.
-
- 2)
-
Deep-learning Architecture
-
Implementation of a hybrid CNN-LSTM model:
- -
- CNN component for spatial feature extraction from satellite imagery.
- -
- LSTM component for modeling temporal crop growth patterns.
- Incorporate attention mechanisms to improve model interpretability.
- Utilize transfer learning techniques for better gen- eralization across regions.
-
- 3)
-
Multitask Learning
- Simultaneously predict crop yield and other agronomic indicators.
- Integrate causal inference techniques to understand relationships between input features.
- 4)
-
Model Training and Optimization
- Use diverse datasets from multiple geographic regions and crop types.
- Implement techniques like exponential decay learning rates and skip connections.
- Employ ensemble methods to improve overall prediction accuracy.
- 5)
-
Validation and Testing
- Perform cross-validation across different regions and crop seasons.
- Evaluate the effectiveness in comparison with con- ventional machine learning models such as Random Forest and SVM.
- Evaluate using metrics such as RMSE, MAE, R2, and correlation coefficients.

6. Results and Discussion


7. Future Work
8. Conclusion
References
- Jeong, S.; Ko, J.; Ban, J.-O.; Shin, T.; Yeom, J.-M. Deep learning-enhanced remote sensing-integrated crop modeling for rice yield prediction. Ecological Informatics 2024, 84, 102886. [Google Scholar] [CrossRef]
- Tripathi, A.; Tiwari, R.K.; Tiwari, S.P. A deep learning multilayer perceptron and remote sensing approach for soil health based crop yield estimation. International Journal of Applied Earth Observation and Geoinformation 2022, 113, 102959. [Google Scholar] [CrossRef]
- Khaki, S.; Archontoulis, S.V.; Wang, L. A CNN-RNN framework for crop yield prediction. Frontiers in Plant Science 2020, 10. [Google Scholar] [CrossRef] [PubMed]
- Fan, J.; Gomes, C.P.; Bai, J.; Ortiz-Bobea, A.; Li, Z. A GNN-RNN Approach for Harnessing Geospatial and Temporal Information: Application to Crop Yield Prediction. Proceedings of the AAAI Conference on Artificial Intelligence. [CrossRef]
- Thirumal, S.; Latha, R. Automated Rice Crop Yield Prediction using Sine Cosine Algorithm with Weighted Regularized Extreme Learning Machine. [CrossRef]
- Elavarasan, D.; Vincent, P.M.D. Crop Yield Prediction Using Deep Reinforcement Learning Model for Sustainable Agrarian Applications. IEEE Access 2020, 8, 86886–86901. [Google Scholar] [CrossRef]
- Cunha, R.L.F.; Silva, B. ESTIMATING CROP YIELDS WITH REMOTE SENSING AND DEEP LEARNING. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences 2020, IV-3/W2-2020, 59–64. [Google Scholar] [CrossRef]
- Wang, F.; et al. Estimating Soybean Yields using Causal Inference and Deep Learning Approaches with Satellite Remote Sensing Data. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2024. [Google Scholar] [CrossRef]
- Kalmani, V.H.; Dharwadkar, N.V.; Thapa, V. Crop Yield Prediction using Deep Learning Algorithm based on CNN-LSTM with Attention Layer and Skip Connection. Indian Journal Of Agricultural Research. [CrossRef]
- Kuriakose, S.M.; Singh, T. Indian Crop Yield Prediction using LSTM Deep Learning Networks. [CrossRef]
- Sharma, P.; Aneja, N.; Aneja; Dadheech, P. Predicting agricultural yields based on machine learning using regression and deep learning. IEEE Access 2023, 11, 111255–111264. [Google Scholar] [CrossRef]
- Muruganantham, P.; Islam, N.; Samrat, N.H.; Wibowo, S.; Grandhi, S. A Systematic Literature Review on Crop Yield Predic- tion with Deep Learning and Remote Sensing. Remote Sensing 2022, 14, 1990. [Google Scholar] [CrossRef]
- Sah, S.; Haldar, D.; Singh, R.; Das, B.; Nain, A.S. Rice yield prediction through integration of biophysical parameters with SAR and optical remote sensing data using machine learning models. Scientific Reports 2024, 14. [Google Scholar] [CrossRef] [PubMed]
- Thakkar, M.; Vanzara, R. Enhancing crop yield estimation from remote sensing data: a comparative study of the Quartile Clean Image method and vision transformer. Discover Applied Sciences 2024, 6. [Google Scholar] [CrossRef]
- Pham, T.; Bui, L.K.; Kuhn, M.; Awange, J.; Nguyen, B.V. Enhancing Crop Yield Prediction Utilizing Machine Learning on Satellite-Based Vegetation Health Indices. Sensors 2022, 22, 719. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).