Submitted:
13 June 2025
Posted:
17 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methods
2.1. Understanding of the Agricultural Problem
2.2. Data collection and Processing
2.3. Comparison and Selection of Predictive Algorithms
2.4. Predictive Modeling with XGBoost.
2.4. Mobile Application Development
2.5. Implementation and Functional Validation Testing
3. Results and Discussion
3.1. Performance of the XGBoost Model
- Mean Square Error Medio (MSE): 53.6068
- Root Mean Square Error (RMSE): 7.3217
- Coefficient of Determination (R²): 0.9399
3.2. Variable Importance Analysis
3.3. Technical Evaluation of the Offline Mobile Application
3.4. Participatory Field Validation
3.5. Comparative Analysis with Previous Studies
3.6. General Discussion
4. Conclusions
Author Contributions
Founding
Founding
Acknowledgments
Conflicts of Interest
References
- Charry, A., Perea, C., Ramírez, K., Zambrano, G., Yovera, F., Santos, A., ... & Pulleman, M., «La economía agridulce de los diferentes sistemas de producción de cacao en Colombia, Ecuador y Perú.,» Agricultural Systems, 2025.
- Mohan, R. J., Rayanoothala, P. S., & Sree, R. P., «Next-gen agriculture: integrating AI and XAI for precision crop yield predictions.,» Frontiers in Plant Science, 2025. [CrossRef]
- Pineda, Medina, D., Miranda Cabrera, I., de la Cruz, R. A., Guerra Arzuaga, L., Cuello Portal, S., & Bianchini, M., «A mobile app for detecting potato crop diseases.,» Journal of Imaging, p. 10(2), 2024.
- Shamshiri, R. R., Sturm, B., Weltzien, C., Fulton, J., Khosla, R., Schirrmann, M., ... & Hameed, I. A., «Digitalization of agriculture for sustainable crop production: a use-case review.,» Frontiers in Environmental Science, 2024. [CrossRef]
- Schröer, C., Kruse, F., & Gómez, J. M., «A systematic literature review on applying CRISP-DM process model.,» Procedia Computer Science, 2021. [CrossRef]
- Fan, C., Chen, M., Wang, X., Wang, J. y Huang, B., «A review on data preprocessing techniques toward efficient and reliable knowledge discovery from building operational data.,» Frontiers in energy research, 2021. [CrossRef]
- Chicco, D., Warrens, MJ y Jurman, G., «The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation.,» Peerj computer science, 2021.
- Yu, K., Liu, Y., & Sharma, A., «Analyze the effectiveness of the algorithm for agricultural product delivery vehicle routing problem based on mathematical model.,» International Journal of Agricultural and Environmental Information Systems (IJAEIS),, 2021. [CrossRef]
- Chen, R. C., Dewi, C., Huang, S. W., & Caraka, R. E., «Selecting critical features for data classification based on machine learning methods.,» Journal of Big Data., 2020. [CrossRef]
- Joshua, V., Priyadharson, S. M., & Kannadasan, R., «Exploration of machine learning approaches for paddy yield prediction in eastern part of Tamilnadu.,» Agronomy, 2021. [CrossRef]
- Pechlivani, E. M., Gkogkos, G., Giakoumoglou, N., Hadjigeorgiou, I., & Tzovaras, D., «Towards sustainable farming: a robust decision support system’s architecture for agriculture 4.0.,» International Conference on Digital Signal Processing (DSP), 2023.
- Hacinas, E. A. S., Querol, L. S., Santos, K. L. T., Matira, E. B., Castillo, R. C., Arcelo, M., ... & Rustia, D. J. A., «Rapid Automatic Cacao Pod Borer Detection Using Edge Computing on Low-End Mobile Devices.,» Agronomy, 2024. [CrossRef]
- Nti, IK, Zaman, A., Nyarko-Boateng, O., Adekoya, AF y Keyeremeh, F., «A predictive analytics model for crop suitability and productivity with tree-based ensemble learning.,» Decision Analytics Journal, 2023. [CrossRef]
- Jo, J. S., Kim, D. S., Jo, W. J., Sim, H. S., Lee, H. J., Moon, Y. H., ... & Kim, S. K., «Prediction of strawberry fruit yield based on cultivar-specific growth models in the tunnel-type greenhouse,» Horticulture, Environment, and Biotechnology, 2022.
- Hinojosa, C., Sanchez, K., Camacho, A., & Arguello, H., «AgroTIC: Bridging the gap between farmers, agronomists, and merchants through smartphones and machine learning.,» arXiv preprint arXiv:2305.12418., 2023.
- J. Lowenberg DeBoer, «Economics of adoption for digital automated technologies in agriculture.,» AgEcon SEARCH RESEARCH IN AGRICULTURAL & APPLIED ECONOMICS, 2022.
- T. Marinchenko, Digital transformations in agriculture. In Complex Systems: Innovation and Sustainability in the Digital, Cham: Springer International Publishing., 2021.
- Fan, L.; Fang, S.; Fan, J.; Wang, Y.; Zhan, L, «Rice Yield Estimation Using Machine Learning and Feature Selection in Hilly and Mountainous Chongqing, China,» Agriculture, 2024. [CrossRef]
- Baio, F. H. R., Santana, D. C., Teodoro, L. P. R., Oliveira, I. C. D., Gava, R., de Oliveira, J. L. G., ... & Shiratsuchi, L. S., «Maize yield prediction with machine learning, spectral variables and irrigation management.,» Remote Sensing, 2022.
- Lamos, Díaz, H., Puentes Garzón, DE, & Zarate Caicedo, DA, «Comparación entre modelos de aprendizaje automático para pronóstico de rendimiento en cultivos de cacao en santander, Colombia.,» Revista Facultad de Ingeniería, 2020.
- Chaudhary, M., Gastli, MS, Nassar, L. y Karray, F., «Deep Learning Approaches for Forecasting Strawberry Yield and Prices Using Satellite Imagery and Station-Based Soil Parameters.,» Preimpresión de arXiv:2102.09024, 2021.
- Araújo, S. O., Peres, R. S., Ramalho, J. C., Lidon, F., & Barata, J., «Machine learning applications in agriculture: current trends, challenges, and future perspectives.,» Agronomy, 2023.







| Model | Mean R² | RMSE | MSE |
|---|---|---|---|
| Decision tree | 0.7300 | 9.6600 | 93.35 |
| Random Forest | 0.7471 | 7.3530 | — |
| Gradient Boosting | 0.7926 | 6.5234 | — |
| Support Vector Machine | 0.6405 | 8.9588 | — |
| XGBoost | 0.9399 | 7.3217 | 53.61 |
| Study | Cultivo | Algorithm | MSE | RMSE | R² | Offline | Country |
|---|---|---|---|---|---|---|---|
| This study | Cacao | XGBOOST | 9.63 | 3.10 | 0.939 | Yes | Ecuador |
| Fan and Zhan (2024) | Rice | Random Forest | - | 529.1 kg/ha | 0.85 | No | China |
| Baio (2022) | Maize | Random Forest | - | - | 0.94 | No | Brasil |
| Lamos (2020) | Cacao | Gradient Boosting | - | 20.41 | 0.68 | No | Colombia |
| Chaudhary (2021) | Strawberry | CNN-LSTM | - | - | 0.89 | No | USA |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).