Submitted:
11 February 2025
Posted:
13 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. IoT-Driven Processing and Integration Pipeline
- Light Sensors: Monitor light intensity to ensure optimal exposure for photosynthesis.
- Moisture Sensors: Measure soil moisture levels to prevent over- or under-watering.
- Temperature-Humidity Sensors: Track ambient conditions crucial for maintaining favorable growing environments.
- CO₂ Sensors: Monitor carbon dioxide concentrations to optimize photosynthetic efficiency.
- Water-Nutrient Intake Sensors: Analyze nutrient levels, including potassium, phosphorus, and nitrogen, to ensure a balanced supply.
2.1.1. Data Storage of IoT Devices
2.1.2. Correlation Analysis
2.1.3. Cloud-Based Data Processing and Analytics
2.2. AI-Driven Predictive Modeling and Parameter Computation in Crop Management
2.2.1. Predictive Modeling
Support Vector Machines (SVM)
Deep Neural Networks (DNN)
Gradient Boosting
2.2.2. Parameter Computaion
Time Series Analysis
Multivariate Regression for Resource Contributions
Multivariate linear regression
Random Forest Model
2.2.3. Integrated Agricultural Efficiency Metric (IAEM)
Metric Scale
- Poor Performance: IAEM < 0.1
- Indicates significant inefficiencies in resource management, alert accuracy, or latency handling.
- Moderate Performance: 0.1 ≤ IAEM < 0.2
- Reflects average system performance with room for noticeable improvement.
- Good Performance: 0.2 ≤ IAEM < 0.3
- Demonstrates effective optimization across metrics but with potential for further fine-tuning.
- Excellent Performance: IAEM ≥ 0.3
- Represents highly optimized system performance with minimal latency, high accuracy, and efficient resource utilization.
Data Latency Metric (DL)
Real-Time Alert Accuracy (RA)
Cloud Data Availability (CDA)

2.3. Dataset Structure and Model Inputs
Input Parameters
Output Targets
Preprocessing
Algorithm-Specific Inputs/Outputs
2.4. MATLAB Simulink for Real-Time Feedback
2.5. Experimental Setup for Evaluating Plant Growth
3. Results and Discussion
3.1. System Performance
3.1.1. Real-Time IoT-Driven Alert System Responsiveness



3.1.2. IoT-Based Monitoring for Growth
- IoT-based Monitoring (green line): Plants experienced dynamic adjustments in light, moisture, and nutrient levels, resulting in faster growth and higher biomass accumulation over 8 weeks.
- Non-IoT Monitoring (pink line): This scenario represents plant growth under static conditions, where environmental factors are not adjusted in real-time. As a result, the plant grows more slowly compared to the IoT-monitored system.
3.1.3. IoT Data Storage
3.2. Growth Monitoring and Environmental Correlations
3.3. Models performance
3.3.1. Prediction Models
3.3.1. Support Vector Machine
- Week 2: Observed growth peaked at 298 g, and the SVM model accurately predicted the same value, demonstrating high early-stage prediction accuracy.
- Week 6: Growth dropped to 179 g, and the SVM model closely reflected this decline, though with a slight underprediction.
- Week 12: Observed growth reached 301 g, with the SVM model predicting 300 g, showcasing its effectiveness in capturing late-stage growth trends.
3.3.2. Gradient Boosting
- Week 2: Observed crop yield reached 298 g, and the model's final prediction closely aligned, indicating strong early-stage accuracy.
- Week 6: Observed crop yield dropped to 179 g, and the model captured this decline, though earlier iterations slightly overestimated the yield.
- Week 12: Observed yield peaked at 301 g, and the final prediction value reached 300 g, demonstrating the model's ability to accurately track significant growth increases.
3.3.3. Deep Neural Network (DNN)
- Week 2: Observed growth was 298 g, and predictions from both train and test datasets were highly aligned, showing strong model accuracy.
- Week 6: Observed growth dropped to 179 g, but the model overestimated growth at 240 g (train) and underestimated it at 160 g (test). This mid-cycle deviation suggests a need for recalibration, likely due to sensitivity in temperature variations.
- Week 12: Observed growth was 301 g, and predictions were 290 g (train) and 280 g (test), demonstrating better alignment in late-stage growth.
3.3.4. Time Series Analysis
- Week 2: Observed growth was 298 g, closely matching the model's predicted value, confirming high early-stage accuracy.
- Week 6: Observed growth dropped to 179 g, while the model predicted 185 g, remaining within the tolerance range.
- Week 12: Observed growth peaked at 301 g, and the model predicted 295 g, demonstrating robust long-term forecasting capabilities.
3.3.5. Comparison of Growth Prediction Models
- Week 4: Observed growth was 240 g. The best-performing models were Gradient Boosting (≈238 g) and Time Series Analysis (≈239 g), while SVM (≈230 g) and DNN (≈227 g) slightly underpredicted growth.
- Week 12: Observed growth was 320 g. Gradient Boosting (≈319 g) and Time Series Analysis (≈318 g) remained the most accurate, while SVM (≈310 g) and DNN (≈312 g) still slightly underpredicted.
3.3.2. Factor Analysis Models3.3.6. Random Forest Model
- Light intensity had the highest importance (0.778), making it the dominant environmental factor in crop yield.
- Other key factors included Nitrogen (0.069), Moisture (0.048), Temperature (0.037), Phosphorus (0.034), and Potassium (0.033).
3.3.7. Multivariate Regression
- Phosphorus (β=0.54) was the most influential factor.
- Potassium (β=-0.43) had a negative impact on growth, suggesting potential overuse issues.
- Other factors included Temperature (β=0.23), Moisture (β=0.06), Nitrogen (β=0.04), and Light (β=0.03).
3.4. Ressource Efficiency and System Optimization
3.4.1. Comparison of Smart Farming Approaches
3.4.2. Integrated Agricultural Efficiency Metric (IAEM)
Data Latency Metric (DL)
Resource Efficiency (RE)
Real-Time Alert Accuracy (RA)
Cloud Data Availability (CDA)
Pairwise Comparison algorithm using AHP for weights
5. Conclusions
Future Research Directions
- Sensor Network Expansion: Integrate additional sensors for a more granular understanding of environmental conditions.
- AI Model Refinement: Utilize advanced deep learning architectures and ensemble learning methods to improve prediction accuracy and real-time adaptability.
- Scalability: Explore the application of this system in large-scale operations, including open-field farming, to validate its robustness and efficiency in diverse contexts.
- Cost-Effectiveness Analysis: Assess the economic viability of implementing such systems on a broader scale, particularly in resource-limited regions.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
Appendix A. Agriculture Data Table
| Week | Light (lumens) | Moisture_Content (%) | Temperature (°C) | Nitrogen (mg/kg) | Phosphorus (mg/kg) | Potassium (mg/kg) | Crop Yield (g) | Growth (cm) |
| 1.0 | 6872.3 | 78.18 | 15.63 | 140.83 | 20.0 | 40.0 | 214.26 | 30.38 |
| 2.0 | 9753.57 | 66.51 | 27.73 | 73.96 | 22.0 | 42.0 | 298.15 | 42.27 |
| 3.0 | 8659.97 | 76.37 | 21.29 | 64.49 | 24.0 | 41.0 | 265.06 | 37.58 |
| 4.0 | 7993.29 | 73.69 | 25.17 | 98.95 | 25.0 | 44.0 | 245.76 | 34.84 |
| 5.0 | 5780.09 | 55.87 | 33.15 | 148.57 | 28.0 | 45.0 | 184.15 | 26.11 |
| 6.0 | 5779.97 | 75.31 | 19.99 | 74.21 | 26.0 | 43.0 | 179.29 | 25.42 |
| 7.0 | 5290.42 | 25.31 | 23.21 | 117.21 | 30.0 | 46.0 | 164.12 | 23.27 |
| 8.0 | 9330.88 | 31.76 | 30.11 | 126.16 | 32.0 | 48.0 | 287.83 | 40.81 |
| 9.0 | 8005.58 | 22.31 | 19.58 | 73.76 | 31.0 | 47.0 | 243.53 | 34.53 |
| 10.0 | 8540.36 | 39.52 | 16.54 | 122.42 | 27.0 | 44.0 | 264.09 | 37.44 |
| 11.0 | 5102.92 | 43.32 | 20.8 | 86.78 | 29.0 | 45.0 | 159.66 | 22.24 |
| 12.0 | 9849.55 | 36.28 | 18.22 | 113.23 | 33.0 | 49.0 | 301.23 | 42.31 |
Appendix B. Raw Data for Growth Prediction and System Optimization Analysis
| Week | Observed Growth (g) | SVM Prediction (g) | Gradient Boosting Prediction (g) | DNN Prediction (Train/Test) (g) | Time Series Prediction (g) |
| 2 | 298 | 298 | 298 | 298 / 298 | 298 |
| 4 | 240 | 230 | 238 | 227 / 225 | 239 |
| 6 | 179 | 176 | 181 | 240 / 160 | 185 |
| 8 | 260 | 255 | 259 | 250 / 248 | 258 |
| 10 | 280 | 273 | 278 | 275 / 270 | 277 |
| 12 | 301 | 300 | 300 | 290 / 280 | 295 |
Appendix C. Feature Importance from Random Forest Model
| Feature | Importance Score |
| Light Intensity | 0.778 |
| Nitrogen | 0.069 |
| Moisture | 0.048 |
| Temperature | 0.037 |
| Phosphorus | 0.034 |
| Potassium | 0.033 |
Appendix D. Regression Coefficients from Multivariate Analysis
| Factor | Regression Coefficient (β) |
| Phosphorus | 0.54 |
| Potassium | -0.43 |
| Temperature | 0.23 |
| Moisture | 0.06 |
| Nitrogen | 0.04 |
| Light | 0.03 |
Appendix E. System Optimization Metrics Before and After IoT Implementation
| Resource | Before IoT (%) | After IoT (%) | Reduction (%) |
| Light Usage | 100 | 75 | 25 |
| Nitrogen Usage | 100 | 70 | 30 |
| Phosphorus Usage | 100 | 65 | 35 |
| Potassium Usage | 100 | 60 | 40 |
| Water Usage | 100 | 70 | 30 |
Appendix F. Real-Time Alert Accuracy Improvements
| Experiment | Baseline System Accuracy (%) | Optimized System Accuracy (%) | Improvement (%) |
| 1 | 91 | 93 | +2 |
| 2 | 91 | 94 | +3 |
| 3 | 91 | 95 | +4 |
| 4 | 91 | 96 | +5 |
| 5 | 91 | 96 | +5 |
References
- Sahoo, S. , Anand, A., Mohanty, S., & Parvez, A. (2024). Trends in Agriculture Science: Integrating Data Analytics for Enhanced Decision Making by Farmers. Zenodo. [CrossRef]
- Jaeger, S. R. Vertical farming (plant factory with artificial lighting) and its produce: consumer insights. Current Opinion in Food Science 2024, 56, 101145. [Google Scholar] [CrossRef]
- Lakhiar, I. , Gao, J., Syed, T., Chandio, F. A., & Buttar, N. Modern plant cultivation technologies in agriculture under controlled environment: A review on aeroponics. Journal of Plant Interactions 2018, 13, 1472308. [Google Scholar] [CrossRef]
- Dozono, K. , Amalathas, S., & Saravanan, R. (2021). The impact of cloud computing and artificial intelligence in digital agriculture. In Advances in Artificial Intelligence and Cloud Computing (pp. 547–558). [CrossRef]
- Oliveira, M. , Zorzeto-Cesar, T., Attux, R., & Rodrigues, L. H. (2023). Leveraging data from plant monitoring into crop models. [CrossRef]
- Wootton, C. (2016). General Purpose Input/Output (GPIO). In A Practical Guide to Raspberry Pi (pp. 365–386). [CrossRef]
- Klein, S. (2017). Azure Event Hubs. In Pro SQL Server on Microsoft Azure (pp. 407–427). [CrossRef]
- Borra, P. Impact and innovations of Azure IoT: Current applications, services, and future directions. International Journal of Recent Technology and Engineering (IJRTE) 2024, 13, 21–26. [Google Scholar] [CrossRef]
- Papadopoulos, S. (2022). A practical, powerful, robust, and interpretable family of correlation coefficients. SSRN. [CrossRef]
- Wakchaure, S. Overview of Apache NiFi. International Journal of Advanced Research in Science, Communication and Technology2023, 523-530. [CrossRef]
- C. Cortes and V. Vapnik. Support-vector networks. Machine Learning 1995, 20, 273–297. [Google Scholar] [CrossRef]
- LeCun, Y. , Bengio, Y., & Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Friedman, J. H. Greedy Function Approximation: A Gradient Boosting Machine. The Annals of Statistics 2000, 29, 1189–1232. [Google Scholar] [CrossRef]
- Kumar, S. (2024). Time series analysis. In Python for Accounting and Finance (pp. 381–409). [CrossRef]
- Wei, X. , Liu, X., Fan, Y., Tan, L., & Liu, Q. A unified test for the AR error structure of an autoregressive model. Axioms 2022, 11, 690. [Google Scholar] [CrossRef]
- Delsole, T. , & Tippett, M. (2022). Multivariate linear regression. In [Book Title] (pp. 314–334). [CrossRef]
- Pan, M. , Xia, B., Huang, W., Ren, Y., & Wang, S. PM2.5 concentration prediction model based on Random Forest and SHAP. International Journal of Pattern Recognition and Artificial Intelligence 2024, 38. [Google Scholar] [CrossRef]
- Avila, A. F. D. , & Evenson, R. E. Total Factor Productivity Growth in Agriculture: The Role of Technological Capital. Handbook of Agricultural Economics 2010, 4, 3769–3822. [Google Scholar] [CrossRef]
- Shang, J. , Liu, J., Ma, B., Zhao, T., Jiao, X., Geng, X., Huffman, T., Kovacs, J. M., & Walters, D. Mapping spatial variability of crop growth conditions using RapidEye data in Northern Ontario, Canada. Remote Sensing of Environment 2015, 168, 113–125. [Google Scholar] [CrossRef]
- Hyndman, R. J. , & Koehler, A. B. Another look at measures of forecast accuracy. International Journal of Forecasting 2006, 22, 679–688. [Google Scholar] [CrossRef]
- Bushuev, A. , Litvinov, Y. V., Boikov, V., Bystrov, S., Nuyya, O., & Sergeantova, M. (2024). Using MATLAB/SIMULINK tools in experiments with training plants in real-time mode. [CrossRef]
- Peralta, E. A. , Freeman, J. , & Gil, A. F. Population expansion and intensification from a Malthus-Boserup perspective: A multiproxy approach in Central Western Argentina. Quaternary International 2024, 689–690, 55–65. [Google Scholar] [CrossRef]
- Muhammad, D. , Isah, A. A., Hassan, I., Sale, A., Ibrahim, M., & Hussaini, A. Embedded Internet of Things (IoT) Environmental Monitoring System Using Raspberry Pi. Sule Lamido University Journal of Science & Technology 2023, 6, 247–255. [Google Scholar] [CrossRef]
- Ramakrishna, C. , Venkateshwarlu, B., Srinivas, J., & Srinivas, Dr. S. IoT-based smart farming using cloud computing and machine learning. International Journal of Innovative Technology and Exploring Engineering 2019, 9, 3555–3458. [Google Scholar] [CrossRef]
- Bilal, M. , Tayyab, M., Hamza, A., Shahzadi, K., & Rubab, F. The Internet of Things for smart farming: Measuring productivity and effectiveness. Electronics Communications Scientific Applications 2023, 10, 16012. [Google Scholar] [CrossRef]




















| Algorithm | Input Data | Output | Purpose |
|---|---|---|---|
| Support Vector Machine (SVM) | Weekly sensor data (light, moisture, nutrients) | Binary classification (optimal/suboptimal conditions) | Enables threshold adjustments for alerts and ensures environmental parameters remain optimal. |
| Gradient Boosting | Historical sensor data | Predicted growth (g) | Guides long-term resource allocation by predicting plant development under different conditions. |
| Deep Neural Network (DNN) | Time-series sensor data | Growth forecasts (g) | Proactive environmental adjustments to optimize nutrient and water supply. |
| Time Series Analysis | Daily sensor readings | Growth trends with tolerance bands | Planning & watering scheduling based on predicted fluctuations. |
| Random Forest | Sensor data + growth metrics | Feature importance scores (0–1) | Prioritizing critical environmental factors affecting plant growth. |
| Multivariate Regression | All sensor variables | Regression coefficients (β-values) | Quantifies nutrient/growth relationships for targeted optimization. |
| Algorithm | Input Data | Key Output | Impact on System Optimization |
|---|---|---|---|
| Support Vector Machine (SVM) | Weekly averages of light, moisture, and nutrients. | 96% alert accuracy for suboptimal conditions. | Reduced over-watering by 15% through dynamic moisture threshold adjustments. |
| Gradient Boosting | Historical sensor data aggregated weekly. | Predicted Week 12 growth: 300g (vs. 301g actual). | Guided nitrogen allocation, reducing usage by 30% while maintaining yield. |
| Deep Neural Network (DNN) | Daily time-series sensor data (light, moisture, temperature). | Week 6 growth forecast: 240g (vs. 179g actual). | Highlighted mid-cycle prediction errors, prompting recalibration of temperature controls. |
| Time Series Analysis | Daily sensor readings from Azure Blob Storage. | Growth trends within ±5% tolerance bands. | Optimized irrigation schedules, reducing water usage by 30%. |
| Random Forest | Raw sensor data + growth metrics (biomass, height). | Light importance: 0.778 (highest among factors). | Prioritized light control in IAEM, contributing to a 25% reduction in energy consumption. |
| Multivariate Regression | All sensor variables (light, moisture, temperature, N, P, K). | Phosphorus (β=0.54), Potassium (β=-0.43). | Identified potassium overuse, leading to a 40% reduction and balanced nutrient strategies. |
| System | Our Azure-Based Pipeline (2024) | ThingSpeak-Based System (2023) | IoT Smart Farming (2019) | Cloud-Based Smart Farming (2023) |
| Cloud Platform | Azure IoT Hub, Blob Storage, Event Hub | ThingSpeak Cloud | Generic cloud platform | Cloud computing services |
| Data Processing | Advanced ETL pipeline with MATLAB Simulink | Basic aggregation | ML for irrigation | IoT sensors, centralized processing |
| Data Storage | Blob Storage with containers by sensor type | Centralized in ThingSpeak | General cloud storage | Real-time cloud storage |
| Sensors Used | Light, Moisture, Temp, N, P, K, CO2 sensors | DHT11 for temp/humidity | Soil moisture, temp, UV | Temp, humidity, soil moisture, pH |
| Hardware | Raspberry Pi for sensor integration | Raspberry Pi Model B | Zig bee-enabled sensors | Raspberry Pi with sensors |
| Real-Time Feedback | Dynamic adjustments via MATLAB Simulink | No feedback loop | Automated irrigation | Manual/automated control |
| Predictive Analytics | Gradient Boosting, SVM, Deep Neural Networks | No predictive analytics | ML for irrigation | Basic ML for optimization |
| Monitoring Tools | Power BI dashboards, real-time alerts | Web/mobile apps | Mobile apps, SMS alerts | Web/mobile monitoring |
| Decision Support | Predictive models, real-time alerts | Alerts based on thresholds | Automated decisions | Limited decision support |
| Security | Secure transfer with Azure authentication | Basic security | Secure communication | IoT security protocols |
| Scalability | Highly scalable for large-scale | Small-scale applications | Moderate scalability | Scalable for small/medium farms |
| Innovations | ETL pipeline, predictive analytics | Cost-effective, simple monitoring | IoT, ML for irrigation | Combines IoT, ML, cloud |
| Limitations | Might be Higher cost for larger scale | Limited analytics, basic system | Focuses on irrigation | Limited predictive analytics |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).