2. Related Work
The growing interest in using AI techniques to recognize or detect plant diseases has been analysed by researchers (e.g., Singh (2020) [
5] and Abade et al. (2021) [
6]) in their review works. The works mainly presented a review of the effective use of diverse imaging techniques and computer vision approaches for identifying crop diseases. The use of deep learning (DL), support vector machine (SVM), K-means clustering, and K-nearest neighbours (K-NN) are the major ones used for such task. The works also mention that, with the advent of DL, the shift is moving gradually from the machine learning-based approaches, which mostly employed image processing techniques for feature extractions. It was observed that most image processing techniques used in recognising or identifying plant diseases are mainly employed at the preprocessing stage to help improve the quality of the images (dataset) for the DL model to be trained effectively. Again, the 121 papers reviewed by Abade et al. (2021) [
6] concluded that Convolution Neural Network (CNN) has become state-of-the-art technique for identifying and classifying plant diseases. The papers reviewed by Abade et al. (2021) [
6] were published between 2011 and 2021, spanning 10 years. Therefore, they can serve as a critical guide to understanding the domain and direction of DL for plant disease and pest classification, especially with approaches related to disease detection, dataset characteristics, crop and pathogens investigation, and so on. According to the various literature reviews, CNN-based architecture is an impressive solution for crop disease detection and can contribute to shaping the future of computer vision in this direction. In this section, we present some of the standard works that have employed Deep Learning (DL) for pests and diseases identification, the use of the DL model for tomato and maize and other crops' diseases and pests’ identification, and we finally explore how deep feature extraction approaches are currently deployed to improve accuracy as well as reduce resource consumption of AI-enabled mobile app.
Identification of plant leaf disease using a nine-layer convolutional neural network was deployed by Geetharamani and Arun Pandian (2019) [
7] to identify disease. The Deep CNN model was trained using a publicly available dataset with 39 different classes of plant leaves and background images. To improve the model's performance, the authors employed six image augmentation methods: gamma correction, noise injection, image flipping, PCA colour augmentation, rotation and scaling to have a robust model for disease classification. The proposed model achieved 96.46% accuracy. While the proposed model was robust, the work focused on only plant diseases. Again, the number of classes for each category of plants was small; therefore, new models must be developed to handle higher numbers of diseases per plant category for practical application in real-world scenarios.
Ferentinos (2018) [
8] developed a deep learning model for plant disease detection and diagnoses. Specialized deep-learning models were trained to identify diseased or healthy leaves. In the work, several CNN-based models were implemented to determine which one effectively distinguishes healthy leaves from diseased ones. These models (binary classifiers) were designed to identify one out of two possible states (healthy or diseased) of the leave. VGG architecture achieved 99.53% accuracy, toping all the models used in the experiment. Though the results produced by Ferentino are impressive, current datasets warrant a multi-class model for identifying plant diseases.
Agarwal et al. (2020) [
9] developed a robust model for identifying diseases in tomato crops. According to Agarwal et al. (2020) [
9], pre-trained CNN models perform reasonably well but are computationally expensive (running time and memory demand). Using a dataset from PlantVillage, a publicly available dataset, the proposed lightweight model achieved 98.4% accuracy, performing better than the traditional machine learning techniques relying on handcrafted image feature extraction techniques and some pre-trained models. Interestingly, VGG16, a pre-trained model, achieved 93.5% accuracy, suggesting that lightweight models can deliver the required solution at some point. It must be mentioned that new pre-trained models have recently been developed to help image classification tasks with impressive accuracies. For example, the EfficentNet family of pre-trained models have demonstrated their effectiveness in computational complexity and high accuracy at fewer epochs during the models’ training. Developing solutions using one of the state-of-the-art pre-train models will ensure that such solutions provide acceptable outcomes. However, the generalization of developed models can sometimes be challenging.
The use of hybrid DL-based and ensemble classifier models to improve the classification of diseases in tomato crops has been demonstrated by [
10,
11]. The concept of the ensembled classifiers is to amalgamate the outcomes of weak and relatively more robust models, thereby creating a new classifier known to yield superior results compared to individual classifiers. These ensemble methods operate faster and more effectively and hold promise for AI models on resource-constrained devices. Given the current emphasis within the research community on rapid and accurate methods for diagnosing and classifying various plant types and their diseases, this development is significant [
11]. The models were trained on the publicly available PlantVillage dataset, consisting of tomato images, and subsequently tested using publicly accessible Taiwan tomato leaf data. The most successful ensemble technique achieved an impressive accuracy of 95.98% on the tomato dataset. However, it's worth noting that the dataset used in this study solely comprises images of tomato leaves, indicating a limitation in identifying diseases affecting fruits and pests attacking tomato plants.
The challenge of achieving high accuracy performance for multiclass disease detection influenced the design of a new approach to preprocessing images for Deep Learning models by [
12]. In their work, anisotropic filtering was applied to all images during the preprocessing stages to remove unwanted distortions in images. U-net was further used to segment an image into background and foreground, where the foreground pixels were accepted as pixels belonging to regions of interest. An accuracy of 92.4% was recorded for the DbnetAlexnet model on segmented images, which was impressive compared to the unsegmented images. Bacterial spot, healthy, late blight, early blight, leaf mould, tomato mosaic virus, tomato yellow leaf mould, tomato mosaic virus, and tomato yellow leaf curl virus were the classes used to train the model. The classes used to develop the model were only diseases that affect the leaves of the tomato. As much as the work is a significant contribution to the development of DL models for plant pathology, the work did not focus on diseases and pests that affect the tomato fruits. Again, the accuracy obtained is significant, but more can be done to develop a robust model for the identification of tomato diseases and pests.
Sathiya et al. (2022) [
13] claimed that the current disease detection methodologies are unsuitable for real-time, field scenarios because they are subject to prediction errors. This may be attributed to the controlled environment set up during the creation of the dataset (image acquisition), which may be different during the usage of the model. The study proposed using soft computing techniques (CDD-H_HSC) to automatically identify diseases, especially at the earlier stages where most of the existing solutions fail. Just like Badiger et al. (2023) [
12] and Sathiya et al. (2022) [
13] also employed segmentation to separate the diseased area from the input plant leaf image at the preprocessing stage using the Multi-Swarm Coyote Optimization (MSCO) technique. To deal with the curse of dimension in model development, the Improved Chan-Vese Snake Optimization (ICVSO) algorithm is employed to minimize the dimensionality of the features for training. The Fitness-Distance Balance Deep Neural Network (FDB-DNN) classifier model identifies the leaf and its associated disease. The work demonstrates the benefit of segmentation in improving the identification of disease. The stages involved in this work make it a bit complicated to deploy on low-resource devices such as smartphones.
Shewale et al. (2023) [
14] proposed a deep learning architecture for early plant leaf disease detection. A significant importance of such architecture or model is its ability to detect disease early, enabling the farmer to take action before he loses all his crops when the disease appears late. The model was developed to identify and classify diseases in tomato leaves. The strength of the proposed model in their work is its ability to generalize well compared to existing models deployed in real-time. The model was, however, not integrated into any device for farmers to use in the field for identification or classification of diseases, as mentioned by the authors. There is, therefore, the need to work on models that can easily be deployed on devices such as mobile phones for adoption by farmers. The work again acknowledged that numerous approaches are available for automatic disease recognition. However, more needs to be done to make tools easy and accessible to farmers all over the world.
Using CNN to classify maize leaf disease has also gained momentum over time. A prominent one was proposed in the work by Priyadharshini et al. (2018) [
15]. The work modified the LeNet, a CNN-based architecture, to develop a model that can identify maize disease with an accuracy of 97.89%. The authors claim that CNN models are effective models for classification tasks; therefore, more experiments can be done to develop models that can achieve the perfect level of accuracy. The work demonstrated this by experimenting with various kernel sizes and depths. In the study, the authors inferred that kernels of sizes 3, 9, and 3 are better suited for maize leaf disease classification. A major limitation of the classifier is that the number of diseases used to train the model needs to be bigger to deploy such a model for wide acceptability. A dataset containing a significant number of maize diseases and maize pests can help develop comprehensive AI models that can effectively identify pests and diseases and, therefore, can be deployed for farmer usage.
Esgario et al. (2020) [
16] presented a paper on a mobile app that assists farmers in identifying pests and diseases of coffee leaves. The app has an AI model developed using deep learning architecture to make it possible to identify pests and diseases effectively. The authors achieved these remarkable results by designing a Computer Vision pipeline that segments and classifies leaf lesions and estimates the severity of stress caused by biotic agents in coffee leaves. ResNet50 is deployed for this solution and biotic stress classification; the accuracy rates were greater than 97%. A major limitation acknowledged by the researchers is that their proposed framework could only work when the smartphone with the app is connected to the internet. This challenge has been observed with several mobile app solutions, such as Plantix [
17]. Indeed, there is a trade-off between accuracy and running time, and therefore, lightweight CNN models that can fit on mobile phones cannot predict perfectly as those that are run on the webserver. Nonetheless, there is the need to develop an app that farmers can use without the need to connect to the internet since many farmers in developing countries such as Burkina Faso will not be able to benefit from the power of AI in transforming the domain of agriculture if internet connectivity is always needed for this task.
Jonathan et al. (2022) [
18] presented an innovative in-field plant disease detection solution using a smartphone's deep learning-based trained AI model. Plant Pathology on Palms, as the authors named it, went beyond training models to deploying them on mobile phones. In the work, a lightweight deep learning-based model is developed and integrated into a mobile phone for quick identification of diseases. As mentioned earlier, the reason for developing lightweight models is to mitigate the challenge of effectively running huge CNN models on resource-limited devices. The proposed framework by Jonathan consumes about 1MB and achieved 97%, 97.1%, and 96.4% accuracy on apple, citrus, and tomato leaves datasets respectively. One of their tiny models achieves 93.33% accuracy. The framework was compared with three variants of MobileNetV2 and demonstrated promising performance stability by outperforming most of the settings provided for the MobileNetV2 variants. The work contributes significantly to the discussion of using smartphone and edge computing devices for real-time classification or identification of diseases. However, the solution focused on three crops and some diseases that affect these crops. Expanding the dataset and developing alternative models that will be robust on the expanded dataset will also contribute to pushing machines to assist in the early detection of pests and diseases and, therefore, help farmers take the necessary action before they lose their crops completely.
Kaushik et al. (2023) [
19] proposed a deep learning model-based framework for mobile applications (TomFusioNet) to identify and classify tomato diseases and pests. TomFusioNet's pipeline comprises two modules, namely DeepRec and DeepPred. DeepRec provides preliminary disease recognition results, while DeepPred further identifies the type of disease in the crop. A multi-objective optimization-based, non-dominated sorting genetic algorithm was employed for hyperparameter tuning of DeepPred. The proposed models, DeepPred and DeepRec, achieved an average accuracy of 99.93% and 98.32%, respectively. TomFusioNet demonstrates a superior performance with an AUC value of 99.10% and a convergence loss of 0.021. The framework, however, was developed for effectively analyzing leaves of tomato crops for disease identification and classification.
Nag et al. (2023) [
20] have also proposed a mobile app-based tomato disease identification using fine-tuned CNNs. The study employed SqueexNet 1.1, VGG19, AlexNet, ResNet-50, and DesneNet-121 on a combined created image and images from PlantVillage. DenseNet-121 leads the pack of CNN architectures with 99.85% accuracy. The work primarily focused on the diseases that affect the leaves. The need for an internet connection to operate the mobile app can sometimes be challenging, especially in regions where internet connectivity is poor.
Using pre-train models for developing models for plant pests and disease detection presents a viable alternative to deploying DL on mobile phones [
21]. Wang et al. (2023) [
22] adopted EfficientNetB0 to develop a model to classify diseases in tomato plants. The developed model could achieve 91.4% average accuracy, which the authors believe is impressive. Implementing the pretrained model in the mobile application also exhibited satisfactory computational complexities (time and space) compared with other deep learning models implemented on mobile or smartphones. The authors acknowledged the necessity to improve the model’s accuracy and reliability to encourage farmers to accept the app wholeheartedly as a critical tool for productivity. The challenge again is the sharp drop in performance of models developed using laboratory-captured images (images captured under controlled lighting and uniform background conditions) [23-25]. This has led to the recent direction of creating datasets for training models that can effectively classify diseases in the wild. The need to, therefore, have a dataset that largely mimics the field environment is crucial for the adaptability of mobile apps for plant diseases and pest classification [
25]. Ngugi et al. (2020) [
25] created a dataset using such conditions and deployed a framework with a segmentation mechanism to separate regions of interest from the background. The proposed algorithm records a 94.39% mean F1 score against the rest. Wang (2023) [
22] demonstrates the efficacy of the EfficentNet family of models in the classification of diseases. Developing apps that integrate several EfficientNet models for identifying pests and diseases in three or more cropping systems will results in high demand of computational resources by the app. Leveraging on the strength of EfficientNet and other techniques can help improve the development of efficient apps for the detection of diseases and the identification and classification of pests.
The use of deep-feature techniques to extract the critical characteristics of images and train classifier models such as SVM, KNN, and Extreme Learning Machine (ELM) has also gained popularity due to outstanding performances. Türkoğlu et al. (2019) [
26] used such an approach for plant disease and pest detection. In their work, FC6 layers of AlexNet, VGG16, and VGG19 models produce better accuracy scores when compared with the others. EfficentNet family of models have proven superior to the model proposed by these authors and, therefore, can be deployed to extract features for the effective performance for plant pests and diseases classification.
In this work, we propose EfficentNetB3 deep feature and an ANN to effectively classify pests and diseases in onion, tomato and maize farms. We are able to reduce computation complexity with this approach, as well as improve performance as compared with the state-of-the-art EfficientNet family model.