Submitted:
07 July 2025
Posted:
09 July 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Multi-branch Large-kernel Fusion Depthwise (MLFD)module is introduced into the backbone, which utilizes the multi-branch structure to enhance the feature learning ability of the model, and at the same time adds a large convolutional kernel to extract a wide range of contextually complex features.
- MDTA (Multi-scale Dilated Transformer Attention) module: Captures contextual dependencies in both channel and spatial dimensions via dilated self-attention.
- Lo-Head detection head:Reduces model complexity by optimizing the bounding box and classification branches using grouped and depthwise convolution strategies,enabling the model to achieve a better balance between efficiency and performance.
- Multiple datasets such as rice leaves, potato leaves and tomato leaves were used for disease detection, and the results of the generalization experiments were compared, demonstrating the model’s excellent ability to detect diseases across plant classes.
2. Materials and Methods
2.1. Data Set
2.2. General Technical Route
2.3. The Proposed MML-YOLO
2.3.1. MLFD Module
2.3.2. Execution Process of the MDTA Module
2.3.3. Lo-Detect Head.
2.4. Experimental Platform and Evaluation Metrics
3. Results
3.1. Selection of Baseline Model
3.2. Ablation Experiments
3.3. Comparison Experiment
3.4. Generalization Experiment
4. Discussion
5. Conclusions
References
- Liu, X.; Li, Q.; Yin, B.; Yan, H.; Wang, Y. Assessment of macro, trace and toxic element intake from rice: differences between cultivars, pigmented and non-pigmented rice. Scientific Reports 2024, 14, 10398. [Google Scholar] [CrossRef] [PubMed]
- Mukherjee, R.; Ghosh, A.; Chakraborty, C.; De, J.N.; Mishra, D.P. Rice leaf disease identification and classification using machine learning techniques: A comprehensive review. Engineering Applications of Artificial Intelligence 2025, 139, 109639. [Google Scholar] [CrossRef]
- Zhou, H.; Cai, D.; Lin, L.; Huang, D.; Wu, B.-M. Recognition of multi-symptomatic rice leaf blast in dual scenarios by using convolutional neural networks. Smart Agricultural Technology 2025, 11, 100867. [Google Scholar] [CrossRef]
- Simhadri, C.G.; Kondaveeti, H.K.; Vatsavayi, V.K.; Mitra, A.; Ananthachari, P. Deep learning for rice leaf disease detection: A systematic literature review on emerging trends, methodologies and techniques. Information Processing in Agriculture 2024. [Google Scholar] [CrossRef]
- Wengler, M.R.; Talbot, N.J. Mechanisms of regulated cell death during plant infection by the rice blast fungus *Magnaporthe oryzae*. Cell Death & Differentiation 2025, 1–9.
- Lee, H.; Park, Y.; Kim, G.; Lee, J.H. Pre-symptomatic diagnosis of rice blast and brown spot diseases using chlorophyll fluorescence imaging. Plant Phenomics 2025, 100012. [Google Scholar] [CrossRef]
- Wu, B.; Zhang, X.; Zhao, J.; Zeng, B.; Cao, Z. Identification and analysis of miRNA-mRNA regulatory modules associated with resistance to bacterial leaf streak in rice. BMC Genomics 2025, 26. [Google Scholar] [CrossRef]
- Prakasam, V.; Savani, A.K.; Sukesh, P. Unveiling the potential antifungal role of essential oils in the management of *Rhizoctonia solani* causing sheath blight of rice. European Journal of Plant Pathology 2025, 171, 245–256. [Google Scholar] [CrossRef]
- Kumar, B.N.; Sakthivel, S. Rice leaf disease classification using a fusion vision approach. Scientific Reports 2025, 15, 8692. [Google Scholar] [CrossRef]
- Sangaiah, A.K.; Yu, F.-N.; Lin, Y.-B.; Shen, W.-C.; Sharma, A. UAV T-YOLO-rice: An enhanced tiny YOLO networks for rice leaves diseases detection in paddy agronomy. IEEE Transactions on Network Science and Engineering 2024, 11, 5201–5216. [Google Scholar] [CrossRef]
- Chakrabarty, A.; Ahmed, S.T.; Islam, M.F.U.; Aziz, S.M.; Maidin, S.S. An interpretable fusion model integrating lightweight CNN and transformer architectures for rice leaf disease identification. Ecological Informatics 2024, 82, 102718. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 2016, 39, 1137–1149. [Google Scholar] [CrossRef]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision; 2017; pp. 2961–2969. [Google Scholar]
- Vijayakumar, A.; Vairavasundaram, S. YOLO-based object detection models: A review and its applications. Multimedia Tools and Applications 2024, 83, 83535–83574. [Google Scholar] [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.-Y.; Berg, A.C. SSD: Single shot multibox detector. Proceedings of ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Part I; pp. 21–37. [Google Scholar]
- Gao, W.; Zong, C.; Wang, M.; Zhang, H.; Fang, Y. Intelligent identification of rice leaf disease based on YOLO V5-EFFICIENT. Crop Protection 2024, 183, 106758. [Google Scholar] [CrossRef]
- Ramadan, S.T.Y.; Islam, M.S.; Sakib, T.; Sharmin, N.; Rahman, M.M.; Rahman, M.M. Image-based rice leaf disease detection using CNN and generative adversarial network. Neural Computing and Applications 2025, 37, 439–456. [Google Scholar] [CrossRef]
- Tao, J.; Li, X.; He, Y.; Islam, M.A. CEFW-YOLO: A High-Precision Model for Plant Leaf Disease Detection in Natural Environments. Agriculture 2025, 15, 833. [Google Scholar] [CrossRef]
- Wang, C.; Li, H.; Deng, X.; Liu, Y.; Wu, T.; Liu, W.; Xiao, R.; Wang, Z.; Wang, B. Improved You Only Look Once v. 8 Model Based on Deep Learning: Precision Detection and Recognition of Fresh Leaves from Yunnan Large-Leaf Tea Tree. Agriculture 2024, 14, 2324. [Google Scholar] [CrossRef]
- Xu, K.; Hou, Y.; Sun, W.; Chen, D.; Lv, D.; Xing, J.; Yang, R. A Detection Method for Sweet Potato Leaf Spot Disease and Leaf-Eating Pests. Agriculture 2025, 15, 503. [Google Scholar] [CrossRef]
- Zhou, S.; Yin, W.; He, Y.; Kan, X.; Li, X. Detection of Apple Leaf Gray Spot Disease Based on Improved YOLOv8 Network. Mathematics 2025, 13, 840. [Google Scholar] [CrossRef]
- Shi, H.; Liu, C.; Wu, M.; Zhang, H.; Song, H.; Sun, H.; Li, Y.; Hu, J. Real-time detection of Chinese cabbage seedlings in the field based on YOLO11-CGB. Frontiers in Plant Science 2025, 16, 1558378. [Google Scholar] [CrossRef] [PubMed]
- Woo, S.; Park, J.; Lee, J.-Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV); 2018; pp. 3–19. [Google Scholar]
- Tan, M.; Pang, R.; Le, Q.V. EfficientDet: Scalable and Efficient Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; 2020; pp. 10781–10790. [Google Scholar]
- Abulizi, A.; Ye, J.; Abudukelimu, H.; Guo, W. DM-YOLO: Improved YOLOv9 model for tomato leaf disease detection. Frontiers in Plant Science 2025, 15, 1473928. [Google Scholar] [CrossRef] [PubMed]
- Liu, W.; Lu, H.; Fu, H.; Cao, Z. Learning to Upsample by Learning to Sample. In Proceedings of the IEEE/CVF International Conference on Computer Vision; 2023; pp. 6027–6037. [Google Scholar]
- Ma, S.; Xu, Y. MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression. arXiv preprint arXiv:2307.07662, arXiv:2307.07662.
- Zhou, H.; Hu, Y.; Liu, S.; Zhou, G.; Xu, J.; Chen, A.; Wang, Y.; Li, L.; Hu, Y. A Precise Framework for Rice Leaf Disease Image–Text Retrieval Using FHTW-Net. Plant Phenomics 2024, 6, 0168. [Google Scholar] [CrossRef] [PubMed]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale. arXiv preprint arXiv:2010.11929, arXiv:2010.11929.
- Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers); 2019; pp. 4171–4186. [Google Scholar]
- Yu, X.-H.; Chen, G.-A.; Cheng, S.-X. Dynamic Learning Rate Optimization of the Backpropagation Algorithm. IEEE Transactions on Neural Networks 1995, 6, 669–677. [Google Scholar]
- Deari, S.; Ulukaya, S. A Hybrid Multistage Model Based on YOLO and Modified Inception Network for Rice Leaf Disease Analysis. Arabian Journal for Science and Engineering 2024, 49, 6715–6723. [Google Scholar] [CrossRef]
- Li, P.; Zhou, J.; Sun, H.; Zeng, J. RDRM-YOLO: A High-Accuracy and Lightweight Rice Disease Detection Model for Complex Field Environments Based on Improved YOLOv5. Agriculture 2025, 15, 479. [Google Scholar] [CrossRef]
- Li, Z.; Wu, W.; Wei, B.; Li, H.; Zhan, J.; Deng, S.; Wang, J. Rice Disease Detection: TLI-YOLO Innovative Approach for Enhanced Detection and Mobile Compatibility. Sensors 2025, 25, 2494. [Google Scholar] [CrossRef]
- Fang, K.; Zhou, R.; Deng, N.; Li, C.; Zhu, X. RLDD-YOLOv11n: Research on Rice Leaf Disease Detection Based on YOLOv11. Agronomy 2025, 15, 1266. [Google Scholar] [CrossRef]
- Zhang, H.; Jung, C.; Liu, X. Frequency-Aware Learned Image Compression Using Channel-Wise Attention and Restormer. IEEE Access 2025. [Google Scholar] [CrossRef]
- Kumar, A.; Yadav, S.P.; Kumar, A. An Improved Feature Extraction Algorithm for Robust Swin Transformer Model in High-Dimensional Medical Image Analysis. Computers in Biology and Medicine 2025, 188, 109822. [Google Scholar] [CrossRef]

| Bacterial Leaf Blight | Brown spot | Rice blast |
| Sheath blight | Bacterial Leaf Streak | Rice blast |




| Channel Group Attention | Window Multihead Attention |


| Original image | |
| YOLOv11n | |
| MML-YOLO |


| Model | FLOPs (G) | Params (M) | Precision | Recall | mAP50 | Model Size (MB) |
|---|---|---|---|---|---|---|
| yolov11-n | 6.3 | 2.58 | 0.9627 | 0.9517 | 0.9812 | 5.2 |
| yolov11-s | 21.3 | 9.41 | 0.9814 | 0.9757 | 0.9863 | 18.3 |
| yolov11-m | 67.7 | 20.03 | 0.9912 | 0.9827 | 0.9884 | 38.7 |
| yolov11-l | 86.6 | 25.28 | 0.9853 | 0.9849 | 0.9897 | 48.9 |
| yolov11-x | 194.4 | 56.83 | 0.9926 | 0.9806 | 0.9908 | 109.1 |
| MLFD | MDTA | Lo-Detect | Params(m) | FLOPs(G) | FPS | Precision | Recall | mAP50 | mAP50-95 | ModelSize(MB) |
|---|---|---|---|---|---|---|---|---|---|---|
| × | × | × | 2.58 | 6.3 | 330.35 | 0.9627 | 0.9517 | 0.9812 | 0.7743 | 5.2MB |
| ✓ | × | × | 2.92 | 7.2 | 247.02 | 0.9754 | 0.9623 | 0.9820 | 0.7806 | 6.5MB |
| × | ✓ | × | 2.89 | 7.5 | 208.76 | 0.9632 | 0.9589 | 0.9821 | 0.7841 | 7.1MB |
| × | × | ✓ | 2.31 | 5.1 | 355.00 | 0.9581 | 0.9543 | 0.9802 | 0.7719 | 4.7MB |
| ✓ | ✓ | × | 2.93 | 7.2 | 202.83 | 0.9763 | 0.9559 | 0.9881 | 0.7941 | 7.4MB |
| × | ✓ | ✓ | 2.65 | 6 | 261.11 | 0.9729 | 0.9555 | 0.9836 | 0.7883 | 6MB |
| ✓ | × | ✓ | 2.43 | 5.1 | 252.95 | 0.9634 | 0.9521 | 0.9834 | 0.7747 | 5.5MB |
| ✓ | ✓ | ✓ | 2.66 | 6.2 | 248.67 | 0.9726 | 0.9553 | 0.9872 | 0.7927 | 5.6MB |
| Model | Params(m) | FLOPs(G) | FPS | Precision | Recall | F1-Score | mAP50 | mAP50-95 | Model Size(MB) |
|---|---|---|---|---|---|---|---|---|---|
| YOLOv5n | 2.18 | 5.8 | 279.86 | 0.9629 | 0.9204 | 0.9406 | 0.9731 | 0.7396 | 4.5MB |
| YOLOv8n | 2.69 | 6.8 | 310.07 | 0.9614 | 0.9499 | 0.9555 | 0.9792 | 0.7693 | 5.4MB |
| YOLOv10n | 2.27 | 6.5 | 300.33 | 0.9506 | 0.9418 | 0.946 | 0.9786 | 0.7733 | 5.5MB |
| YOLOv11n | 2.58 | 6.3 | 330.35 | 0.9627 | 0.9517 | 0.9569 | 0.9812 | 0.7743 | 5.2MB |
| Faster-RCNN | 41.36 | 137 | 88.9 | 0.842 | 0.827 | — | 0.9576 | 0.676 | 31.6MB |
| MML-YOLO | 2.66 | 6 | 248.67 | 0.9726 | 0.9553 | 0.9639 | 0.9872 | 0.7927 | 5.6MB |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).