Preprint
Article

This version is not peer-reviewed.

AgriM-LLM: An Agriculture-Specific Multimodal Large Language Model for Intelligent Crop Disease and Pest Management

Submitted:

07 January 2026

Posted:

09 January 2026

You are already at the latest version

Abstract
Crop diseases and pests pose significant threats to global food security, demanding precise and efficient management solutions. While Multimodal Large Language Models (M-LLMs) offer promising avenues for intelligent agricultural diagnosis, general-purpose models often falter due to a lack of specialized visual feature extraction, inadequate understanding of agricultural terminology, and insufficient precision in prevention advice. To address these challenges, this paper introduces AgriM-LLM, a novel agriculture-specific multimodal large language model designed for enhanced crop disease and pest identification and prevention. AgriM-LLM integrates several key innovations: an Enhanced Vision Encoder featuring a Multi-Scale Feature Fusion module for capturing subtle visual symptoms; an Agriculture-Knowledge-Enhanced Q-Former that injects structured agricultural knowledge to guide cross-modal alignment; and a Domain-Adaptive Language Model employing a multi-stage progressive fine-tuning strategy for expert-level advice generation. Furthermore, an efficient LoRA-based fine-tuning strategy ensures practical computational resource utilization. Evaluated on a comprehensive Chinese agricultural multimodal dataset, AgriM-LLM consistently outperforms existing general-purpose and domain-specific baselines. Our ablation studies confirm the critical contribution of each proposed component, and detailed analyses demonstrate superior visual encoding, knowledge integration, and linguistic specialization. AgriM-LLM represents a significant step towards providing timely, accurate, and actionable intelligent decision support for farmers, thereby fostering sustainable agricultural development.
Keywords: 
;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated