Multi-Modal LLMs in Agriculture: A Comprehensive Review
Author(s) -
Ranjan Sapkota,
Rizwan Qureshi,
Muhammad Usman Hadi,
Syed Zohaib Hassan,
Ferhat Sadak,
Maged Shoman,
Muhammad Sajjad,
Fayaz Ali Dharejo,
Achyut Paudel,
Jiajia Li,
Zhichao Meng,
John Shutske,
Manoj Karkee
Publication year - 2025
Publication title -
ieee transactions on automation science and engineering
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 1.314
H-Index - 87
eISSN - 1558-3783
pISSN - 1545-5955
DOI - 10.1109/tase.2025.3612154
Subject(s) - robotics and control systems , power, energy and industry applications , components, circuits, devices and systems
Given the rapid emergence and applications of Multi-Modal Large Language Models (MM-LLMs) across various scientific fields, insights regarding their applicability in agriculture are still only partially explored. This paper conducts an in-depth review of MM-LLMs in agriculture, focusing on understanding how MM-LLMs can be developed and implemented to optimize agricultural processes, increase efficiency, and reduce costs. Recent studies have explored the capabilities of MM-LLMs in agricultural information processing and decision-making. Despite these advancements, significant gaps persist, particularly in addressing domain-specific challenges such as variable data quality and availability, integration with existing agricultural systems, and the creation of robust training datasets that accurately represent complex agricultural environments. Moreover, a comprehensive understanding of the capabilities, challenges, and limitations of MM-LLMs in agricultural information processing and application is still missing. Exploring these areas is crucial to providing the community with a broader perspective and a clearer understanding of MM-LLMs’ applications, establishing a benchmark for the current state and emerging trends in this field. To bridge this gap, this survey reviews the progress of MM-LLMs and their utilization in agriculture, with an additional focus on 11 key research questions (RQs), where 4 RQs are general and 7 RQs are agriculture focused. By addressing these RQs, this review outlines the current opportunities and challenges, limitations, and future roadmap for MM-LLMs in agriculture. The findings indicate that multi-modal MM-LLMs not only simplify complex agricultural challenges but also significantly enhance decision-making and improve the efficiency of agricultural image processing. These advancements position MM-LLMs as an essential tool for the future of farming. For continued research and understanding, an organized and regularly updated list of papers on MM-LLMs is available at https://github.com/JiajiaLi04/Multi-Modal-LLMs-in-Agriculture.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom