
Deployable Deep Learning for Cross-Domain Plant Leaf Disease Detection via Ensemble Learning, Knowledge Distillation, and Quantization
Author(s) -
Mohammad Junayed Hasan,
Suvodeep Mazumdar,
Sifat Momen
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3595390
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Accurate leaf disease detection via smartphone-based deep learning holds immense potential for mitigating global crop losses. However, significant deployment challenges persist when transitioning from controlled laboratory environments to real-world agricultural conditions. Despite recent advances, three fundamental barriers remain: cross-domain generalization, severe class imbalance, and computational limitations for edge deployment. This study introduces the first open cross-domain benchmark for tomato leaf disease detection, unifying PlantVillage and TomatoVillage datasets into 15 harmonized disease classes to enable reproducible evaluation across domains. We propose a unified optimization approach integrating ensemble learning, knowledge distillation, and quantization across 24 deep learning architectures for edge-compatible disease detection. Strategic data augmentation and ADASYN-based balancing mitigate the severe 75:1 class imbalance, while systematic hyperparameter tuning optimizes model configurations. Our four-model ensemble (DenseNet-121, ResNet-101, DenseNet-201, EfficientNet-B4) achieves 99.15% accuracy via soft-voting. Knowledge distillation transfers ensemble capabilities to compact ShuffleNetV2, maintaining 98.53% accuracy with 163× parameter reduction and 43.6× speedup. INT8 quantization provides 671× compression (1.46 MB) while sustaining 97.46% accuracy, enabling 0.29ms inference. Cross-dataset validation demonstrates robust generalization with only 3.45% performance degradation. Grad-CAM++ and LIME-based explainability confirm biologically grounded attention patterns aligned with plant pathology principles. Finally, field deployment via a multilingual and multi-platform Flutter application validates real-world feasibility, establishing the first scalable framework bridging research and practical agricultural deployment. This work sets a standardized benchmark and extensible methodology for future multi-dataset precision agriculture research. Codes and implementations are publicly available at: https://github.com/junayed-hasan/tomato-leaf-ai.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom