Vision Transformer Algorithm for Plant Disease Detection: A Systematic Literature Review

Tri Handhika Avatar
Vision-Transformer-Algorithm-for-Plant-Disease-Detection_A-Systematic-Literature-Review_watermark

Abstract. Advances in deep learning have significantly impacted various sectors, including agriculture, by improving efficiency and sustainability. Computer vision-based plant disease detection that utilizes deep learning models such as Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) has replaced manual methods that require specialized skills. Vision Transformers (ViTs) with self-attention mechanisms excel at recognizing complex patterns in images of diseased plants, enabling early detection and more effective management. This study conducted a systematic literature review to evaluate the use of feature fusion-based transformer algorithms in plant disease detection and to identify current research trends, challenges, and future opportunities. The results show that ViTs can improve accuracy and efficiency in plant disease diagnosis, particularly under diverse environmental conditions. However, the adoption of this technology faces obstacles such as the need for large annotated datasets, significant computational costs, and variability in field conditions that can affect model performance. This research highlights the need for further innovation to address these challenges and expand the accessibility and reliability of deep learning-based plant disease detection for the global agricultural sector.