1
|
Zhu J, Qiu J, Chen S, Chen S, Zhang H. An application of YOLOv8 integrated with attention mechanisms for detection of grape leaf black rot spots. PLoS One 2025; 20:e0321788. [PMID: 40233077 PMCID: PMC11999144 DOI: 10.1371/journal.pone.0321788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Accepted: 03/11/2025] [Indexed: 04/17/2025] Open
Abstract
Aiming at the problems of low recognition rate of small target spots in grape leaf images and low detection accuracy due to low resolution of input images. In this paper, an improved recognition network based on YOLO v8 is constructed. In the constructed network, Spatial Pyramid Dilated Convolution (SPD-Conv) is used to replace each stepwise convolution layer and each pooling layer to better capture the detailed features of small targets. Meanwhile, the Efficient Multi-Scale Attention (EMA) Module is incorporated into the Neck part of YOLO v8 to make full use of the feature information of each detection layer and improve the accuracy of feature representation.The Plant Village dataset and the orchard image set are used to test the network performance of the improved model. The experimental test results show that the improved YOLO v8 has 92.64% precision, 93.28% recall and 96.17% AP. The size of model was a mere 7.1M. Compared to YOLO v8, the improvements are 2.38%, 1.91%, and 1.13%, respectively. Compared with the mainstream networks YOLO v4, YOLO v5, YOLO v6, and YOLO v7, precision is improved by 4.74%, 3.38%, 4.15%, and 4.69%, respectively. Therefore, the improved network proposed in this paper can improve the detection accuracy of small target objects and also identify the black rot disease of grape leaves more accurately.
Collapse
Affiliation(s)
- Jiajun Zhu
- College of Yonyou Digital Intelligence, Nantong Institute of Technology, Nantong, China
| | - Jinlin Qiu
- College of Information Science and Technology, Nantong University, Nantong,China
| | - Shouwen Chen
- Institute of Mathematics and Finance, Chuzhou University, Chuzhou, China
| | - Shuxi Chen
- College of Yonyou Digital Intelligence, Nantong Institute of Technology, Nantong, China
| | - Haifei Zhang
- School of Information Engineering, Nantong Institute of Technology, Nantong, China
| |
Collapse
|
2
|
Xiong H, Li J, Wang T, Zhang F, Wang Z. EResNet-SVM: an overfitting-relieved deep learning model for recognition of plant diseases and pests. JOURNAL OF THE SCIENCE OF FOOD AND AGRICULTURE 2024; 104:6018-6034. [PMID: 38483173 DOI: 10.1002/jsfa.13462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 02/01/2024] [Accepted: 03/14/2024] [Indexed: 04/28/2024]
Abstract
BACKGROUND The accurate recognition and early warning for plant diseases and pests are a prerequisite of intelligent prevention and control for plant diseases and pests. As a result of the phenotype similarity of the hazarded plant after plant diseases and pests occur, as well as the interference of the external environment, traditional deep learning models often face the overfitting problem in phenotype recognition of plant diseases and pests, which leads to not only the slow convergence speed of the network, but also low recognition accuracy. RESULTS Motivated by the above problems, the present study proposes a deep learning model EResNet-support vector machine (SVM) to alleviate the overfitting for the recognition and classification of plant diseases and pests. First, the feature extraction capability of the model is improved by increasing feature extraction layers in the convolutional neural network. Second, the order-reduced modules are embedded and a sparsely activated function is introduced to reduce model complexity and alleviate overfitting. Finally, a classifier fused by SVM and fully connected layers are introduced to transforms the original non-linear classification problem into a linear classification problem in high-dimensional space to further alleviate the overfitting and improve the recognition accuracy of plant diseases and pests. The ablation experiments further demonstrate that the fused structure can effectively alleviate the overfitting and improve the recognition accuracy. The experimental recognition results for typical plant diseases and pests show that the proposed EResNet-SVM model has 99.30% test accuracy for eight conditions (seven plant diseases and one normal), which is 5.90% higher than the original ResNet18. Compared with the classic AlexNet, GoogLeNet, Xception, SqueezeNet and DenseNet201 models, the accuracy of the EResNet-SVM model has improved by 5.10%, 7%, 8.10%, 6.20% and 1.90%, respectively. The testing accuracy of the EResNet-SVM model for 6 insect pests is 100%, which is 3.90% higher than that of the original ResNet18 model. CONCLUSION This research provides not only useful references for alleviating the overfitting problem in deep learning, but also a theoretical and technical support for the intelligent detection and control of plant diseases and pests. © 2024 Society of Chemical Industry.
Collapse
Affiliation(s)
- Haitao Xiong
- College of Mechanical and Electrical Engineering, Qingdao Agricultural University, Qingdao, China
| | - Juan Li
- College of Mechanical and Electrical Engineering, Qingdao Agricultural University, Qingdao, China
| | - Tiewei Wang
- College of Mechanical and Electrical Engineering, Qingdao Agricultural University, Qingdao, China
- Jimo District Water Conservancy Bureau of Qingdao City, Qingdao, China
| | - Fan Zhang
- College of Mechanical and Electrical Engineering, Qingdao Agricultural University, Qingdao, China
| | - Ziyang Wang
- College of Mechanical and Electrical Engineering, Qingdao Agricultural University, Qingdao, China
| |
Collapse
|
3
|
Karim MJ, Goni MOF, Nahiduzzaman M, Ahsan M, Haider J, Kowalski M. Enhancing agriculture through real-time grape leaf disease classification via an edge device with a lightweight CNN architecture and Grad-CAM. Sci Rep 2024; 14:16022. [PMID: 38992069 PMCID: PMC11239930 DOI: 10.1038/s41598-024-66989-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 07/08/2024] [Indexed: 07/13/2024] Open
Abstract
Crop diseases can significantly affect various aspects of crop cultivation, including crop yield, quality, production costs, and crop loss. The utilization of modern technologies such as image analysis via machine learning techniques enables early and precise detection of crop diseases, hence empowering farmers to effectively manage and avoid the occurrence of crop diseases. The proposed methodology involves the use of modified MobileNetV3Large model deployed on edge device for real-time monitoring of grape leaf disease while reducing computational memory demands and ensuring satisfactory classification performance. To enhance applicability of MobileNetV3Large, custom layers consisting of two dense layers were added, each followed by a dropout layer, helped mitigate overfitting and ensured that the model remains efficient. Comparisons among other models showed that the proposed model outperformed those with an average train and test accuracy of 99.66% and 99.42%, with a precision, recall, and F1 score of approximately 99.42%. The model was deployed on an edge device (Nvidia Jetson Nano) using a custom developed GUI app and predicted from both saved and real-time data with high confidence values. Grad-CAM visualization was used to identify and represent image areas that affect the convolutional neural network (CNN) classification decision-making process with high accuracy. This research contributes to the development of plant disease classification technologies for edge devices, which have the potential to enhance the ability of autonomous farming for farmers, agronomists, and researchers to monitor and mitigate plant diseases efficiently and effectively, with a positive impact on global food security.
Collapse
Affiliation(s)
- Md Jawadul Karim
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi, 6204, Bangladesh
| | - Md Omaer Faruq Goni
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi, 6204, Bangladesh
| | - Md Nahiduzzaman
- Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi, 6204, Bangladesh
| | - Mominul Ahsan
- Department of Computer Science, University of York, Deramore Lane, Heslington, York, YO10 5GH, UK
| | - Julfikar Haider
- Department of Engineering, Manchester Metropolitan University, Chester Street, Manchester, M1 5GD, UK
| | - Marcin Kowalski
- Institute of Optoelectronics, Military University of Technology, Gen. S. Kaliskiego 2, 00-908, Warsaw, Poland.
| |
Collapse
|
4
|
Liu E, Gold KM, Combs D, Cadle-Davidson L, Jiang Y. Deep semantic segmentation for the quantification of grape foliar diseases in the vineyard. FRONTIERS IN PLANT SCIENCE 2022; 13:978761. [PMID: 36161031 PMCID: PMC9501698 DOI: 10.3389/fpls.2022.978761] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Accepted: 08/18/2022] [Indexed: 06/16/2023]
Abstract
Plant disease evaluation is crucial to pathogen management and plant breeding. Human field scouting has been widely used to monitor disease progress and provide qualitative and quantitative evaluation, which is costly, laborious, subjective, and often imprecise. To improve disease evaluation accuracy, throughput, and objectiveness, an image-based approach with a deep learning-based analysis pipeline was developed to calculate infection severity of grape foliar diseases. The image-based approach used a ground imaging system for field data acquisition, consisting of a custom stereo camera with strobe light for consistent illumination and real time kinematic (RTK) GPS for accurate localization. The deep learning-based pipeline used the hierarchical multiscale attention semantic segmentation (HMASS) model for disease infection segmentation, color filtering for grapevine canopy segmentation, and depth and location information for effective region masking. The resultant infection, canopy, and effective region masks were used to calculate the severity rate of disease infections in an image sequence collected in a given unit (e.g., grapevine panel). Fungicide trials for grape downy mildew (DM) and powdery mildew (PM) were used as case studies to evaluate the developed approach and pipeline. Experimental results showed that the HMASS model achieved acceptable to good segmentation accuracy of DM (mIoU > 0.84) and PM (mIoU > 0.74) infections in testing images, demonstrating the model capability for symptomatic disease segmentation. With the consistent image quality and multimodal metadata provided by the imaging system, the color filter and overlapping region removal could accurately and reliably segment grapevine canopies and identify repeatedly imaged regions between consecutive image frames, leading to critical information for infection severity calculation. Image-derived severity rates were highly correlated (r > 0.95) with human-assessed values, and had comparable statistical power in differentiating fungicide treatment efficacy in both case studies. Therefore, the developed approach and pipeline can be used as an effective and efficient tool to quantify the severity of foliar disease infections, enabling objective, high-throughput disease evaluation for fungicide trial evaluation, genetic mapping, and breeding programs.
Collapse
Affiliation(s)
- Ertai Liu
- Department of Biological and Environmental Engineering, Cornell University, Ithaca, NY, United States
| | - Kaitlin M. Gold
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, United States
| | - David Combs
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, United States
| | - Lance Cadle-Davidson
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, United States
- Grape Genetics Research Unit, United States Department of Agriculture-Agricultural Research Service, Geneva, NY, United States
| | - Yu Jiang
- Horticulture Section, School of Integrative Plant Science, Cornell AgriTech, Cornell University, Geneva, NY, United States
| |
Collapse
|
5
|
Abstract
Crop disease has been a severe issue for agriculture, causing economic loss for growers. Thus, disease identification urgently needs to be addressed, especially for precision agriculture. As of today, deep learning has been widely used for crop disease identification combined with optical imaging sensors. In this study, a lightweight convolutional neural network model is designed and validated on two publicly available imaging datasets and one self-built dataset with 28 types of leaf and leaf disease images of 6 crops as the research object. This model is an improvement of the existing convolutional neural network, reducing the floating-point operations by 65%. In addition, dilated depth-wise convolutions were used to increase the network receptive field and improve the model recognition accuracy without affecting the network computational speed. Meanwhile, two attention mechanisms are optimized to reduce attention module computation, improving the capability of the model to select the correct regions of interest. After training, this model achieved an average accuracy of 99.86%, and the image calculation speed was 0.173 s. Comparing with 11 backbone models and 5 latest crop leaf disease identification studies, the proposed model achieved the highest accuracy. Therefore, this model with an advantage of balancing between the calculation speed and recognition accuracy. Furthermore, the proposed model provides a theoretical basis and technical support for the practical application and mobile terminal applications of crop disease recognition in precision agriculture.
Collapse
|
6
|
Zeng T, Li C, Zhang B, Wang R, Fu W, Wang J, Zhang X. Rubber Leaf Disease Recognition Based on Improved Deep Convolutional Neural Networks With a Cross-Scale Attention Mechanism. FRONTIERS IN PLANT SCIENCE 2022; 13:829479. [PMID: 35295638 PMCID: PMC8918928 DOI: 10.3389/fpls.2022.829479] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Accepted: 01/24/2022] [Indexed: 06/14/2023]
Abstract
Natural rubber is an essential raw material for industrial products and plays an important role in social development. A variety of diseases can affect the growth of rubber trees, reducing the production and quality of natural rubber. Therefore, it is of great significance to automatically identify rubber leaf disease. However, in practice, different diseases have complex morphological characteristics of spots and symptoms at different stages and scales, and there are subtle interclass differences and large intraclass variation between the symptoms of diseases. To tackle these challenges, a group multi-scale attention network (GMA-Net) was proposed for rubber leaf disease image recognition. The key idea of our method is to develop a group multi-scale dilated convolution (GMDC) module for multi-scale feature extraction as well as a cross-scale attention feature fusion (CAFF) module for multi-scale attention feature fusion. Specifically, the model uses a group convolution structure to reduce model parameters and provide multiple branches and then embeds multiple dilated convolutions to improve the model's adaptability to the scale variability of disease spots. Furthermore, the CAFF module is further designed to drive the network to learn the attentional features of multi-scale diseases and strengthen the disease features fusion at different scales. In this article, a dataset of rubber leaf diseases was constructed, including 2,788 images of four rubber leaf diseases and healthy leaves. Experimental results show that the accuracy of the model is 98.06%, which was better than other state-of-the-art approaches. Moreover, the model parameters of GMA-Net are only 0.65 M, and the model size is only 5.62 MB. Compared with MobileNetV1, V2, and ShuffleNetV1, V2 lightweight models, the model parameters and size are reduced by more than half, but the recognition accuracy is also improved by 3.86-6.1%. In addition, to verify the robustness of this model, we have also verified it on the PlantVillage public dataset. The experimental results show that the recognition accuracy of our proposed model is 99.43% on the PlantVillage dataset, which is also better than other state-of-the-art approaches. The effectiveness of the proposed method is verified, and it can be used for plant disease recognition.
Collapse
Affiliation(s)
- Tiwei Zeng
- School of Information and Communication Engineering, Hainan University, Haikou, China
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| | - Chengming Li
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| | - Bin Zhang
- School of Information and Communication Engineering, Hainan University, Haikou, China
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| | - Rongrong Wang
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| | - Wei Fu
- School of Information and Communication Engineering, Hainan University, Haikou, China
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| | - Juan Wang
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| | - Xirui Zhang
- School of Information and Communication Engineering, Hainan University, Haikou, China
- Mechanical and Electrical Engineering College, Hainan University, Haikou, China
| |
Collapse
|
7
|
Zhang Z, Qiao Y, Guo Y, He D. Deep Learning Based Automatic Grape Downy Mildew Detection. FRONTIERS IN PLANT SCIENCE 2022; 13:872107. [PMID: 35755646 PMCID: PMC9227981 DOI: 10.3389/fpls.2022.872107] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 04/27/2022] [Indexed: 05/04/2023]
Abstract
Grape downy mildew (GDM) disease is a common plant leaf disease, and it causes serious damage to grape production, reducing yield and fruit quality. Traditional manual disease detection relies on farm experts and is often time-consuming. Computer vision technologies and artificial intelligence could provide automatic disease detection for real-time controlling the spread of disease on the grapevine in precision viticulture. To achieve the best trade-off between GDM detection accuracy and speed under natural environments, a deep learning based approach named YOLOv5-CA is proposed in this study. Here coordinate attention (CA) mechanism is integrated into YOLOv5, which highlights the downy mildew disease-related visual features to enhance the detection performance. A challenging GDM dataset was acquired in a vineyard under a nature scene (consisting of different illuminations, shadows, and backgrounds) to test the proposed approach. Experimental results show that the proposed YOLOv5-CA achieved a detection precision of 85.59%, a recall of 83.70%, and a mAP@0.5 of 89.55%, which is superior to the popular methods, including Faster R-CNN, YOLOv3, and YOLOv5. Furthermore, our proposed approach with inference occurring at 58.82 frames per second, could be deployed for the real-time disease control requirement. In addition, the proposed YOLOv5-CA based approach could effectively capture leaf disease related visual features resulting in higher GDE detection accuracy. Overall, this study provides a favorable deep learning based approach for the rapid and accurate diagnosis of grape leaf diseases in the field of automatic disease detection.
Collapse
Affiliation(s)
- Zhao Zhang
- College of Mechanical and Electronic Engineering, Northwest A&F University, Xianyang, China
- College of Electronic and Electrical Engineering, Baoji University of Arts and Sciences, Baoji, China
- Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Northwest A&F University, Xianyang, China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Northwest A&F University, Xianyang, China
| | - Yongliang Qiao
- Faculty of Engineering, Australian Centre for Field Robotics (ACFR), The University of Sydney, Sydney, NSW, Australia
- *Correspondence: Yongliang Qiao
| | - Yangyang Guo
- College of Mechanical and Electronic Engineering, Northwest A&F University, Xianyang, China
- Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Northwest A&F University, Xianyang, China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Northwest A&F University, Xianyang, China
| | - Dongjian He
- College of Mechanical and Electronic Engineering, Northwest A&F University, Xianyang, China
- Key Laboratory of Agricultural Internet of Things, Ministry of Agriculture and Rural Affairs, Northwest A&F University, Xianyang, China
- Shaanxi Key Laboratory of Agricultural Information Perception and Intelligent Service, Northwest A&F University, Xianyang, China
- Dongjian He
| |
Collapse
|