1
|
Baiju BV, Kirupanithi N, Srinivasan S, Kapoor A, Mathivanan SK, Shah MA. Robust CRW crops leaf disease detection and classification in agriculture using hybrid deep learning models. PLANT METHODS 2025; 21:18. [PMID: 39948565 PMCID: PMC11827293 DOI: 10.1186/s13007-025-01332-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 01/25/2025] [Indexed: 02/16/2025]
Abstract
The problem of plant diseases is huge as it affects the crop quality and leads to reduced crop production. Crop-Convolutional neural network (CNN) depiction is that several scholars have used the approaches of machine learning (ML) and deep learning (DL) techniques and have configured their models to specific crops to diagnose plant diseases. In this logic, it is unjustifiable to apply crop-specific models as farmers are resource-poor and possess a low digital literacy level. This study presents a Slender-CNN model of plant disease detection in corn (C), rice (R) and wheat (W) crops. The designed architecture incorporates parallel convolution layers of different dimensions in order to localize the lesions with multiple scales accurately. The experimentation results show that the designed network achieves the accuracy of 88.54% as well as overcomes several benchmark CNN models: VGG19, EfficientNetb6, ResNeXt, DenseNet201, AlexNet, YOLOv5 and MobileNetV3. In addition, the validated model demonstrates its effectiveness as a multi-purpose device by correctly categorizing the healthy and the infected class of individual types of crops, providing 99.81%, 87.11%, and 98.45% accuracy for CRW crops, respectively. Furthermore, considering the best performance values achieved and compactness of the proposed model, it can be employed for on-farm agricultural diseased crops identification finding applications even in resource-limited settings.
Collapse
Affiliation(s)
- B V Baiju
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Nancy Kirupanithi
- Sathyabama Institute of Science and Technology, Chennai, Tamil Nadu, India
| | - Saravanan Srinivasan
- Department of Computer Science and Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Chennai, 600062, India
| | - Anjali Kapoor
- School of Computer Science and Engineering, Galgotias University, Greater Noida, 203201, India
| | | | - Mohd Asif Shah
- Department of Economics, Kardan University, Parwane Du, 1001, Kabul, Afghanistan.
- Centre for Research Impact & Outcome, Chitkara University Institute of Engineering and Technology, Chitkara University, Rajpura, 140401, Panjab, India.
- Chitkara Centre for Research and Development, Chitkara University, Baddi, Himachal Pradesh, 140401, India.
- Division of Research and Development, Lovely Professional University, Phagwara, Panjab, 140401, India.
| |
Collapse
|
2
|
Mohan RNVJ, Rayanoothala PS, Sree RP. Next-gen agriculture: integrating AI and XAI for precision crop yield predictions. FRONTIERS IN PLANT SCIENCE 2025; 15:1451607. [PMID: 39845494 PMCID: PMC11751057 DOI: 10.3389/fpls.2024.1451607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Accepted: 12/17/2024] [Indexed: 01/24/2025]
Abstract
Climate change poses significant challenges to global food security by altering precipitation patterns and increasing the frequency of extreme weather events such as droughts, heatwaves, and floods. These phenomena directly affect agricultural productivity, leading to lower crop yields and economic losses for farmers. This study leverages Artificial Intelligence (AI) and Explainable Artificial Intelligence (XAI) techniques to predict crop yields and assess the impacts of climate change on agriculture, providing a novel approach to understanding complex interactions between climatic and agronomic factors. Using Exploratory Data Analysis (EDA), the study identifies temperature as the most critical factor influencing crop yields, with notable interactions observed between rainfall patterns and macronutrient levels. Advanced regression models, including Decision Tree Regressor, Random Forest Regressor, and LightGBM Regressor, achieved exceptional predictive performance, with R² scores reaching 0.92, mean squared errors as low as 0.02, and mean absolute errors of 0.015. Additionally, XAI techniques such as SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) enhanced the interpretability of the predictions, offering actionable insights into the relative importance of key features. These insights inform strategies for agricultural decision-making and climate adaptation. By integrating AI-driven predictions with XAI-based interpretability, this research presents a robust and transparent framework for mitigating the adverse effects of climate change on agriculture, emphasizing its potential for scalable application in precision farming and policy development.
Collapse
Affiliation(s)
- R. N. V. Jagan Mohan
- Department of Computer Science and Engineering, Sagi Rama Krishnam Raju Engineering College, Bhimavaram, India
| | - Pravallika Sree Rayanoothala
- Department of Plant Pathology, MS Swaminathan School of Agriculture, Centurion University of Technology and Management, Odisha, India
| | - R. Praneetha Sree
- Department of Computer Science and Engineering, Indian Institute of Information Technology Design and Manufacturing (III TDM), Kurnool, Andhrapradesh, India
| |
Collapse
|
3
|
Amudha M, Brindha K. Effective feature selection based HOBS pruned- ELM model for tomato plant leaf disease classification. PLoS One 2024; 19:e0315031. [PMID: 39637070 PMCID: PMC11620619 DOI: 10.1371/journal.pone.0315031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 11/17/2024] [Indexed: 12/07/2024] Open
Abstract
Tomato cultivation is expanding rapidly, but the tomato sector faces significant challenges from various sources, including environmental (abiotic stress) and biological (biotic stress or disease) threats, which adversely impact the crop's growth, reproduction, and overall yield potential. The objective of this work is to build deep learning based lightweight convolutional neural network (CNN) architecture for the real-time classification of biotic stress in tomato plant leaves. This model proposes to address the drawbacks of conventional CNNs, which are resource-intensive and time-consuming, by using optimization methods that reduce processing complexity and enhance classification accuracy. Traditional plant disease classification methods predominantly utilize CNN based deep learning techniques, originally developed for fundamental image classification tasks. It relies on computationally intensive CNNs, hindering real-time application due to long training times. To address this, a lighter CNN framework is proposed to enhance with two key components. Firstly, an Elephant Herding Optimization (EHO) algorithm selects pertinent features for classification tasks. The classification module integrates a Hessian-based Optimal Brain Surgeon (HOBS) approach with a pruned Extreme Learning Machine (ELM), optimizing network parameters while reducing computational complexity. The proposed pruned model gives an accuracy of 95.73%, Cohen's kappa of 0.81%, training time of 2.35sec on Plant Village dataset, comprising 8,000 leaf images across 10 distinct classes of tomato plant, which demonstrates that this framework effectively reduces the model's size of 9.2Mb and parameters by reducing irrelevant connections in the classification layer. The proposed classifier performance was compared to existing deep learning models, the experimental results show that the pruned DenseNet achieves an accuracy of 86.64% with a model size of 10.6 MB, while GhostNet reaches an accuracy of 92.15% at 10.9 MB. CACPNET demonstrates an accuracy of 92.4% with a model size of 18.0 MB. In contrast, the proposed approach significantly outperforms these models in terms of accuracy and processing time.
Collapse
Affiliation(s)
- M. Amudha
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, Tamil Nādu, India
| | - K. Brindha
- School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore, Tamil Nādu, India
| |
Collapse
|
4
|
Li Y, Lu Y, Liu H, Bai J, Yang C, Yuan H, Li X, Xiao Q. Tea leaf disease and insect identification based on improved MobileNetV3. FRONTIERS IN PLANT SCIENCE 2024; 15:1459292. [PMID: 39399539 PMCID: PMC11466808 DOI: 10.3389/fpls.2024.1459292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 09/10/2024] [Indexed: 10/15/2024]
Abstract
Accurate detection of tea leaf diseases and insects is crucial for their scientific and effective prevention and control, essential for ensuring the quality and yield of tea. Traditional methods for identifying tea leaf diseases and insects primarily rely on professional technicians, which are difficult to apply in various scenarios. This study proposes a recognition method for tea leaf diseases and insects based on improved MobileNetV3. Initially, a dataset containing images of 17 different types of tea leaf diseases and insects was curated, with data augmentation techniques utilized to broaden recognition scenarios. Subsequently, the network structure of MobileNetV3 was enhanced by integrating the CA (coordinate attention) module to improve the perception of location information. Moreover, a fine-tuning transfer learning strategy was employed to optimize model training and accelerate convergence. Experimental results on the constructed dataset reveal that the initial recognition accuracy of MobileNetV3 is 94.45%, with an F1-score of 94.12%. Without transfer learning, the recognition accuracy of MobileNetV3-CA reaches 94.58%, while with transfer learning, it reaches 95.88%. Through comparative experiments, this study compares the improved algorithm with the original MobileNetV3 model and other classical image classification models (ResNet18, AlexNet, VGG16, SqueezeNet, and ShuffleNetV2). The findings show that MobileNetV3-CA based on transfer learning achieves higher accuracy in identifying tea leaf diseases and insects. Finally, a tea diseases and insects identification application was developed based on this model. The model showed strong robustness and could provide a reliable reference for intelligent diagnosis of tea diseases and insects.
Collapse
Affiliation(s)
- Yang Li
- Key Laboratory of Tea Quality and Safety Control, Ministry of Agriculture and Rural Affairs, Tea Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou, China
| | - Yuheng Lu
- Hangzhou Ruikun Technology Co., Ltd., Hangzhou, China
| | - Haoyang Liu
- Key Laboratory of Tea Quality and Safety Control, Ministry of Agriculture and Rural Affairs, Tea Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou, China
| | - Jiahe Bai
- Tea Station of Xinchang County, Shaoxing, China
| | - Chen Yang
- Hangzhou Ruikun Technology Co., Ltd., Hangzhou, China
| | - Haiyan Yuan
- Tea Station of Xinchang County, Shaoxing, China
| | - Xin Li
- Key Laboratory of Tea Quality and Safety Control, Ministry of Agriculture and Rural Affairs, Tea Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou, China
| | - Qiang Xiao
- Key Laboratory of Tea Quality and Safety Control, Ministry of Agriculture and Rural Affairs, Tea Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou, China
| |
Collapse
|
5
|
Zhang X, Feng Q, Zhu D, Liang X, Zhang J. Compressing recognition network of cotton disease with spot-adaptive knowledge distillation. FRONTIERS IN PLANT SCIENCE 2024; 15:1433543. [PMID: 39391779 PMCID: PMC11464345 DOI: 10.3389/fpls.2024.1433543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Accepted: 09/05/2024] [Indexed: 10/12/2024]
Abstract
Deep networks play a crucial role in the recognition of agricultural diseases. However, these networks often come with numerous parameters and large sizes, posing a challenge for direct deployment on resource-limited edge computing devices for plant protection robots. To tackle this challenge for recognizing cotton diseases on the edge device, we adopt knowledge distillation to compress the big networks, aiming to reduce the number of parameters and the computational complexity of the networks. In order to get excellent performance, we conduct combined comparison experiments from three aspects: teacher network, student network and distillation algorithm. The teacher networks contain three classical convolutional neural networks, while the student networks include six lightweight networks in two categories of homogeneous and heterogeneous structures. In addition, we investigate nine distillation algorithms using spot-adaptive strategy. The results demonstrate that the combination of DenseNet40 as the teacher and ShuffleNetV2 as the student show best performance when using NST algorithm, yielding a recognition accuracy of 90.59% and reducing FLOPs from 0.29 G to 0.045 G. The proposed method can facilitate the lightweighting of the model for recognizing cotton diseases while maintaining high recognition accuracy and offer a practical solution for deploying deep models on edge computing devices.
Collapse
Affiliation(s)
- Xinwen Zhang
- School of Mechanical and Electrical Engineering, Gansu Agricultural University, Lanzhou, China
| | - Quan Feng
- School of Mechanical and Electrical Engineering, Gansu Agricultural University, Lanzhou, China
| | - Dongqin Zhu
- School of Mechanical and Electrical Engineering, Gansu Agricultural University, Lanzhou, China
| | - Xue Liang
- School of Mechanical and Electrical Engineering, Gansu Agricultural University, Lanzhou, China
| | - Jianhua Zhang
- Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing, China
- National Nanfan Research Institute, Chinese Academy of Agricultural Sciences, Sanya, China
| |
Collapse
|
6
|
Peng S, Rajjou L. Advancing plant biology through deep learning-powered natural language processing. PLANT CELL REPORTS 2024; 43:208. [PMID: 39102077 DOI: 10.1007/s00299-024-03294-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 07/19/2024] [Indexed: 08/06/2024]
Abstract
The application of deep learning methods, specifically the utilization of Large Language Models (LLMs), in the field of plant biology holds significant promise for generating novel knowledge on plant cell systems. The LLM framework exhibits exceptional potential, particularly with the development of Protein Language Models (PLMs), allowing for in-depth analyses of nucleic acid and protein sequences. This analytical capacity facilitates the discernment of intricate patterns and relationships within biological data, encompassing multi-scale information within DNA or protein sequences. The contribution of PLMs extends beyond mere sequence patterns and structure--function recognition; it also supports advancements in genetic improvements for agriculture. The integration of deep learning approaches into the domain of plant sciences offers opportunities for major breakthroughs in basic research across multi-scale plant traits. Consequently, the strategic application of deep learning methodologies, particularly leveraging the potential of LLMs, will undoubtedly play a pivotal role in advancing plant sciences, plant production, plant uses and propelling the trajectory toward sustainable agroecological and agro-food transitions.
Collapse
Affiliation(s)
- Shuang Peng
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin for Plant Sciences (IJPB), 78000, Versailles, France
| | - Loïc Rajjou
- Université Paris-Saclay, INRAE, AgroParisTech, Institut Jean-Pierre Bourgin for Plant Sciences (IJPB), 78000, Versailles, France.
| |
Collapse
|
7
|
Qu H, Zheng C, Ji H, Huang R, Wei D, Annis S, Drummond F. A deep multi-task learning approach to identifying mummy berry infection sites, the disease stage, and severity. FRONTIERS IN PLANT SCIENCE 2024; 15:1340884. [PMID: 38606063 PMCID: PMC11007028 DOI: 10.3389/fpls.2024.1340884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Accepted: 02/26/2024] [Indexed: 04/13/2024]
Abstract
Introduction Mummy berry is a serious disease that may result in up to 70 percent of yield loss for lowbush blueberries. Practical mummy berry disease detection, stage classification and severity estimation remain great challenges for computer vision-based approaches because images taken in lowbush blueberry fields are usually a mixture of different plant parts (leaves, bud, flowers and fruits) with a very complex background. Specifically, typical problems hindering this effort included data scarcity due to high manual labelling cost, tiny and low contrast disease features interfered and occluded by healthy plant parts, and over-complicated deep neural networks which made deployment of a predictive system difficult. Methods Using real and raw blueberry field images, this research proposed a deep multi-task learning (MTL) approach to simultaneously accomplish three disease detection tasks: identification of infection sites, classification of disease stage, and severity estimation. By further incorporating novel superimposed attention mechanism modules and grouped convolutions to the deep neural network, enabled disease feature extraction from both channel and spatial perspectives, achieving better detection performance in open and complex environments, while having lower computational cost and faster convergence rate. Results Experimental results demonstrated that our approach achieved higher detection efficiency compared with the state-of-the-art deep learning models in terms of detection accuracy, while having three main advantages: 1) field images mixed with various types of lowbush blueberry plant organs under a complex background can be used for disease detection; 2) parameter sharing among different tasks greatly reduced the size of training samples and saved 60% training time than when the three tasks (data preparation, model development and exploration) were trained separately; and 3) only one-sixth of the network parameter size (23.98M vs. 138.36M) and one-fifteenth of the computational cost (1.13G vs. 15.48G FLOPs) were used when compared with the most popular Convolutional Neural Network VGG16. Discussion These features make our solution very promising for future mobile deployment such as a drone carried task unit for real-time field surveillance. As an automatic approach to fast disease diagnosis, it can be a useful technical tool to provide growers real time disease information that can prevent further disease transmission and more severe effects on yield due to fruit mummification.
Collapse
Affiliation(s)
- Hongchun Qu
- Institute of Ecological Safety and College of Automation, Chongqing University of Posts and Telecommunications, Chongqing, China
- College of Computer Science, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Chaofang Zheng
- Institute of Ecological Safety and College of Automation, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Hao Ji
- Institute of Ecological Safety and College of Automation, Chongqing University of Posts and Telecommunications, Chongqing, China
- College of Computer Science, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Rui Huang
- College of Computer Science, Chongqing University of Posts and Telecommunications, Chongqing, China
| | - Dianwen Wei
- Institute of Natural Resources and Ecology, Heilongjiang Academy of Sciences, Harbin, China
| | - Seanna Annis
- School of Biology and Ecology, University of Maine, Orono, ME, United States
- Cooperative Extension, University of Maine, Orono, ME, United States
| | - Francis Drummond
- School of Biology and Ecology, University of Maine, Orono, ME, United States
- Cooperative Extension, University of Maine, Orono, ME, United States
| |
Collapse
|
8
|
Mamba Kabala D, Hafiane A, Bobelin L, Canals R. Image-based crop disease detection with federated learning. Sci Rep 2023; 13:19220. [PMID: 37932344 PMCID: PMC10628142 DOI: 10.1038/s41598-023-46218-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 10/30/2023] [Indexed: 11/08/2023] Open
Abstract
Crop disease detection and management is critical to improving productivity, reducing costs, and promoting environmentally friendly crop treatment methods. Modern technologies, such as data mining and machine learning algorithms, have been used to develop automated crop disease detection systems. However, centralized approach to data collection and model training induces challenges in terms of data privacy, availability, and transfer costs. To address these challenges, federated learning appears to be a promising solution. In this paper, we explored the application of federated learning for crop disease classification using image analysis. We developed and studied convolutional neural network (CNN) models and those based on attention mechanisms, in this case vision transformers (ViT), using federated learning, leveraging an open access image dataset from the "PlantVillage" platform. Experiments conducted concluded that the performance of models trained by federated learning is influenced by the number of learners involved, the number of communication rounds, the number of local iterations and the quality of the data. With the objective of highlighting the potential of federated learning in crop disease classification, among the CNN models tested, ResNet50 performed better in several experiments than the other models, and proved to be an optimal choice, but also the most suitable for a federated learning scenario. The ViT_B16 and ViT_B32 Vision Transformers require more computational time, making them less suitable in a federated learning scenario, where computational time and communication costs are key parameters. The paper provides a state-of-the-art analysis, presents our methodology and experimental results, and concludes with ideas and future directions for our research on using federated learning in the context of crop disease classification.
Collapse
Affiliation(s)
- Denis Mamba Kabala
- INSA CVL, University of Orleans, PRISME Laboratory EA 4229, 88 Boulevard Lahitolle, 18000, Bourges, France.
| | - Adel Hafiane
- INSA CVL, University of Orleans, PRISME Laboratory EA 4229, 88 Boulevard Lahitolle, 18000, Bourges, France
| | - Laurent Bobelin
- INSA CVL, University of Orleans, LIFO Laboratory EA 4022, 88 Boulevard Lahitolle, 18000, Bourges, France
| | - Raphaël Canals
- University of Orleans, INSA CVL, PRISME Laboratory EA 4229, 12 Rue de Blois, 45067, Orléans, France
| |
Collapse
|
9
|
Quan S, Wang J, Jia Z, Yang M, Xu Q. MS-Net: a novel lightweight and precise model for plant disease identification. FRONTIERS IN PLANT SCIENCE 2023; 14:1276728. [PMID: 37965007 PMCID: PMC10641454 DOI: 10.3389/fpls.2023.1276728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 10/11/2023] [Indexed: 11/16/2023]
Abstract
The rapid development of image processing technology and the improvement of computing power in recent years have made deep learning one of the main methods for plant disease identification. Currently, many neural network models have shown better performance in plant disease identification. Typically, the performance improvement of the model needs to be achieved by increasing the depth of the network. However, this also increases the computational complexity, memory requirements, and training time, which will be detrimental to the deployment of the model on mobile devices. To address this problem, a novel lightweight convolutional neural network has been proposed for plant disease detection. Skip connections are introduced into the conventional MobileNetV3 network to enrich the input features of the deep network, and the feature fusion weight parameters in the skip connections are optimized using an improved whale optimization algorithm to achieve higher classification accuracy. In addition, the bias loss substitutes the conventional cross-entropy loss to reduce the interference caused by redundant data during the learning process. The proposed model is pre-trained on the plant classification task dataset instead of using the classical ImageNet for pre-training, which further enhances the performance and robustness of the model. The constructed network achieved high performance with fewer parameters, reaching an accuracy of 99.8% on the PlantVillage dataset. Encouragingly, it also achieved a prediction accuracy of 97.8% on an apple leaf disease dataset with a complex outdoor background. The experimental results show that compared with existing advanced plant disease diagnosis models, the proposed model has fewer parameters, higher recognition accuracy, and lower complexity.
Collapse
Affiliation(s)
- Siyu Quan
- School of Computer Science and Technology, Xinjiang University, Urumqi, China
- Xinjiang Uygur Autonomous Region Signal Detection and Processing Key Laboratory, Xinjiang University, Urumqi, China
| | - Jiajia Wang
- School of Computer Science and Technology, Xinjiang University, Urumqi, China
- Xinjiang Uygur Autonomous Region Signal Detection and Processing Key Laboratory, Xinjiang University, Urumqi, China
| | - Zhenhong Jia
- School of Computer Science and Technology, Xinjiang University, Urumqi, China
- Xinjiang Uygur Autonomous Region Signal Detection and Processing Key Laboratory, Xinjiang University, Urumqi, China
| | - Mengge Yang
- School of Computer Science and Technology, Xinjiang University, Urumqi, China
- Xinjiang Uygur Autonomous Region Signal Detection and Processing Key Laboratory, Xinjiang University, Urumqi, China
| | - Qiqi Xu
- School of Computer Science and Technology, Xinjiang University, Urumqi, China
- Xinjiang Uygur Autonomous Region Signal Detection and Processing Key Laboratory, Xinjiang University, Urumqi, China
| |
Collapse
|
10
|
Lu B, Lu J, Xu X, Jin Y. MixSeg: a lightweight and accurate mix structure network for semantic segmentation of apple leaf disease in complex environments. FRONTIERS IN PLANT SCIENCE 2023; 14:1233241. [PMID: 37780516 PMCID: PMC10535114 DOI: 10.3389/fpls.2023.1233241] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 08/18/2023] [Indexed: 10/03/2023]
Abstract
Introduction Semantic segmentation is effective in dealing with complex environments. However, the most popular semantic segmentation methods are usually based on a single structure, they are inefficient and inaccurate. In this work, we propose a mix structure network called MixSeg, which fully combines the advantages of convolutional neural network, Transformer, and multi-layer perception architectures. Methods Specifically, MixSeg is an end-to-end semantic segmentation network, consisting of an encoder and a decoder. In the encoder, the Mix Transformer is designed to model globally and inject local bias into the model with less computational cost. The position indexer is developed to dynamically index absolute position information on the feature map. The local optimization module is designed to optimize the segmentation effect of the model on local edges and details. In the decoder, shallow and deep features are fused to output accurate segmentation results. Results Taking the apple leaf disease segmentation task in the real scene as an example, the segmentation effect of the MixSeg is verified. The experimental results show that MixSeg has the best segmentation effect and the lowest parameters and floating point operations compared with the mainstream semantic segmentation methods on small datasets. On apple alternaria blotch and apple grey spot leaf image datasets, the most lightweight MixSeg-T achieves 98.22%, 98.09% intersection over union for leaf segmentation and 87.40%, 86.20% intersection over union for disease segmentation. Discussion Thus, the performance of MixSeg demonstrates that it can provide a more efficient and stable method for accurate segmentation of leaves and diseases in complex environments.
Collapse
|