1
|
Liu H, Yang J, Jiang C, He S, Fu Y, Zhang S, Hu X, Fang J, Ji W. S2DA-Net: Spatial and spectral-learning double-branch aggregation network for liver tumor segmentation in CT images. Comput Biol Med 2024; 174:108400. [PMID: 38613888 DOI: 10.1016/j.compbiomed.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2023] [Revised: 03/10/2024] [Accepted: 04/01/2024] [Indexed: 04/15/2024]
Abstract
Accurate liver tumor segmentation is crucial for aiding radiologists in hepatocellular carcinoma evaluation and surgical planning. While convolutional neural networks (CNNs) have been successful in medical image segmentation, they face challenges in capturing long-term dependencies among pixels. On the other hand, Transformer-based models demand a high number of parameters and involve significant computational costs. To address these issues, we propose the Spatial and Spectral-learning Double-branched Aggregation Network (S2DA-Net) for liver tumor segmentation. S2DA-Net consists of a double-branched encoder and a decoder with a Group Multi-Head Cross-Attention Aggregation (GMCA) module, Two branches in the encoder consist of a Fourier Spectral-learning Multi-scale Fusion (FSMF) branch and a Multi-axis Aggregation Hadamard Attention (MAHA) branch. The FSMF branch employs a Fourier-based network to learn amplitude and phase information, capturing richer features and detailed information without introducing an excessive number of parameters. The FSMF branch utilizes a Fourier-based network to capture amplitude and phase information, enriching features without introducing excessive parameters. The MAHA branch incorporates spatial information, enhancing discriminative features while minimizing computational costs. In the decoding path, a GMCA module extracts local information and establishes long-term dependencies, improving localization capabilities by amalgamating features from diverse branches. Experimental results on the public LiTS2017 liver tumor datasets show that the proposed segmentation model achieves significant improvements compared to the state-of-the-art methods, obtaining dice per case (DPC) 69.4 % and global dice (DG) 80.0 % for liver tumor segmentation on the LiTS2017 dataset. Meanwhile, the pre-trained model based on the LiTS2017 datasets obtain, DPC 73.4 % and an DG 82.2 % on the 3DIRCADb dataset.
Collapse
Affiliation(s)
- Huaxiang Liu
- Department Radiology of Taizhou Hospital, Zhejiang University, Taizhou, 318000, Zhejiang, China; Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China; Key Laboratory of Evidence-based Radiology of Taizhou, Taizhou, 317000, Zhejiang, China
| | - Jie Yang
- School of Geophysics and Measurement and Control Technology, East China University of Technology, Nanchang, 330013, China
| | - Chao Jiang
- School of Geophysics and Measurement and Control Technology, East China University of Technology, Nanchang, 330013, China
| | - Sailing He
- Department Radiology of Taizhou Hospital, Zhejiang University, Taizhou, 318000, Zhejiang, China
| | - Youyao Fu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Shiqing Zhang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Xudong Hu
- Key Laboratory of Evidence-based Radiology of Taizhou, Taizhou, 317000, Zhejiang, China
| | - Jiangxiong Fang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China.
| | - Wenbin Ji
- Department Radiology of Taizhou Hospital, Zhejiang University, Taizhou, 318000, Zhejiang, China; Key Laboratory of Evidence-based Radiology of Taizhou, Taizhou, 317000, Zhejiang, China.
| |
Collapse
|
2
|
Yang L, Shao D, Huang Z, Geng M, Zhang N, Chen L, Wang X, Liang D, Pang ZF, Hu Z. Few-shot segmentation framework for lung nodules via an optimized active contour model. Med Phys 2024; 51:2788-2805. [PMID: 38189528 DOI: 10.1002/mp.16933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Revised: 11/07/2023] [Accepted: 12/15/2023] [Indexed: 01/09/2024] Open
Abstract
BACKGROUND Accurate segmentation of lung nodules is crucial for the early diagnosis and treatment of lung cancer in clinical practice. However, the similarity between lung nodules and surrounding tissues has made their segmentation a longstanding challenge. PURPOSE Existing deep learning and active contour models each have their limitations. This paper aims to integrate the strengths of both approaches while mitigating their respective shortcomings. METHODS In this paper, we propose a few-shot segmentation framework that combines a deep neural network with an active contour model. We introduce heat kernel convolutions and high-order total variation into the active contour model and solve the challenging nonsmooth optimization problem using the alternating direction method of multipliers. Additionally, we use the presegmentation results obtained from training a deep neural network on a small sample set as the initial contours for our optimized active contour model, addressing the difficulty of manually setting the initial contours. RESULTS We compared our proposed method with state-of-the-art methods for segmentation effectiveness using clinical computed tomography (CT) images acquired from two different hospitals and the publicly available LIDC dataset. The results demonstrate that our proposed method achieved outstanding segmentation performance according to both visual and quantitative indicators. CONCLUSION Our approach utilizes the output of few-shot network training as prior information, avoiding the need to select the initial contour in the active contour model. Additionally, it provides mathematical interpretability to the deep learning, reducing its dependency on the quantity of training samples.
Collapse
Affiliation(s)
- Lin Yang
- Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- College of Mathematics and Statistics, Henan University, Kaifeng, China
| | - Dan Shao
- Department of Nuclear Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, China
| | - Zhenxing Huang
- Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Mengxiao Geng
- Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- College of Mathematics and Statistics, Henan University, Kaifeng, China
| | - Na Zhang
- Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Key Laboratory of Biomedical Imaging Science and System, Chinese Academy of Sciences, Shenzhen, China
| | - Long Chen
- Department of PET/CT Center and the Department of Thoracic Cancer I, Cancer Center of Yunnan Province, Yunnan Cancer Hospital, The Third Affiliated Hospital of Kunming Medical University, Kunming, China
| | - Xi Wang
- Department of PET/CT Center and the Department of Thoracic Cancer I, Cancer Center of Yunnan Province, Yunnan Cancer Hospital, The Third Affiliated Hospital of Kunming Medical University, Kunming, China
| | - Dong Liang
- Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Key Laboratory of Biomedical Imaging Science and System, Chinese Academy of Sciences, Shenzhen, China
| | - Zhi-Feng Pang
- College of Mathematics and Statistics, Henan University, Kaifeng, China
| | - Zhanli Hu
- Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- Key Laboratory of Biomedical Imaging Science and System, Chinese Academy of Sciences, Shenzhen, China
| |
Collapse
|
3
|
Zhang Y, Chen Z, Yang X. Light-M: An efficient lightweight medical image segmentation framework for resource-constrained IoMT. Comput Biol Med 2024; 170:108088. [PMID: 38320339 DOI: 10.1016/j.compbiomed.2024.108088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 12/22/2023] [Accepted: 01/27/2024] [Indexed: 02/08/2024]
Abstract
The Internet of Medical Things (IoMT) is being incorporated into current healthcare systems. This technology intends to connect patients, IoMT devices, and hospitals over mobile networks, allowing for more secure, quick, and convenient health monitoring and intelligent healthcare services. However, existing intelligent healthcare applications typically rely on large-scale AI models, and standard IoMT devices have significant resource constraints. To alleviate this paradox, in this paper, we propose a Knowledge Distillation (KD)-based IoMT end-edge-cloud orchestrated architecture for medical image segmentation tasks, called Light-M, aiming to deploy a lightweight medical model in resource-constrained IoMT devices. Specifically, Light-M trains a large teacher model in the cloud server and employs computation in local nodes through imitation of the performance of the teacher model using knowledge distillation. Light-M contains two KD strategies: (1) active exploration and passive transfer (AEPT) and (2) self-attention-based inter-class feature variation (AIFV) distillation for the medical image segmentation task. The AEPT encourages the student model to learn undiscovered knowledge/features of the teacher model without additional feature layers, aiming to explore new features and outperform the teacher. To improve the distinguishability of the student for different classes, the student learns the self-attention-based feature variation (AIFV) between classes. Since the proposed AEPT and AIFV only appear in the training process, our framework does not involve any additional computation burden for a student model during the segmentation task deployment. Extensive experiments on cardiac images and public real-scene datasets demonstrate that our approach improves student model learning representations and outperforms state-of-the-art methods by combining two knowledge distillation strategies. Moreover, when deployed on the IoT device, the distilled student model takes only 29.6 ms for one sample at the inference step.
Collapse
Affiliation(s)
- Yifan Zhang
- Shenzhen University, 3688 Nanhai Ave., Shenzhen, 518060, Guangdong, China
| | - Zhuangzhuang Chen
- Shenzhen University, 3688 Nanhai Ave., Shenzhen, 518060, Guangdong, China
| | - Xuan Yang
- Shenzhen University, 3688 Nanhai Ave., Shenzhen, 518060, Guangdong, China.
| |
Collapse
|
4
|
Soomro S, Niaz A, Soomro TA, Kim J, Manzoor A, Choi KN. Selective image segmentation driven by region, edge and saliency functions. PLoS One 2023; 18:e0294789. [PMID: 38100430 PMCID: PMC10723724 DOI: 10.1371/journal.pone.0294789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 11/07/2023] [Indexed: 12/17/2023] Open
Abstract
Present active contour methods often struggle with the segmentation of regions displaying variations in texture, color, or intensity a phenomenon referred to as inhomogeneities. These limitation impairs their ability to precisely distinguish and outline diverse components within an image. Further some of these methods employ intricate mathematical formulations for energy minimization. Such complexity introduces computational sluggishness, making these methods unsuitable for tasks requiring real-time processing or rapid segmentation. Moreover, these methods are susceptible to being trapped in energy configurations corresponding to local minimum points. Consequently, the segmentation process fails to converge to the desired outcome. Additionally, the efficacy of these methods diminishes when confronted with regions exhibiting weak or subtle boundaries. To address these limitations comprehensively, our proposed approach introduces a fresh paradigm for image segmentation through the synchronization of region-based, edge-based, and saliency-based segmentation techniques. Initially, we adapt an intensity edge term based on the zero crossing feature detector (ZCD), which is used to highlight significant edges of an image. Secondly, a saliency function is formulated to detect salient regions from an image. We have also included a globally tuned region based SPF (signed pressure force) term to move contour away and capture homogeneous regions. ZCD, saliency and global SPF are jointly incorporated with some scaled value for the level set evolution to develop an effective image segmentation model. In addition, proposed method is capable to perform selective object segmentation, which enables us to choose any single or multiple objects inside an image. Saliency function and ZCD detector are considered feature enhancement tools, which are used to get important features of an image, so this method has a solid capacity to segment nature images (homogeneous or inhomogeneous) precisely. Finally, the adaption of the Gaussian kernel removes the need of any penalization term for level set reinitialization. Experimental results will exhibit the efficiency of the proposed method.
Collapse
Affiliation(s)
- Shafiullah Soomro
- Department of Computer Science and Engineering, Chung-Ang University, Seoul, Republic of Korea
- Department of Computer Science and Media Technology, Linnaeus University, Vaxjo, Sweden
| | - Asim Niaz
- Department of Computer Science and Engineering, Chung-Ang University, Seoul, Republic of Korea
| | | | - Jin Kim
- SecuLayer Inc., Seoul, South Korea
| | - Adnan Manzoor
- Department of Artificial Intelligence, Quaid-e-Awam University of Engineering Science and Technology, Nawabshah, Sindh, Pakistan
| | - Kwang Nam Choi
- Department of Computer Science and Engineering, Chung-Ang University, Seoul, Republic of Korea
| |
Collapse
|
5
|
Fang J, Jiang H, Zhang S, Sun L, Hu X, Liu J, Gong M, Liu H, Fu Y. BAF-Net: Bidirectional attention fusion network via CNN and transformers for the pepper leaf segmentation. FRONTIERS IN PLANT SCIENCE 2023; 14:1123410. [PMID: 37051074 PMCID: PMC10083316 DOI: 10.3389/fpls.2023.1123410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Accepted: 03/03/2023] [Indexed: 06/19/2023]
Abstract
The segmentation of pepper leaves from pepper images is of great significance for the accurate control of pepper leaf diseases. To address the issue, we propose a bidirectional attention fusion network combing the convolution neural network (CNN) and Swin Transformer, called BAF-Net, to segment the pepper leaf image. Specially, BAF-Net first uses a multi-scale fusion feature (MSFF) branch to extract the long-range dependencies by constructing the cascaded Swin Transformer-based and CNN-based block, which is based on the U-shape architecture. Then, it uses a full-scale feature fusion (FSFF) branch to enhance the boundary information and attain the detailed information. Finally, an adaptive bidirectional attention module is designed to bridge the relation of the MSFF and FSFF features. The results on four pepper leaf datasets demonstrated that our model obtains F1 scores of 96.75%, 91.10%, 97.34% and 94.42%, and IoU of 95.68%, 86.76%, 96.12% and 91.44%, respectively. Compared to the state-of-the-art models, the proposed model achieves better segmentation performance. The code will be available at the website: https://github.com/fangchj2002/BAF-Net.
Collapse
Affiliation(s)
- Jiangxiong Fang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, Zhejiang, China
| | - Houtao Jiang
- Jiangxi Engineering Laboratory on Radioactive Geoscience and Big Data Technology, East China University of Technology, Nanchang, China
| | - Shiqing Zhang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, Zhejiang, China
| | - Lin Sun
- Jiangxi Engineering Laboratory on Radioactive Geoscience and Big Data Technology, East China University of Technology, Nanchang, China
| | - Xudong Hu
- Jiangxi Engineering Laboratory on Radioactive Geoscience and Big Data Technology, East China University of Technology, Nanchang, China
- Engineering Research Center of Development and Management for Low to Ultra-Low Permeability Oil & Gas Reservoirs in West China, Xi’an Shiyou University, Xi’an, China
| | - Jun Liu
- College of Mechanical Engineering, Quzhou University, Quzhou, Zhejiang, China
| | - Meng Gong
- Jiangxi Engineering Laboratory on Radioactive Geoscience and Big Data Technology, East China University of Technology, Nanchang, China
| | - Huaxiang Liu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, Zhejiang, China
| | - Youyao Fu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, Zhejiang, China
| |
Collapse
|
6
|
Liu H, Fu Y, Zhang S, Liu J, Wang Y, Wang G, Fang J. GCHA-Net: Global context and hybrid attention network for automatic liver segmentation. Comput Biol Med 2023; 152:106352. [PMID: 36481761 DOI: 10.1016/j.compbiomed.2022.106352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/15/2022] [Accepted: 11/23/2022] [Indexed: 11/27/2022]
Abstract
Liver segmentation is a critical step in liver cancer diagnosis and surgical planning. The U-Net's architecture is one of the most efficient deep networks for medical image segmentation. However, the continuous downsampling operators in U-Net causes the loss of spatial information. To solve these problems, we propose a global context and hybrid attention network, called GCHA-Net, to adaptive capture the structural and detailed features. To capture the global features, a global attention module (GAM) is designed to model the channel and positional dimensions of the interdependencies. To capture the local features, a feature aggregation module (FAM) is designed, where a local attention module (LAM) is proposed to capture the spatial information. LAM can make our model focus on the local liver regions and suppress irrelevant information. The experimental results on the dataset LiTS2017 show that the dice per case (DPC) value and dice global (DG) value of liver were 96.5% and 96.9%, respectively. Compared with the state-of-the-art models, our model has superior performance in liver segmentation. Meanwhile, we test the experiment results on the 3Dircadb dataset, and it shows our model can obtain the highest accuracy compared with the closely related models. From these results, it can been seen that the proposed model can effectively capture the global context information and build the correlation between different convolutional layers. The code is available at the website: https://github.com/HuaxiangLiu/GCAU-Net.
Collapse
Affiliation(s)
- Huaxiang Liu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Youyao Fu
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Shiqing Zhang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Jun Liu
- College of Mechanical Engineering, Quzhou University, Quzhou, 324000, Zhejiang, China
| | - Yong Wang
- School of Aeronautics and Astronautics, Sun Yat Sen University, Guangzhou, 510275, Guangdong, China
| | - Guoyu Wang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China
| | - Jiangxiong Fang
- Institute of Intelligent Information Processing, Taizhou University, Taizhou, 318000, Zhejiang, China; College of Mechanical Engineering, Quzhou University, Quzhou, 324000, Zhejiang, China.
| |
Collapse
|
7
|
Wu C, Wang Z. Robust fuzzy dual-local information clustering with kernel metric and quadratic surface prototype for image segmentation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03690-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
8
|
Segmentation of ultrasound image sequences by combing a novel deep siamese network with a deformable contour model. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07054-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|