1
|
Liu L, Aviles-Rivero AI, Schonlieb CB. Contrastive Registration for Unsupervised Medical Image Segmentation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:147-159. [PMID: 37983143 DOI: 10.1109/tnnls.2023.3332003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
Medical image segmentation is an important task in medical imaging, as it serves as the first step for clinical diagnosis and treatment planning. While major success has been reported using deep learning supervised techniques, they assume a large and well-representative labeled set. This is a strong assumption in the medical domain where annotations are expensive, time-consuming, and inherent to human bias. To address this problem, unsupervised segmentation techniques have been proposed in the literature. Yet, none of the existing unsupervised segmentation techniques reach accuracies that come even near to the state-of-the-art of supervised segmentation methods. In this work, we present a novel optimization model framed in a new convolutional neural network (CNN)-based contrastive registration architecture for unsupervised medical image segmentation called CLMorph. The core idea of our approach is to exploit image-level registration and feature-level contrastive learning, to perform registration-based segmentation. First, we propose an architecture to capture the image-to-image transformation mapping via registration for unsupervised medical image segmentation. Second, we embed a contrastive learning mechanism in the registration architecture to enhance the discriminative capacity of the network at the feature level. We show that our proposed CLMorph technique mitigates the major drawbacks of existing unsupervised techniques. We demonstrate, through numerical and visual experiments, that our technique substantially outperforms the current state-of-the-art unsupervised segmentation methods on two major medical image datasets.
Collapse
|
2
|
Chen H, Fu J, Liu X, Zheng Z, Luo X, Zhou K, Xu Z, Geng D. A Parkinson's disease-related nuclei segmentation network based on CNN-Transformer interleaved encoder with feature fusion. Comput Med Imaging Graph 2024; 118:102465. [PMID: 39591710 DOI: 10.1016/j.compmedimag.2024.102465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 10/03/2024] [Accepted: 11/03/2024] [Indexed: 11/28/2024]
Abstract
Automatic segmentation of Parkinson's disease (PD) related deep gray matter (DGM) nuclei based on brain magnetic resonance imaging (MRI) is significant in assisting the diagnosis of PD. However, due to the degenerative-induced changes in appearance, low tissue contrast, and tiny DGM nuclei size in elders' brain MRI images, many existing segmentation models are limited in the application. To address these challenges, this paper proposes a PD-related DGM nuclei segmentation network to provide precise prior knowledge for aiding diagnosis PD. The encoder of network is designed as an alternating encoding structure where the convolutional neural network (CNN) captures spatial and depth texture features, while the Transformer complements global position information between DGM nuclei. Moreover, we propose a cascaded channel-spatial-wise block to fuse features extracted by the CNN and Transformer, thereby achieving more precise DGM nuclei segmentation. The decoder incorporates a symmetrical boundary attention module, leveraging the symmetrical structures of bilateral nuclei regions by constructing signed distance maps for symmetric differences, which optimizes segmentation boundaries. Furthermore, we employ a dynamic adaptive region of interests weighted Dice loss to enhance sensitivity towards smaller structures, thereby improving segmentation accuracy. In qualitative analysis, our method achieved optimal average values for PD-related DGM nuclei (DSC: 0.854, IOU: 0.750, HD95: 1.691 mm, ASD: 0.195 mm). Experiments conducted on multi-center clinical datasets and public datasets demonstrate the good generalizability of the proposed method. Furthermore, a volumetric analysis of segmentation results reveals significant differences between HCs and PDs. Our method holds promise for assisting clinicians in the rapid and accurate diagnosis of PD, offering a practical method for the imaging analysis of neurodegenerative diseases.
Collapse
Affiliation(s)
- Hongyi Chen
- Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China.
| | - Junyan Fu
- Department of Radiology, Huashan Hospital, Fudan University, Shanghai, 200040, China.
| | - Xiao Liu
- Department of Radiology, Huashan Hospital, Fudan University, Shanghai, 200040, China; Shanghai Engineering Research Center of Intelligent Imaging for Critical Brain Diseases, Shanghai, 200040, China; Institute of Functional and Molecular Medical Imaging, Fudan University, Shanghai, 200040, China.
| | - Zhiji Zheng
- Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China.
| | - Xiao Luo
- Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China.
| | - Kun Zhou
- Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China.
| | - Zhijian Xu
- Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China.
| | - Daoying Geng
- Academy for Engineering and Technology, Fudan University, Shanghai, 200433, China; Department of Radiology, Huashan Hospital, Fudan University, Shanghai, 200040, China; Shanghai Engineering Research Center of Intelligent Imaging for Critical Brain Diseases, Shanghai, 200040, China; Institute of Functional and Molecular Medical Imaging, Fudan University, Shanghai, 200040, China.
| |
Collapse
|
3
|
Zhang X, Tian L, Guo S, Liu Y. STF-Net: sparsification transformer coding guided network for subcortical brain structure segmentation. BIOMED ENG-BIOMED TE 2024; 69:465-480. [PMID: 38712825 DOI: 10.1515/bmt-2023-0121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Accepted: 04/15/2024] [Indexed: 05/08/2024]
Abstract
Subcortical brain structure segmentation plays an important role in the diagnosis of neuroimaging and has become the basis of computer-aided diagnosis. Due to the blurred boundaries and complex shapes of subcortical brain structures, labeling these structures by hand becomes a time-consuming and subjective task, greatly limiting their potential for clinical applications. Thus, this paper proposes the sparsification transformer (STF) module for accurate brain structure segmentation. The self-attention mechanism is used to establish global dependencies to efficiently extract the global information of the feature map with low computational complexity. Also, the shallow network is used to compensate for low-level detail information through the localization of convolutional operations to promote the representation capability of the network. In addition, a hybrid residual dilated convolution (HRDC) module is introduced at the bottom layer of the network to extend the receptive field and extract multi-scale contextual information. Meanwhile, the octave convolution edge feature extraction (OCT) module is applied at the skip connections of the network to pay more attention to the edge features of brain structures. The proposed network is trained with a hybrid loss function. The experimental evaluation on two public datasets: IBSR and MALC, shows outstanding performance in terms of objective and subjective quality.
Collapse
Affiliation(s)
- Xiufeng Zhang
- School of Mechanical and Electrical Engineering, 66455 Dalian Minzu University , Dalian, Liaoning, China
| | - Lingzhuo Tian
- School of Mechanical and Electrical Engineering, 66455 Dalian Minzu University , Dalian, Liaoning, China
| | - Shengjin Guo
- School of Mechanical and Electrical Engineering, 66455 Dalian Minzu University , Dalian, Liaoning, China
| | - Yansong Liu
- School of Mechanical and Electrical Engineering, 66455 Dalian Minzu University , Dalian, Liaoning, China
| |
Collapse
|
4
|
Zhong T, Wang Y, Xu X, Wu X, Liang S, Ning Z, Wang L, Niu Y, Li G, Zhang Y. A brain subcortical segmentation tool based on anatomy attentional fusion network for developing macaques. Comput Med Imaging Graph 2024; 116:102404. [PMID: 38870599 DOI: 10.1016/j.compmedimag.2024.102404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 05/21/2024] [Accepted: 05/22/2024] [Indexed: 06/15/2024]
Abstract
Magnetic Resonance Imaging (MRI) plays a pivotal role in the accurate measurement of brain subcortical structures in macaques, which is crucial for unraveling the complexities of brain structure and function, thereby enhancing our understanding of neurodegenerative diseases and brain development. However, due to significant differences in brain size, structure, and imaging characteristics between humans and macaques, computational tools developed for human neuroimaging studies often encounter obstacles when applied to macaques. In this context, we propose an Anatomy Attentional Fusion Network (AAF-Net), which integrates multimodal MRI data with anatomical constraints in a multi-scale framework to address the challenges posed by the dynamic development, regional heterogeneity, and age-related size variations of the juvenile macaque brain, thus achieving precise subcortical segmentation. Specifically, we generate a Signed Distance Map (SDM) based on the initial rough segmentation of the subcortical region by a network as an anatomical constraint, providing comprehensive information on positions, structures, and morphology. Then we construct AAF-Net to fully fuse the SDM anatomical constraints and multimodal images for refined segmentation. To thoroughly evaluate the performance of our proposed tool, over 700 macaque MRIs from 19 datasets were used in this study. Specifically, we employed two manually labeled longitudinal macaque datasets to develop the tool and complete four-fold cross-validations. Furthermore, we incorporated various external datasets to demonstrate the proposed tool's generalization capabilities and promise in brain development research. We have made this tool available as an open-source resource at https://github.com/TaoZhong11/Macaque_subcortical_segmentation for direct application.
Collapse
Affiliation(s)
- Tao Zhong
- School of Biomedical Engineering, Guangdong Provincial Key Laboratory of Medical Image Processing and Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, China
| | - Ya Wang
- Department of Radiology and Biomedical Research Imaging Center, University of North Carolina at Chapel Hill, USA
| | - Xiaotong Xu
- School of Biomedical Engineering, Guangdong Provincial Key Laboratory of Medical Image Processing and Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, China
| | - Xueyang Wu
- School of Biomedical Engineering, Guangdong Provincial Key Laboratory of Medical Image Processing and Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, China
| | - Shujun Liang
- School of Biomedical Engineering, Guangdong Provincial Key Laboratory of Medical Image Processing and Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, China
| | - Zhenyuan Ning
- School of Biomedical Engineering, Guangdong Provincial Key Laboratory of Medical Image Processing and Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, China
| | - Li Wang
- Department of Radiology and Biomedical Research Imaging Center, University of North Carolina at Chapel Hill, USA
| | - Yuyu Niu
- Yunnan Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, China
| | - Gang Li
- Department of Radiology and Biomedical Research Imaging Center, University of North Carolina at Chapel Hill, USA.
| | - Yu Zhang
- School of Biomedical Engineering, Guangdong Provincial Key Laboratory of Medical Image Processing and Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, China.
| |
Collapse
|
5
|
Coupeau P, Fasquel JB, Hertz-Pannier L, Dinomais M. GNN-based structural information to improve DNN-based basal ganglia segmentation in children following early brain lesion. Comput Med Imaging Graph 2024; 115:102396. [PMID: 38744197 DOI: 10.1016/j.compmedimag.2024.102396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 04/26/2024] [Accepted: 04/30/2024] [Indexed: 05/16/2024]
Abstract
Analyzing the basal ganglia following an early brain lesion is crucial due to their noteworthy role in sensory-motor functions. However, the segmentation of these subcortical structures on MRI is challenging in children and is further complicated by the presence of a lesion. Although current deep neural networks (DNN) perform well in segmenting subcortical brain structures in healthy brains, they lack robustness when faced with lesion variability, leading to structural inconsistencies. Given the established spatial organization of the basal ganglia, we propose enhancing the DNN-based segmentation through post-processing with a graph neural network (GNN). The GNN conducts node classification on graphs encoding both class probabilities and spatial information regarding the regions segmented by the DNN. In this study, we focus on neonatal arterial ischemic stroke (NAIS) in children. The approach is evaluated on both healthy children and children after NAIS using three DNN backbones: U-Net, UNETr, and MSGSE-Net. The results show an improvement in segmentation performance, with an increase in the median Dice score by up to 4% and a reduction in the median Hausdorff distance (HD) by up to 93% for healthy children (from 36.45 to 2.57) and up to 91% for children suffering from NAIS (from 40.64 to 3.50). The performance of the method is compared with atlas-based methods. Severe cases of neonatal stroke result in a decline in performance in the injured hemisphere, without negatively affecting the segmentation of the contra-injured hemisphere. Furthermore, the approach demonstrates resilience to small training datasets, a widespread challenge in the medical field, particularly in pediatrics and for rare pathologies.
Collapse
Affiliation(s)
- Patty Coupeau
- Universite d'Angers, LARIS, SFR MATHSTIC, F-49000 Angers, France.
| | | | - Lucie Hertz-Pannier
- UNIACT/Neurospin/JOLIOT/DRF/CEA-Saclay, and U1141 NeuroDiderot/Inserm, CEA, Paris University, France
| | - Mickaël Dinomais
- Universite d'Angers, LARIS, SFR MATHSTIC, F-49000 Angers, France; Departement de medecine physique et de readaptation, Centre Hospitalier Universitaire d'Angers, France
| |
Collapse
|
6
|
Wang M, Jiang H. Memory-Net: Coupling feature maps extraction and hierarchical feature maps reuse for efficient and effective PET/CT multi-modality image-based tumor segmentation. Knowl Based Syst 2023. [DOI: 10.1016/j.knosys.2023.110399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
|
7
|
Wang M, Jiang H, Shi T, Yao YD. SCL-Net: Structured Collaborative Learning for PET/CT Based Tumor Segmentation. IEEE J Biomed Health Inform 2023; 27:1048-1059. [PMID: 37015562 DOI: 10.1109/jbhi.2022.3226475] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Collaborative learning methods for medical image segmentation are often variants of UNet, where the constructions of classifiers depend on each other and their outputs are supervised independently. However, they cannot explicitly ensure that optimizing auxiliary classifier heads leads to improved segmentation of target classifier. To resolve this problem, we propose a structured collaborative learning (SCL) method, which consists of a context-aware structured classifier population generation (CA-SCPG) module, where the feature propagation of the target classifier path is directly enhanced by the outputs of auxiliary classifiers via a light-weighted high-level context-aware dense connection (HLCA-DC) mechanism, and a knowledge-aware structured classifier population supervision (KA-SCPS) module, where the auxiliary classifiers are properly supervised under the guidance of target classifier's segmentations. Specifically, SCL is proposed based on a recurrent-dense-siamese decoder (RDS-Decoder), which consists of multiple siamese-decoder paths. CA-SCPG enhances the feature propagation of the decoder paths by HLCA-DC, which densely reuses previous decoder paths' output predictions to belong to the target classes as inputs to the latter decoder paths. KA-SCPS supervises the classifier heads simultaneously with KA-SCPS loss, which consists of a generalized weighted cross-entropy loss for deep class-imbalanced learning and a novel knowledge-aware Dice loss (KA-DL). KA-DL is a weighted Dice loss broadcasting knowledges learnt by the target classifier to other classifier heads, harmonizing the learning process of the classifier population. Experiments are performed based on PET/CT volumes with malignant melanoma, lymphoma, or lung cancer. Experimental results demonstrate the superiority of our SCL, when compared to the state-of-the-art methods and baselines.
Collapse
|
8
|
Li X, Wei Y, Hu Q, Wang C, Yang J. Learning to segment subcortical structures from noisy annotations with a novel uncertainty-reliability aware learning framework. Comput Biol Med 2022; 151:106326. [PMID: 36442274 DOI: 10.1016/j.compbiomed.2022.106326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 10/24/2022] [Accepted: 11/14/2022] [Indexed: 11/17/2022]
Abstract
Accurate segmentation of subcortical structures is an important task in quantitative brain image analysis. Convolutional neural networks (CNNs) have achieved remarkable results in medical image segmentation. However, due to the difficulty of acquiring high-quality annotations of brain subcortical structures, learning segmentation networks using noisy annotations is an inevitable topic. A common practice is to select images or pixels with reliable annotations for training, which usually may not make full use of the information from the training samples, thus affecting the performance of the learned segmentation model. To address the above problem, in this work, we propose a novel robust learning method and denote it as uncertainty-reliability awareness learning (URAL), which can make sufficient use of all training pixels. At each training iteration, the proposed method first selects training pixels with reliable annotations from the set of pixels with uncertain network prediction, by utilizing a small clean validation set following a meta-learning paradigm. Meanwhile, we propose the online prototypical soft label correction (PSLC) method to estimate the pseudo-labels of label-unreliable pixels. Then, the segmentation loss of label-reliable pixels and the semi-supervised segmentation loss of label-unreliable pixels are used to calibrate the total segmentation loss. Finally, we propose a category-wise contrastive regularization to learn compact feature representations of all uncertain training pixels. Comprehensive experiments are performed on two publicly available brain MRI datasets. The proposed method achieves the best Dice scores and MHD values on both datasets compared to several recent state-of-the-art methods under all label noise settings. Our code is available at https://github.com/neulxlx/URAL.
Collapse
Affiliation(s)
- Xiang Li
- College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China.
| | - Ying Wei
- College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China; Information Technology R&D Innovation Center of Peking University, Shaoxing, China; Changsha Hisense Intelligent System Research Institute Co., Ltd., China.
| | - Qian Hu
- College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China.
| | - Chuyuan Wang
- College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China.
| | - Jingjing Yang
- College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China.
| |
Collapse
|
9
|
Hu Q, Wei Y, Li X, Wang C, Li J, Wang Y. EA-Net: Edge-aware network for brain structure segmentation via decoupled high and low frequency features. Comput Biol Med 2022; 150:106139. [PMID: 36209556 DOI: 10.1016/j.compbiomed.2022.106139] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 09/08/2022] [Accepted: 09/18/2022] [Indexed: 11/21/2022]
Abstract
Automatic brain structure segmentation in Magnetic Resonance Image (MRI) plays an important role in the diagnosis of various neuropsychiatric diseases. However, most existing methods yield unsatisfactory results due to blurred boundaries and complex structures. Improving the segmentation ability requires the model to be explicit about the spatial localization and shape appearance of targets, which correspond to the low-frequency content features and the high-frequency edge features, respectively. Therefore, in this paper, to extract rich edge and content feature representations, we focus on the composition of the feature and utilize a frequency decoupling (FD) block to separate the low-frequency and high-frequency parts of the feature. Further, a novel edge-aware network (EA-Net) is proposed for jointly learning to segment brain structures and detect object edges. First, an encoder-decoder sub-network is utilized to obtain multi-level information from the input MRI, which is then sent to the FD block to complete the frequency separation. Further, we use different mechanisms to optimize both the low-frequency and high-frequency features. Finally, these two parts are fused to generate the final prediction. In particular, we extract the content mask and the edge mask from the optimization feature with different supervisions, which forces the network to learn the boundary features of the object. Extensive experiments are performed on two public brain MRI T1 scan datasets (the IBSR dataset and the MALC dataset) to evaluate the effectiveness of the proposed algorithm. The experiments show that the EA-Net achieves the best performance compared with the state-of-the-art methods, and improves the segmentation DSC score by 1.31% at most compared with the U-Net model and its variants. Moreover, we evaluate the EA-Net under different noise disturbances, and the results demonstrate the robustness and superiority of our method under low-quality noisy MRI. Code is available at https://github.com/huqian999/EA-Net.
Collapse
Affiliation(s)
- Qian Hu
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.
| | - Ying Wei
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China; Information Technology R&D Innovation Center of Peking University, Shaoxing, China; Changsha Hisense Intelligent System Research Institute Co., Ltd., China.
| | - Xiang Li
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.
| | - Chuyuan Wang
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.
| | - Jiaguang Li
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.
| | - Yuefeng Wang
- College of Information Science and Engineering, Northeastern University, Shenyang 110819, China.
| |
Collapse
|
10
|
Cao X, Chen H, Li Y, Peng Y, Zhou Y, Cheng L, Liu T, Shen D. Auto-DenseUNet: Searchable neural network architecture for mass segmentation in 3D automated breast ultrasound. Med Image Anal 2022; 82:102589. [DOI: 10.1016/j.media.2022.102589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 07/18/2022] [Accepted: 08/17/2022] [Indexed: 11/15/2022]
|
11
|
Li X, Wei Y, Wang C, Hu Q, Liu C. Contextual-wise discriminative feature extraction and robust network learning for subcortical structure segmentation. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03848-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
12
|
MASS: Modality-collaborative semi-supervised segmentation by exploiting cross-modal consistency from unpaired CT and MRI images. Med Image Anal 2022; 80:102506. [DOI: 10.1016/j.media.2022.102506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 05/25/2022] [Accepted: 05/27/2022] [Indexed: 11/20/2022]
|
13
|
Wang Y, Haghpanah FS, Zhang X, Santamaria K, da Costa Aguiar Alves GK, Bruno E, Aw N, Maddocks A, Duarte CS, Monk C, Laine A, Posner J. ID-Seg: an infant deep learning-based segmentation framework to improve limbic structure estimates. Brain Inform 2022; 9:12. [PMID: 35633447 PMCID: PMC9148335 DOI: 10.1186/s40708-022-00161-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 05/05/2022] [Indexed: 11/10/2022] Open
Abstract
Infant brain magnetic resonance imaging (MRI) is a promising approach for studying early neurodevelopment. However, segmenting small regions such as limbic structures is challenging due to their low inter-regional contrast and high curvature. MRI studies of the adult brain have successfully applied deep learning techniques to segment limbic structures, and similar deep learning models are being leveraged for infant studies. However, these deep learning-based infant MRI segmentation models have generally been derived from small datasets, and may suffer from generalization problems. Moreover, the accuracy of segmentations derived from these deep learning models relative to more standard Expectation-Maximization approaches has not been characterized. To address these challenges, we leveraged a large, public infant MRI dataset (n = 473) and the transfer-learning technique to first pre-train a deep convolutional neural network model on two limbic structures: amygdala and hippocampus. Then we used a leave-one-out cross-validation strategy to fine-tune the pre-trained model and evaluated it separately on two independent datasets with manual labels. We term this new approach the Infant Deep learning SEGmentation Framework (ID-Seg). ID-Seg performed well on both datasets with a mean dice similarity score (DSC) of 0.87, a mean intra-class correlation (ICC) of 0.93, and a mean average surface distance (ASD) of 0.31 mm. Compared to the Developmental Human Connectome pipeline (dHCP) pipeline, ID-Seg significantly improved segmentation accuracy. In a third infant MRI dataset (n = 50), we used ID-Seg and dHCP separately to estimate amygdala and hippocampus volumes and shapes. The estimates derived from ID-seg, relative to those from the dHCP, showed stronger associations with behavioral problems assessed in these infants at age 2. In sum, ID-Seg consistently performed well on two different datasets with an 0.87 DSC, however, multi-site testing and extension for brain regions beyond the amygdala and hippocampus are still needed.
Collapse
Affiliation(s)
- Yun Wang
- Department of Psychiatry and Behavioral Sciences, Duke University, Durham, NC, USA
- New York State Psychiatric Institute, New York, NY, USA
| | | | - Xuzhe Zhang
- Department of Biomedical Engineering, Columbia University, New York, NY, USA
| | | | | | | | - Natalie Aw
- New York State Psychiatric Institute, New York, NY, USA
| | - Alexis Maddocks
- Department of Radiology, Columbia University, New York, NY, USA
| | | | - Catherine Monk
- New York State Psychiatric Institute, New York, NY, USA
- Department of Obstetrics and Gynecology, Columbia University, New York, NY, USA
| | - Andrew Laine
- Department of Biomedical Engineering, Columbia University, New York, NY, USA
| | - Jonathan Posner
- Department of Psychiatry and Behavioral Sciences, Duke University, Durham, NC, USA.
- New York State Psychiatric Institute, New York, NY, USA.
| |
Collapse
|
14
|
MSGSE-Net: Multi-scale guided squeeze-and-excitation network for subcortical brain structure segmentation. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.07.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
15
|
Bao Q, Mi S, Gang B, Yang W, Chen J, Liao Q. MDAN: Mirror Difference Aware Network for Brain Stroke Lesion Segmentation. IEEE J Biomed Health Inform 2021; 26:1628-1639. [PMID: 34543208 DOI: 10.1109/jbhi.2021.3113460] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Brain stroke lesion segmentation is of great importance for stroke rehabilitation neuroimaging analysis. Due to the large variance of stroke lesion shapes and similarities of tissue intensity distribution, it remains a challenging task. To help detect abnormalities, the anatomical symmetries of brain magnetic resonance (MR) images have been widely used as visual cues for clinical practices. However, most methods do not fully utilize structural symmetry information in brain images segmentation. This paper presents a novel mirror difference aware network (MDAN) for stroke lesion segmentation in an encoder-decoder architecture, aiming at holistically exploiting the symmetries of image features. Specifically, a differential feature augmentation (DFA) module is developed in the encoding path to highlight the semantically pathological asymmetries of the features in abnormalities. In the DFA module, a Siamese contrastive supervised loss is designed to enhance discriminative features, and a mirror position-based difference augmentation (MDA) module is used to further magnify the discrepancy information. Moreover, mirror feature fusion (MFF) modules are applied to fuse and transfer the information both of the original input and the horizontally flipped features to the decoding path. Extensive experiments on the Anatomical Tracings of Lesions After Stroke (ATLAS) dataset show the proposed MDAN outperforms the state-of-the-art methods.
Collapse
|
16
|
Kushibar K, Salem M, Valverde S, Rovira À, Salvi J, Oliver A, Lladó X. Transductive Transfer Learning for Domain Adaptation in Brain Magnetic Resonance Image Segmentation. Front Neurosci 2021; 15:608808. [PMID: 33994917 PMCID: PMC8116893 DOI: 10.3389/fnins.2021.608808] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 03/26/2021] [Indexed: 11/13/2022] Open
Abstract
Segmentation of brain images from Magnetic Resonance Images (MRI) is an indispensable step in clinical practice. Morphological changes of sub-cortical brain structures and quantification of brain lesions are considered biomarkers of neurological and neurodegenerative disorders and used for diagnosis, treatment planning, and monitoring disease progression. In recent years, deep learning methods showed an outstanding performance in medical image segmentation. However, these methods suffer from generalisability problem due to inter-centre and inter-scanner variabilities of the MRI images. The main objective of the study is to develop an automated deep learning segmentation approach that is accurate and robust to the variabilities in scanner and acquisition protocols. In this paper, we propose a transductive transfer learning approach for domain adaptation to reduce the domain-shift effect in brain MRI segmentation. The transductive scenario assumes that there are sets of images from two different domains: (1) source-images with manually annotated labels; and (2) target-images without expert annotations. Then, the network is jointly optimised integrating both source and target images into the transductive training process to segment the regions of interest and to minimise the domain-shift effect. We proposed to use a histogram loss in the feature level to carry out the latter optimisation problem. In order to demonstrate the benefit of the proposed approach, the method has been tested in two different brain MRI image segmentation problems using multi-centre and multi-scanner databases for: (1) sub-cortical brain structure segmentation; and (2) white matter hyperintensities segmentation. The experiments showed that the segmentation performance of a pre-trained model could be significantly improved by up to 10%. For the first segmentation problem it was possible to achieve a maximum improvement from 0.680 to 0.799 in average Dice Similarity Coefficient (DSC) metric and for the second problem the average DSC improved from 0.504 to 0.602. Moreover, the improvements after domain adaptation were on par or showed better performance compared to the commonly used traditional unsupervised segmentation methods (FIRST and LST), also achieving faster execution time. Taking this into account, this work presents one more step toward the practical implementation of deep learning algorithms into the clinical routine.
Collapse
Affiliation(s)
- Kaisar Kushibar
- Institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| | - Mostafa Salem
- Institute of Computer Vision and Robotics, University of Girona, Girona, Spain.,Computer Science Department, Faculty of Computers and Information, Assiut University, Asyut, Egypt
| | - Sergi Valverde
- Institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| | - Àlex Rovira
- Magnetic Resonance Unit, Department of Radiology, Vall d'Hebron University Hospital, Barcelona, Spain
| | - Joaquim Salvi
- Institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| | - Arnau Oliver
- Institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| | - Xavier Lladó
- Institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| |
Collapse
|
17
|
Su R, Zhang D, Liu J, Cheng C. MSU-Net: Multi-Scale U-Net for 2D Medical Image Segmentation. Front Genet 2021; 12:639930. [PMID: 33679900 PMCID: PMC7928319 DOI: 10.3389/fgene.2021.639930] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 01/20/2021] [Indexed: 11/15/2022] Open
Abstract
Aiming at the limitation of the convolution kernel with a fixed receptive field and unknown prior to optimal network width in U-Net, multi-scale U-Net (MSU-Net) is proposed by us for medical image segmentation. First, multiple convolution sequence is used to extract more semantic features from the images. Second, the convolution kernel with different receptive fields is used to make features more diverse. The problem of unknown network width is alleviated by efficient integration of convolution kernel with different receptive fields. In addition, the multi-scale block is extended to other variants of the original U-Net to verify its universality. Five different medical image segmentation datasets are used to evaluate MSU-Net. A variety of imaging modalities are included in these datasets, such as electron microscopy, dermoscope, ultrasound, etc. Intersection over Union (IoU) of MSU-Net on each dataset are 0.771, 0.867, 0.708, 0.900, and 0.702, respectively. Experimental results show that MSU-Net achieves the best performance on different datasets. Our implementation is available at https://github.com/CN-zdy/MSU_Net.
Collapse
Affiliation(s)
- Run Su
- Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
- Science Island Branch of Graduate School, University of Science and Technology of China, Hefei, China
| | - Deyun Zhang
- School of Engineering, Anhui Agricultural University, Hefei, China
| | - Jinhuai Liu
- Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China
- Science Island Branch of Graduate School, University of Science and Technology of China, Hefei, China
| | - Chuandong Cheng
- Department of Neurosurgery, The First Affiliated Hospital of University of Science and Technology of China (USTC), Hefei, China
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China
- Anhui Province Key Laboratory of Brain Function and Brain Disease, Hefei, China
| |
Collapse
|
18
|
A Deep Spatial Context Guided Framework for Infant Brain Subcortical Segmentation. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION : MICCAI ... INTERNATIONAL CONFERENCE ON MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION 2020; 12267:646-656. [PMID: 33564753 DOI: 10.1007/978-3-030-59728-3_63] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Accurate subcortical segmentation of infant brain magnetic resonance (MR) images is crucial for studying early subcortical structural growth patterns and related diseases diagnosis. However, dynamic intensity changes, low tissue contrast, and small subcortical size of infant brain MR images make subcortical segmentation a challenging task. In this paper, we propose a spatial context guided, coarse-to-fine deep convolutional neural network (CNN) based framework for accurate infant subcortical segmentation. At the coarse stage, we propose a signed distance map (SDM) learning UNet (SDM-UNet) to predict SDMs from the original multi-modal images, including T1w, T2w, and T1w/T2w images. By doing this, the spatial context information, including the relative position information across different structures and the shape information of the segmented structures contained in the ground-truth SDMs, is used for supervising the SDM-UNet to remedy the bad influence from the low tissue contrast in infant brain MR images and generate high-quality SDMs. To improve the robustness to outliers, a Correntropy based loss is introduced in SDM-UNet to penalize the difference between the ground-truth SDMs and predicted SDMs in training. At the fine stage, the predicted SDMs, which contains spatial context information of subcortical structures, are combined with the multi-modal images, and then fed into a multi-source and multi-path UNet (M2-UNet) for delivering refined segmentation. We validate our method on an infant brain MR image dataset with 24 scans by evaluating the Dice ratio between our segmentation and the manual delineation. Compared to four state-of-the-art methods, our method consistently achieves better performances in both qualitative and quantitative evaluations.
Collapse
|