1
|
Xu X, Wang H, Lu Y, Zhang H, Tan T, Xu F, Lei J. Joint segmentation of retinal layers and fluid lesions in optical coherence tomography with cross-dataset learning. Artif Intell Med 2025; 162:103096. [PMID: 39999658 DOI: 10.1016/j.artmed.2025.103096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 12/24/2024] [Accepted: 02/19/2025] [Indexed: 02/27/2025]
Abstract
BACKGROUND AND OBJECTIVES Age-related macular degeneration (AMD) is the leading cause of irreversible vision loss among people over 50 years old, which manifests in the retina through various changes of retinal layers and pathological lesions. The accurate segmentation of optical coherence tomography (OCT) image features is crucial for the identification and tracking of AMD. Although the recent developments in deep neural network have brought profound progress in this area, accurately segmenting retinal layers and pathological lesions remains a challenging task because of the interaction between these two tasks. METHODS In this study, we propose a three-branch, hierarchical multi-task framework that enables joint segmentation of seven retinal layers and three types of pathological lesions. A regression guidance module is introduced to provide explicit shape guidance between sub-tasks. We also propose a cross-dataset learning strategy to leverage public datasets with partial labels. The proposed framework was evaluated on a clinical dataset consisting of 140 OCT B-scans with pixel-level annotations of seven retinal layers and three types of lesions. Additionally, we compared its performance with the state-of-the-art methods on two public datasets. RESULTS Comprehensive ablation showed that the proposed hierarchical architecture significantly improved performance for most retinal layers and pathological lesions, achieving the highest mean DSC of 76.88 %. The IRF also achieved the best performance with a DSC of 68.15 %. Comparative studies demonstrated that the hierarchical multi-task architecture could significantly enhance segmentation accuracy and outperform state-of-the-art methods. CONCLUSION The proposed framework could also be generalized to other medical image segmentation tasks with interdependent relationships.
Collapse
Affiliation(s)
- Xiayu Xu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, PR China; Bioinspired Engineering and Biomechanics Center (BEBC), Xi'an Jiaotong University, Xi'an 710049, PR China.
| | - Hualin Wang
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, PR China; Bioinspired Engineering and Biomechanics Center (BEBC), Xi'an Jiaotong University, Xi'an 710049, PR China
| | - Yulei Lu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, PR China; Bioinspired Engineering and Biomechanics Center (BEBC), Xi'an Jiaotong University, Xi'an 710049, PR China
| | - Hanze Zhang
- Department of Ophthalmology, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710049, PR China
| | - Tao Tan
- Faculty of Applied Sciences, Macao Polytechnic University, 999078, Macao
| | - Feng Xu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an 710049, PR China; Bioinspired Engineering and Biomechanics Center (BEBC), Xi'an Jiaotong University, Xi'an 710049, PR China
| | - Jianqin Lei
- Department of Ophthalmology, First Affiliated Hospital of Xi'an Jiaotong University, Xi'an 710049, PR China.
| |
Collapse
|
2
|
Liu X, Li X, Zhang Y, Wang M, Yao J, Tang J. Boundary-Repairing Dual-Path Network for Retinal Layer Segmentation in OCT Image with Pigment Epithelial Detachment. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024; 37:3101-3130. [PMID: 38740662 PMCID: PMC11612104 DOI: 10.1007/s10278-024-01093-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 03/05/2024] [Accepted: 03/06/2024] [Indexed: 05/16/2024]
Abstract
Automatic retinal layer segmentation in optical coherence tomography (OCT) images is crucial for the diagnosis of ocular diseases. Currently, automatic retinal layer segmentation works well with normal OCT images. However, pigment epithelial detachment (PED) dramatically alters the retinal structure, causing blurred boundaries and partial disappearance of the Bruch's Membrane (BM), thus posing challenges to the segmentation. To tackle these problems, we propose a novel dual-path U-shaped network for simultaneous layer segmentation and boundary regression. This network first designs a feature interaction fusion (FIF) module to strengthen the boundary shape constraints in the layer path. To address the challenge posed by partial BM disappearance and boundary-blurring, we propose a layer boundary repair (LBR) module. This module aims to use contrastive loss to enhance the confidence of blurred boundary regions and refine the segmentation of layer boundaries through the re-prediction head. In addition, we introduce a novel bilateral threshold distance map (BTDM) designed for the boundary path. The BTDM serves to emphasize information within boundary regions. This map, combined with the updated probability map, culminates in topology-guaranteed segmentation results achieved through a topology correction (TC) module. We investigated the proposed network on two severely deformed datasets (i.e., OCTA-500 and Aier-PED) and one slightly deformed dataset (i.e., DUKE). The proposed method achieves an average Dice score of 94.26% on the OCTA-500 dataset, which was 1.5% higher than BAU-Net and outperformed other methods. In the DUKE and Aier-PED datasets, the proposed method achieved average Dice scores of 91.65% and 95.75%, respectively.
Collapse
Affiliation(s)
- Xiaoming Liu
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, 430065, China.
- Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan, 430065, China.
| | - Xiao Li
- School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, 430065, China
- Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System, Wuhan, 430065, China
| | - Ying Zhang
- Wuhan Aier Eye Hospital of Wuhan University, Wuhan, China
| | - Man Wang
- Wuhan Aier Eye Hospital of Wuhan University, Wuhan, China
| | - Junping Yao
- Department of Ophthalmology, Tianyou Hospital Affiliated to Wuhan University of Science and Technology, Wuhan, China
| | - Jinshan Tang
- Department of Health Administration and Policy, College of Health and Human Services, George Mason University, Fairfax, VA, 22030, USA
| |
Collapse
|
3
|
Baştuğ BT, Güneri G, Yıldırım MS, Çorbacı K, Dandıl E. Fully Automated Detection of the Appendix Using U-Net Deep Learning Architecture in CT Scans. J Clin Med 2024; 13:5893. [PMID: 39407953 PMCID: PMC11478302 DOI: 10.3390/jcm13195893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 09/29/2024] [Accepted: 09/30/2024] [Indexed: 10/20/2024] Open
Abstract
Background: The accurate segmentation of the appendix with well-defined boundaries is critical for diagnosing conditions such as acute appendicitis. The manual identification of the appendix is time-consuming and highly dependent on the expertise of the radiologist. Method: In this study, we propose a fully automated approach to the detection of the appendix using deep learning architecture based on the U-Net with specific training parameters in CT scans. The proposed U-Net architecture is trained on an annotated original dataset of abdominal CT scans to segment the appendix efficiently and with high performance. In addition, to extend the training set, data augmentation techniques are applied for the created dataset. Results: In experimental studies, the proposed U-Net model is implemented using hyperparameter optimization and the performance of the model is evaluated using key metrics to measure diagnostic reliability. The trained U-Net model achieved the segmentation performance for the detection of the appendix in CT slices with a Dice Similarity Coefficient (DSC), Volumetric Overlap Error (VOE), Average Symmetric Surface Distance (ASSD), Hausdorff Distance 95 (HD95), Precision (PRE) and Recall (REC) of 85.94%, 23.29%, 1.24 mm, 5.43 mm, 86.83% and 86.62%, respectively. Moreover, our model outperforms other methods by leveraging the U-Net's ability to capture spatial context through encoder-decoder structures and skip connections, providing a correct segmentation output. Conclusions: The proposed U-Net model showed reliable performance in segmenting the appendix region, with some limitations in cases where the appendix was close to other structures. These improvements highlight the potential of deep learning to significantly improve clinical outcomes in appendix detection.
Collapse
Affiliation(s)
- Betül Tiryaki Baştuğ
- Department of Radiology, Medical Faculty, Bilecik Şeyh Edebali University, Bilecik 11230, Türkiye
| | - Gürkan Güneri
- Department of General Surgery, Medical Faculty, Bilecik Şeyh Edebali University, Bilecik 11230, Türkiye;
| | - Mehmet Süleyman Yıldırım
- Department of Sogut Vocational School, Computer Technology, Bilecik Şeyh Edebali University, Bilecik 11600, Türkiye;
| | - Kadir Çorbacı
- Department of General Surgery, Bilecik Osmaneli Mustafa Selahattin Çetintaş Hospital, Bilecik 11500, Türkiye;
| | - Emre Dandıl
- Department of Computer Engineering, Faculty of Engineering, Bilecik Seyh Edebali University, Bilecik 11230, Türkiye
| |
Collapse
|
4
|
Ma F, Wang S, Guo Y, Dai C, Meng J. Image segmentation of mouse eye in vivo with optical coherence tomography based on Bayesian classification. BIOMED ENG-BIOMED TE 2024; 69:307-315. [PMID: 38178615 DOI: 10.1515/bmt-2023-0266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 12/22/2023] [Indexed: 01/06/2024]
Abstract
OBJECTIVES Optical coherence tomography (OCT) is a new imaging technology that uses an optical analog of ultrasound imaging for biological tissues. Image segmentation plays an important role in dealing with quantitative analysis of medical images. METHODS We have proposed a novel framework to deal with the low intensity problem, based on the labeled patches and Bayesian classification (LPBC) model. The proposed method includes training and testing phases. During the training phase, firstly, we manually select the sub-images of background and Region of Interest (ROI) from the training image, and then extract features by patches. Finally, we train the Bayesian model with the features. The segmentation threshold of each patch is computed by the learned Bayesian model. RESULTS In addition, we have collected a new dataset of mouse eyes in vivo with OCT, named MEVOCT, which can be found at URL https://17861318579.github.io/LPBC. MEVOCT consists of 20 high-resolution images. The resolution of every image is 2048 × 2048 pixels. CONCLUSIONS The experimental results demonstrate the effectiveness of the LPBC method on the new MEVOCT dataset. The ROI segmentation is of great importance for the distortion correction.
Collapse
Affiliation(s)
- Fei Ma
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Shengbo Wang
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Yanfei Guo
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| | - Cuixia Dai
- Department of College Science, Shanghai Institute of Technology, Shanghai, Shanghai, China
| | - Jing Meng
- School of Computer Science, Qufu Normal University, Rizhao, Shandong, China
| |
Collapse
|
5
|
Zheng H, Hung ALY, Miao Q, Song W, Scalzo F, Raman SS, Zhao K, Sung K. AtPCa-Net: anatomical-aware prostate cancer detection network on multi-parametric MRI. Sci Rep 2024; 14:5740. [PMID: 38459100 PMCID: PMC10923873 DOI: 10.1038/s41598-024-56405-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 03/06/2024] [Indexed: 03/10/2024] Open
Abstract
Multi-parametric MRI (mpMRI) is widely used for prostate cancer (PCa) diagnosis. Deep learning models show good performance in detecting PCa on mpMRI, but domain-specific PCa-related anatomical information is sometimes overlooked and not fully explored even by state-of-the-art deep learning models, causing potential suboptimal performances in PCa detection. Symmetric-related anatomical information is commonly used when distinguishing PCa lesions from other visually similar but benign prostate tissue. In addition, different combinations of mpMRI findings are used for evaluating the aggressiveness of PCa for abnormal findings allocated in different prostate zones. In this study, we investigate these domain-specific anatomical properties in PCa diagnosis and how we can adopt them into the deep learning framework to improve the model's detection performance. We propose an anatomical-aware PCa detection Network (AtPCa-Net) for PCa detection on mpMRI. Experiments show that the AtPCa-Net can better utilize the anatomical-related information, and the proposed anatomical-aware designs help improve the overall model performance on both PCa detection and patient-level classification.
Collapse
Affiliation(s)
- Haoxin Zheng
- Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA.
- Computer Science, University of California, Los Angeles, Los Angeles, 90095, USA.
| | - Alex Ling Yu Hung
- Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA
- Computer Science, University of California, Los Angeles, Los Angeles, 90095, USA
| | - Qi Miao
- Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA
| | - Weinan Song
- Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, 90095, USA
| | - Fabien Scalzo
- Computer Science, University of California, Los Angeles, Los Angeles, 90095, USA
- The Seaver College, Pepperdine University, Los Angeles, 90363, USA
| | - Steven S Raman
- Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA
| | - Kai Zhao
- Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA
| | - Kyunghyun Sung
- Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA
| |
Collapse
|
6
|
Ma D, Deng W, Khera Z, Sajitha TA, Wang X, Wollstein G, Schuman JS, Lee S, Shi H, Ju MJ, Matsubara J, Beg MF, Sarunic M, Sappington RM, Chan KC. Early inner plexiform layer thinning and retinal nerve fiber layer thickening in excitotoxic retinal injury using deep learning-assisted optical coherence tomography. Acta Neuropathol Commun 2024; 12:19. [PMID: 38303097 PMCID: PMC10835918 DOI: 10.1186/s40478-024-01732-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 01/14/2024] [Indexed: 02/03/2024] Open
Abstract
Excitotoxicity from the impairment of glutamate uptake constitutes an important mechanism in neurodegenerative diseases such as Alzheimer's, multiple sclerosis, and Parkinson's disease. Within the eye, excitotoxicity is thought to play a critical role in retinal ganglion cell death in glaucoma, diabetic retinopathy, retinal ischemia, and optic nerve injury, yet how excitotoxic injury impacts different retinal layers is not well understood. Here, we investigated the longitudinal effects of N-methyl-D-aspartate (NMDA)-induced excitotoxic retinal injury in a rat model using deep learning-assisted retinal layer thickness estimation. Before and after unilateral intravitreal NMDA injection in nine adult Long Evans rats, spectral-domain optical coherence tomography (OCT) was used to acquire volumetric retinal images in both eyes over 4 weeks. Ten retinal layers were automatically segmented from the OCT data using our deep learning-based algorithm. Retinal degeneration was evaluated using layer-specific retinal thickness changes at each time point (before, and at 3, 7, and 28 days after NMDA injection). Within the inner retina, our OCT results showed that retinal thinning occurred first in the inner plexiform layer at 3 days after NMDA injection, followed by the inner nuclear layer at 7 days post-injury. In contrast, the retinal nerve fiber layer exhibited an initial thickening 3 days after NMDA injection, followed by normalization and thinning up to 4 weeks post-injury. Our results demonstrated the pathological cascades of NMDA-induced neurotoxicity across different layers of the retina. The early inner plexiform layer thinning suggests early dendritic shrinkage, whereas the initial retinal nerve fiber layer thickening before subsequent normalization and thinning indicates early inflammation before axonal loss and cell death. These findings implicate the inner plexiform layer as an early imaging biomarker of excitotoxic retinal degeneration, whereas caution is warranted when interpreting the ganglion cell complex combining retinal nerve fiber layer, ganglion cell layer, and inner plexiform layer thicknesses in conventional OCT measures. Deep learning-assisted retinal layer segmentation and longitudinal OCT monitoring can help evaluate the different phases of retinal layer damage upon excitotoxicity.
Collapse
Affiliation(s)
- Da Ma
- Wake Forest University School of Medicine, 1 Medical Center Blvd, Winston-Salem, NC, 27157, USA.
- Wake Forest University Health Sciences, Winston-Salem, NC, USA.
- Translational Eye and Vision Research Center, Wake Forest University School of Medicine, Winston-Salem, NC, USA.
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada.
| | - Wenyu Deng
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
- Department of Ophthalmology, SUNY Downstate Medical Center, Brooklyn, NY, USA
| | - Zain Khera
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
| | - Thajunnisa A Sajitha
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
| | - Xinlei Wang
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
| | - Gadi Wollstein
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
- Center for Neural Science, College of Arts and Science, New York University, New York, NY, USA
- Department of Biomedical Engineering, Tandon School of Engineering, New York University, Brooklyn, NY, USA
| | - Joel S Schuman
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
- Center for Neural Science, College of Arts and Science, New York University, New York, NY, USA
- Department of Biomedical Engineering, Tandon School of Engineering, New York University, Brooklyn, NY, USA
- Wills Eye Hospital, Philadelphia, PA, USA
- Department of Biomedical Engineering, Drexel University, Philadelphia, PA, USA
- Neuroscience Institute, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA
| | - Sieun Lee
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
- Department of Ophthalmology and Visual Sciences, The University of British Columbia, Vancouver, BC, Canada
- Mental Health and Clinical Neurosciences, School of Medicine, University of Nottingham, Nottingham, UK
| | - Haolun Shi
- Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, Canada
| | - Myeong Jin Ju
- Department of Ophthalmology and Visual Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Joanne Matsubara
- Department of Ophthalmology and Visual Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Mirza Faisal Beg
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
| | - Marinko Sarunic
- Institute of Ophthalmology, University College London, London, UK
- Department of Medical Physics and Biomedical Engineering, University College London, London, UK
| | - Rebecca M Sappington
- Wake Forest University School of Medicine, 1 Medical Center Blvd, Winston-Salem, NC, 27157, USA
- Wake Forest University Health Sciences, Winston-Salem, NC, USA
- Translational Eye and Vision Research Center, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Kevin C Chan
- Department of Ophthalmology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA.
- Center for Neural Science, College of Arts and Science, New York University, New York, NY, USA.
- Department of Biomedical Engineering, Tandon School of Engineering, New York University, Brooklyn, NY, USA.
- Neuroscience Institute, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA.
- Department of Radiology, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, USA.
| |
Collapse
|
7
|
Sun K, Chen Y, Dong F, Wu Q, Geng J, Chen Y. Retinal vessel segmentation method based on RSP-SA Unet network. Med Biol Eng Comput 2024; 62:605-620. [PMID: 37964177 DOI: 10.1007/s11517-023-02960-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 10/28/2023] [Indexed: 11/16/2023]
Abstract
Segmenting retinal vessels plays a significant role in the diagnosis of fundus disorders. However, there are two problems in the retinal vessel segmentation methods. First, fine-grained features of fine blood vessels are difficult to be extracted. Second, it is easy to lose track of the details of blood vessel edges. To solve the problems above, the Residual SimAM Pyramid-Spatial Attention Unet (RSP-SA Unet) is proposed, in which the encoding, decoding, and upsampling layers of the Unet are mainly improved. Firstly, the RSP structure proposed in this paper approximates a residual structure combined with SimAM and Pyramid Segmentation Attention (PSA), which is applied to the encoding and decoding parts to extract multi-scale spatial information and important features across dimensions at a finer level. Secondly, the spatial attention (SA) is used in the upsampling layer to perform multi-attention mapping on the input feature map, which could enhance the segmentation effect of small blood vessels with low contrast. Finally, the RSP-SA Unet is verified on the CHASE_DB1, DRIVE, and STARE datasets, and the segmentation accuracy (ACC) of the RSP-SA Unet could reach 0.9763, 0.9704, and 0.9724, respectively. Area under the ROC curve (AUC) could reach 0.9896, 0.9858, and 0.9906, respectively. The RSP-SA Unet overall performance is better than the comparison methods.
Collapse
Affiliation(s)
- Kun Sun
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin, China
- Teaching Demonstration Center for Measurement and Control Technology and Instrumentation, National Experimental, Harbin University of Science and Technology, Harbin, China
| | - Yang Chen
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin, China
- Teaching Demonstration Center for Measurement and Control Technology and Instrumentation, National Experimental, Harbin University of Science and Technology, Harbin, China
| | - Fuxuan Dong
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin, China
- Teaching Demonstration Center for Measurement and Control Technology and Instrumentation, National Experimental, Harbin University of Science and Technology, Harbin, China
| | - Qing Wu
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin, China.
- Teaching Demonstration Center for Measurement and Control Technology and Instrumentation, National Experimental, Harbin University of Science and Technology, Harbin, China.
- Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin, China.
| | - Jiameng Geng
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin, China
- Teaching Demonstration Center for Measurement and Control Technology and Instrumentation, National Experimental, Harbin University of Science and Technology, Harbin, China
| | - Yinsheng Chen
- The Higher Educational Key Laboratory for Measuring & Control Technology and Instrumentation of Heilongjiang Province, Harbin University of Science and Technology, Harbin, China
- Teaching Demonstration Center for Measurement and Control Technology and Instrumentation, National Experimental, Harbin University of Science and Technology, Harbin, China
| |
Collapse
|
8
|
Wei X, Sui R. A Review of Machine Learning Algorithms for Retinal Cyst Segmentation on Optical Coherence Tomography. SENSORS (BASEL, SWITZERLAND) 2023; 23:3144. [PMID: 36991857 PMCID: PMC10054815 DOI: 10.3390/s23063144] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/02/2023] [Accepted: 03/07/2023] [Indexed: 06/19/2023]
Abstract
Optical coherence tomography (OCT) is an emerging imaging technique for diagnosing ophthalmic diseases and the visual analysis of retinal structure changes, such as exudates, cysts, and fluid. In recent years, researchers have increasingly focused on applying machine learning algorithms, including classical machine learning and deep learning methods, to automate retinal cysts/fluid segmentation. These automated techniques can provide ophthalmologists with valuable tools for improved interpretation and quantification of retinal features, leading to more accurate diagnosis and informed treatment decisions for retinal diseases. This review summarized the state-of-the-art algorithms for the three essential steps of cyst/fluid segmentation: image denoising, layer segmentation, and cyst/fluid segmentation, while emphasizing the significance of machine learning techniques. Additionally, we provided a summary of the publicly available OCT datasets for cyst/fluid segmentation. Furthermore, the challenges, opportunities, and future directions of artificial intelligence (AI) in OCT cyst segmentation are discussed. This review is intended to summarize the key parameters for the development of a cyst/fluid segmentation system and the design of novel segmentation algorithms and has the potential to serve as a valuable resource for imaging researchers in the development of assessment systems related to ocular diseases exhibiting cyst/fluid in OCT imaging.
Collapse
|
9
|
Chen S, Ma D, Lee S, Yu TTL, Xu G, Lu D, Popuri K, Ju MJ, Sarunic MV, Beg MF. Segmentation-guided domain adaptation and data harmonization of multi-device retinal optical coherence tomography using cycle-consistent generative adversarial networks. Comput Biol Med 2023; 159:106595. [PMID: 37087780 DOI: 10.1016/j.compbiomed.2023.106595] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 12/26/2022] [Accepted: 01/22/2023] [Indexed: 03/06/2023]
Abstract
BACKGROUND Medical images such as Optical Coherence Tomography (OCT) images acquired from different devices may show significantly different intensity profiles. An automatic segmentation model trained on images from one device may perform poorly when applied to images acquired using another device, resulting in a lack of generalizability. This study addresses this issue using domain adaptation methods improved by Cycle-Consistent Generative Adversarial Networks (CycleGAN), especially when the ground-truth labels are only available in the source domain. METHODS A two-stage pipeline is proposed to generate segmentation in the target domain. The first stage involves the training of a state-of-the-art segmentation model in the source domain. The second stage aims to adapt the images from the target domain to the source domain. The adapted target domain images are segmented using the model in the first stage. Ablation tests were performed with integration of different loss functions, and the statistical significance of these models is reported. Both the segmentation performance and the adapted image quality metrics were evaluated. RESULTS Regarding the segmentation Dice score, the proposed model ssppg achieves a significant improvement of 46.24% compared to without adaptation and reaches 87.4% of the upper limit of the segmentation performance. Furthermore, image quality metrics, including FID and KID scores, indicate that adapted images with better segmentation also have better image qualities. CONCLUSION The proposed method demonstrates the effectiveness of segmentation-driven domain adaptation in retinal imaging processing. It reduces the labor cost of manual labeling, incorporates prior anatomic information to regulate and guide domain adaptation, and provides insights into improving segmentation qualities in image domains without labels.
Collapse
Affiliation(s)
- Shuo Chen
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada.
| | - Da Ma
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada; Department of Internal Medicine, Section of Gerontology and Geriatric Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA; Center for Biomedical Informatics, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA; Alzheimer's Disease Research Center, Wake Forest School of Medicine, Winston-Salem, North Carolina, USA.
| | - Sieun Lee
- Mental Health & Clinical Neuroscience, School of Medicine, University of Nottingham, Nottingham, UK; Precision Imaging Beacon, University of Nottingham, Nottingham, UK
| | - Timothy T L Yu
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
| | - Gavin Xu
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
| | - Donghuan Lu
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada; Tencent Jarvis Lab, Shenzhen, China
| | - Karteek Popuri
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada; Department of Computer Science, Memorial University of Newfoundland, St. John's, NL, Canada
| | - Myeong Jin Ju
- School of Biomedical Engineering, University of British Columbia, BC, Canada; Department of Ophthalmology & Visual Sciences, The University of British Columbia, Vancouver, BC, Canada
| | - Marinko V Sarunic
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada; Institute of Ophthalmology, University College London, London, UK; Department of Medical Physics and Biomedical Engineering, University College London, UK
| | - Mirza Faisal Beg
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
10
|
Philippi D, Rothaus K, Castelli M. A vision transformer architecture for the automated segmentation of retinal lesions in spectral domain optical coherence tomography images. Sci Rep 2023; 13:517. [PMID: 36627357 PMCID: PMC9832034 DOI: 10.1038/s41598-023-27616-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 01/04/2023] [Indexed: 01/12/2023] Open
Abstract
Neovascular age-related macular degeneration (nAMD) is one of the major causes of irreversible blindness and is characterized by accumulations of different lesions inside the retina. AMD biomarkers enable experts to grade the AMD and could be used for therapy prognosis and individualized treatment decisions. In particular, intra-retinal fluid (IRF), sub-retinal fluid (SRF), and pigment epithelium detachment (PED) are prominent biomarkers for grading neovascular AMD. Spectral-domain optical coherence tomography (SD-OCT) revolutionized nAMD early diagnosis by providing cross-sectional images of the retina. Automatic segmentation and quantification of IRF, SRF, and PED in SD-OCT images can be extremely useful for clinical decision-making. Despite the excellent performance of convolutional neural network (CNN)-based methods, the task still presents some challenges due to relevant variations in the location, size, shape, and texture of the lesions. This work adopts a transformer-based method to automatically segment retinal lesion from SD-OCT images and qualitatively and quantitatively evaluate its performance against CNN-based methods. The method combines the efficient long-range feature extraction and aggregation capabilities of Vision Transformers with data-efficient training of CNNs. The proposed method was tested on a private dataset containing 3842 2-dimensional SD-OCT retina images, manually labeled by experts of the Franziskus Eye-Center, Muenster. While one of the competitors presents a better performance in terms of Dice score, the proposed method is significantly less computationally expensive. Thus, future research will focus on the proposed network's architecture to increase its segmentation performance while maintaining its computational efficiency.
Collapse
Affiliation(s)
- Daniel Philippi
- grid.10772.330000000121511713NOVA Information Management School (NOVA IMS), Universidade Nova de Lisboa, 1070-312 Lisbon, Portugal
| | - Kai Rothaus
- grid.416655.5Department of Ophthalmology, St. Franziskus Hospital, 48145 Muenster, Germany
| | - Mauro Castelli
- NOVA Information Management School (NOVA IMS), Universidade Nova de Lisboa, 1070-312, Lisbon, Portugal. .,School of Economics and Business, University of Ljubljana, Ljubljana, Slovenia.
| |
Collapse
|
11
|
Ma D, Pasquale LR, Girard MJA, Leung CKS, Jia Y, Sarunic MV, Sappington RM, Chan KC. Reverse translation of artificial intelligence in glaucoma: Connecting basic science with clinical applications. FRONTIERS IN OPHTHALMOLOGY 2023; 2:1057896. [PMID: 36866233 PMCID: PMC9976697 DOI: 10.3389/fopht.2022.1057896] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 12/05/2022] [Indexed: 04/16/2023]
Abstract
Artificial intelligence (AI) has been approved for biomedical research in diverse areas from bedside clinical studies to benchtop basic scientific research. For ophthalmic research, in particular glaucoma, AI applications are rapidly growing for potential clinical translation given the vast data available and the introduction of federated learning. Conversely, AI for basic science remains limited despite its useful power in providing mechanistic insight. In this perspective, we discuss recent progress, opportunities, and challenges in the application of AI in glaucoma for scientific discoveries. Specifically, we focus on the research paradigm of reverse translation, in which clinical data are first used for patient-centered hypothesis generation followed by transitioning into basic science studies for hypothesis validation. We elaborate on several distinctive areas of research opportunities for reverse translation of AI in glaucoma including disease risk and progression prediction, pathology characterization, and sub-phenotype identification. We conclude with current challenges and future opportunities for AI research in basic science for glaucoma such as inter-species diversity, AI model generalizability and explainability, as well as AI applications using advanced ocular imaging and genomic data.
Collapse
Affiliation(s)
- Da Ma
- School of Medicine, Wake Forest University, Winston-Salem, NC, United States
- Atrium Health Wake Forest Baptist Medical Center, Winston-Salem, NC, United States
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
| | - Louis R. Pasquale
- Department of Ophthalmology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Michaël J. A. Girard
- Ophthalmic Engineering & Innovation Laboratory (OEIL), Singapore Eye Research Institute, Singapore National Eye Centre, Singapore, Singapore
- Duke-NUS Medical School, Singapore, Singapore
- Institute for Molecular and Clinical Ophthalmology, Basel, Switzerland
| | | | - Yali Jia
- Casey Eye Institute, Oregon Health & Science University, Portland, OR, United States
| | - Marinko V. Sarunic
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
- Institute of Ophthalmology, University College London, London, United Kingdom
| | - Rebecca M. Sappington
- School of Medicine, Wake Forest University, Winston-Salem, NC, United States
- Atrium Health Wake Forest Baptist Medical Center, Winston-Salem, NC, United States
| | - Kevin C. Chan
- Departments of Ophthalmology and Radiology, Neuroscience Institute, NYU Grossman School of Medicine, NYU Langone Health, New York University, New York, NY, United States
- Department of Biomedical Engineering, Tandon School of Engineering, New York University, New York, NY, United States
| |
Collapse
|
12
|
Wu J, Liu S, Xiao Z, Zhang F, Geng L. Joint segmentation of retinal layers and macular edema in optical coherence tomography scans based on RLMENet. Med Phys 2022; 49:7150-7166. [PMID: 36574592 DOI: 10.1002/mp.15866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 07/05/2022] [Accepted: 07/09/2022] [Indexed: 12/29/2022] Open
Abstract
PURPOSE The segmentation of retinal layers and fluid lesions on retinal optical coherence tomography (OCT) images is an important component of screening and diagnosing retinopathy in clinical ophthalmic treatment. We designed a novel network for accurate segmentation of the seven tissue layers of the retina and lesion areas of diabetic macular edema (DME), which can assist doctors to quantitatively analyze the disease. METHODS In this article, we propose the Retinal Layer Macular Edema Network (RLMENet) model to achieve end-to-end joint segmentation of retinal layers and fluids. The network employs dense multi-scale attention to enhance the extraction of retinal layer and fluid detail information and achieve efficient long-range modeling, which improves the receptive field and obtains multi-scale features. As the more complex decoder part is designed, which integrates more low-level feature information on the decoder side, more features are extracted to gradually restore the resolution of the feature map and improve the segmentation accuracy. RESULTS We used part of the OCT2017 dataset to train and verify the model to divide the data into a training set, validation set, and test set and set it to a 7:2:1 ratio. We evaluated our method on the ISIC2017 dataset. Experimental results showed that the RLMENet model designed in this work can accurately segment seven retinal tissue layers and DME lesions on the retinal OCT dataset. Finally, the MIoU value in the test set reached 86.55%. The model can be extended to other medical image segmentation datasets to achieve better segmentation performance. CONCLUSIONS The proposed method was superior to the existing segmentation methods, achieved a more refined segmentation effect, and provided an auxiliary analysis tool for clinical diagnosis and treatment.
Collapse
Affiliation(s)
- Jun Wu
- School of Electronics and Information Engineering, TianGong University, Tianjin, China
| | - Shuang Liu
- School of Electronics and Information Engineering, TianGong University, Tianjin, China
| | - Zhitao Xiao
- School of Life Sciences, TianGong University, Tianjin, China
| | - Fang Zhang
- School of Life Sciences, TianGong University, Tianjin, China
| | - Lei Geng
- School of Life Sciences, TianGong University, Tianjin, China
| |
Collapse
|
13
|
Miao Y, Song J, Ju MJ. Image-based cross-calibration method for multiple spectrometer-based OCT. OPTICS LETTERS 2022; 47:5096-5099. [PMID: 36181195 DOI: 10.1364/ol.468707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 08/31/2022] [Indexed: 06/16/2023]
Abstract
A fast and practical computational cross-calibration of multiple spectrometers is described. A signal correlation matrix (CM) can be constructed from paired B-scans in a multiple-spectrometer optical coherence tomography (OCT), where the wavelength-corresponding pixels are indicated by high cross correlation. The CM can be used to either guide the physical alignment of spectrometers or to numerically match the spectra in the post-process. The performance is comparable to the previously reported optimization approach, as demonstrated by the mirror tests, qualitative comparison of OCT and optical coherence tomography angiography (OCTA) images, and quantitative comparison of image metrics.
Collapse
|
14
|
Recent Advanced Deep Learning Architectures for Retinal Fluid Segmentation on Optical Coherence Tomography Images. SENSORS 2022; 22:s22083055. [PMID: 35459040 PMCID: PMC9029682 DOI: 10.3390/s22083055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 04/10/2022] [Accepted: 04/13/2022] [Indexed: 11/16/2022]
Abstract
With non-invasive and high-resolution properties, optical coherence tomography (OCT) has been widely used as a retinal imaging modality for the effective diagnosis of ophthalmic diseases. The retinal fluid is often segmented by medical experts as a pivotal biomarker to assist in the clinical diagnosis of age-related macular diseases, diabetic macular edema, and retinal vein occlusion. In recent years, the advanced machine learning methods, such as deep learning paradigms, have attracted more and more attention from academia in the retinal fluid segmentation applications. The automatic retinal fluid segmentation based on deep learning can improve the semantic segmentation accuracy and efficiency of macular change analysis, which has potential clinical implications for ophthalmic pathology detection. This article summarizes several different deep learning paradigms reported in the up-to-date literature for the retinal fluid segmentation in OCT images. The deep learning architectures include the backbone of convolutional neural network (CNN), fully convolutional network (FCN), U-shape network (U-Net), and the other hybrid computational methods. The article also provides a survey on the prevailing OCT image datasets used in recent retinal segmentation investigations. The future perspectives and some potential retinal segmentation directions are discussed in the concluding context.
Collapse
|
15
|
Ma D, Kumar M, Khetan V, Sen P, Bhende M, Chen S, Yu TTL, Lee S, Navajas EV, Matsubara JA, Ju MJ, Sarunic MV, Raman R, Beg MF. Clinical explainable differential diagnosis of polypoidal choroidal vasculopathy and age-related macular degeneration using deep learning. Comput Biol Med 2022; 143:105319. [PMID: 35220077 DOI: 10.1016/j.compbiomed.2022.105319] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Revised: 02/11/2022] [Accepted: 02/11/2022] [Indexed: 02/06/2023]
Abstract
BACKGROUND This study aims to achieve an automatic differential diagnosis between two types of retinal pathologies with similar pathological features - Polypoidal choroidal vasculopathy (PCV) and wet age-related macular degeneration (AMD) from volumetric optical coherence tomography (OCT) images, and identify clinically-relevant pathological features, using an explainable deep-learning-based framework. METHODS This is a retrospective study with data from a cross-sectional cohort. The OCT volume of 73 eyes from 59 patients was included in this study. Disease differentiation was achieved through single-B-scan-based classification followed by a volumetric probability prediction aggregation step. We compared different labeling strategies with and without identifying pathological B-scans within each OCT volume. Clinical interpretability was achieved through normalized aggregation of B-scan-based saliency maps followed by maximum-intensity-projection onto the en face plane. We derived the PCV score from the proposed differential diagnosis framework with different labeling strategies. The en face projection of saliency map was validated with the pathologies identified in Indocyanine green angiography (ICGA). RESULTS Model trained with both labeling strategies achieved similar level differentiation power (>90%), with good correspondence between pathological features detected from the projected en face saliency map and ICGA. CONCLUSIONS This study demonstrated the potential clinical application of non-invasive differential diagnosis using AI-driven OCT-based analysis, with minimal requirement of labeling efforts, along with clinical explainability achieved through automatically detected disease-related pathologies.
Collapse
Affiliation(s)
- Da Ma
- Wake Forest University School of Medicine, Medical Center Boulevard, Winston-Salem, NC, USA; School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada.
| | - Meenakshi Kumar
- Shri Bhagwan Mahavir Vitreoretinal Service, Medical Research Foundation, Sankara Nethralaya, Chennai, India
| | - Vikas Khetan
- Shri Bhagwan Mahavir Vitreoretinal Service, Medical Research Foundation, Sankara Nethralaya, Chennai, India
| | - Parveen Sen
- Shri Bhagwan Mahavir Vitreoretinal Service, Medical Research Foundation, Sankara Nethralaya, Chennai, India
| | - Muna Bhende
- Shri Bhagwan Mahavir Vitreoretinal Service, Medical Research Foundation, Sankara Nethralaya, Chennai, India
| | - Shuo Chen
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
| | - Timothy T L Yu
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada
| | - Sieun Lee
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada; Mental Health & Clinical Neurosciences, School of Medicine, University of Nottingham, Nottingham, United Kingdom
| | - Eduardo V Navajas
- Department of Ophthalmology & Visual Sciences, The University of British Columbia, Vancouver, BC, Canada; University of British Columbia Vancouver General Hospital, Eye Care Centre, Vancouver, BC, Canada
| | - Joanne A Matsubara
- Department of Ophthalmology & Visual Sciences, The University of British Columbia, Vancouver, BC, Canada; University of British Columbia Vancouver General Hospital, Eye Care Centre, Vancouver, BC, Canada
| | - Myeong Jin Ju
- Department of Ophthalmology & Visual Sciences, The University of British Columbia, Vancouver, BC, Canada; University of British Columbia Vancouver General Hospital, Eye Care Centre, Vancouver, BC, Canada; School of Biomedical Engineering, University of British Columbia, BC, Canada
| | - Marinko V Sarunic
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada; Institute of Ophthalmology, University College London, London, UK; Department of Medical Physics and Biomedical Engineering, University College London, United Kingdom
| | - Rajiv Raman
- Shri Bhagwan Mahavir Vitreoretinal Service, Medical Research Foundation, Sankara Nethralaya, Chennai, India.
| | - Mirza Faisal Beg
- School of Engineering Science, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|