1
|
Atik ME, Kocak İ, Sayin N, Bayramoglu SE, Ozyigit A. Integration of Optical Coherence Tomography Images and Real-Life Clinical Data for Deep Learning Modeling: A Unified Approach in Prognostication of Diabetic Macular Edema. JOURNAL OF BIOPHOTONICS 2025; 18:e202400315. [PMID: 39737652 DOI: 10.1002/jbio.202400315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Revised: 11/06/2024] [Accepted: 12/18/2024] [Indexed: 01/01/2025]
Abstract
The primary ocular effect of diabetes is diabetic retinopathy (DR), which is associated with diabetic microangiopathy. Diabetic macular edema (DME) can cause vision loss for people with DR. For this reason, deciding on the appropriate treatment and follow-up has a critical role in terms of curing the disease. Current artificial intelligence (AI) approaches focus on OCT images and may ignore clinical, laboratory, and demographic information obtained by the specialist. This study presents a novel deep learning (DL) framework for evaluating the visual outcome of the TREX anti-VEGF intravitreal injection regimen. DL models are trained to extract deep features from OCT and ILM topographic images and the obtained deep features are combined with patients' demographic, clinical, and laboratory findings to predict the direction of the treatment process. When the ResNet-18 network is used, the proposed DL framework is able to predict the prognosis status of patients with the highest accuracy.
Collapse
Affiliation(s)
- Muhammed Enes Atik
- Faculty of Civil Engineering, Department of Geomatics Engineering, Istanbul Technical University, Istanbul, Türkiye
| | - İbrahim Kocak
- Medical Faculty, Ophthalmology Department, University of Health Sciences, Istanbul, Türkiye
| | - Nihat Sayin
- Medical Faculty, Ophthalmology Department, University of Health Sciences, Istanbul, Türkiye
| | - Sadik Etka Bayramoglu
- Medical Faculty, Ophthalmology Department, University of Health Sciences, Istanbul, Türkiye
| | - Ahmet Ozyigit
- Medical Faculty, Ophthalmology Department, University of Health Sciences, Istanbul, Türkiye
| |
Collapse
|
2
|
Gencer G, Gencer K. Advanced retinal disease detection from OCT images using a hybrid squeeze and excitation enhanced model. PLoS One 2025; 20:e0318657. [PMID: 39919140 PMCID: PMC11805419 DOI: 10.1371/journal.pone.0318657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 01/20/2025] [Indexed: 02/09/2025] Open
Abstract
BACKGROUND Retinal problems are critical because they can cause severe vision loss if not treated. Traditional methods for diagnosing retinal disorders often rely heavily on manual interpretation of optical coherence tomography (OCT) images, which can be time-consuming and dependent on the expertise of ophthalmologists. This leads to challenges in early diagnosis, especially as retinal diseases like diabetic macular edema (DME), Drusen, and Choroidal neovascularization (CNV) become more prevalent. OCT helps ophthalmologists diagnose patients more accurately by allowing for early detection. This paper offers a hybrid SE (Squeeze-and-Excitation)-Enhanced Hybrid Model for detecting retinal disorders from OCT images, including DME, Drusen, and CNV, using artificial intelligence and deep learning. METHODS The model integrates SE blocks with EfficientNetB0 and Xception architectures, which provide high success in image classification tasks. EfficientNetB0 achieves high accuracy with fewer parameters through model scaling strategies, while Xception offers powerful feature extraction using deep separable convolutions. The combination of these architectures enhances both the efficiency and classification performance of the model, enabling more accurate detection of retinal disorders from OCT images. Additionally, SE blocks increase the representational ability of the network by adaptively recalibrating per-channel feature responses. RESULTS The combined features from EfficientNetB0 and Xception are processed via fully connected layers and categorized using the Softmax algorithm. The methodology was tested on UCSD and Duke's OCT datasets and produced excellent results. The proposed SE-Improved Hybrid Model outperformed the current best-known approaches, with accuracy rates of 99.58% on the UCSD dataset and 99.18% on the Duke dataset. CONCLUSION These findings emphasize the model's ability to effectively diagnose retinal disorders using OCT images and indicate substantial promise for the development of computer-aided diagnostic tools in the field of ophthalmology.
Collapse
Affiliation(s)
- Gülcan Gencer
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Afyonkarahisar Health Sciences University, Afyonkarahisar, Turkey
| | - Kerem Gencer
- Department of Computer Engineering, Faculty of Engineering, Afyon Kocatepe University, Afyonkarahisar, Turkey
| |
Collapse
|
3
|
Kucukgoz B, Zou K, Murphy DC, Steel DH, Obara B, Fu H. Uncertainty-aware regression model to predict post-operative visual acuity in patients with macular holes. Comput Med Imaging Graph 2025; 119:102461. [PMID: 39615266 DOI: 10.1016/j.compmedimag.2024.102461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 10/01/2024] [Accepted: 11/03/2024] [Indexed: 01/11/2025]
Abstract
Full-thickness macular holes are a relatively common and visually disabling condition with a prevalence of approximately 0.5% in the over-40-year-old age group. If left untreated, the hole typically enlarges, reducing visual acuity (VA) below the definition of blindness in the eye affected. They are now routinely treated with surgery, which can close the hole and improve vision in most cases. The extent of improvement, however, is variable and dependent on the size of the hole and other features which can be discerned in spectral-domain optical coherence tomography imaging, which is now routinely available in eye clinics globally. Artificial intelligence (AI) models have been developed to enable surgical decision-making and have achieved relatively high predictive performance. However, their black-box behavior is opaque to users and uncertainty associated with their predictions is not typically stated, leading to a lack of trust among clinicians and patients. In this paper, we describe an uncertainty-aware regression model (U-ARM) for predicting VA for people undergoing macular hole surgery using preoperative spectral-domain optical coherence tomography images, achieving an MAE of 6.07, RMSE of 9.11 and R2 of 0.47 in internal tests, and an MAE of 6.49, RMSE of 9.49, and R2 of 0.42 in external tests. In addition to predicting VA following surgery, U-ARM displays its associated uncertainty, a p-value of <0.005 in internal and external tests, showing the predictions are not due to random chance. We then qualitatively evaluated the performance of U-ARM. Lastly, we demonstrate out-of-sample data performance, generalizing well to data outside the training distribution, low-quality images, and unseen instances not encountered during training. The results show that U-ARM outperforms commonly used methods in terms of prediction and reliability. U-ARM is thus a promising approach for clinical settings and can improve the reliability of AI models in predicting VA.
Collapse
Affiliation(s)
- Burak Kucukgoz
- School of Computing, Newcastle University, Newcastle upon Tyne, UK; Institute of High-Performance Computing, A*STAR, Singapore, 138632, Singapore
| | - Ke Zou
- Institute of High-Performance Computing, A*STAR, Singapore, 138632, Singapore; National Key Laboratory of Fundamental Science on Synthetic Vision and College of Computer Science, Sichuan University, Chengdu, 610065, China
| | - Declan C Murphy
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, UK
| | - David H Steel
- Biosciences Institute, Newcastle University, Newcastle upon Tyne, UK; Sunderland Eye Infirmary, National Health Service, Sunderland, UK
| | - Boguslaw Obara
- School of Computing, Newcastle University, Newcastle upon Tyne, UK; Biosciences Institute, Newcastle University, Newcastle upon Tyne, UK.
| | - Huazhu Fu
- Institute of High-Performance Computing, A*STAR, Singapore, 138632, Singapore.
| |
Collapse
|
4
|
Oghbaie M, Araújo T, Schmidt-Erfurth U, Bogunović H. VLFATRollout: Fully transformer-based classifier for retinal OCT volumes. Comput Med Imaging Graph 2024; 118:102452. [PMID: 39489098 DOI: 10.1016/j.compmedimag.2024.102452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 09/20/2024] [Accepted: 10/12/2024] [Indexed: 11/05/2024]
Abstract
BACKGROUND AND OBJECTIVE Despite the promising capabilities of 3D transformer architectures in video analysis, their application to high-resolution 3D medical volumes encounters several challenges. One major limitation is the high number of 3D patches, which reduces the efficiency of the global self-attention mechanisms of transformers. Additionally, background information can distract vision transformers from focusing on crucial areas of the input image, thereby introducing noise into the final representation. Moreover, the variability in the number of slices per volume complicates the development of models capable of processing input volumes of any resolution while simple solutions like subsampling may risk losing essential diagnostic details. METHODS To address these challenges, we introduce an end-to-end transformer-based framework, variable length feature aggregator transformer rollout (VLFATRollout), to classify volumetric data. The proposed VLFATRollout enjoys several merits. First, the proposed VLFATRollout can effectively mine slice-level fore-background information with the help of transformer's attention matrices. Second, randomization of volume-wise resolution (i.e. the number of slices) during training enhances the learning capacity of the learnable positional embedding (PE) assigned to each volume slice. This technique allows the PEs to generalize across neighboring slices, facilitating the handling of high-resolution volumes at the test time. RESULTS VLFATRollout was thoroughly tested on the retinal optical coherence tomography (OCT) volume classification task, demonstrating a notable average improvement of 5.47% in balanced accuracy over the leading convolutional models for a 5-class diagnostic task. These results emphasize the effectiveness of our framework in enhancing slice-level representation and its adaptability across different volume resolutions, paving the way for advanced transformer applications in medical image analysis. The code is available at https://github.com/marziehoghbaie/VLFATRollout/.
Collapse
Affiliation(s)
- Marzieh Oghbaie
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Austria; Institute of Artificial Intelligence, Center for Medical Data Science, Medical University of Vienna, Austria.
| | - Teresa Araújo
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Austria; Institute of Artificial Intelligence, Center for Medical Data Science, Medical University of Vienna, Austria
| | | | - Hrvoje Bogunović
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Austria; Institute of Artificial Intelligence, Center for Medical Data Science, Medical University of Vienna, Austria
| |
Collapse
|
5
|
Linde G, Rodrigues de Souza W, Chalakkal R, Danesh-Meyer HV, O'Keeffe B, Chiong Hong S. A comparative evaluation of deep learning approaches for ophthalmology. Sci Rep 2024; 14:21829. [PMID: 39294275 PMCID: PMC11410932 DOI: 10.1038/s41598-024-72752-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 09/09/2024] [Indexed: 09/20/2024] Open
Abstract
There is a growing number of publicly available ophthalmic imaging datasets and open-source code for Machine Learning algorithms. This allows ophthalmic researchers and practitioners to independently perform various deep-learning tasks. With the advancement in artificial intelligence (AI) and in the field of imaging, the choice of the most appropriate AI architecture for different tasks will vary greatly. The best-performing AI-dataset combination will depend on the specific problem that needs to be solved and the type of data available. The article discusses different machine learning models and deep learning architectures currently used for various ophthalmic imaging modalities and for different machine learning tasks. It also proposes the most appropriate models based on accuracy and other important factors such as training time, the ability to deploy the model on clinical devices/smartphones, heatmaps that enhance the self-explanatory nature of classification decisions, and the ability to train/adapt on small image datasets to determine if further data collection is worthwhile. The article extensively reviews the existing state-of-the-art AI methods focused on useful machine-learning applications for ophthalmology. It estimates their performance and viability through training and evaluating architectures with different public and private image datasets of different modalities, such as full-color retinal images, OCT images, and 3D OCT scans. The article is expected to benefit the readers by enriching their knowledge of artificial intelligence applied to ophthalmology.
Collapse
Affiliation(s)
- Glenn Linde
- oDocs Eye Care Research, Dunedin, New Zealand
| | - Waldir Rodrigues de Souza
- Department of Ophthalmology, Dunedin Hospital, Te Whatu Ora Southern, Dunedin, New Zealand
- Department of Medicine, Ophthalmology Section, University of Otago, Dunedin, New Zealand
| | | | | | | | - Sheng Chiong Hong
- oDocs Eye Care Research, Dunedin, New Zealand
- Department of Ophthalmology, Dunedin Hospital, Te Whatu Ora Southern, Dunedin, New Zealand
| |
Collapse
|
6
|
Li Y, Jie C, Wang J, Zhang W, Wang J, Deng Y, Liu Z, Hou X, Bi X. Global research trends and future directions in diabetic macular edema research: A bibliometric and visualized analysis. Medicine (Baltimore) 2024; 103:e38596. [PMID: 38905408 PMCID: PMC11191902 DOI: 10.1097/md.0000000000038596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Accepted: 05/24/2024] [Indexed: 06/23/2024] Open
Abstract
BACKGROUND Diabetic Macular Edema (DME) significantly impairs vision in diabetics, with varied patient responses to current treatments like anti-vascular endothelial growth factor (VEGF) therapy underscoring the necessity for continued research into more effective strategies. This study aims to evaluate global research trends and identify emerging frontiers in DME to guide future research and clinical management. METHODS A qualitative and quantitative analysis of publications related to diabetic macular edema retrieved from the Web of Science Core Collection (WoSCC) between its inception and September 4, 2023, was conducted. Microsoft Excel, CiteSpace, VOSviewer, Bibliometrix Package, and Tableau were used for the bibliometric analysis and visualization. This encompasses an examination of the overall distribution of annual output, major countries, regions, institutions, authors, core journals, co-cited references, and keyword analyses. RESULTS Overall, 5624 publications were analyzed, indicating an increasing trend in DME research. The United States was identified as the leading country in DME research, with the highest h-index of 135 and 91,841 citations. Francesco Bandello emerged as the most prolific author with 97 publications. Neil M. Bressler has the highest h-index and highest total citation count of 46 and 9692, respectively. The journals "Retina - the Journal of Retinal and Vitreous Diseases" and "Ophthalmology" were highlighted as the most prominent in this field. "Retina" leads with 354 publications, a citation count of 11,872, and an h-index of 59. Meanwhile, "Ophthalmology" stands out with the highest overall citation count of 31,558 and the highest h-index of 90. The primary research focal points in diabetic macular edema included "prevalence and risk factors," "pathological mechanisms," "imaging modalities," "treatment strategies," and "clinical trials." Emerging research areas encompassed "deep learning and artificial intelligence," "novel treatment modalities," and "biomarkers." CONCLUSION Our bibliometric analysis delineates the leading role of the United States in DME research. We identified current research hotspots, including epidemiological studies, pathophysiological mechanisms, imaging advancements, and treatment innovations. Emerging trends, such as the integration of artificial intelligence and novel therapeutic approaches, highlight future directions. These insights underscore the importance of collaborative and interdisciplinary approaches in advancing DME research and clinical management.
Collapse
Affiliation(s)
- Yuanyuan Li
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Chuanhong Jie
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Jianwei Wang
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Weiqiong Zhang
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Jingying Wang
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Yu Deng
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Ziqiang Liu
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Xiaoyu Hou
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| | - Xuqi Bi
- Eye Hospital China Academy of Chinese Medical Sciences, Beijing, China
| |
Collapse
|
7
|
Alizadeh Eghtedar R, Vard A, Malekahmadi M, Peyman A. A new computer-aided diagnosis tool based on deep learning methods for automatic detection of retinal disorders from OCT images. Int Ophthalmol 2024; 44:110. [PMID: 38396074 DOI: 10.1007/s10792-024-03033-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 01/11/2024] [Indexed: 02/25/2024]
Abstract
PURPOSE Early detection of retinal disorders using optical coherence tomography (OCT) images can prevent vision loss. Since manual screening can be time-consuming, tedious, and fallible, we present a reliable computer-aided diagnosis (CAD) software based on deep learning. Also, we made efforts to increase the interpretability of the deep learning methods, overcome their vague and black box nature, and also understand their behavior in the diagnosis. METHODS We propose a novel method to improve the interpretability of the used deep neural network by embedding the rich semantic information of abnormal areas based on the ophthalmologists' interpretations and medical descriptions in the OCT images. Finally, we trained the classification network on a small subset of the online publicly available University of California San Diego (UCSD) dataset with an overall of 29,800 OCT images. RESULTS The experimental results on the 1000 test OCT images show that the proposed method achieves the overall precision, accuracy, sensitivity, and f1-score of 97.6%, 97.6%, 97.6%, and 97.59%, respectively. Also, the heat map images provide a clear region of interest which indicates that the interpretability of the proposed method is increased dramatically. CONCLUSION The proposed software can help ophthalmologists in providing a second opinion to make a decision, and primitive automated diagnoses of retinal diseases and even it can be used as a screening tool, in eye clinics. Also, the improvement of the interpretability of the proposed method causes to increase in the model generalization, and therefore, it will work properly on a wide range of other OCT datasets.
Collapse
Affiliation(s)
- Reza Alizadeh Eghtedar
- Department of Bioelectrics and Biomedical Engineering, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
- Student Research Committee, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Alireza Vard
- Department of Bioelectrics and Biomedical Engineering, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
- Medical Image & Signal Processing Research Center, School of Advanced Technologies in Medicine, Isfahan University of Medical Sciences, Isfahan, Iran.
| | - Mohammad Malekahmadi
- Department of Ophthalmology, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
- Isfahan Eye Research Center, Department of Ophthalmology, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Alireza Peyman
- Department of Ophthalmology, School of Medicine, Isfahan University of Medical Sciences, Isfahan, Iran
- Isfahan Eye Research Center, Department of Ophthalmology, Isfahan University of Medical Sciences, Isfahan, Iran
| |
Collapse
|
8
|
Talcott KE, Valentim CCS, Perkins SW, Ren H, Manivannan N, Zhang Q, Bagherinia H, Lee G, Yu S, D'Souza N, Jarugula H, Patel K, Singh RP. Automated Detection of Abnormal Optical Coherence Tomography B-scans Using a Deep Learning Artificial Intelligence Neural Network Platform. Int Ophthalmol Clin 2024; 64:115-127. [PMID: 38146885 DOI: 10.1097/iio.0000000000000519] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2023]
|
9
|
Cai L, Wen C, Jiang J, Liang C, Zheng H, Su Y, Chen C. Classification of diabetic maculopathy based on optical coherence tomography images using a Vision Transformer model. BMJ Open Ophthalmol 2023; 8:e001423. [PMID: 38135350 DOI: 10.1136/bmjophth-2023-001423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 11/22/2023] [Indexed: 12/24/2023] Open
Abstract
PURPOSE To develop a Vision Transformer model to detect different stages of diabetic maculopathy (DM) based on optical coherence tomography (OCT) images. METHODS After removing images with poor quality, a total of 3319 OCT images were extracted from the Eye Center of the Renmin Hospital of Wuhan University and randomly split the images into training and validation sets in a 7:3 ratio. All macular cross-sectional scan OCT images were collected retrospectively from the eyes of DM patients from 2016 to 2022. One of the OCT stages of DM, including early diabetic macular oedema (DME), advanced DME, severe DME and atrophic maculopathy, was labelled on the collected images, respectively. A deep learning (DL) model based on Vision Transformer was trained to detect four OCT grading of DM. RESULTS The model proposed in our paper can provide an impressive detection performance. We achieved an accuracy of 82.00%, an F1 score of 83.11%, an area under the receiver operating characteristic curve (AUC) of 0.96. The AUC for the detection of four OCT grading (ie, early DME, advanced DME, severe DME and atrophic maculopathy) was 0.96, 0.95, 0.87 and 0.98, respectively, with an accuracy of 90.87%, 89.96%, 94.42% and 95.13%, respectively, a precision of 88.46%, 80.31%, 89.42% and 87.74%, respectively, a sensitivity of 87.03%, 88.18%, 63.39% and 89.42%, respectively, a specificity of 93.02%, 90.72%, 98.40% and 96.66%, respectively and an F1 score of 87.74%, 84.06%, 88.18% and 88.57%, respectively. CONCLUSION Our DL model based on Vision Transformer demonstrated a relatively high accuracy in the detection of OCT grading of DM, which can help with patients in a preliminary screening to identify groups with serious conditions. These patients need a further test for an accurate diagnosis, and a timely treatment to obtain a good visual prognosis. These results emphasised the potential of artificial intelligence in assisting clinicians in developing therapeutic strategies with DM in the future.
Collapse
Affiliation(s)
- Liwei Cai
- Department of Ophthalmology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Chi Wen
- Wuhan University School of Computer Science, Wuhan, Hubei, China
| | - Jingwen Jiang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou, Guangdong, China
| | - Congbi Liang
- Department of Ophthalmology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Hongmei Zheng
- Department of Ophthalmology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Yu Su
- Department of Ophthalmology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| | - Changzheng Chen
- Department of Ophthalmology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
| |
Collapse
|
10
|
Araújo T, Aresta G, Schmidt-Erfurth U, Bogunović H. Few-shot out-of-distribution detection for automated screening in retinal OCT images using deep learning. Sci Rep 2023; 13:16231. [PMID: 37758754 PMCID: PMC10533534 DOI: 10.1038/s41598-023-43018-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 09/15/2023] [Indexed: 09/29/2023] Open
Abstract
Deep neural networks have been increasingly proposed for automated screening and diagnosis of retinal diseases from optical coherence tomography (OCT), but often provide high-confidence predictions on out-of-distribution (OOD) cases, compromising their clinical usage. With this in mind, we performed an in-depth comparative analysis of the state-of-the-art uncertainty estimation methods for OOD detection in retinal OCT imaging. The analysis was performed within the use-case of automated screening and staging of age-related macular degeneration (AMD), one of the leading causes of blindness worldwide, where we achieved a macro-average area under the curve (AUC) of 0.981 for AMD classification. We focus on a few-shot Outlier Exposure (OE) method and the detection of near-OOD cases that share pathomorphological characteristics with the inlier AMD classes. Scoring the OOD case based on the Cosine distance in the feature space from the penultimate network layer proved to be a robust approach for OOD detection, especially in combination with the OE. Using Cosine distance and only 8 outliers exposed per class, we were able to improve the near-OOD detection performance of the OE with Reject Bucket method by [Formula: see text] 10% compared to without OE, reaching an AUC of 0.937. The Cosine distance served as a robust metric for OOD detection of both known and unknown classes and should thus be considered as an alternative to the reject bucket class probability in OE approaches, especially in the few-shot scenario. The inclusion of these methodologies did not come at the expense of classification performance, and can substantially improve the reliability and trustworthiness of the resulting deep learning-based diagnostic systems in the context of retinal OCT.
Collapse
Affiliation(s)
- Teresa Araújo
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Vienna, Austria.
| | - Guilherme Aresta
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Vienna, Austria
| | - Ursula Schmidt-Erfurth
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Vienna, Austria
| | - Hrvoje Bogunović
- Christian Doppler Laboratory for Artificial Intelligence in Retina, Department of Ophthalmology and Optometry, Medical University of Vienna, Vienna, Austria.
| |
Collapse
|
11
|
Hanson RLW, Airody A, Sivaprasad S, Gale RP. Optical coherence tomography imaging biomarkers associated with neovascular age-related macular degeneration: a systematic review. Eye (Lond) 2023; 37:2438-2453. [PMID: 36526863 PMCID: PMC9871156 DOI: 10.1038/s41433-022-02360-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 10/13/2022] [Accepted: 12/06/2022] [Indexed: 12/23/2022] Open
Abstract
The aim of this systematic literature review is twofold, (1) detail the impact of retinal biomarkers identifiable via optical coherence tomography (OCT) on disease progression and response to treatment in neovascular age-related macular degeneration (nAMD) and (2) establish which biomarkers are currently identifiable by artificial intelligence (AI) models and the utilisation of this technology. Following the PRISMA guidelines, PubMed was searched for peer-reviewed publications dated between January 2016 and January 2022. POPULATION Patients diagnosed with nAMD with OCT imaging. SETTINGS Comparable settings to NHS hospitals. STUDY DESIGNS Randomised controlled trials, prospective/retrospective cohort studies and review articles. From 228 articles, 130 were full-text reviewed, 50 were removed for falling outside the scope of this review with 10 added from the author's inventory, resulting in the inclusion of 90 articles. From 9 biomarkers identified; intraretinal fluid (IRF), subretinal fluid, pigment epithelial detachment, subretinal hyperreflective material (SHRM), retinal pigmental epithelial (RPE) atrophy, drusen, outer retinal tabulation (ORT), hyperreflective foci (HF) and retinal thickness, 5 are considered pertinent to nAMD disease progression; IRF, SHRM, drusen, ORT and HF. A number of these biomarkers can be classified using current AI models. Significant retinal biomarkers pertinent to disease activity and progression in nAMD are identifiable via OCT; IRF being the most important in terms of the significant impact on visual outcome. Incorporating AI into ophthalmology practice is a promising advancement towards automated and reproducible analyses of OCT data with the ability to diagnose disease and predict future disease conversion. SYSTEMATIC REVIEW REGISTRATION This review has been registered with PROSPERO (registration ID: CRD42021233200).
Collapse
Affiliation(s)
- Rachel L W Hanson
- Academic Unit of Ophthalmology, York and Scarborough Teaching Hospitals NHS Foundation Trust, York, UK
| | - Archana Airody
- Academic Unit of Ophthalmology, York and Scarborough Teaching Hospitals NHS Foundation Trust, York, UK
| | - Sobha Sivaprasad
- Moorfields National Institute of Health Research, Biomedical Research Centre, London, UK
| | - Richard P Gale
- Academic Unit of Ophthalmology, York and Scarborough Teaching Hospitals NHS Foundation Trust, York, UK.
- Hull York Medical School, University of York, York, UK.
- York Biomedical Research Institute, University of York, York, UK.
| |
Collapse
|
12
|
Akinniyi O, Rahman MM, Sandhu HS, El-Baz A, Khalifa F. Multi-Stage Classification of Retinal OCT Using Multi-Scale Ensemble Deep Architecture. Bioengineering (Basel) 2023; 10:823. [PMID: 37508850 PMCID: PMC10376573 DOI: 10.3390/bioengineering10070823] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 07/01/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023] Open
Abstract
Accurate noninvasive diagnosis of retinal disorders is required for appropriate treatment or precision medicine. This work proposes a multi-stage classification network built on a multi-scale (pyramidal) feature ensemble architecture for retinal image classification using optical coherence tomography (OCT) images. First, a scale-adaptive neural network is developed to produce multi-scale inputs for feature extraction and ensemble learning. The larger input sizes yield more global information, while the smaller input sizes focus on local details. Then, a feature-rich pyramidal architecture is designed to extract multi-scale features as inputs using DenseNet as the backbone. The advantage of the hierarchical structure is that it allows the system to extract multi-scale, information-rich features for the accurate classification of retinal disorders. Evaluation on two public OCT datasets containing normal and abnormal retinas (e.g., diabetic macular edema (DME), choroidal neovascularization (CNV), age-related macular degeneration (AMD), and Drusen) and comparison against recent networks demonstrates the advantages of the proposed architecture's ability to produce feature-rich classification with average accuracy of 97.78%, 96.83%, and 94.26% for the first (binary) stage, second (three-class) stage, and all-at-once (four-class) classification, respectively, using cross-validation experiments using the first dataset. In the second dataset, our system showed an overall accuracy, sensitivity, and specificity of 99.69%, 99.71%, and 99.87%, respectively. Overall, the tangible advantages of the proposed network for enhanced feature learning might be used in various medical image classification tasks where scale-invariant features are crucial for precise diagnosis.
Collapse
Affiliation(s)
- Oluwatunmise Akinniyi
- Department of Computer Science, School of Computer, Mathematical and Natural Sciences, Morgan State University, Baltimore, MD 21251, USA
| | - Md Mahmudur Rahman
- Department of Computer Science, School of Computer, Mathematical and Natural Sciences, Morgan State University, Baltimore, MD 21251, USA
| | - Harpal Singh Sandhu
- Bioengineering Department, University of Louisville, Louisville, KY 20292, USA
| | - Ayman El-Baz
- Bioengineering Department, University of Louisville, Louisville, KY 20292, USA
| | - Fahmi Khalifa
- Electronics and Communications Engineering Department, Mansoura University, Mansoura 35516, Egypt
- Electrical and Computer Engineering Department, Morgan State University, Baltimore MD 21251, USA
| |
Collapse
|
13
|
Feng H, Chen J, Zhang Z, Lou Y, Zhang S, Yang W. A bibliometric analysis of artificial intelligence applications in macular edema: exploring research hotspots and Frontiers. Front Cell Dev Biol 2023; 11:1174936. [PMID: 37255600 PMCID: PMC10225517 DOI: 10.3389/fcell.2023.1174936] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 05/02/2023] [Indexed: 06/01/2023] Open
Abstract
Background: Artificial intelligence (AI) is used in ophthalmological disease screening and diagnostics, medical image diagnostics, and predicting late-disease progression rates. We reviewed all AI publications associated with macular edema (ME) research Between 2011 and 2022 and performed modeling, quantitative, and qualitative investigations. Methods: On 1st February 2023, we screened the Web of Science Core Collection for AI applications related to ME, from which 297 studies were identified and analyzed (2011-2022). We collected information on: publications, institutions, country/region, keywords, journal name, references, and research hotspots. Literature clustering networks and Frontier knowledge bases were investigated using bibliometrix-BiblioShiny, VOSviewer, and CiteSpace bibliometric platforms. We used the R "bibliometrix" package to synopsize our observations, enumerate keywords, visualize collaboration networks between countries/regions, and generate a topic trends plot. VOSviewer was used to examine cooperation between institutions and identify citation relationships between journals. We used CiteSpace to identify clustering keywords over the timeline and identify keywords with the strongest citation bursts. Results: In total, 47 countries published AI studies related to ME; the United States had the highest H-index, thus the greatest influence. China and the United States cooperated most closely between all countries. Also, 613 institutions generated publications - the Medical University of Vienna had the highest number of studies. This publication record and H-index meant the university was the most influential in the ME field. Reference clusters were also categorized into 10 headings: retinal Optical Coherence Tomography (OCT) fluid detection, convolutional network models, deep learning (DL)-based single-shot predictions, retinal vascular disease, diabetic retinopathy (DR), convolutional neural networks (CNNs), automated macular pathology diagnosis, dry age-related macular degeneration (DARMD), class weight, and advanced DL architecture systems. Frontier keywords were represented by diabetic macular edema (DME) (2021-2022). Conclusion: Our review of the AI-related ME literature was comprehensive, systematic, and objective, and identified future trends and current hotspots. With increased DL outputs, the ME research focus has gradually shifted from manual ME examinations to automatic ME detection and associated symptoms. In this review, we present a comprehensive and dynamic overview of AI in ME and identify future research areas.
Collapse
Affiliation(s)
- Haiwen Feng
- Department of Software Engineering, School of Software, Shenyang University of Technology, Shenyang, Liaoning, China
| | - Jiaqi Chen
- Department of Software Engineering, School of Software, Shenyang University of Technology, Shenyang, Liaoning, China
| | - Zhichang Zhang
- Department of Computer, School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
| | - Yan Lou
- Department of Computer, School of Intelligent Medicine, China Medical University, Shenyang, Liaoning, China
| | - Shaochong Zhang
- Shenzhen Eye Institute, Shenzhen Eye Hospital, Jinan University, Shenzhen, China
| | - Weihua Yang
- Shenzhen Eye Institute, Shenzhen Eye Hospital, Jinan University, Shenzhen, China
| |
Collapse
|
14
|
Manikandan S, Raman R, Rajalakshmi R, Tamilselvi S, Surya RJ. Deep learning-based detection of diabetic macular edema using optical coherence tomography and fundus images: A meta-analysis. Indian J Ophthalmol 2023; 71:1783-1796. [PMID: 37203031 PMCID: PMC10391382 DOI: 10.4103/ijo.ijo_2614_22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2023] Open
Abstract
Diabetic macular edema (DME) is an important cause of visual impairment in the working-age group. Deep learning methods have been developed to detect DME from two-dimensional retinal images and also from optical coherence tomography (OCT) images. The performances of these algorithms vary and often create doubt regarding their clinical utility. In resource-constrained health-care systems, these algorithms may play an important role in determining referral and treatment. The survey provides a diversified overview of macular edema detection methods, including cutting-edge research, with the objective of providing pertinent information to research groups, health-care professionals, and diabetic patients about the applications of deep learning in retinal image detection and classification process. Electronic databases such as PubMed, IEEE Explore, BioMed, and Google Scholar were searched from inception to March 31, 2022, and the reference lists of published papers were also searched. The study followed the preferred reporting items for systematic review and meta-analysis (PRISMA) reporting guidelines. Examination of various deep learning models and their exhibition regarding precision, epochs, their capacity to detect anomalies for less training data, concepts, and challenges that go deep into the applications were analyzed. A total of 53 studies were included that evaluated the performance of deep learning models in a total of 1,414,169°CT volumes, B-scans, patients, and 472,328 fundus images. The overall area under the receiver operating characteristic curve (AUROC) was 0.9727. The overall sensitivity for detecting DME using OCT images was 96% (95% confidence interval [CI]: 0.94-0.98). The overall sensitivity for detecting DME using fundus images was 94% (95% CI: 0.90-0.96).
Collapse
Affiliation(s)
- Suchetha Manikandan
- Professor & Deputy Director, Centre for Healthcare Advancement, Innovation ! Research, Vellore Institute of Technology, Chennai, Tamil Nadu, India
| | - Rajiv Raman
- Senior Consultant, Shri Bhagwan Mahavir Vitreoretinal Services, Sankara Nethralaya, Chennai, Tamil Nadu, India
| | - Ramachandran Rajalakshmi
- Head Medical Retina, Dr. Mohan's Diabetes Specialties Centre and Madras Diabetes Research Foundation, Chennai, Tamil Nadu, India
| | - S Tamilselvi
- Junior Research Fellow, Centre for Healthcare Advancement, Innovation & Research, Vellore Institute of Technology, Chennai, Tamil Nadu, India
| | - R Janani Surya
- Research Associate, Vision Research Foundation, Chennai, Tamil Nadu, India
| |
Collapse
|
15
|
Kayadibi İ, Güraksın GE. An Explainable Fully Dense Fusion Neural Network with Deep Support Vector Machine for Retinal Disease Determination. INT J COMPUT INT SYS 2023. [DOI: 10.1007/s44196-023-00210-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023] Open
Abstract
AbstractRetinal issues are crucial because they result in visual loss. Early diagnosis can aid physicians in initiating treatment and preventing visual loss. Optical coherence tomography (OCT), which portrays retinal morphology cross-sectionally and noninvasively, is used to identify retinal abnormalities. The process of analyzing OCT images, on the other hand, takes time. This study has proposed a hybrid approach based on a fully dense fusion neural network (FD-CNN) and dual preprocessing to identify retinal diseases, such as choroidal neovascularization, diabetic macular edema, drusen from OCT images. A dual preprocessing methodology, in other words, a hybrid speckle reduction filter was initially used to diminish speckle noise present in OCT images. Secondly, the FD-CNN architecture was trained, and the features obtained from this architecture were extracted. Then Deep Support Vector Machine (D-SVM) and Deep K-Nearest Neighbor (D-KNN) classifiers were proposed to reclassify those features and tested on University of California San Diego (UCSD) and Duke OCT datasets. D-SVM demonstrated the best performance in both datasets. D-SVM achieved 99.60% accuracy, 99.60% sensitivity, 99.87% specificity, 99.60% precision and 99.60% F1 score in the UCSD dataset. It achieved 97.50% accuracy, 97.64% sensitivity, 98.91% specificity, 96.61% precision, and 97.03% F1 score in Duke dataset. Additionally, the results were compared to state-of-the-art works on the both datasets. The D-SVM was demonstrated to be an efficient and productive strategy for improving the robustness of automatic retinal disease classification. Also, in this study, it is shown that the unboxing of how AI systems' black-box choices is made by generating heat maps using the local interpretable model-agnostic explanation method, which is an explainable artificial intelligence (XAI) technique. Heat maps, in particular, may contribute to the development of more stable deep learning-based systems, as well as enhancing the confidence in the diagnosis of retinal disease in the analysis of OCT image for ophthalmologists.
Collapse
|
16
|
He J, Wang J, Han Z, Ma J, Wang C, Qi M. An interpretable transformer network for the retinal disease classification using optical coherence tomography. Sci Rep 2023; 13:3637. [PMID: 36869160 PMCID: PMC9984386 DOI: 10.1038/s41598-023-30853-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Accepted: 03/02/2023] [Indexed: 03/05/2023] Open
Abstract
Retinal illnesses such as age-related macular degeneration and diabetic macular edema will lead to irreversible blindness. With optical coherence tomography (OCT), doctors are able to see cross-sections of the retinal layers and provide patients with a diagnosis. Manual reading of OCT images is time-consuming, labor-intensive and even error-prone. Computer-aided diagnosis algorithms improve efficiency by automatically analyzing and diagnosing retinal OCT images. However, the accuracy and interpretability of these algorithms can be further improved through effective feature extraction, loss optimization and visualization analysis. In this paper, we propose an interpretable Swin-Poly Transformer network for performing automatically retinal OCT image classification. By shifting the window partition, the Swin-Poly Transformer constructs connections between neighboring non-overlapping windows in the previous layer and thus has the flexibility to model multi-scale features. Besides, the Swin-Poly Transformer modifies the importance of polynomial bases to refine cross entropy for better retinal OCT image classification. In addition, the proposed method also provides confidence score maps, assisting medical practitioners to understand the models' decision-making process. Experiments in OCT2017 and OCT-C8 reveal that the proposed method outperforms both the convolutional neural network approach and ViT, with an accuracy of 99.80% and an AUC of 99.99%.
Collapse
Affiliation(s)
- Jingzhen He
- Department of Radiology, Qilu Hospital of Shandong University, Jinan, 250012, China.
| | - Junxia Wang
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China
| | - Zeyu Han
- School of Mathematics and Statistics, Shandong University, Weihai, 264209, China
| | - Jun Ma
- School of Cyber Science and Engineering, Southeast University, Nanjing, 211189, China
| | - Chongjing Wang
- China Academy of Information and Communications Technology, Beijing, 100191, China
| | - Meng Qi
- School of Information Science and Engineering, Shandong Normal University, Jinan, 250358, China.
| |
Collapse
|
17
|
Tampu IE, Eklund A, Johansson K, Gimm O, Haj-Hosseini N. Diseased thyroid tissue classification in OCT images using deep learning: Towards surgical decision support. JOURNAL OF BIOPHOTONICS 2023; 16:e202200227. [PMID: 36203247 DOI: 10.1002/jbio.202200227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 09/15/2022] [Accepted: 09/16/2022] [Indexed: 06/16/2023]
Abstract
Intraoperative guidance tools for thyroid surgery based on optical coherence tomography (OCT) could aid distinguish between normal and diseased tissue. However, OCT images are difficult to interpret, thus, real-time automatic analysis could support the clinical decision-making. In this study, several deep learning models were investigated for thyroid disease classification on 2D and 3D OCT data obtained from ex vivo specimens of 22 patients undergoing surgery and diagnosed with several thyroid pathologies. Additionally, two open-access datasets were used to evaluate the custom models. On the thyroid dataset, the best performance was achieved by the 3D vision transformer model with a Matthew's correlation coefficient (MCC) of 0.79 (accuracy = 0.90) for the normal-versus-abnormal classification. On the open-access datasets, the custom models achieved the best performance (MCC > 0.88, accuracy > 0.96). Results obtained for the normal-versus-abnormal classification suggest OCT, complemented with deep learning-based analysis, as a tool for real-time automatic diseased tissue identification in thyroid surgery.
Collapse
Affiliation(s)
- Iulian Emil Tampu
- Department of Biomedical Engineering, Linköping University, Linköping, Sweden
- Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
| | - Anders Eklund
- Department of Biomedical Engineering, Linköping University, Linköping, Sweden
- Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
- Division of Statistics & Machine Learning, Department of Computer and Information Science, Linköping University, Linköping, Sweden
| | - Kenth Johansson
- Department of Surgery, Västervik Hospital, Västervik, Sweden
- Department of Surgery, Örebro University Hospital, Örebro, Sweden
| | - Oliver Gimm
- Department of Surgery, Linköping University Hospital, Linköping, Sweden
- Department of Biomedical and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Neda Haj-Hosseini
- Department of Biomedical Engineering, Linköping University, Linköping, Sweden
- Center for Medical Image Science and Visualization, Linköping University, Linköping, Sweden
| |
Collapse
|
18
|
Pavithra K, Kumar P, Geetha M, Bhandary SV. Computer aided diagnosis of diabetic macular edema in retinal fundus and OCT images: A review. Biocybern Biomed Eng 2023. [DOI: 10.1016/j.bbe.2022.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
19
|
Celebi ARC, Bulut E, Sezer A. Artificial intelligence based detection of age-related macular degeneration using optical coherence tomography with unique image preprocessing. Eur J Ophthalmol 2023; 33:65-73. [PMID: 35469472 DOI: 10.1177/11206721221096294] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
PURPOSE The aim of the study is to improve the accuracy of age related macular degeneration (AMD) disease in its earlier phases with proposed Capsule Network (CapsNet) architecture trained on speckle noise reduced spectral domain optical coherence tomography (SD-OCT) images based on an optimized Bayesian non-local mean (OBNLM) filter augmentation techniques. METHODS A total of 726 local SD-OCT images were collected and labelled as 159 drusen, 145 dry AMD, 156 wet AMD and 266 normal. Region of interest (ROI) was identified. Speckle noise in SD-OCT images were reduced based on OBNLM filter. The processed images were fed to proposed CapsNet architecture to clasify SD-OCT images. Accuracy rates were calculated in both public and local dataset. RESULTS Accuracy rate of local SD-OCT image dataset classification was achieved to a value of 96.39% after performing data augmentation and speckle noise reduction with OBNLM. The performance of proposed CapsNet was also evaluated on the public Kaggle dataset under the same processing procedures and the accuracy rate was calculated as 98.07%. The sensitivity and specificity rates were 96.72% and 99.98%, respectively. CONCLUSIONS The classification success of proposed CapsNet may be improved with robust pre-processing steps like; determination of ROI and denoised SD-OCT images based on OBNLM. These impactful image preprocessing steps yielded higher accuracy rates for determining different types of AMD including its precursor lesion on the both local and public dataset with proposed CapsNet architecture.
Collapse
Affiliation(s)
- Ali Riza Cenk Celebi
- Department of Ophthalmology, Acibadem University School of Medicine, Istanbul, Turkey
| | - Erkan Bulut
- Department of Ophthalmology, Beylikduzu Public Hospital, Istanbul, Turkey
| | - Aysun Sezer
- United'Informatique et d'Ingenierie des Systemes, 52849ENSTA-ParisTech, Universite de Paris-Saclay, Villefranche Sur Mer, Provence-Alpes-Côte d'azur, France
| |
Collapse
|
20
|
Li HY, Wang DX, Dong L, Wei WB. Deep learning algorithms for detection of diabetic macular edema in OCT images: A systematic review and meta-analysis. Eur J Ophthalmol 2023; 33:278-290. [PMID: 35473414 DOI: 10.1177/11206721221094786] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
PURPOSE Artificial intelligence (AI) can detect diabetic macular edema (DME) from optical coherence tomography (OCT) images. We aimed to evaluate the performance of deep learning neural networks in DME detection. METHODS Embase, Pubmed, the Cochrane Library, and IEEE Xplore were searched up to August 14, 2021. We included studies using deep learning algorithms to detect DME from OCT images. Two reviewers extracted the data independently, and the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) tool was applied to assess the risk of bias. The study is reported according to Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies (PRISMA-DTA). RESULTS Ninteen studies involving 41005 subjects were included. The pooled sensitivity and specificity were 96.0% (95% confidence interval (CI): 93.9% to 97.3%) and 99.3% (95% CI: 98.2% to 99.7%), respectively. Subgroup analyses found that data set selection, sample size of training set and the choice of OCT devices contributed to the heterogeneity (all P < 0.05). While there was no association between the diagnostic accuracy and transfer learning adoption or image management (all P > 0.05). CONCLUSIONS Deep learning methods, particularly the convolutional neural networks (CNNs) could effectively detect clinically significant DME, which can provide referral suggestions to the patients.
Collapse
Affiliation(s)
- He-Yan Li
- Beijing Tongren Eye Center, Beijing Key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Ophthalmology & Visual Sciences Key Lab, Medical Artificial Intelligence Research and Verification Key Laboratory of the Ministry of Industry and Information Technology, 117902Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Dai-Xi Wang
- 12517Capital Medical University, Beijing, China
| | - Li Dong
- Beijing Tongren Eye Center, Beijing Key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Ophthalmology & Visual Sciences Key Lab, Medical Artificial Intelligence Research and Verification Key Laboratory of the Ministry of Industry and Information Technology, 117902Beijing Tongren Hospital, Capital Medical University, Beijing, China
| | - Wen-Bin Wei
- Beijing Tongren Eye Center, Beijing Key Laboratory of Intraocular Tumor Diagnosis and Treatment, Beijing Ophthalmology & Visual Sciences Key Lab, Medical Artificial Intelligence Research and Verification Key Laboratory of the Ministry of Industry and Information Technology, 117902Beijing Tongren Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
21
|
Potapenko I, Thiesson B, Kristensen M, Hajari JN, Ilginis T, Fuchs J, Hamann S, la Cour M. Automated artificial intelligence-based system for clinical follow-up of patients with age-related macular degeneration. Acta Ophthalmol 2022; 100:927-936. [PMID: 35322564 PMCID: PMC9790353 DOI: 10.1111/aos.15133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 02/05/2022] [Accepted: 03/12/2022] [Indexed: 12/30/2022]
Abstract
PURPOSE In this study, we investigate the potential of a novel artificial intelligence-based system for autonomous follow-up of patients treated for neovascular age-related macular degeneration (AMD). METHODS A temporal deep learning model was trained on a data set of 84 489 optical coherence tomography scans from AMD patients to recognize disease activity, and its performance was compared with a published non-temporal model trained on the same data (Acta Ophthalmol, 2021). An autonomous follow-up system was created by augmenting the AI model with deterministic logic to suggest treatment according to the observe-and-plan regimen. To validate the AI-based system, a data set comprising clinical decisions and imaging data from 200 follow-up consultations was collected prospectively. In each case, both the autonomous AI decision and original clinical decision were compared with an expert panel consensus. RESULTS The temporal AI model proved superior at detecting disease activity compared with the model without temporal input (area under the curve 0.900 (95% CI 0.894-0.906) and 0.857 (95% CI 0.846-0.867) respectively). The AI-based follow-up system could make an autonomous decision in 73% of the cases, 91.8% of which were in agreement with expert consensus. This was on par with the 87.7% agreement rate between decisions made in the clinic and expert consensus (p = 0.33). CONCLUSIONS The proposed autonomous follow-up system was shown to be safe and compliant with expert consensus on par with clinical practice. The system could in the future ease the pressure on public ophthalmology services from an increasing number of AMD patients.
Collapse
Affiliation(s)
- Ivan Potapenko
- Department of OphthalmologyRigshospitaletCopenhagenDenmark,Faculty of Health and Medical SciencesUniversity of CopenhagenCopenhagenDenmark
| | - Bo Thiesson
- Enversion A/SAarhusDenmark,Department of EngineeringAarhus UniversityAarhusDenmark
| | | | | | - Tomas Ilginis
- Department of OphthalmologyRigshospitaletCopenhagenDenmark
| | - Josefine Fuchs
- Department of OphthalmologyRigshospitaletCopenhagenDenmark
| | - Steffen Hamann
- Department of OphthalmologyRigshospitaletCopenhagenDenmark,Faculty of Health and Medical SciencesUniversity of CopenhagenCopenhagenDenmark
| | - Morten la Cour
- Department of OphthalmologyRigshospitaletCopenhagenDenmark,Faculty of Health and Medical SciencesUniversity of CopenhagenCopenhagenDenmark
| |
Collapse
|
22
|
Sun LC, Pao SI, Huang KH, Wei CY, Lin KF, Chen PN. Generative adversarial network-based deep learning approach in classification of retinal conditions with optical coherence tomography images. Graefes Arch Clin Exp Ophthalmol 2022; 261:1399-1412. [PMID: 36441228 DOI: 10.1007/s00417-022-05919-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 09/21/2022] [Accepted: 11/22/2022] [Indexed: 11/29/2022] Open
Abstract
PURPOSE To determine whether a deep learning approach using generative adversarial networks (GANs) is beneficial for the classification of retinal conditions with Optical coherence tomography (OCT) images. METHODS Our study utilized 84,452 retinal OCT images obtained from a publicly available dataset (Kermany Dataset). Employing GAN, synthetic OCT images are produced to balance classes of retinal disorders. A deep learning classification model is constructed using pretrained deep neural networks (DNNs), and outcomes are evaluated using 2082 images collected from patients who visited the Department of Ophthalmology and the Department of Endocrinology and Metabolism at the Tri-service General Hospital in Taipei from January 2017 to December 2021. RESULTS The highest classification accuracies accomplished by deep learning machines trained on the unbalanced dataset for its training set, validation set, fivefold cross validation (CV), Kermany test set, and TSGH test set were 97.73%, 96.51%, 97.14%, 99.59%, and 81.03%, respectively. The highest classification accuracies accomplished by deep learning machines trained on the synthesis-balanced dataset for its training set, validation set, fivefold CV, Kermany test set, and TSGH test set were 98.60%, 98.41%, 98.52%, 99.38%, and 84.92%, respectively. In comparing the highest accuracies, deep learning machines trained on the synthesis-balanced dataset outperformed deep learning machines trained on the unbalanced dataset for the training set, validation set, fivefold CV, and TSGH test set. CONCLUSIONS Overall, deep learning machines on a synthesis-balanced dataset demonstrated to be advantageous over deep learning machines trained on an unbalanced dataset for the classification of retinal conditions.
Collapse
Affiliation(s)
- Ling-Chun Sun
- School of Medicine, National Defense Medical Center, Taipei, Taiwan
| | - Shu-I Pao
- Department of Ophthalmology, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Ke-Hao Huang
- Department of Ophthalmology, Song-Shan Branch of Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
| | - Chih-Yuan Wei
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan
| | - Ke-Feng Lin
- Medical Informatics Office, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan
- School of Public Health, National Defense Medical Center, Taipei, Taiwan
| | - Ping-Nan Chen
- Department of Biomedical Engineering, National Defense Medical Center, No.161, Sec.6, Minchiuan E. Rd., Neihu Dist, Taipei, 11490, Taiwan.
| |
Collapse
|
23
|
Tampu IE, Eklund A, Haj-Hosseini N. Inflation of test accuracy due to data leakage in deep learning-based classification of OCT images. Sci Data 2022; 9:580. [PMID: 36138025 PMCID: PMC9500039 DOI: 10.1038/s41597-022-01618-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 08/09/2022] [Indexed: 11/10/2022] Open
Abstract
In the application of deep learning on optical coherence tomography (OCT) data, it is common to train classification networks using 2D images originating from volumetric data. Given the micrometer resolution of OCT systems, consecutive images are often very similar in both visible structures and noise. Thus, an inappropriate data split can result in overlap between the training and testing sets, with a large portion of the literature overlooking this aspect. In this study, the effect of improper dataset splitting on model evaluation is demonstrated for three classification tasks using three OCT open-access datasets extensively used, Kermany's and Srinivasan's ophthalmology datasets, and AIIMS breast tissue dataset. Results show that the classification performance is inflated by 0.07 up to 0.43 in terms of Matthews Correlation Coefficient (accuracy: 5% to 30%) for models tested on datasets with improper splitting, highlighting the considerable effect of dataset handling on model evaluation. This study intends to raise awareness on the importance of dataset splitting given the increased research interest in implementing deep learning on OCT data.
Collapse
Affiliation(s)
- Iulian Emil Tampu
- Department of Biomedical Engineering, Linköping University, 581 85, Linköping, Sweden. .,Center for Medical Image Science and Visualization, Linköping University, 581 85, Linköping, Sweden.
| | - Anders Eklund
- Department of Biomedical Engineering, Linköping University, 581 85, Linköping, Sweden.,Center for Medical Image Science and Visualization, Linköping University, 581 85, Linköping, Sweden.,Division of Statistics & Machine Learning, Department of Computer and Information Science, Linköping University, 581 83, Linköping, Sweden
| | - Neda Haj-Hosseini
- Department of Biomedical Engineering, Linköping University, 581 85, Linköping, Sweden.,Center for Medical Image Science and Visualization, Linköping University, 581 85, Linköping, Sweden
| |
Collapse
|
24
|
Padilla-Pantoja FD, Sanchez YD, Quijano-Nieto BA, Perdomo OJ, Gonzalez FA. Etiology of Macular Edema Defined by Deep Learning in Optical Coherence Tomography Scans. Transl Vis Sci Technol 2022; 11:29. [PMID: 36169966 PMCID: PMC9526369 DOI: 10.1167/tvst.11.9.29] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Purpose To develop an automated method based on deep learning (DL) to classify macular edema (ME) from the evaluation of optical coherence tomography (OCT) scans. Methods A total of 4230 images were obtained from data repositories of patients attended in an ophthalmology clinic in Colombia and two free open-access databases. They were annotated with four biomarkers (BMs) as intraretinal fluid, subretinal fluid, hyperreflective foci/tissue, and drusen. Then the scans were labeled as control or ocular disease among diabetic macular edema (DME), neovascular age-related macular degeneration (nAMD), and retinal vein occlusion (RVO) by two expert ophthalmologists. Our method was developed by following four consecutive phases: segmentation of BMs, the combination of BMs, feature extraction with convolutional neural networks to achieve binary classification for each disease, and, finally, multiclass classification of diseases and control images. Results The accuracy of our model for nAMD was 97%, and for DME, RVO, and control were 94%, 93%, and 93%, respectively. Area under curve values were 0.99, 0.98, 0.96, and 0.97, respectively. The mean Cohen's kappa coefficient for the multiclass classification task was 0.84. Conclusions The proposed DL model may identify OCT scans as normal and ME. In addition, it may classify its cause among three major exudative retinal diseases with high accuracy and reliability. Translational Relevance Our DL approach can optimize the efficiency and timeliness of appropriate etiological diagnosis of ME, thus improving patient access and clinical decision making. It could be useful in places with a shortage of specialists and for readers that evaluate OCT scans remotely.
Collapse
Affiliation(s)
| | - Yeison D Sanchez
- MindLab Research Group, Universidad Nacional de Colombia, Bogotá, Colombia
| | | | - Oscar J Perdomo
- School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
| | - Fabio A Gonzalez
- MindLab Research Group, Universidad Nacional de Colombia, Bogotá, Colombia
| |
Collapse
|
25
|
Almasi R, Vafaei A, Kazeminasab E, Rabbani H. Automatic detection of microaneurysms in optical coherence tomography images of retina using convolutional neural networks and transfer learning. Sci Rep 2022; 12:13975. [PMID: 35978087 PMCID: PMC9385621 DOI: 10.1038/s41598-022-18206-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 08/08/2022] [Indexed: 11/15/2022] Open
Abstract
Microaneurysms (MAs) are pathognomonic signs that help clinicians to detect diabetic retinopathy (DR) in the early stages. Automatic detection of MA in retinal images is an active area of research due to its application in screening processes for DR which is one of the main reasons of blindness amongst the working-age population. The focus of these works is on the automatic detection of MAs in en face retinal images like fundus color and Fluorescein Angiography (FA). On the other hand, detection of MAs from Optical Coherence Tomography (OCT) images has 2 main advantages: first, OCT is a non-invasive imaging technique that does not require injection, therefore is safer. Secondly, because of the proven application of OCT in detection of Age-Related Macular Degeneration, Diabetic Macular Edema, and normal cases, thanks to detecting MAs in OCT, extensive information is obtained by using this imaging technique. In this research, the concentration is on the diagnosis of MAs using deep learning in the OCT images which represent in-depth structure of retinal layers. To this end, OCT B-scans should be divided into strips and MA patterns should be searched in the resulted strips. Since we need a dataset comprising OCT image strips with suitable labels and such large labelled datasets are not yet available, we have created it. For this purpose, an exact registration method is utilized to align OCT images with FA photographs. Then, with the help of corresponding FA images, OCT image strips are created from OCT B-scans in four labels, namely MA, normal, abnormal, and vessel. Once the dataset of image strips is prepared, a stacked generalization (stacking) ensemble of four fine-tuned, pre-trained convolutional neural networks is trained to classify the strips of OCT images into the mentioned classes. FA images are used once to create OCT strips for training process and they are no longer needed for subsequent steps. Once the stacking ensemble model is obtained, it will be used to classify the OCT strips in the test process. The results demonstrate that the proposed framework classifies overall OCT image strips and OCT strips containing MAs with accuracy scores of 0.982 and 0.987, respectively.
Collapse
Affiliation(s)
- Ramin Almasi
- Department of Computer Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran
| | - Abbas Vafaei
- Department of Computer Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran.
| | - Elahe Kazeminasab
- Department of Computer Engineering, Faculty of Engineering, University of Isfahan, Isfahan, Iran
| | - Hossein Rabbani
- Medical Image and Signal Processing Research Center, Isfahan University of Medical Sciences, Isfahan, Iran.
| |
Collapse
|
26
|
Ma Z, Xie Q, Xie P, Fan F, Gao X, Zhu J. HCTNet: A Hybrid ConvNet-Transformer Network for Retinal Optical Coherence Tomography Image Classification. BIOSENSORS 2022; 12:542. [PMID: 35884345 PMCID: PMC9313149 DOI: 10.3390/bios12070542] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 07/13/2022] [Accepted: 07/18/2022] [Indexed: 06/15/2023]
Abstract
Automatic and accurate optical coherence tomography (OCT) image classification is of great significance to computer-assisted diagnosis of retinal disease. In this study, we propose a hybrid ConvNet-Transformer network (HCTNet) and verify the feasibility of a Transformer-based method for retinal OCT image classification. The HCTNet first utilizes a low-level feature extraction module based on the residual dense block to generate low-level features for facilitating the network training. Then, two parallel branches of the Transformer and the ConvNet are designed to exploit the global and local context of the OCT images. Finally, a feature fusion module based on an adaptive re-weighting mechanism is employed to combine the extracted global and local features for predicting the category of OCT images in the testing datasets. The HCTNet combines the advantage of the convolutional neural network in extracting local features and the advantage of the vision Transformer in establishing long-range dependencies. A verification on two public retinal OCT datasets shows that our HCTNet method achieves an overall accuracy of 91.56% and 86.18%, respectively, outperforming the pure ViT and several ConvNet-based classification methods.
Collapse
Affiliation(s)
- Zongqing Ma
- Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100192, China; (Z.M.); (Q.X.); (F.F.)
- Beijing Laboratory of Biomedical Testing Technology and Instruments, Beijing Information Science and Technology University, Beijing 100192, China
| | - Qiaoxue Xie
- Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100192, China; (Z.M.); (Q.X.); (F.F.)
- Beijing Laboratory of Biomedical Testing Technology and Instruments, Beijing Information Science and Technology University, Beijing 100192, China
| | - Pinxue Xie
- Beijing Anzhen Hospital, Capital Medical University, Beijing 100029, China; (P.X.); (X.G.)
| | - Fan Fan
- Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100192, China; (Z.M.); (Q.X.); (F.F.)
- Beijing Laboratory of Biomedical Testing Technology and Instruments, Beijing Information Science and Technology University, Beijing 100192, China
| | - Xinxiao Gao
- Beijing Anzhen Hospital, Capital Medical University, Beijing 100029, China; (P.X.); (X.G.)
| | - Jiang Zhu
- Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100192, China; (Z.M.); (Q.X.); (F.F.)
- Beijing Laboratory of Biomedical Testing Technology and Instruments, Beijing Information Science and Technology University, Beijing 100192, China
| |
Collapse
|
27
|
Ara RK, Matiolański A, Dziech A, Baran R, Domin P, Wieczorkiewicz A. Fast and Efficient Method for Optical Coherence Tomography Images Classification Using Deep Learning Approach. SENSORS (BASEL, SWITZERLAND) 2022; 22:4675. [PMID: 35808169 PMCID: PMC9269557 DOI: 10.3390/s22134675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 06/13/2022] [Accepted: 06/16/2022] [Indexed: 05/18/2023]
Abstract
The use of optical coherence tomography (OCT) in medical diagnostics is now common. The growing amount of data leads us to propose an automated support system for medical staff. The key part of the system is a classification algorithm developed with modern machine learning techniques. The main contribution is to present a new approach for the classification of eye diseases using the convolutional neural network model. The research concerns the classification of patients on the basis of OCT B-scans into one of four categories: Diabetic Macular Edema (DME), Choroidal Neovascularization (CNV), Drusen, and Normal. Those categories are available in a publicly available dataset of above 84,000 images utilized for the research. After several tested architectures, our 5-layer neural network gives us a promising result. We compared them to the other available solutions which proves the high quality of our algorithm. Equally important for the application of the algorithm is the computational time, which is reduced by the limited size of the model. In addition, the article presents a detailed method of image data augmentation and its impact on the classification results. The results of the experiments were also presented for several derived models of convolutional network architectures that were tested during the research. Improving processes in medical treatment is important. The algorithm cannot replace a doctor but, for example, can be a valuable tool for speeding up the process of diagnosis during screening tests.
Collapse
Affiliation(s)
- Rouhollah Kian Ara
- Institute of Telecommunications, AGH University of Science and Technology, 30-059 Krakow, Poland; (R.K.A.); (A.D.)
| | - Andrzej Matiolański
- Institute of Telecommunications, AGH University of Science and Technology, 30-059 Krakow, Poland; (R.K.A.); (A.D.)
| | - Andrzej Dziech
- Institute of Telecommunications, AGH University of Science and Technology, 30-059 Krakow, Poland; (R.K.A.); (A.D.)
| | - Remigiusz Baran
- Faculty of Electrical Engineering, Automatic Control and Computer Science, Kielce University of Technology, 25-314 Kielce, Poland;
| | - Paweł Domin
- Consultronix S.A., 32-083 Balice, Poland; (P.D.); (A.W.)
| | | |
Collapse
|
28
|
Toğaçar M, Ergen B, Tümen V. Use of dominant activations obtained by processing OCT images with the CNNs and slime mold method in retinal disease detection. Biocybern Biomed Eng 2022. [DOI: 10.1016/j.bbe.2022.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
29
|
Gour N, Tanveer M, Khanna P. Challenges for ocular disease identification in the era of artificial intelligence. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06770-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
30
|
Roychowdhury S, Tang KS, Ashok M, Sanka A. SISE-PC: Semi-supervised Image Subsampling for Explainable Pathology Classification. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2021; 2021:2806-2809. [PMID: 34891832 DOI: 10.1109/embc46164.2021.9630682] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Although automated pathology classification using deep learning (DL) has proved to be predictively efficient, DL methods are found to be data and compute cost intensive. In this work, we aim to reduce DL training costs by pre-training a ResNet feature extractor using SimCLR contrastive loss for latent encoding of OCT images. We propose a novel active learning framework that identifies a minimal sub-sampled dataset containing the most uncertain OCT image samples using label propagation on the SimCLR latent encodings. The pre-trained ResNet model is then fine-tuned with the labelled minimal sub-sampled data and the underlying pathological sites are visually explained. Our framework identifies upto 2% of OCT images to be most uncertain that need prioritized specialist attention and that can fine-tune a ResNet model to achieve upto 97% classification accuracy. The proposed method can be extended to other medical images to minimize prediction costs.
Collapse
|
31
|
Chakroborty S, Gupta M, Devishamani CS, Patel K, Ankit C, Ganesh Babu TC, Raman R. Narrative review of artificial intelligence in diabetic macular edema: Diagnosis and predicting treatment response using optical coherence tomography. Indian J Ophthalmol 2021; 69:2999-3008. [PMID: 34708735 PMCID: PMC8725112 DOI: 10.4103/ijo.ijo_1482_21] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Diabetic macular edema (DME), being a frequent manifestation of DR, disrupts the retinal symmetry. This event is particularly triggered by vascular endothelial growth factors (VEGF). Intravitreal injections of anti-VEGFs have been the most practiced treatment but an expensive option. A major challenge associated with this treatment is determining an optimal treatment regimen and differentiating patients who do not respond to anti-VEGF. As it has a significant burden for both the patient and the health care providers if the patient is not responding, any clinically acceptable method to predict the treatment outcomes holds huge value in the efficient management of DME. In such situations, artificial intelligence (AI) or machine learning (ML)-based algorithms come useful as they can analyze past clinical details of the patients and help clinicians to predict the patient's response to an anti-VEGF agent. The work presented here attempts to review the literature that is available from the peer research community to discuss solutions provided by AI/ML methodologies to tackle challenges in DME management. Lastly, a possibility for using two different types of data has been proposed, which is believed to be the key differentiators as compared to the similar and recent contributions from the peer research community.
Collapse
Affiliation(s)
- Sandipan Chakroborty
- Center for Applications and Research in India, Carl Zeiss India (Bangalore) Pvt. Ltd., Bengaluru, Karnataka, India
| | - Mansi Gupta
- Center for Applications and Research in India, Carl Zeiss India (Bangalore) Pvt. Ltd., Bengaluru, Karnataka, India
| | | | - Krunalkumar Patel
- Center for Applications and Research in India, Carl Zeiss India (Bangalore) Pvt. Ltd., Bengaluru, Karnataka, India
| | - Chavan Ankit
- Center for Applications and Research in India, Carl Zeiss India (Bangalore) Pvt. Ltd., Bengaluru, Karnataka, India
| | - T C Ganesh Babu
- Center for Applications and Research in India, Carl Zeiss India (Bangalore) Pvt. Ltd., Bengaluru, Karnataka, India
| | - Rajiv Raman
- Shri Bhagwan Mahavir Vitreoretinal Services, Sankara Nethralaya, Chennai, Tamil Nadu, India
| |
Collapse
|
32
|
Tang F, Wang X, Ran AR, Chan CKM, Ho M, Yip W, Young AL, Lok J, Szeto S, Chan J, Yip F, Wong R, Tang Z, Yang D, Ng DS, Chen LJ, Brelén M, Chu V, Li K, Lai THT, Tan GS, Ting DSW, Huang H, Chen H, Ma JH, Tang S, Leng T, Kakavand S, Mannil SS, Chang RT, Liew G, Gopinath B, Lai TYY, Pang CP, Scanlon PH, Wong TY, Tham CC, Chen H, Heng PA, Cheung CY. A Multitask Deep-Learning System to Classify Diabetic Macular Edema for Different Optical Coherence Tomography Devices: A Multicenter Analysis. Diabetes Care 2021; 44:2078-2088. [PMID: 34315698 PMCID: PMC8740924 DOI: 10.2337/dc20-3064] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 05/29/2021] [Indexed: 02/03/2023]
Abstract
OBJECTIVE Diabetic macular edema (DME) is the primary cause of vision loss among individuals with diabetes mellitus (DM). We developed, validated, and tested a deep learning (DL) system for classifying DME using images from three common commercially available optical coherence tomography (OCT) devices. RESEARCH DESIGN AND METHODS We trained and validated two versions of a multitask convolution neural network (CNN) to classify DME (center-involved DME [CI-DME], non-CI-DME, or absence of DME) using three-dimensional (3D) volume scans and 2D B-scans, respectively. For both 3D and 2D CNNs, we used the residual network (ResNet) as the backbone. For the 3D CNN, we used a 3D version of ResNet-34 with the last fully connected layer removed as the feature extraction module. A total of 73,746 OCT images were used for training and primary validation. External testing was performed using 26,981 images across seven independent data sets from Singapore, Hong Kong, the U.S., China, and Australia. RESULTS In classifying the presence or absence of DME, the DL system achieved area under the receiver operating characteristic curves (AUROCs) of 0.937 (95% CI 0.920-0.954), 0.958 (0.930-0.977), and 0.965 (0.948-0.977) for the primary data set obtained from CIRRUS, SPECTRALIS, and Triton OCTs, respectively, in addition to AUROCs >0.906 for the external data sets. For further classification of the CI-DME and non-CI-DME subgroups, the AUROCs were 0.968 (0.940-0.995), 0.951 (0.898-0.982), and 0.975 (0.947-0.991) for the primary data set and >0.894 for the external data sets. CONCLUSIONS We demonstrated excellent performance with a DL system for the automated classification of DME, highlighting its potential as a promising second-line screening tool for patients with DM, which may potentially create a more effective triaging mechanism to eye clinics.
Collapse
Affiliation(s)
- Fangyao Tang
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | - Xi Wang
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR
| | - An-Ran Ran
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | | | - Mary Ho
- Department of Ophthalmology and Visual Sciences, Prince of Wales Hospital, Hong Kong SAR.,Alice Ho Miu Ling Nethersole Hospital, Hong Kong SAR
| | - Wilson Yip
- Department of Ophthalmology and Visual Sciences, Prince of Wales Hospital, Hong Kong SAR.,Alice Ho Miu Ling Nethersole Hospital, Hong Kong SAR
| | - Alvin L Young
- Department of Ophthalmology and Visual Sciences, Prince of Wales Hospital, Hong Kong SAR.,Alice Ho Miu Ling Nethersole Hospital, Hong Kong SAR
| | - Jerry Lok
- Hong Kong Eye Hospital, Hong Kong SAR
| | | | | | - Fanny Yip
- Hong Kong Eye Hospital, Hong Kong SAR
| | | | - Ziqi Tang
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | - Dawei Yang
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | - Danny S Ng
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR.,Hong Kong Eye Hospital, Hong Kong SAR
| | - Li Jia Chen
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR.,Department of Ophthalmology and Visual Sciences, Prince of Wales Hospital, Hong Kong SAR
| | - Marten Brelén
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | - Victor Chu
- United Christian Hospital, Hong Kong SAR
| | - Kenneth Li
- United Christian Hospital, Hong Kong SAR
| | | | - Gavin S Tan
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
| | - Daniel S W Ting
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
| | - Haifan Huang
- Joint Shantou International Eye Center, Shantou University and The Chinese University of Hong Kong, Shantou, Guangdong, China
| | - Haoyu Chen
- Joint Shantou International Eye Center, Shantou University and The Chinese University of Hong Kong, Shantou, Guangdong, China
| | - Jacey Hongjie Ma
- Aier School of Ophthalmology, Central South University, Changsha, Hunan, China
| | - Shibo Tang
- Aier School of Ophthalmology, Central South University, Changsha, Hunan, China
| | - Theodore Leng
- Byers Eye Institute at Stanford, Stanford University School of Medicine, Palo Alto, CA
| | - Schahrouz Kakavand
- Byers Eye Institute at Stanford, Stanford University School of Medicine, Palo Alto, CA
| | - Suria S Mannil
- Byers Eye Institute at Stanford, Stanford University School of Medicine, Palo Alto, CA
| | - Robert T Chang
- Byers Eye Institute at Stanford, Stanford University School of Medicine, Palo Alto, CA
| | - Gerald Liew
- Department of Ophthalmology, Westmead Institute for Medical Research, University of Sydney, Sydney, NSW, Australia
| | - Bamini Gopinath
- Department of Ophthalmology, Westmead Institute for Medical Research, University of Sydney, Sydney, NSW, Australia.,Macquarie University Hearing, Department of Linguistics, Macquarie University, Sydney, New South Wales, Australia
| | - Timothy Y Y Lai
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | - Chi Pui Pang
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| | - Peter H Scanlon
- Gloucestershire Retinal Research Group, Gloucestershire Hospitals NHS Foundation Trust, Gloucester, U.K
| | - Tien Yin Wong
- Singapore Eye Research Institute, Singapore National Eye Centre, Singapore
| | - Clement C Tham
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR.,Hong Kong Eye Hospital, Hong Kong SAR.,Department of Ophthalmology and Visual Sciences, Prince of Wales Hospital, Hong Kong SAR
| | - Hao Chen
- Department of Computer Science and Engineering, The Hong Kong University of Sciences and Technology, Hong Kong SAR
| | - Pheng-Ann Heng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Hong Kong SAR
| | - Carol Y Cheung
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR
| |
Collapse
|
33
|
Ran A, Cheung CY. Deep Learning-Based Optical Coherence Tomography and Optical Coherence Tomography Angiography Image Analysis: An Updated Summary. Asia Pac J Ophthalmol (Phila) 2021; 10:253-260. [PMID: 34383717 DOI: 10.1097/apo.0000000000000405] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
ABSTRACT Deep learning (DL) is a subset of artificial intelligence based on deep neural networks. It has made remarkable breakthroughs in medical imaging, particularly for image classification and pattern recognition. In ophthalmology, there are rising interests in applying DL methods to analyze optical coherence tomography (OCT) and optical coherence tomography angiography (OCTA) images. Studies showed that OCT and OCTA image evaluation by DL algorithms achieved good performance for disease detection, prognosis prediction, and image quality control, suggesting that the incorporation of DL technology could potentially enhance the accuracy of disease evaluation and the efficiency of clinical workflow. However, substantial issues, such as small training sample size, data preprocessing standardization, model robustness, results explanation, and performance cross-validation, are yet to be tackled before deploying these DL models in real-time clinics. This review summarized recent studies on DL-based image analysis models for OCT and OCTA images and discussed the potential challenges of clinical deployment and future research directions.
Collapse
Affiliation(s)
- Anran Ran
- Department of Ophthalmology and Visual Sciences, the Chinese University of Hong Kong, Hong Kong SAR
| | | |
Collapse
|