1
|
Li F, Wang D, Yang Z, Zhang Y, Jiang J, Liu X, Kong K, Zhou F, Tham CC, Medeiros F, Han Y, Grzybowski A, Zangwill LM, Lam DSC, Zhang X. The AI revolution in glaucoma: Bridging challenges with opportunities. Prog Retin Eye Res 2024; 103:101291. [PMID: 39186968 DOI: 10.1016/j.preteyeres.2024.101291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/19/2024] [Accepted: 08/19/2024] [Indexed: 08/28/2024]
Abstract
Recent advancements in artificial intelligence (AI) herald transformative potentials for reshaping glaucoma clinical management, improving screening efficacy, sharpening diagnosis precision, and refining the detection of disease progression. However, incorporating AI into healthcare usages faces significant hurdles in terms of developing algorithms and putting them into practice. When creating algorithms, issues arise due to the intensive effort required to label data, inconsistent diagnostic standards, and a lack of thorough testing, which often limits the algorithms' widespread applicability. Additionally, the "black box" nature of AI algorithms may cause doctors to be wary or skeptical. When it comes to using these tools, challenges include dealing with lower-quality images in real situations and the systems' limited ability to work well with diverse ethnic groups and different diagnostic equipment. Looking ahead, new developments aim to protect data privacy through federated learning paradigms, improving algorithm generalizability by diversifying input data modalities, and augmenting datasets with synthetic imagery. The integration of smartphones appears promising for using AI algorithms in both clinical and non-clinical settings. Furthermore, bringing in large language models (LLMs) to act as interactive tool in medicine may signify a significant change in how healthcare will be delivered in the future. By navigating through these challenges and leveraging on these as opportunities, the field of glaucoma AI will not only have improved algorithmic accuracy and optimized data integration but also a paradigmatic shift towards enhanced clinical acceptance and a transformative improvement in glaucoma care.
Collapse
Affiliation(s)
- Fei Li
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Deming Wang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Zefeng Yang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Yinhang Zhang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Jiaxuan Jiang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Xiaoyi Liu
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Kangjie Kong
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| | - Fengqi Zhou
- Ophthalmology, Mayo Clinic Health System, Eau Claire, WI, USA.
| | - Clement C Tham
- Department of Ophthalmology and Visual Sciences, The Chinese University of Hong Kong, Hong Kong SAR, China.
| | - Felipe Medeiros
- Bascom Palmer Eye Institute, University of Miami Miller School of Medicine, Miami, FL, USA.
| | - Ying Han
- University of California, San Francisco, Department of Ophthalmology, San Francisco, CA, USA; The Francis I. Proctor Foundation for Research in Ophthalmology, University of California, San Francisco, CA, USA.
| | - Andrzej Grzybowski
- Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, Poznan, Poland.
| | - Linda M Zangwill
- Hamilton Glaucoma Center, Viterbi Family Department of Ophthalmology, Shiley Eye Institute, University of California, San Diego, CA, USA.
| | - Dennis S C Lam
- The International Eye Research Institute of the Chinese University of Hong Kong (Shenzhen), Shenzhen, China; The C-MER Dennis Lam & Partners Eye Center, C-MER International Eye Care Group, Hong Kong, China.
| | - Xiulan Zhang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangdong Provincial Clinical Research Center for Ocular Diseases, Guangzhou 510060, China.
| |
Collapse
|
2
|
Chaurasia AK, Greatbatch CJ, Han X, Gharahkhani P, Mackey DA, MacGregor S, Craig JE, Hewitt AW. Highly Accurate and Precise Automated Cup-to-Disc Ratio Quantification for Glaucoma Screening. OPHTHALMOLOGY SCIENCE 2024; 4:100540. [PMID: 39051045 PMCID: PMC11268341 DOI: 10.1016/j.xops.2024.100540] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 03/26/2024] [Accepted: 04/22/2024] [Indexed: 07/27/2024]
Abstract
Objective An enlarged cup-to-disc ratio (CDR) is a hallmark of glaucomatous optic neuropathy. Manual assessment of the CDR may be less accurate and more time-consuming than automated methods. Here, we sought to develop and validate a deep learning-based algorithm to automatically determine the CDR from fundus images. Design Algorithm development for estimating CDR using fundus data from a population-based observational study. Participants A total of 181 768 fundus images from the United Kingdom Biobank (UKBB), Drishti_GS, and EyePACS. Methods FastAI and PyTorch libraries were used to train a convolutional neural network-based model on fundus images from the UKBB. Models were constructed to determine image gradability (classification analysis) as well as to estimate CDR (regression analysis). The best-performing model was then validated for use in glaucoma screening using a multiethnic dataset from EyePACS and Drishti_GS. Main Outcome Measures The area under the receiver operating characteristic curve and coefficient of determination. Results Our gradability model vgg19_batch normalization (bn) achieved an accuracy of 97.13% on a validation set of 16 045 images, with 99.26% precision and area under the receiver operating characteristic curve of 96.56%. Using regression analysis, our best-performing model (trained on the vgg19_bn architecture) attained a coefficient of determination of 0.8514 (95% confidence interval [CI]: 0.8459-0.8568), while the mean squared error was 0.0050 (95% CI: 0.0048-0.0051) and mean absolute error was 0.0551 (95% CI: 0.0543-0.0559) on a validation set of 12 183 images for determining CDR. The regression point was converted into classification metrics using a tolerance of 0.2 for 20 classes; the classification metrics achieved an accuracy of 99.20%. The EyePACS dataset (98 172 healthy, 3270 glaucoma) was then used to externally validate the model for glaucoma classification, with an accuracy, sensitivity, and specificity of 82.49%, 72.02%, and 82.83%, respectively. Conclusions Our models were precise in determining image gradability and estimating CDR. Although our artificial intelligence-derived CDR estimates achieve high accuracy, the CDR threshold for glaucoma screening will vary depending on other clinical parameters. Financial Disclosures Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
Collapse
Affiliation(s)
- Abadh K. Chaurasia
- Menzies Institute for Medical Research, University of Tasmania, Hobart, Australia
| | - Connor J. Greatbatch
- Menzies Institute for Medical Research, University of Tasmania, Hobart, Australia
| | - Xikun Han
- QIMR Berghofer Medical Research Institute, Brisbane, Australia
- School of Medicine, University of Queensland, Brisbane, Australia
| | - Puya Gharahkhani
- QIMR Berghofer Medical Research Institute, Brisbane, Australia
- School of Medicine, University of Queensland, Brisbane, Australia
- Faculty of Health, School of Biomedical Sciences, Queensland University of Technology, Brisbane, Queensland, Australia
| | - David A. Mackey
- Lions Eye Institute, Centre for Vision Sciences, University of Western Australia, Nedlands, Australia
| | - Stuart MacGregor
- QIMR Berghofer Medical Research Institute, Brisbane, Australia
- School of Medicine, University of Queensland, Brisbane, Australia
| | - Jamie E. Craig
- Department of Ophthalmology, Flinders University, Flinders Medical Centre, Bedford Park, Australia
| | - Alex W. Hewitt
- Menzies Institute for Medical Research, University of Tasmania, Hobart, Australia
- Centre for Eye Research Australia, University of Melbourne, Melbourne, Australia
| |
Collapse
|
3
|
Correia Barão R, Hemelings R, Abegão Pinto L, Pazos M, Stalmans I. Artificial intelligence for glaucoma: state of the art and future perspectives. Curr Opin Ophthalmol 2024; 35:104-110. [PMID: 38018807 DOI: 10.1097/icu.0000000000001022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
PURPOSE OF REVIEW To address the current role of artificial intelligence (AI) in the field of glaucoma. RECENT FINDINGS Current deep learning (DL) models concerning glaucoma diagnosis have shown consistently improving diagnostic capabilities, primarily based on color fundus photography and optical coherence tomography, but also with multimodal strategies. Recent models have also suggested that AI may be helpful in detecting and estimating visual field progression from different input data. Moreover, with the emergence of newer DL architectures and synthetic data, challenges such as model generalizability and explainability have begun to be tackled. SUMMARY While some challenges remain before AI is routinely employed in clinical practice, new research has expanded the range in which it can be used in the context of glaucoma management and underlined the relevance of this research avenue.
Collapse
Affiliation(s)
- Rafael Correia Barão
- Department of Ophthalmology, Hospital de Santa Maria, CHULN
- Visual Sciences Study Center, Faculty of Medicine, University of Lisbon, Lisbon, Portugal
| | - Ruben Hemelings
- Department of Neurosciences, Research Group Ophthalmology, KU Leuven, Leuven, Belgium
- Singapore Eye Research Institute, Singapore National Eye Centre
- SERI-NTU Advanced Ocular Engineering (STANCE) Programme, Singapore, Singapore
| | - Luís Abegão Pinto
- Department of Ophthalmology, Hospital de Santa Maria, CHULN
- Visual Sciences Study Center, Faculty of Medicine, University of Lisbon, Lisbon, Portugal
| | - Marta Pazos
- Institute of Ophthalmology, Hospital Clínic de Barcelona, Barcelona, Spain
| | - Ingeborg Stalmans
- Department of Neurosciences, Research Group Ophthalmology, KU Leuven, Leuven, Belgium
- Department of Ophthalmology, University Hospitals UZ Leuven, Leuven, Belgium
| |
Collapse
|
4
|
Guo X, Li R, Lu X, Zhang X, Wu Q, Tian Q, Guo B, Tang G, Xu J, Feng J, Zhao L, Ling S, Dong Z, Song J, Bi H. Quantization of Optic Disc Characteristics in Young Adults Based on Artificial Intelligence. Curr Eye Res 2023; 48:1068-1077. [PMID: 37555317 DOI: 10.1080/02713683.2023.2244700] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 07/21/2023] [Accepted: 07/28/2023] [Indexed: 08/10/2023]
Abstract
PURPOSE This study aimed to automatically and quantitatively analyse the characteristics of the optic disc by applying artificial intelligence (AI) to fundus images. METHODS A total of 1084 undergraduates were recruited in this cross-sectional study. The optic disc area, cup-to-disc ratio (C/D), optic disc tilt, and the area, width, and height of peripapillary atrophy (PPA) were automatically and quantitatively detected using AI. Based on axial length (AL), participants were divided into five groups: Group 1 (AL ≤ 23 mm); Group 2 (23 mm < AL≤ 24 mm); Group 3 (24 mm < AL≤ 25 mm); Group 4 (25 mm < AL< 26 mm) and Group 5 (AL ≥ 26 mm). Relationships between ocular parameters and optic disc characteristics were analysed. RESULT A total of 999 undergraduates were included in the analysis. The prevalence of optic disc tilting and PPA were 47.1% and 92.5%, respectively, and increased with the severity of myopia. The mean optic disc area, PPA area, C/D, and optic disc tilt ratio were 1.97 ± 0.46 mm2, 0.84 ± 0.59 mm2, 0.18 ± 0.07, and 0.81 ± 0.08, respectively. In Group 5, the average optic disc area (1.84 ± 0.41 mm2) and optic disc tilt ratio (0.79 ± 0.08) were significantly smaller and the PPA area (1.12 ± 0.61 mm2) was significantly larger than those in the other groups. AL was negatively correlated with optic disc area and optic disc tilt ratio (r=-0.271, -0.219; both p < 0.001) and positively correlated with PPA area, width, and height (r = 0.421, 0.426, 0.345; all p < 0.01). A greater AL (β = 0.284, p < 0.01) and a smaller optic disc tilt ratio (β=-0.516, p < 0.01) were related to a larger PPA area. CONCLUSION The characteristics of the optic disc can be feasibly and efficiently extracted using AI. The quantization of the optic disc might provide new indicators for clinicians to evaluate the degree of myopia.
Collapse
Affiliation(s)
- Xiaoxiao Guo
- Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Runkuan Li
- Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Xiuzhen Lu
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Xiuyan Zhang
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Qiuxin Wu
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Qingmei Tian
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Bin Guo
- Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Guodong Tang
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Jing Xu
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Jiaojiao Feng
- Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Lili Zhao
- Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Saiguang Ling
- EVision Technology (Beijing) Co., Ltd, Beijing, P. R. China
| | - Zhou Dong
- EVision Technology (Beijing) Co., Ltd, Beijing, P. R. China
| | - Jike Song
- Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| | - Hongsheng Bi
- Affiliated Eye Hospital of Shandong, University of Traditional Chinese Medicine, Jinan, P. R. China
- Shandong Provincial Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Key Laboratory of Integrated Traditional Chinese and Western Medicine for Prevention and Therapy of Ocular Diseases, Universities of Shandong; Eye Institute of Shandong University of Traditional Chinese Medicine, Jinan, P. R. China
| |
Collapse
|
5
|
Yi Y, Jiang Y, Zhou B, Zhang N, Dai J, Huang X, Zeng Q, Zhou W. C2FTFNet: Coarse-to-fine transformer network for joint optic disc and cup segmentation. Comput Biol Med 2023; 164:107215. [PMID: 37481947 DOI: 10.1016/j.compbiomed.2023.107215] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/07/2023] [Accepted: 06/25/2023] [Indexed: 07/25/2023]
Abstract
Glaucoma is a leading cause of worldwide blindness and visual impairment, making early screening and diagnosis is crucial to prevent vision loss. Cup-to-Disk Ratio (CDR) evaluation serves as a widely applied approach for effective glaucoma screening. At present, deep learning methods have exhibited outstanding performance in optic disk (OD) and optic cup (OC) segmentation and maturely deployed in CAD system. However, owning to the complexity of clinical data, these techniques could be constrained. Therefore, an original Coarse-to-Fine Transformer Network (C2FTFNet) is designed to segment OD and OC jointly , which is composed of two stages. In the coarse stage, to eliminate the effects of irrelevant organization on the segmented OC and OD regions, we employ U-Net and Circular Hough Transform (CHT) to segment the Region of Interest (ROI) of OD. Meanwhile, a TransUnet3+ model is designed in the fine segmentation stage to extract the OC and OD regions more accurately from ROI. In this model, to alleviate the limitation of the receptive field caused by traditional convolutional methods, a Transformer module is introduced into the backbone to capture long-distance dependent features for retaining more global information. Then, a Multi-Scale Dense Skip Connection (MSDC) module is proposed to fuse the low-level and high-level features from different layers for reducing the semantic gap among different level features. Comprehensive experiments conducted on DRIONS-DB, Drishti-GS, and REFUGE datasets validate the superior effectiveness of the proposed C2FTFNet compared to existing state-of-the-art approaches.
Collapse
Affiliation(s)
- Yugen Yi
- School of Software, Jiangxi Normal University, Nanchang, 330022, China; Jiangxi Provincial Engineering Research Center of Blockchain Data Security and Governance, Nanchang, 330022, China
| | - Yan Jiang
- School of Software, Jiangxi Normal University, Nanchang, 330022, China
| | - Bin Zhou
- School of Software, Jiangxi Normal University, Nanchang, 330022, China
| | - Ningyi Zhang
- School of Software, Jiangxi Normal University, Nanchang, 330022, China
| | - Jiangyan Dai
- School of Computer Engineering, Weifang University, 261061, China.
| | - Xin Huang
- School of Software, Jiangxi Normal University, Nanchang, 330022, China
| | - Qinqin Zeng
- Department of Ophthalmology, The Second Affiliated Hospital of Nanchang University, Nanchang, 330006, China
| | - Wei Zhou
- College of Computer Science, Shenyang Aerospace University, Shenyang, 110136, China.
| |
Collapse
|
6
|
Alfonso-Francia G, Pedraza-Ortega JC, Badillo-Fernández M, Toledano-Ayala M, Aceves-Fernandez MA, Rodriguez-Resendiz J, Ko SB, Tovar-Arriaga S. Performance Evaluation of Different Object Detection Models for the Segmentation of Optical Cups and Discs. Diagnostics (Basel) 2022; 12:diagnostics12123031. [PMID: 36553037 PMCID: PMC9777130 DOI: 10.3390/diagnostics12123031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 11/22/2022] [Accepted: 11/25/2022] [Indexed: 12/12/2022] Open
Abstract
Glaucoma is an eye disease that gradually deteriorates vision. Much research focuses on extracting information from the optic disc and optic cup, the structure used for measuring the cup-to-disc ratio. These structures are commonly segmented with deeplearning techniques, primarily using Encoder-Decoder models, which are hard to train and time-consuming. Object detection models using convolutional neural networks can extract features from fundus retinal images with good precision. However, the superiority of one model over another for a specific task is still being determined. The main goal of our approach is to compare object detection model performance to automate segment cups and discs on fundus images. This study brings the novelty of seeing the behavior of different object detection models in the detection and segmentation of the disc and the optical cup (Mask R-CNN, MS R-CNN, CARAFE, Cascade Mask R-CNN, GCNet, SOLO, Point_Rend), evaluated on Retinal Fundus Images for Glaucoma Analysis (REFUGE), and G1020 datasets. Reported metrics were Average Precision (AP), F1-score, IoU, and AUCPR. Several models achieved the highest AP with a perfect 1.000 when the threshold for IoU was set up at 0.50 on REFUGE, and the lowest was Cascade Mask R-CNN with an AP of 0.997. On the G1020 dataset, the best model was Point_Rend with an AP of 0.956, and the worst was SOLO with 0.906. It was concluded that the methods reviewed achieved excellent performance with high precision and recall values, showing efficiency and effectiveness. The problem of how many images are needed was addressed with an initial value of 100, with excellent results. Data augmentation, multi-scale handling, and anchor box size brought improvements. The capability to translate knowledge from one database to another shows promising results too.
Collapse
Affiliation(s)
- Gendry Alfonso-Francia
- Faculty of Engineering, Autonomous University of Querétaro, Santiago de Querétaro 76010, Mexico
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK S7N 5A9, Canada
| | | | - Mariana Badillo-Fernández
- Instituto Mexicano de Oftalmología (IMO) I.A.P., Circuito Exterior Estadio Corregidora sn, Centro Sur, Santiago de Querétaro 76010, Mexico
| | - Manuel Toledano-Ayala
- Faculty of Engineering, Autonomous University of Querétaro, Santiago de Querétaro 76010, Mexico
| | | | | | - Seok-Bum Ko
- Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK S7N 5A9, Canada
| | - Saul Tovar-Arriaga
- Faculty of Engineering, Autonomous University of Querétaro, Santiago de Querétaro 76010, Mexico
- Correspondence:
| |
Collapse
|