1
|
Yu M, Fang Y, Liu Y, Bozoki AC, Xiao S, Yue L, Liu M. Hybrid multi-modality multi-task learning for forecasting progression trajectories in subjective cognitive decline. Neural Netw 2025; 186:107263. [PMID: 39985974 PMCID: PMC11893250 DOI: 10.1016/j.neunet.2025.107263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 12/11/2024] [Accepted: 02/07/2025] [Indexed: 02/24/2025]
Abstract
While numerous studies strive to exploit the complementary potential of MRI and PET using learning-based methods, the effective fusion of the two modalities remains a tricky problem due to their inherently distinctive properties. In addition, current studies often face the problem of small sample sizes and missing PET data due to factors such as patient withdrawal or low image quality. To this end, we propose a hybrid multi-modality multi-task learning (HM2L) framework with cross-domain knowledge transfer for forecasting trajectories of SCD progression. Our HM2L comprises (1) missing PET imputation, (2) multi-modality feature extraction for MRI and PET feature learning with a novel softmax-triplet constraint, (3) attention-based multi-modality fusion of MRI and PET features, and (4) multi-task prediction of category labels and clinical scores such as Mini-Mental State Examination (MMSE) and Geriatric Depression Scale (GDS). To handle problems with small sample sizes, a transfer learning strategy is developed to enable knowledge transfer from a relatively large scale dataset with MRI and PET from 795 subjects to two small-scale SCD cohorts with a total of 136 subjects. Experimental results indicate HM2L surpasses several state-of-the-art methods in jointly predicting category labels and clinical scores of subjective cognitive decline. Results show that the MMSE scores of SCD subjects who develop mild cognitive impairment during the 2-year/7-year follow-up are significantly lower than those of subjects who remain stable, while there exists a complex relationship between SCD progression with GDS.
Collapse
Affiliation(s)
- Minhui Yu
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University, Chapel Hill, NC 27599, USA
| | - Yuqi Fang
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yunbi Liu
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Andrea C Bozoki
- Department of Neurology, University of North Carolina at Chapel Hill, NC 27599, USA
| | - Shifu Xiao
- Department of Geriatric Psychiatry, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200240, China
| | - Ling Yue
- Department of Geriatric Psychiatry, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200240, China.
| | - Mingxia Liu
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| |
Collapse
|
2
|
Duenias D, Nichyporuk B, Arbel T, Riklin Raviv T. Hyperfusion: A hypernetwork approach to multimodal integration of tabular and medical imaging data for predictive modeling. Med Image Anal 2025; 102:103503. [PMID: 40037055 DOI: 10.1016/j.media.2025.103503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 01/11/2025] [Accepted: 02/10/2025] [Indexed: 03/06/2025]
Abstract
The integration of diverse clinical modalities such as medical imaging and the tabular data extracted from patients' Electronic Health Records (EHRs) is a crucial aspect of modern healthcare. Integrative analysis of multiple sources can provide a comprehensive understanding of the clinical condition of a patient, improving diagnosis and treatment decision. Deep Neural Networks (DNNs) consistently demonstrate outstanding performance in a wide range of multimodal tasks in the medical domain. However, the complex endeavor of effectively merging medical imaging with clinical, demographic and genetic information represented as numerical tabular data remains a highly active and ongoing research pursuit. We present a novel framework based on hypernetworks to fuse clinical imaging and tabular data by conditioning the image processing on the EHR's values and measurements. This approach aims to leverage the complementary information present in these modalities to enhance the accuracy of various medical applications. We demonstrate the strength and generality of our method on two different brain Magnetic Resonance Imaging (MRI) analysis tasks, namely, brain age prediction conditioned by subject's sex and multi-class Alzheimer's Disease (AD) classification conditioned by tabular data. We show that our framework outperforms both single-modality models and state-of-the-art MRI tabular data fusion methods. A link to our code can be found at https://github.com/daniel4725/HyperFusion.
Collapse
Affiliation(s)
- Daniel Duenias
- Ben Gurion University of the Negev, blvd 1, Beer Sheva 84105, Israel
| | - Brennan Nichyporuk
- Centre for Intelligent Machines, McGill University, 3480 University St, Montreal, QC, H3A 0E9, Canada; Mila - Quebec AI Institute, 6666 Rue Saint-Urbain, Montréal, QC H2S 3H1, Canada
| | - Tal Arbel
- Centre for Intelligent Machines, McGill University, 3480 University St, Montreal, QC, H3A 0E9, Canada; Mila - Quebec AI Institute, 6666 Rue Saint-Urbain, Montréal, QC H2S 3H1, Canada
| | | |
Collapse
|
3
|
Gan Q, Wang C, Zhong Z, Wu J, Ge Q, Shi L, Shang J, Liu C. Cross-modal attention model integrating tongue images and descriptions: a novel intelligent TCM approach for pathological organ diagnosis. Front Physiol 2025; 16:1580985. [PMID: 40342857 PMCID: PMC12059375 DOI: 10.3389/fphys.2025.1580985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2025] [Accepted: 04/07/2025] [Indexed: 05/11/2025] Open
Abstract
Introduction Tongue diagnosis is a fundamental technique in traditional Chinese medicine (TCM), where clinicians evaluate the tongue's appearance to infer the condition of pathological organs. However, most existing research on intelligent tongue diagnosis primarily focuses on analyzing tongue images, often neglecting the important descriptive text that accompanies these images. This text is an essential component of clinical diagnosis. To overcome this gap, we propose a novel Cross-Modal Pathological Organ Diagnosis Model that integrates tongue images and textual descriptions for more accurate pathological classification. Methods Our model extracts features from both the tongue images and the corresponding textual descriptions. These features are then fused using a cross-modal attention mechanism to enhance the classification of pathological organs. The cross-modal attention mechanism enables the model to effectively combine visual and textual information, addressing the limitations of using either modality alone. Results We conducted experiments using a self-constructed dataset to evaluate our model's performance. The results demonstrate that our model outperforms common models regarding overall accuracy. Additionally, ablation studies, where either tongue images or textual descriptions were used alone, confirmed the significant benefit of multimodal fusion in improving diagnostic accuracy. Discussion This study introduces a new perspective on intelligent tongue diagnosis in TCM by incorporating visual and textual data. The experimental findings highlight the importance of cross-modal feature fusion for improving the accuracy of pathological diagnosis. Our approach not only contributes to the development of more effective diagnostic systems but also paves the way for future advancements in the automation of TCM diagnosis.
Collapse
Affiliation(s)
- Quan Gan
- School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
| | - Chen Wang
- School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
| | - Zhaoman Zhong
- School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
| | - Jiaying Wu
- School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
| | - Qiwei Ge
- The Graduate school of East Asian Studies, Yamaguchi University, Yamaguchi-shi, Japan
| | - Lei Shi
- School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
| | - Jiaqing Shang
- School of Computer Engineering, Jiangsu Ocean University, Lianyungang, China
| | - Chuanxia Liu
- School of Foreign Languages, Jiangsu Ocean University, Lianyungang, China
| |
Collapse
|
4
|
Sun Y, Shao S, Huang J, Shi H, Yan L, Lu Y, Liu P, Jiang Y, Qiao J, Zhang L. Development and validation of a deep learning model based on cascade mask regional convolutional neural network to noninvasively and accurately identify human round spermatids. J Adv Res 2025:S2090-1232(25)00219-X. [PMID: 40185275 DOI: 10.1016/j.jare.2025.03.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 03/31/2025] [Accepted: 03/31/2025] [Indexed: 04/07/2025] Open
Abstract
INTRODUCTION The difficulty of identifying human round spermatids (hRSs) has impeded applications of the human round spermatid injection (ROSI) technique. RSs can be accurately screened through flow cytometric analysis utilizing the Hoechst fluorescence profile reflecting DNA, but this method is not suitable for isolating hRSs due to the toxicity associated with Hoechst staining. OBJECTIVE To evaluate the capacity of a deep learning model grounded in a cascade mask region-based convolutional neural network (R-CNN) for the noninvasive and accurate identification of hRSs. METHODS In this study, we presented the development and validation of a deep learning model for identifying hRSs through the analysis of 3457 optical light microscope images of sorted hRSs obtained via flow cytometric analysis. The model's accuracy and specificity were evaluated by calculating the mean average precision (mAP). Furthermore, a double-blind experiment was conducted to access the reliability of the proposed model in accurately identifying hRSs. It detected the expression of protamine (PRM1) and/or peanut lectin (PNA), which are established markers for RSs. RESULTS Our deep learning-based model demonstrated a high precision, achieving a mAP of over 0.80 for isolating hRSs in test datasets. The expression of PRM1 and/or PNA was observed in all cells noninvasively selected by our AI model during an independent double-blind test. This phenomenon confirmed the accuracy and effectiveness of the proposed model. The model's capability for noninvasive and accurate isolation of hRSs among spermatogenic cells highlighted its robustness and generalizability for clinical applications. CONCLUSION The deep learning AI model based on a cascade R-CNN has the ability to accurately identify hRSs among spermatogenic cells. The application of this noninvasive method, which requires no additional procedures in clinical practice, is able to facilitate the widespread implementation of ROSI technique. Therefore, it can provide patients with spermatogenic arrest the opportunity to become biological fathers.
Collapse
Affiliation(s)
- Yujiao Sun
- Institute of Genetics and Development Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Shihao Shao
- School of Basic Medical Sciences Peking University, Beijing 100101, China
| | - Jiangwei Huang
- Institute of Genetics and Development Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Hao Shi
- National Center for Protein Sciences at Peking University, School of Life Sciences, Peking University, Beijing 100191, China
| | - Liying Yan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China
| | - Yongjie Lu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China
| | - Ping Liu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China
| | - Yuqiang Jiang
- Institute of Genetics and Development Biology, Chinese Academy of Sciences, Beijing 100101, China.
| | - Jie Qiao
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China; Beijing Advanced Innovation Center for Genomics, Beijing 100191, China; Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100191, China.
| | - Li Zhang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China.
| |
Collapse
|
5
|
Wang B, Xie R, Qi W, Yao J, Shi Y, Lou X, Dong C, Zhu X, Wang B, He D, Chen Y, Cao S. Advancing Alzheimer's disease risk prediction: development and validation of a machine learning-based preclinical screening model in a cross-sectional study. BMJ Open 2025; 15:e092293. [PMID: 39922598 DOI: 10.1136/bmjopen-2024-092293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/10/2025] Open
Abstract
OBJECTIVES Alzheimer's disease (AD) poses a significant challenge for individuals aged 65 and older, being the most prevalent form of dementia. Although existing AD risk prediction tools demonstrate high accuracy, their complexity and limited accessibility restrict practical application. This study aimed to develop a convenience, efficient prediction model for AD risk using machine learning techniques. DESIGN AND SETTING We conducted a cross-sectional study with participants aged 60 and older from the National Alzheimer's Coordinating Center. We selected personal characteristics, clinical data and psychosocial factors as baseline predictors for AD (March 2015 to December 2021). The study utilised Random Forest and Extreme Gradient Boosting (XGBoost) algorithms alongside traditional logistic regression for modelling. An oversampling method was applied to balance the data set. INTERVENTIONS This study has no interventions. PARTICIPANTS The study included 2379 participants, of whom 507 were diagnosed with AD. PRIMARY AND SECONDARY OUTCOME MEASURES Including accuracy, precision, recall, F1 score, etc. RESULTS: 11 variables were critical in the training phase, including educational level, depression, insomnia, age, Body Mass Index (BMI), medication count, gender, stenting, systolic blood pressure (sbp), neurosis and rapid eye movement. The XGBoost model exhibited superior performance compared with other models, achieving area under the curve of 0.915, sensitivity of 76.2% and specificity of 92.9%. The most influential predictors were educational level, total medication count, age, sbp and BMI. CONCLUSIONS The proposed classifier can help guide preclinical screening of AD in the elderly population.
Collapse
Affiliation(s)
- Bingsheng Wang
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Ruihan Xie
- Department of Information Engineering, The Chinese University of Hong Kong, Hong Kong, China
| | - Wenhao Qi
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Jiani Yao
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Yankai Shi
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Xiajing Lou
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Chaoqun Dong
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Xiaohong Zhu
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Bing Wang
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Danni He
- Zhejiang Provincial People's Hospital, Hangzhou, Zhejiang, China
| | - Yanfei Chen
- Hangzhou Normal University Affiliated Hospital, Hangzhou, Zhejiang, China
| | - Shihua Cao
- School of Nursing, Hangzhou Normal University, Hangzhou, Zhejiang, China
- Engineering Research Center of Mobile Health Management System, Ministry of Education, Hangzhou, Zhejiang, China
| |
Collapse
|
6
|
Fang J, Zhang DF, Xie K, Xu L, Bi XA. Bilinear Perceptual Fusion Algorithm Based on Brain Functional and Structural Data for ASD Diagnosis and Regions of Interest Identification. Interdiscip Sci 2024; 16:936-950. [PMID: 39254805 DOI: 10.1007/s12539-024-00651-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 08/10/2024] [Accepted: 08/12/2024] [Indexed: 09/11/2024]
Abstract
Autism spectrum disorder (ASD) is a serious mental disorder with a complex pathogenesis mechanism and variable presentation among individuals. Although many deep learning algorithms have been used to diagnose ASD, most of them focus on a single modality of data, resulting in limited information extraction and poor stability. In this paper, we propose a bilinear perceptual fusion (BPF) algorithm that leverages data from multiple modalities. In our algorithm, different schemes are used to extract features according to the characteristics of functional and structural data. Through bilinear operations, the associations between the functional and structural features of each region of interest (ROI) are captured. Then the associations are used to integrate the feature representation. Graph convolutional neural networks (GCNs) can effectively utilize topology and node features in brain network analysis. Therefore, we design a deep learning framework called BPF-GCN and conduct experiments on publicly available ASD dataset. The results show that the classification accuracy of BPF-GCN reached 82.35%, surpassing existing methods. This demonstrates the superiority of its classification performance, and the framework can extract ROIs related to ASD. Our work provides a valuable reference for the timely diagnosis and treatment of ASD.
Collapse
Affiliation(s)
- Jinxiong Fang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Da-Fang Zhang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China.
| | - Kun Xie
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, China
| | - Luyun Xu
- College of Business, Hunan Normal University, Changsha, 410081, China
| | - Xia-An Bi
- College of Information Science and Engineering, Hunan Normal University, Changsha, 410081, China.
| |
Collapse
|
7
|
Chen J, Li Q, Liu F, Wen Y. M3T-LM: A multi-modal multi-task learning model for jointly predicting patient length of stay and mortality. Comput Biol Med 2024; 183:109237. [PMID: 39378581 DOI: 10.1016/j.compbiomed.2024.109237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2024] [Revised: 09/29/2024] [Accepted: 09/30/2024] [Indexed: 10/10/2024]
Abstract
Ensuring accurate predictions of inpatient length of stay (LoS) and mortality rates is essential for enhancing hospital service efficiency, particularly in light of the constraints posed by limited healthcare resources. Integrative analysis of heterogeneous clinic record data from different sources can hold great promise for improving the prognosis and diagnosis level of LoS and mortality. Currently, most existing studies solely focus on single data modality or tend to single-task learning, i.e., training LoS and mortality tasks separately. This limits the utilization of available multi-modal data and prevents the sharing of feature representations that could capture correlations between different tasks, ultimately hindering the model's performance. To address the challenge, this study proposes a novel Multi-Modal Multi-Task learning model, termed as M3T-LM, to integrate clinic records to predict inpatients' LoS and mortality simultaneously. The M3T-LM framework incorporates multiple data modalities by constructing sub-models tailored to each modality. Specifically, a novel attention-embedded one-dimensional (1D) convolutional neural network (CNN) is designed to handle numerical data. For clinical notes, they are converted into sequence data, and then two long short-term memory (LSTM) networks are exploited to model on textual sequence data. A two-dimensional (2D) CNN architecture, noted as CRXMDL, is designed to extract high-level features from chest X-ray (CXR) images. Subsequently, multiple sub-models are integrated to formulate the M3T-LM to capture the correlations between patient LoS and modality prediction tasks. The efficiency of the proposed method is validated on the MIMIC-IV dataset. The proposed method attained a test MAE of 5.54 for LoS prediction and a test F1 of 0.876 for mortality prediction. The experimental results demonstrate that our approach outperforms state-of-the-art (SOTA) methods in tackling mixed regression and classification tasks.
Collapse
Affiliation(s)
- Junde Chen
- Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, Orange, CA 92866, USA.
| | - Qing Li
- Department of Industrial and Manufacturing Systems Engineering, Iowa State University, Ames, IA 50011, USA.
| | - Feng Liu
- School of Systems and Enterprises, Stevens Institute of Technology, Hoboken, NJ 07030, USA.
| | - Yuxin Wen
- Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, Orange, CA 92866, USA.
| |
Collapse
|
8
|
Thulasimani V, Shanmugavadivel K, Cho J, Veerappampalayam Easwaramoorthy S. A Review of Datasets, Optimization Strategies, and Learning Algorithms for Analyzing Alzheimer's Dementia Detection. Neuropsychiatr Dis Treat 2024; 20:2203-2225. [PMID: 39588176 PMCID: PMC11586527 DOI: 10.2147/ndt.s496307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2024] [Accepted: 11/14/2024] [Indexed: 11/27/2024] Open
Abstract
Alzheimer's Dementia (AD) is a progressive neurological disorder that affects memory and cognitive function, necessitating early detection for its effective management. This poses a significant challenge to global public health. The early and accurate detection of dementia is crucial for several reasons. First, timely detection facilitates early intervention and planning of treatment. Second, precise diagnostic methods are essential for distinguishing dementia from other cognitive disorders and medical conditions that may present with similar symptoms. Continuous analysis and improvements in detection methods have contributed to advancements in medical research. It helps to identify new biomarkers, refine existing diagnostic tools, and foster the development of innovative technologies, ultimately leading to more accurate and efficient diagnostic approaches for dementia. This paper presents a critical analysis of multimodal imaging datasets, learning algorithms, and optimisation techniques utilised in the context of Alzheimer's dementia detection. The focus is on understanding the advancements and challenges in employing diverse imaging modalities, such as MRI (Magnetic Resonance Imaging), PET (Positron Emission Tomography), and EEG (ElectroEncephaloGram). This study evaluated various machine learning algorithms, deep learning models, transfer learning techniques, and generative adversarial networks for the effective analysis of multi-modality imaging data for dementia detection. In addition, a critical examination of optimisation techniques encompassing optimisation algorithms and hyperparameter tuning strategies for processing and analysing images is presented in this study to discern their influence on model performance and generalisation. Thorough examination and enhancement of methods for dementia detection are fundamental for addressing the healthcare challenges posed by dementia, facilitating timely interventions, improving diagnostic accuracy, and advancing research in neurodegenerative diseases.
Collapse
Affiliation(s)
- Vanaja Thulasimani
- Department of Artificial Intelligence, Kongu Engineering College, Perundurai, Tamilnadu, India
| | | | - Jaehyuk Cho
- Department of Software Engineering and Division of Electronics and Information Engineering, Jeonbuk National University, Jeonju-Si, Republic of Korea
| | | |
Collapse
|
9
|
Sun J, Han JDJ, Chen W. Exploring the relationship among Alzheimer's disease, aging and cognitive scores through neuroimaging-based approach. Sci Rep 2024; 14:27472. [PMID: 39523370 PMCID: PMC11551169 DOI: 10.1038/s41598-024-78712-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Accepted: 11/04/2024] [Indexed: 11/16/2024] Open
Abstract
Alzheimer's disease (AD) is a fatal neurodegenerative disorder, with the Mini-Mental State Examination (MMSE) and Clinical Dementia Rating (CDR) serving significant roles in monitoring its progression. We hypothesize that while cognitive assessment scores can detect AD-related brain changes, the targeted brain regions may differ. Additionally, given AD's strong association with aging, we propose that specific brain regions are influenced by both AD pathology and aging, exhibiting strong correlations with both. To test these hypotheses, we developed a 3D convolutional network with a mixed-attention mechanism to recognize AD subjects from structural magnetic resonance imaging (sMRI) data and utilize 3D convolutional methods to pinpoint brain regions significantly correlated with the AD, MMSE, CDR and age. All models were trained and internally validated on 417 samples from the Alzheimer's Disease Neuroimaging Initiative (ADNI), and the classification model was externally validated on 382 samples from the Australian Imaging and Lifestyle flagship (AIBL). This approach provided robust support for using MMSE and CDR in assessing AD progression and visually illustrated the relationship between aging and AD. The analysis revealed correlations among the four identification tasks (AD, MMSE, CDR and age) and highlighted asymmetric brain lesions in both AD and aging. Notably, we found that AD can accelerate aging to some extent, and a significant correlation exists between the rate of aging and cognitive assessment scores. This offers new insights into the relationship between AD and aging.
Collapse
Affiliation(s)
- Jinhui Sun
- School of Cyber Science and Engineering, Qufu Normal University, Qufu, 273165, People's Republic of China
| | - Jing-Dong J Han
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, People's Republic of China.
| | - Weiyang Chen
- School of Cyber Science and Engineering, Qufu Normal University, Qufu, 273165, People's Republic of China.
| |
Collapse
|
10
|
Nguyen TNQ, García-Rudolph A, Saurí J, Kelleher JD. Multi-task learning for predicting quality-of-life and independence in activities of daily living after stroke: a proof-of-concept study. Front Neurol 2024; 15:1449234. [PMID: 39399874 PMCID: PMC11469734 DOI: 10.3389/fneur.2024.1449234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 08/28/2024] [Indexed: 10/15/2024] Open
Abstract
A health-related (HR) profile is a set of multiple health-related items recording the status of the patient at different follow-up times post-stroke. In order to support clinicians in designing rehabilitation treatment programs, we propose a novel multi-task learning (MTL) strategy for predicting post-stroke patient HR profiles. The HR profile in this study is measured by the Barthel index (BI) assessment or by the EQ-5D-3L questionnaire. Three datasets are used in this work and for each dataset six neural network architectures are developed and tested. Results indicate that an MTL architecture combining a pre-trained network for all tasks with a concatenation strategy conditioned by a task grouping method is a promising approach for predicting the HR profile of a patient with stroke at different phases of the patient journey. These models obtained a mean F1-score of 0.434 (standard deviation 0.022, confidence interval at 95% [0.428, 0.44]) calculated across all the items when predicting BI at 3 months after stroke (MaS), 0.388 (standard deviation 0.029, confidence interval at 95% [0.38, 0.397]) when predicting EQ-5D-3L at 6MaS, and 0.462 (standard deviation 0.029, confidence interval at 95% [0.454, 0.47]) when predicting the EQ-5D-3L at 18MaS. Furthermore, our MTL architecture outperforms the reference single-task learning models and the classic MTL of all tasks in 8 out of 10 tasks when predicting BI at 3MaS and has better prediction performance than the reference models on all tasks when predicting EQ-5D-3L at 6 and 18MaS. The models we present in this paper are the first models to predict the components of the BI or the EQ-5D-3L, and our results demonstrate the potential benefits of using MTL in a health context to predict patient profiles.
Collapse
Affiliation(s)
- Thi Nguyet Que Nguyen
- Research Hub 4 - Digital Futures Research Hub, Technological University Dublin, Dublin, Ireland
- Artificial Intelligence in Digital Health and Medicine (AIDHM), Technological University Dublin, Dublin, Ireland
| | - Alejandro García-Rudolph
- Department of Research and Innovation, Institut Guttmann, Institut Universitari de Neurorehabilitació adscrit ala UAB, Barcelona, Spain
- Departament de Medicina, Universitat Autonoma De Barcelona, Bellaterra, Spain
- Fundació Institut d'Investigació en Ciències de la Salut Germans Trias i Pujol, Barcelona, Spain
| | - Joan Saurí
- Department of Research and Innovation, Institut Guttmann, Institut Universitari de Neurorehabilitació adscrit ala UAB, Barcelona, Spain
- Departament de Medicina, Universitat Autonoma De Barcelona, Bellaterra, Spain
- Fundació Institut d'Investigació en Ciències de la Salut Germans Trias i Pujol, Barcelona, Spain
| | - John D. Kelleher
- Artificial Intelligence in Digital Health and Medicine (AIDHM), Technological University Dublin, Dublin, Ireland
- School of Computer Science and Statistics, Trinity College Dublin, ADAPT Research Centre, Dublin, Ireland
| |
Collapse
|
11
|
Yu Q, Ma Q, Da L, Li J, Wang M, Xu A, Li Z, Li W. A transformer-based unified multimodal framework for Alzheimer's disease assessment. Comput Biol Med 2024; 180:108979. [PMID: 39098237 DOI: 10.1016/j.compbiomed.2024.108979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Revised: 07/03/2024] [Accepted: 07/31/2024] [Indexed: 08/06/2024]
Abstract
In Alzheimer's disease (AD) assessment, traditional deep learning approaches have often employed separate methodologies to handle the diverse modalities of input data. Recognizing the critical need for a cohesive and interconnected analytical framework, we propose the AD-Transformer, a novel transformer-based unified deep learning model. This innovative framework seamlessly integrates structural magnetic resonance imaging (sMRI), clinical, and genetic data from the extensive Alzheimer's Disease Neuroimaging Initiative (ADNI) database, encompassing 1651 subjects. By employing a Patch-CNN block, the AD-Transformer efficiently transforms image data into image tokens, while a linear projection layer adeptly converts non-image data into corresponding tokens. As the core, a transformer block learns comprehensive representations of the input data, capturing the intricate interplay between modalities. The AD-Transformer sets a new benchmark in AD diagnosis and Mild Cognitive Impairment (MCI) conversion prediction, achieving remarkable average area under curve (AUC) values of 0.993 and 0.845, respectively, surpassing those of traditional image-only models and non-unified multimodal models. Our experimental results confirmed the potential of the AD-Transformer as a potent tool in AD diagnosis and MCI conversion prediction. By providing a unified framework that jointly learns holistic representations of both image and non-image data, the AD-Transformer paves the way for more effective and precise clinical assessments, offering a clinically adaptable strategy for leveraging diverse data modalities in the battle against AD.
Collapse
Affiliation(s)
- Qi Yu
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Qian Ma
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Lijuan Da
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Jiahui Li
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Mengying Wang
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Andi Xu
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| | - Zilin Li
- School of Mathematics and Statistics, Northeast Normal University, Changchun, 130024, Jilin, China
| | - Wenyuan Li
- Department of Big Data in Health Science, School of Public Health and Center of Clinical Big Data and Analytics of The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China.
| |
Collapse
|
12
|
Tchetchenian A, Zekelman L, Chen Y, Rushmore J, Zhang F, Yeterian EH, Makris N, Rathi Y, Meijering E, Song Y, O'Donnell LJ. Deep multimodal saliency parcellation of cerebellar pathways: Linking microstructure and individual function through explainable multitask learning. Hum Brain Mapp 2024; 45:e70008. [PMID: 39185598 PMCID: PMC11345609 DOI: 10.1002/hbm.70008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 07/18/2024] [Accepted: 08/10/2024] [Indexed: 08/27/2024] Open
Abstract
Parcellation of human cerebellar pathways is essential for advancing our understanding of the human brain. Existing diffusion magnetic resonance imaging tractography parcellation methods have been successful in defining major cerebellar fibre tracts, while relying solely on fibre tract structure. However, each fibre tract may relay information related to multiple cognitive and motor functions of the cerebellum. Hence, it may be beneficial for parcellation to consider the potential importance of the fibre tracts for individual motor and cognitive functional performance measures. In this work, we propose a multimodal data-driven method for cerebellar pathway parcellation, which incorporates both measures of microstructure and connectivity, and measures of individual functional performance. Our method involves first training a multitask deep network to predict various cognitive and motor measures from a set of fibre tract structural features. The importance of each structural feature for predicting each functional measure is then computed, resulting in a set of structure-function saliency values that are clustered to parcellate cerebellar pathways. We refer to our method as Deep Multimodal Saliency Parcellation (DeepMSP), as it computes the saliency of structural measures for predicting cognitive and motor functional performance, with these saliencies being applied to the task of parcellation. Applying DeepMSP to a large-scale dataset from the Human Connectome Project Young Adult study (n = 1065), we found that it was feasible to identify multiple cerebellar pathway parcels with unique structure-function saliency patterns that were stable across training folds. We thoroughly experimented with all stages of the DeepMSP pipeline, including network selection, structure-function saliency representation, clustering algorithm, and cluster count. We found that a 1D convolutional neural network architecture and a transformer network architecture both performed comparably for the multitask prediction of endurance, strength, reading decoding, and vocabulary comprehension, with both architectures outperforming a fully connected network architecture. Quantitative experiments demonstrated that a proposed low-dimensional saliency representation with an explicit measure of motor versus cognitive category bias achieved the best parcellation results, while a parcel count of four was most successful according to standard cluster quality metrics. Our results suggested that motor and cognitive saliencies are distributed across the cerebellar white matter pathways. Inspection of the final k = 4 parcellation revealed that the highest-saliency parcel was most salient for the prediction of both motor and cognitive performance scores and included parts of the middle and superior cerebellar peduncles. Our proposed saliency-based parcellation framework, DeepMSP, enables multimodal, data-driven tractography parcellation. Through utilising both structural features and functional performance measures, this parcellation strategy may have the potential to enhance the study of structure-function relationships of the cerebellar pathways.
Collapse
Affiliation(s)
- Ari Tchetchenian
- Biomedical Image Computing Group, School of Computer Science and EngineeringUniversity of New South Wales (UNSW)SydneyNew South WalesAustralia
| | - Leo Zekelman
- Department of Radiology, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
- Harvard UniversityCambridgeMassachusettsUSA
| | - Yuqian Chen
- Department of Radiology, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
| | - Jarrett Rushmore
- Department of PsychiatryMassachusetts General Hospital, Harvard Medical SchoolBostonMassachusettsUSA
- Department of NeurologyMassachusetts General Hospital, Harvard Medical SchoolBostonMassachusettsUSA
- Department of RadiologyMassachusetts General Hospital, Harvard Medical SchoolBostonMassachusettsUSA
- Department of Anatomy and NeurobiologyBoston University School of MedicineBostonMassachusettsUSA
| | - Fan Zhang
- School of Information and Communication EngineeringUniversity of Electronic Science and Technology of ChinaChengduChina
| | | | - Nikos Makris
- Department of PsychiatryMassachusetts General Hospital, Harvard Medical SchoolBostonMassachusettsUSA
- Department of NeurologyMassachusetts General Hospital, Harvard Medical SchoolBostonMassachusettsUSA
- Department of RadiologyMassachusetts General Hospital, Harvard Medical SchoolBostonMassachusettsUSA
- Department of Psychiatry, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
| | - Yogesh Rathi
- Department of Radiology, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
- Department of Psychiatry, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
| | - Erik Meijering
- Biomedical Image Computing Group, School of Computer Science and EngineeringUniversity of New South Wales (UNSW)SydneyNew South WalesAustralia
| | - Yang Song
- Biomedical Image Computing Group, School of Computer Science and EngineeringUniversity of New South Wales (UNSW)SydneyNew South WalesAustralia
| | - Lauren J. O'Donnell
- Department of Radiology, Brigham and Women's HospitalHarvard Medical SchoolBostonMassachusettsUSA
| |
Collapse
|
13
|
Dhaygude AD, Ameta GK, Khan IR, Singh PP, Maaliw RR, Lakshmaiya N, Shabaz M, Khan MA, Hussein HS, Alshazly H. Knowledge‐based deep learning system for classifying Alzheimer's disease for multi‐task learning. CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY 2024; 9:805-820. [DOI: 10.1049/cit2.12291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 06/21/2023] [Indexed: 08/25/2024] Open
Abstract
AbstractDeep learning has recently become a viable approach for classifying Alzheimer's disease (AD) in medical imaging. However, existing models struggle to efficiently extract features from medical images and may squander additional information resources for illness classification. To address these issues, a deep three‐dimensional convolutional neural network incorporating multi‐task learning and attention mechanisms is proposed. An upgraded primary C3D network is utilised to create rougher low‐level feature maps. It introduces a new convolution block that focuses on the structural aspects of the magnetic resonance imaging image and another block that extracts attention weights unique to certain pixel positions in the feature map and multiplies them with the feature map output. Then, several fully connected layers are used to achieve multi‐task learning, generating three outputs, including the primary classification task. The other two outputs employ backpropagation during training to improve the primary classification job. Experimental findings show that the authors’ proposed method outperforms current approaches for classifying AD, achieving enhanced classification accuracy and other indicators on the Alzheimer's disease Neuroimaging Initiative dataset. The authors demonstrate promise for future disease classification studies.
Collapse
Affiliation(s)
| | - Gaurav Kumar Ameta
- Department of Computer Science & Engineering Parul Institute of Technology Parul University Vadodara Gujarat India
| | | | | | - Renato R. Maaliw
- College of Engineering Southern Luzon State University Lucban Quezon Philippines
| | - Natrayan Lakshmaiya
- Department of Mechanical Engineering Saveetha School of Engineering SIMATS Chennai Tamil Nadu India
| | - Mohammad Shabaz
- Model Institute of Engineering and Technology Jammu J&K India
| | - Muhammad Attique Khan
- Department of Computer Science HITEC University Taxila Pakistan
- Department of Computer Science and Mathematics Lebanese American University Beirut Lebanon
| | - Hany S. Hussein
- Electrical Engineering Department College of Engineering King Khalid University Abha Saudi Arabia
- Electrical Engineering Department Aswan University Aswan Egypt
| | - Hammam Alshazly
- Faculty of Computers and Information South Valley University Qena Egypt
| |
Collapse
|
14
|
Hasan F, Muhtar MS, Wu D, Chen PY, Hsu MH, Nguyen PA, Chen TJ, Chiu HY. Web-based artificial intelligence to predict cognitive impairment following stroke: A multicenter study. J Stroke Cerebrovasc Dis 2024; 33:107826. [PMID: 38908612 DOI: 10.1016/j.jstrokecerebrovasdis.2024.107826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 06/05/2024] [Accepted: 06/18/2024] [Indexed: 06/24/2024] Open
Abstract
BACKGROUND AND PURPOSE Post-stroke cognitive impairment (PSCI) is highly prevalent in modern society. However, there is limited study implying an accurate and explainable machine learning model to predict PSCI. The aim of this study is to develop and validate a web-based artificial intelligence (AI) tool for predicting PSCI. METHODS The retrospective cohort study design was conducted to develop and validate a web-based prediction model. Adults who experienced a stroke between January 1, 2004, and September 30, 2017, were enrolled, and patients with PSCI were followed up from the stroke index date until their last follow-up. The model's performance metrics, including accuracy, area under the curve (AUC), recall, precision, and F1 score, were compared. RESULTS A total of 3209 stroke patients were included in the study. The model demonstrated an accuracy of 0.8793, AUC of 0.9200, recall of 0.6332, precision of 0.9664, and F1 score of 0.7651. In the external validation phase, the accuracy improved to 0.9039, AUC to 0.9094, recall to 0.7457, precision to 0.9168, and F1 score to 0.8224. The final model can be accessed at https://psci-calculator.my.id/. CONCLUSION Our results are able to produce a user-friendly interface that is useful for health practitioners to perform early prediction on PSCI. These findings also suggest that the provided AI model is reliable and can serve as a roadmap for future studies using AI models in a clinical setting.
Collapse
Affiliation(s)
- Faizul Hasan
- Faculty of Nursing, Chulalongkorn University, Boromarajonani Srisataphat Building, 12th Floor, Rama1 Road, Wang Mai, Pathum Wan, Bangkok 10330, Thailand; School of Nursing, College of Nursing, Taipei Medical University, No. 250, Wuxing St., Xinyi Dist., Taipei City 110, Taiwan
| | | | - Dean Wu
- Research Center of Sleep Medicine, College of Medicine, Taipei Medical University 110, Taipei City, Taiwan; Department of Neurology, School of Medicine, College of Medicine, Taipei Medical University, Taipei 110, Taiwan; Department of Neurology, Shuang-Ho Hospital, New Taipei City 23561, Taiwan
| | - Pin-Yuan Chen
- Department of Neurosurgery, Chang Gung Memorial Hospital, Keelung City 204, Taiwan; School of Medicine, College of Medicine, Chang Gung University, Taoyuan City 333, Taiwan
| | - Min-Huei Hsu
- Graduate Institute of Data Science, Taipei Medical University, Taipei City 110, Taiwan
| | - Phung Anh Nguyen
- Graduate Institute of Data Science, Taipei Medical University, Taipei City 110, Taiwan
| | - Ting-Jhen Chen
- Faculty of Nursing, Chulalongkorn University, Boromarajonani Srisataphat Building, 12th Floor, Rama1 Road, Wang Mai, Pathum Wan, Bangkok 10330, Thailand; School of Nursing, Faculty of Science, Medicine and Health, University of Wollongong, Northfields Ave, Wollongong, NSW 2522, Australia
| | - Hsiao-Yean Chiu
- School of Nursing, College of Nursing, Taipei Medical University, No. 250, Wuxing St., Xinyi Dist., Taipei City 110, Taiwan; Research Center of Sleep Medicine, College of Medicine, Taipei Medical University 110, Taipei City, Taiwan; Department of Nursing, Taipei Medical University Hospital, Taipei City 110, Taiwan.
| |
Collapse
|
15
|
Kaur A, Mittal M, Bhatti JS, Thareja S, Singh S. A systematic literature review on the significance of deep learning and machine learning in predicting Alzheimer's disease. Artif Intell Med 2024; 154:102928. [PMID: 39029377 DOI: 10.1016/j.artmed.2024.102928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 04/15/2024] [Accepted: 06/27/2024] [Indexed: 07/21/2024]
Abstract
BACKGROUND Alzheimer's disease (AD) is the most prevalent cause of dementia, characterized by a steady decline in mental, behavioral, and social abilities and impairs a person's capacity for independent functioning. It is a fatal neurodegenerative disease primarily affecting older adults. OBJECTIVES The purpose of this literature review is to investigate various AD detection techniques, datasets, input modalities, algorithms, libraries, and performance evaluation metrics used to determine which model or strategy may provide superior performance. METHOD The initial search yielded 807 papers, but only 100 research articles were chosen after applying the inclusion-exclusion criteria. This SLR analyzed research items published between January 2019 and December 2022. The ACM, Elsevier, IEEE Xplore Digital Library, PubMed, Springer and Taylor & Francis were systematically searched. The current study considers articles that used Magnetic Resonance Imaging (MRI), Positron Emission Tomography (PET), APOe4 genotype, Diffusion Tensor Imaging (DTI) and Cerebrospinal Fluid (CSF) biomarkers. The study was performed following Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) guidelines. CONCLUSION According to the literature survey, most studies (n = 76) used the DL strategy. The datasets used by studies were primarily derived from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. The majority of studies (n = 73) used single-modality neuroimaging data, while the remaining used multi-modal input data. In a multi-modality approach, the combination of MRI and PET scans is commonly preferred. Also, Regarding the algorithm used, Convolution Neural Network (CNN) showed the highest accuracy, 100 %, in classifying AD vs. CN subjects whereas the SVM was the most common ML algorithm, with a maximum accuracy of 99.82 %.
Collapse
Affiliation(s)
- Arshdeep Kaur
- Dept. of Computer Science & Technology, Central University of Punjab, Bathinda, India
| | - Meenakshi Mittal
- Dept. of Computer Science & Technology, Central University of Punjab, Bathinda, India
| | - Jasvinder Singh Bhatti
- Dept. of Human Genetics and Molecular Medicine, Central University of Punjab, Bathinda, India
| | - Suresh Thareja
- Dept. of Pharmaceutical Sciences and Natural Products, Central University of Punjab, Bathinda, India
| | - Satwinder Singh
- Dept. of Computer Science & Technology, Central University of Punjab, Bathinda, India.
| |
Collapse
|
16
|
White A, Saranti M, d'Avila Garcez A, Hope TMH, Price CJ, Bowman H. Predicting recovery following stroke: Deep learning, multimodal data and feature selection using explainable AI. Neuroimage Clin 2024; 43:103638. [PMID: 39002223 PMCID: PMC11299565 DOI: 10.1016/j.nicl.2024.103638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 04/22/2024] [Accepted: 06/29/2024] [Indexed: 07/15/2024]
Abstract
Machine learning offers great potential for automated prediction of post-stroke symptoms and their response to rehabilitation. Major challenges for this endeavour include the very high dimensionality of neuroimaging data, the relatively small size of the datasets available for learning and interpreting the predictive features, as well as, how to effectively combine neuroimaging and tabular data (e.g. demographic information and clinical characteristics). This paper evaluates several solutions based on two strategies. The first is to use 2D images that summarise MRI scans. The second is to select key features that improve classification accuracy. Additionally, we introduce the novel approach of training a convolutional neural network (CNN) on images that combine regions-of-interests (ROIs) extracted from MRIs, with symbolic representations of tabular data. We evaluate a series of CNN architectures (both 2D and a 3D) that are trained on different representations of MRI and tabular data, to predict whether a composite measure of post-stroke spoken picture description ability is in the aphasic or non-aphasic range. MRI and tabular data were acquired from 758 English speaking stroke survivors who participated in the PLORAS study. Each participant was assigned to one of five different groups that were matched for initial severity of symptoms, recovery time, left lesion size and the months or years post-stroke that spoken description scores were collected. Training and validation were carried out on the first four groups. The fifth (lock-box/test set) group was used to test how well model accuracy generalises to new (unseen) data. The classification accuracy for a baseline logistic regression was 0.678 based on lesion size alone, rising to 0.757 and 0.813 when initial symptom severity and recovery time were successively added. The highest classification accuracy (0.854), area under the curve (0.899) and F1 score (0.901) were observed when 8 regions of interest were extracted from each MRI scan and combined with lesion size, initial severity and recovery time in a 2D Residual Neural Network (ResNet). This was also the best model when data were limited to the 286 participants with moderate or severe initial aphasia (with area under curve = 0.865), a group that would be considered more difficult to classify. Our findings demonstrate how imaging and tabular data can be combined to achieve high post-stroke classification accuracy, even when the dataset is small in machine learning terms. We conclude by proposing how the current models could be improved to achieve even higher levels of accuracy using images from hospital scanners.
Collapse
Affiliation(s)
- Adam White
- Department of Computer Science, City, University of London, UK
| | | | | | - Thomas M H Hope
- Wellcome Centre for Human Neuroimaging, University College London, UK
| | - Cathy J Price
- Wellcome Centre for Human Neuroimaging, University College London, UK
| | - Howard Bowman
- School of Psychology, University of Birmingham, UK; School of Computer Science, University of Birmingham, UK
| |
Collapse
|
17
|
Malik I, Iqbal A, Gu YH, Al-antari MA. Deep Learning for Alzheimer's Disease Prediction: A Comprehensive Review. Diagnostics (Basel) 2024; 14:1281. [PMID: 38928696 PMCID: PMC11202897 DOI: 10.3390/diagnostics14121281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Revised: 06/10/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024] Open
Abstract
Alzheimer's disease (AD) is a neurological disorder that significantly impairs cognitive function, leading to memory loss and eventually death. AD progresses through three stages: early stage, mild cognitive impairment (MCI) (middle stage), and dementia. Early diagnosis of Alzheimer's disease is crucial and can improve survival rates among patients. Traditional methods for diagnosing AD through regular checkups and manual examinations are challenging. Advances in computer-aided diagnosis systems (CADs) have led to the development of various artificial intelligence and deep learning-based methods for rapid AD detection. This survey aims to explore the different modalities, feature extraction methods, datasets, machine learning techniques, and validation methods used in AD detection. We reviewed 116 relevant papers from repositories including Elsevier (45), IEEE (25), Springer (19), Wiley (6), PLOS One (5), MDPI (3), World Scientific (3), Frontiers (3), PeerJ (2), Hindawi (2), IO Press (1), and other multiple sources (2). The review is presented in tables for ease of reference, allowing readers to quickly grasp the key findings of each study. Additionally, this review addresses the challenges in the current literature and emphasizes the importance of interpretability and explainability in understanding deep learning model predictions. The primary goal is to assess existing techniques for AD identification and highlight obstacles to guide future research.
Collapse
Affiliation(s)
- Isra Malik
- Department of Computer Science, COMSATS University Islamabad, Wah Campus, Wah Cantt 44000, Pakistan
| | - Ahmed Iqbal
- Department of Computer Science, Sir Syed Case Institute of Technology, Islamabad 45230, Pakistan
| | - Yeong Hyeon Gu
- Department of Artificial Intelligence and Data Science, College of AI Convergence, Daeyang AI Center, Sejong University, Seoul 05006, Republic of Korea
| | - Mugahed A. Al-antari
- Department of Artificial Intelligence and Data Science, College of AI Convergence, Daeyang AI Center, Sejong University, Seoul 05006, Republic of Korea
| |
Collapse
|
18
|
Mehmood A, Shahid F, Khan R, Ibrahim MM, Zheng Z. Utilizing Siamese 4D-AlzNet and Transfer Learning to Identify Stages of Alzheimer's Disease. Neuroscience 2024; 545:69-85. [PMID: 38492797 DOI: 10.1016/j.neuroscience.2024.03.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 03/05/2024] [Accepted: 03/10/2024] [Indexed: 03/18/2024]
Abstract
Alzheimer's disease (AD) is the general form of dementia, leading to a progressive neurological disorder characterized by memory loss due to brain cell damage. Artificial Intelligence (AI) assists in the early identification and prediction of AD patients, determining future risks and benefits for radiologists and doctors to save time and cost. Since deep learning (DL) approaches work well with massive datasets and have recently become helpful for AD detection, there remains an area for improvement in automating detection performance. Present approaches somehow addressed the challenges of limited annotated data samples for binary classification. This contrasts with prior state-of-the-art techniques, which were constrained by their incapacity to capture abstract-level information. In this paper, we proposed a Siamese 4D-AlzNet model comprised of four parallel convolutional neural network (CNN) streams (Five CNN layer blocks) and customized transfer learning models (Frozen VGG-19, Frozen VGG-16, and customized AlexNet). Siamese 4D-AlzNet was vertically and horizontally stored, and the spatial features were passed to the final layer for classification. For experiments, T1-weighted MRI images comprised of four distinct subject classes, normal control (NC), mild cognitive impairment (MCI), late mild cognitive impairment (LMCI), and AD, have been employed. Our proposed models achieved outstanding accuracy, with a remarkable 95.05% accuracy distinguishing between normal and AD subjects. The performance across remaining binary class pairs consistently exceeded 90%. We thoroughly compared our model with the latest methods using the same dataset as our reference. Our proposed model improved NC-AD and MCI-AD classification accuracy by 2% 7%.
Collapse
Affiliation(s)
- Atif Mehmood
- School of Computer Science and Technology, Zhejiang Normal University, Jinhua 321004, China; Zhejiang Institute of Photoelectronics & Zhejiang Institute for Advanced Light Source, Zhejiang Normal University, Jinhua, Zhejiang 321004, China
| | - Farah Shahid
- School of Computer Science and Technology, Zhejiang Normal University, Jinhua 321004, China; Zhejiang Institute of Photoelectronics & Zhejiang Institute for Advanced Light Source, Zhejiang Normal University, Jinhua, Zhejiang 321004, China.
| | - Rizwan Khan
- School of Computer Science and Technology, Zhejiang Normal University, Jinhua 321004, China
| | - Mostafa M Ibrahim
- Department of Electrical Engineering, Faculty of Engineering, Minia University, Minia 61519, Egypt
| | - Zhonglong Zheng
- School of Computer Science and Technology, Zhejiang Normal University, Jinhua 321004, China.
| |
Collapse
|
19
|
Han K, Li G, Fang Z, Yang F. Multi-Template Meta-Information Regularized Network for Alzheimer's Disease Diagnosis Using Structural MRI. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:1664-1676. [PMID: 38109240 DOI: 10.1109/tmi.2023.3344384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Structural magnetic resonance imaging (sMRI) has been widely applied in computer-aided Alzheimer's disease (AD) diagnosis, owing to its capabilities in providing detailed brain morphometric patterns and anatomical features in vivo. Although previous works have validated the effectiveness of incorporating metadata (e.g., age, gender, and educational years) for sMRI-based AD diagnosis, existing methods solely paid attention to metadata-associated correlation to AD (e.g., gender bias in AD prevalence) or confounding effects (e.g., the issue of normal aging and metadata-related heterogeneity). Hence, it is difficult to fully excavate the influence of metadata on AD diagnosis. To address these issues, we constructed a novel Multi-template Meta-information Regularized Network (MMRN) for AD diagnosis. Specifically, considering diagnostic variation resulting from different spatial transformations onto different brain templates, we first regarded different transformations as data augmentation for self-supervised learning after template selection. Since the confounding effects may arise from excessive attention to meta-information owing to its correlation with AD, we then designed the modules of weakly supervised meta-information learning and mutual information minimization to learn and disentangle meta-information from learned class-related representations, which accounts for meta-information regularization for disease diagnosis. We have evaluated our proposed MMRN on two public multi-center cohorts, including the Alzheimer's Disease Neuroimaging Initiative (ADNI) with 1,950 subjects and the National Alzheimer's Coordinating Center (NACC) with 1,163 subjects. The experimental results have shown that our proposed method outperformed the state-of-the-art approaches in both tasks of AD diagnosis, mild cognitive impairment (MCI) conversion prediction, and normal control (NC) vs. MCI vs. AD classification.
Collapse
|
20
|
Jomeiri A, Navin AH, Shamsi M. Longitudinal MRI analysis using a hybrid DenseNet-BiLSTM method for Alzheimer's disease prediction. Behav Brain Res 2024; 463:114900. [PMID: 38341100 DOI: 10.1016/j.bbr.2024.114900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 12/16/2023] [Accepted: 02/06/2024] [Indexed: 02/12/2024]
Abstract
Alzheimer's disease is a progressive neurological disorder characterized by brain atrophy and cell death, leading to cognitive decline and impaired functioning. Previous research has primarily focused on using cross-sectional data for Alzheimer's disease identification, but analyzing longitudinal sequential MR images is crucial for improved diagnostic accuracy and understanding disease progression. However, existing deep learning models face challenges in learning spatial and temporal features from such data. To address these challenges, this study presents a novel hybrid DenseNet-BiLSTM method for Alzheimer's disease prediction using longitudinal MRI analysis. The proposed framework combines Convolutional DenseNet for spatial information extraction and joined BiLSTM layers for capturing temporal characteristics and relationships between longitudinal images at different time points. This approach overcomes issues like overfitting, vanishing gradients, and incomplete patient data. We evaluated the model on 684 longitudinal MRI images from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database, including normal controls, individuals with mild cognitive impairment, and Alzheimer's disease patients. The results demonstrate high classification accuracy, with 95.28% for AD/CN, 88.19% for NC/MCI, 83.51% for sMCI/pMCI, and 92.14% for MCI/AD. These findings highlight the substantial improvement in Alzheimer's disease diagnosis achieved through the utilization of longitudinal MRI images. The contributions of this study lie in both the deep learning and medical domains. In the deep learning domain, our hybrid framework effectively learns spatial and temporal features from longitudinal data, addressing the challenges associated with multi-dimensional and sequential time series data. In the medical domain, our study emphasizes the importance of analyzing baseline and longitudinal MR images for accurate diagnosis and understanding disease progression.
Collapse
Affiliation(s)
- Alireza Jomeiri
- Department of Computer Engineering, Qom Branch, Islamic Azad University, Qom, Iran
| | - Ahmad Habibizad Navin
- Department of Computer Engineering, Tabriz Branch, Islamic Azad University, Tabriz, Iran.
| | - Mahboubeh Shamsi
- Department of Engineering, Qom University of Technology, Qom, Iran
| |
Collapse
|
21
|
Chen Z, Liu Y, Zhang Y, Zhu J, Li Q, Wu X. Shared Manifold Regularized Joint Feature Selection for Joint Classification and Regression in Alzheimer's Disease Diagnosis. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2024; 33:2730-2745. [PMID: 38578858 DOI: 10.1109/tip.2024.3382600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/07/2024]
Abstract
In Alzheimer's disease (AD) diagnosis, joint feature selection for predicting disease labels (classification) and estimating cognitive scores (regression) with neuroimaging data has received increasing attention. In this paper, we propose a model named Shared Manifold regularized Joint Feature Selection (SMJFS) that performs classification and regression in a unified framework for AD diagnosis. For classification, unlike the existing works that build least squares regression models which are insufficient in the ability of extracting discriminative information for classification, we design an objective function that integrates linear discriminant analysis and subspace sparsity regularization for acquiring an informative feature subset. Furthermore, the local data relationships are learned according to the samples' transformed distances to exploit the local data structure adaptively. For regression, in contrast to previous works that overlook the correlations among cognitive scores, we learn a latent score space to capture the correlations and employ the latent space to design a regression model with l2,1 -norm regularization, facilitating the feature selection in regression task. Moreover, the missing cognitive scores can be recovered in the latent space for increasing the number of available training samples. Meanwhile, to capture the correlations between the two tasks and describe the local relationships between samples, we construct an adaptive shared graph to guide the subspace learning in classification and the latent cognitive score learning in regression simultaneously. An efficient iterative optimization algorithm is proposed to solve the optimization problem. Extensive experiments on three datasets validate the discriminability of the features selected by SMJFS.
Collapse
|
22
|
Wang R, Gao L, Zhang X, Han J. SVFR: A novel slice-to-volume feature representation framework using deep neural networks and a clustering model for the diagnosis of Alzheimer's disease. Heliyon 2024; 10:e23008. [PMID: 38148809 PMCID: PMC10750062 DOI: 10.1016/j.heliyon.2023.e23008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 09/30/2023] [Accepted: 11/23/2023] [Indexed: 12/28/2023] Open
Abstract
Deep neural networks (DNNs) have been effective in classifying structural magnetic resonance imaging (sMRI) images for Alzheimer's disease (AD) diagnosis. In this study, we propose a novel two-phase slice-to-volume feature representation (SVFR) framework for AD diagnosis. Specifically, we design a slice-level feature extractor to automatically select informative slice images and extract their slice-level features, by combining DNN and clustering models. Furthermore, we propose a joint volume-level feature generator and classifier to hierarchically aggregate the slice-level features into volume-level features and to classify images, by devising a spatial pyramid set pooling module and a fusion module. Experimental results demonstrate the superior performance of the proposed SVFR, surpassing the majority of the state-of-the-art methods and achieving comparable results to the best-performing approach. Experimental results also showcase the efficacy of the slice-level feature extractor in the selection of informative slice images, as well as the effectiveness of the volume-level feature generator and classifier in the integration of slice-level features for image classification. The source code for this study is publicly available at https://github.com/gll89/SVFR.
Collapse
Affiliation(s)
- Rubing Wang
- Faculty of Electrical Engineering and Computer Science, Zhejiang Engineering Research Center of Advanced Mass Spectrometry and Clinical Application, Ningbo University, Ningbo, China
| | - Linlin Gao
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
- Faculty of Electrical Engineering and Computer Science, Zhejiang Engineering Research Center of Advanced Mass Spectrometry and Clinical Application, Ningbo University, Ningbo, China
| | | | - Jinming Han
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - the Alzheimer's Disease Neuroimaging Initiative
- School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, China
- Faculty of Electrical Engineering and Computer Science, Zhejiang Engineering Research Center of Advanced Mass Spectrometry and Clinical Application, Ningbo University, Ningbo, China
- Ningbo Medical Center Lihuili Hospital, Ningbo, China
- Department of Neurology, Xuanwu Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
23
|
Tang X, Zhang C, Guo R, Yang X, Qian X. A Causality-Aware Graph Convolutional Network Framework for Rigidity Assessment in Parkinsonians. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:229-240. [PMID: 37432810 DOI: 10.1109/tmi.2023.3294182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/13/2023]
Abstract
Rigidity is one of the common motor disorders in Parkinson's disease (PD), which lead to life quality deterioration. The widely-used rating-scale-based approach for rigidity assessment still depends on the availability of experienced neurologists and is limited by rating subjectivity. Given the recent successful applications of quantitative susceptibility mapping (QSM) in auxiliary PD diagnosis, automated assessment of PD rigidity can be essentially achieved through QSM analysis. However, a major challenge is the performance instability due to the confounding factors (e.g., noise and distribution shift) which conceal the truly-causal features. Therefore, we propose a causality-aware graph convolutional network (GCN) framework, where causal feature selection is combined with causal invariance to ensure that causality-informed model decisions are reached. Firstly, a GCN model that integrates causal feature selection is systematically constructed at three graph levels: node, structure, and representation. In this model, a causal diagram is learned to extract a subgraph with truly-causal information. Secondly, a non-causal perturbation strategy is developed along with an invariance constraint to ensure the stability of the assessment results under different distributions, and thus avoid spurious correlations caused by distribution shifts. The superiority of the proposed method is shown by extensive experiments and the clinical value is revealed by the direct relevance of selected brain regions to rigidity in PD. Besides, its extensibility is verified on other two tasks: PD bradykinesia and mental state for Alzheimer's disease. Overall, we provide a clinically-potential tool for automated and stable assessment of PD rigidity. Our source code will be available at https://github.com/SJTUBME-QianLab/Causality-Aware-Rigidity.
Collapse
|
24
|
Dai Y, Hsu YC, Fernandes BS, Zhang K, Li X, Enduru N, Liu A, Manuel AM, Jiang X, Zhao Z. Disentangling Accelerated Cognitive Decline from the Normal Aging Process and Unraveling Its Genetic Components: A Neuroimaging-Based Deep Learning Approach. J Alzheimers Dis 2024; 97:1807-1827. [PMID: 38306043 PMCID: PMC11649026 DOI: 10.3233/jad-231020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
Background The progressive cognitive decline, an integral component of Alzheimer's disease (AD), unfolds in tandem with the natural aging process. Neuroimaging features have demonstrated the capacity to distinguish cognitive decline changes stemming from typical brain aging and AD between different chronological points. Objective To disentangle the normal aging effect from the AD-related accelerated cognitive decline and unravel its genetic components using a neuroimaging-based deep learning approach. Methods We developed a deep-learning framework based on a dual-loss Siamese ResNet network to extract fine-grained information from the longitudinal structural magnetic resonance imaging (MRI) data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) study. We then conducted genome-wide association studies (GWAS) and post-GWAS analyses to reveal the genetic basis of AD-related accelerated cognitive decline. Results We used our model to process data from 1,313 individuals, training it on 414 cognitively normal people and predicting cognitive assessment for all participants. In our analysis of accelerated cognitive decline GWAS, we identified two genome-wide significant loci: APOE locus (chromosome 19 p13.32) and rs144614292 (chromosome 11 p15.1). Variant rs144614292 (G > T) has not been reported in previous AD GWA studies. It is within the intronic region of NELL1, which is expressed in neurons and plays a role in controlling cell growth and differentiation. The cell-type-specific enrichment analysis and functional enrichment of GWAS signals highlighted the microglia and immune-response pathways. Conclusions Our deep learning model effectively extracted relevant neuroimaging features and predicted individual cognitive decline. We reported a novel variant (rs144614292) within the NELL1 gene.
Collapse
Affiliation(s)
- Yulin Dai
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
| | - Yu-Chun Hsu
- Center for Secure Artificial Intelligence for Healthcare,
School of Biomedical Informatics, The University of Texas Health Science Center at
Houston, Houston, TX, USA
| | - Brisa S. Fernandes
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
| | - Kai Zhang
- Center for Secure Artificial Intelligence for Healthcare,
School of Biomedical Informatics, The University of Texas Health Science Center at
Houston, Houston, TX, USA
| | - Xiaoyang Li
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
- Department of Biostatistics and Data Science, School of
Public Health, The University of Texas Health Science Center at Houston, Houston,
TX, USA
| | - Nitesh Enduru
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
- Department of Epidemiology, Human Genetics and
Environmental Sciences, School of Public Health, The University of Texas Health
Science Center at Houston, Houston, TX, USA
| | - Andi Liu
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
- Department of Epidemiology, Human Genetics and
Environmental Sciences, School of Public Health, The University of Texas Health
Science Center at Houston, Houston, TX, USA
| | - Astrid M. Manuel
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
| | - Xiaoqian Jiang
- Center for Secure Artificial Intelligence for Healthcare,
School of Biomedical Informatics, The University of Texas Health Science Center at
Houston, Houston, TX, USA
| | - Zhongming Zhao
- Center for Precision Health, McWilliams School of
Biomedical Informatics, The University of Texas Health Science Center at Houston,
Houston, TX, USA
- Department of Epidemiology, Human Genetics and
Environmental Sciences, School of Public Health, The University of Texas Health
Science Center at Houston, Houston, TX, USA
- Department of Biomedical Informatics, Vanderbilt University
Medical enter, Nashville, TN, USA
| |
Collapse
|
25
|
Sun K, Wang Q, Shen D. Joint Cross-Attention Network With Deep Modality Prior for Fast MRI Reconstruction. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:558-569. [PMID: 37695966 DOI: 10.1109/tmi.2023.3314008] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/13/2023]
Abstract
Current deep learning-based reconstruction models for accelerated multi-coil magnetic resonance imaging (MRI) mainly focus on subsampled k-space data of single modality using convolutional neural network (CNN). Although dual-domain information and data consistency constraint are commonly adopted in fast MRI reconstruction, the performance of existing models is still limited mainly by three factors: inaccurate estimation of coil sensitivity, inadequate utilization of structural prior, and inductive bias of CNN. To tackle these challenges, we propose an unrolling-based joint Cross-Attention Network, dubbed as jCAN, using deep guidance of the already acquired intra-subject data. Particularly, to improve the performance of coil sensitivity estimation, we simultaneously optimize the latent MR image and sensitivity map (SM). Besides, we introduce Gating layer and Gaussian layer into SM estimation to alleviate the "defocus" and "over-coupling" effects and further ameliorate the SM estimation. To enhance the representation ability of the proposed model, we deploy Vision Transformer (ViT) and CNN in the image and k-space domains, respectively. Moreover, we exploit pre-acquired intra-subject scan as reference modality to guide the reconstruction of subsampled target modality by resorting to the self- and cross-attention scheme. Experimental results on public knee and in-house brain datasets demonstrate that the proposed jCAN outperforms the state-of-the-art methods by a large margin in terms of SSIM and PSNR for different acceleration factors and sampling masks. Our code is publicly available at https://github.com/sunkg/jCAN.
Collapse
|
26
|
Nemali A, Vockert N, Berron D, Maas A, Bernal J, Yakupov R, Peters O, Gref D, Cosma N, Preis L, Priller J, Spruth E, Altenstein S, Lohse A, Fliessbach K, Kimmich O, Vogt I, Wiltfang J, Hansen N, Bartels C, Schott BH, Maier F, Meiberth D, Glanz W, Incesoy E, Butryn M, Buerger K, Janowitz D, Pernecky R, Rauchmann B, Burow L, Teipel S, Kilimann I, Göerß D, Dyrba M, Laske C, Munk M, Sanzenbacher C, Müller S, Spottke A, Roy N, Heneka M, Brosseron F, Roeske S, Dobisch L, Ramirez A, Ewers M, Dechent P, Scheffler K, Kleineidam L, Wolfsgruber S, Wagner M, Jessen F, Duzel E, Ziegler G. Gaussian Process-based prediction of memory performance and biomarker status in ageing and Alzheimer's disease-A systematic model evaluation. Med Image Anal 2023; 90:102913. [PMID: 37660483 DOI: 10.1016/j.media.2023.102913] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 06/28/2023] [Accepted: 07/25/2023] [Indexed: 09/05/2023]
Abstract
Neuroimaging markers based on Magnetic Resonance Imaging (MRI) combined with various other measures (such as genetic covariates, biomarkers, vascular risk factors, neuropsychological tests etc.) might provide useful predictions of clinical outcomes during the progression towards Alzheimer's disease (AD). The use of multiple features in predictive frameworks for clinical outcomes has become increasingly prevalent in AD research. However, many studies do not focus on systematically and accurately evaluating combinations of multiple input features. Hence, the aim of the present work is to explore and assess optimal combinations of various features for MR-based prediction of (1) cognitive status and (2) biomarker positivity with a multi-kernel learning Gaussian process framework. The explored features and parameters included (A) combinations of brain tissues, modulation, smoothing, and image resolution; (B) incorporating demographics & clinical covariates; (C) the impact of the size of the training data set; (D) the influence of dimensionality reduction and the choice of kernel types. The approach was tested in a large German cohort including 959 subjects from the multicentric longitudinal study of cognitive impairment and dementia (DELCODE). Our evaluation suggests the best prediction of memory performance was obtained for a combination of neuroimaging markers, demographics, genetic information (ApoE4) and CSF biomarkers explaining 57% of outcome variance in out-of-sample predictions. The highest performance for Aβ42/40 status classification was achieved for a combination of demographics, ApoE4, and a memory score while usage of structural MRI further improved the classification of individual patient's pTau status.
Collapse
Affiliation(s)
- A Nemali
- Institute of Cognitive Neurology and Dementia Research, Otto-von-Guericke University Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany.
| | - N Vockert
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - D Berron
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - A Maas
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - J Bernal
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - R Yakupov
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - O Peters
- German Center for Neurodegenerative Diseases (DZNE), Berlin, Germany; Charité-Universitätsmedizin Berlin, Campus Benjamin Franklin, Department of Psychiatry, Hindenburgdamm 30, 12203, Berlin, Germany
| | - D Gref
- Charité-Universitätsmedizin Berlin, Campus Benjamin Franklin, Department of Psychiatry, Hindenburgdamm 30, 12203, Berlin, Germany
| | - N Cosma
- Charité-Universitätsmedizin Berlin, Campus Benjamin Franklin, Department of Psychiatry, Hindenburgdamm 30, 12203, Berlin, Germany
| | - L Preis
- Charité-Universitätsmedizin Berlin, Campus Benjamin Franklin, Department of Psychiatry, Hindenburgdamm 30, 12203, Berlin, Germany
| | - J Priller
- German Center for Neurodegenerative Diseases (DZNE), Berlin, Germany; Department of Psychiatry and Psychotherapy, Charité, Charitéplatz 1, 10117 Berlin, Germany; School of Medicine, Technical University of Munich; Department of Psychiatry and Psychotherapy, Munich, Germany; University of Edinburgh and UK DRI, Edinburgh, UK
| | - E Spruth
- German Center for Neurodegenerative Diseases (DZNE), Berlin, Germany; Department of Psychiatry and Psychotherapy, Charité, Charitéplatz 1, 10117 Berlin, Germany
| | - S Altenstein
- German Center for Neurodegenerative Diseases (DZNE), Berlin, Germany; Department of Psychiatry and Psychotherapy, Charité, Charitéplatz 1, 10117 Berlin, Germany
| | - A Lohse
- Department of Psychiatry and Psychotherapy, Charité, Charitéplatz 1, 10117 Berlin, Germany
| | - K Fliessbach
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; University of Bonn Medical Center, Department of Neurodegenerative Disease and Geriatric Psychiatry/Psychiatry, Venusberg-Campus 1, 53127 Bonn, Germany
| | - O Kimmich
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - I Vogt
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - J Wiltfang
- German Center for Neurodegenerative Diseases (DZNE), Goettingen, Germany; Department of Psychiatry and Psychotherapy, University Medical Center Goettingen, University of Goettingen, Von-Siebold-Str. 5, 37075 Goettingen, Germany; Neurosciences and Signaling Group, Institute of Biomedicine (iBiMED), Department of Medical Sciences, University of Aveiro, Aveiro, Portugal
| | - N Hansen
- Department of Psychiatry and Psychotherapy, University Medical Center Goettingen, University of Goettingen, Von-Siebold-Str. 5, 37075 Goettingen, Germany
| | - C Bartels
- Department of Psychiatry and Psychotherapy, University Medical Center Goettingen, University of Goettingen, Von-Siebold-Str. 5, 37075 Goettingen, Germany
| | - B H Schott
- Leibniz Institute for Neurobiology, Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE), Goettingen, Germany; Department of Psychiatry and Psychotherapy, University Medical Center Goettingen, University of Goettingen, Von-Siebold-Str. 5, 37075 Goettingen, Germany
| | - F Maier
- Department of Psychiatry, University of Cologne, Medical Faculty, Kerpener Strasse 62, 50924 Cologne, Germany
| | - D Meiberth
- Department of Psychiatry, University of Cologne, Medical Faculty, Kerpener Strasse 62, 50924 Cologne, Germany
| | - W Glanz
- Institute of Cognitive Neurology and Dementia Research, Otto-von-Guericke University Magdeburg, Germany
| | - E Incesoy
- Institute of Cognitive Neurology and Dementia Research, Otto-von-Guericke University Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - M Butryn
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - K Buerger
- German Center for Neurodegenerative Diseases (DZNE, Munich), Feodor-Lynen-Strasse 17, 81377 Munich, Germany; Institute for Stroke and Dementia Research (ISD), University Hospital, LMU Munich, Feodor-Lynen-Strasse 17, 81377 Munich, Germany
| | - D Janowitz
- Institute for Stroke and Dementia Research (ISD), University Hospital, LMU Munich, Feodor-Lynen-Strasse 17, 81377 Munich, Germany
| | - R Pernecky
- German Center for Neurodegenerative Diseases (DZNE, Munich), Feodor-Lynen-Strasse 17, 81377 Munich, Germany; Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany; Munich Cluster for Systems Neurology (SyNergy) Munich, Munich, Germany; Ageing Epidemiology Research Unit (AGE), School of Public Health, Imperial College London, London, UK
| | - B Rauchmann
- Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
| | - L Burow
- Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
| | - S Teipel
- German Center for Neurodegenerative Diseases (DZNE), Rostock, Germany; Department of Psychosomatic Medicine, Rostock University Medical Center, Gehlsheimer Str. 20, 18147 Rostock, Germany
| | - I Kilimann
- German Center for Neurodegenerative Diseases (DZNE), Rostock, Germany; Department of Psychosomatic Medicine, Rostock University Medical Center, Gehlsheimer Str. 20, 18147 Rostock, Germany
| | - D Göerß
- Department of Psychosomatic Medicine, Rostock University Medical Center, Gehlsheimer Str. 20, 18147 Rostock, Germany
| | - M Dyrba
- German Center for Neurodegenerative Diseases (DZNE), Rostock, Germany
| | - C Laske
- German Center for Neurodegenerative Diseases (DZNE), Tübingen, Germany; Section for Dementia Research, Hertie Institute for Clinical Brain Research and Department of Psychiatry and Psychotherapy, University of Tübingen, Tübingen, Germany
| | - M Munk
- German Center for Neurodegenerative Diseases (DZNE), Tübingen, Germany; Department of Psychiatry and Psychotherapy, University of Tuebingen, Tuebingen, Germany
| | - C Sanzenbacher
- German Center for Neurodegenerative Diseases (DZNE), Tübingen, Germany
| | - S Müller
- Department of Psychiatry and Psychotherapy, University of Tuebingen, Tuebingen, Germany
| | - A Spottke
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Department of Neurology, University of Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - N Roy
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - M Heneka
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Department of Psychiatry and Psychotherapy, University of Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - F Brosseron
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - S Roeske
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany
| | - L Dobisch
- German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - A Ramirez
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Department of Neurology, University of Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, Joseph-Stelzmann-Strasse 26, 50931 Köln, Germany; Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany; Department of Psychiatry & Glenn Biggs Institute for Alzheimer's and Neurodegenerative Diseases, San Antonio, TX, USA
| | - M Ewers
- German Center for Neurodegenerative Diseases (DZNE, Munich), Feodor-Lynen-Strasse 17, 81377 Munich, Germany; Institute for Stroke and Dementia Research (ISD), University Hospital, LMU Munich, Feodor-Lynen-Strasse 17, 81377 Munich, Germany
| | - P Dechent
- MR-Research in Neurosciences, Department of Cognitive Neurology, Georg-August-University Goettingen, Germany
| | - K Scheffler
- Department for Biomedical Magnetic Resonance, University of Tübingen, 72076 Tübingen, Germany
| | - L Kleineidam
- University of Bonn Medical Center, Department of Neurodegenerative Disease and Geriatric Psychiatry/Psychiatry, Venusberg-Campus 1, 53127 Bonn, Germany
| | - S Wolfsgruber
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; University of Bonn Medical Center, Department of Neurodegenerative Disease and Geriatric Psychiatry/Psychiatry, Venusberg-Campus 1, 53127 Bonn, Germany
| | - M Wagner
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; University of Bonn Medical Center, Department of Neurodegenerative Disease and Geriatric Psychiatry/Psychiatry, Venusberg-Campus 1, 53127 Bonn, Germany
| | - F Jessen
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Venusberg-Campus 1, 53127 Bonn, Germany; Department of Psychiatry, University of Cologne, Medical Faculty, Kerpener Strasse 62, 50924 Cologne, Germany; Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, Joseph-Stelzmann-Strasse 26, 50931 Köln, Germany
| | - E Duzel
- Institute of Cognitive Neurology and Dementia Research, Otto-von-Guericke University Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| | - G Ziegler
- Institute of Cognitive Neurology and Dementia Research, Otto-von-Guericke University Magdeburg, Germany; German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
| |
Collapse
|
27
|
Płotka S, Grzeszczyk MK, Brawura-Biskupski-Samaha R, Gutaj P, Lipa M, Trzciński T, Išgum I, Sánchez CI, Sitek A. BabyNet++: Fetal birth weight prediction using biometry multimodal data acquired less than 24 hours before delivery. Comput Biol Med 2023; 167:107602. [PMID: 37925906 DOI: 10.1016/j.compbiomed.2023.107602] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 09/12/2023] [Accepted: 10/17/2023] [Indexed: 11/07/2023]
Abstract
Accurate prediction of fetal weight at birth is essential for effective perinatal care, particularly in the context of antenatal management, which involves determining the timing and mode of delivery. The current standard of care involves performing a prenatal ultrasound 24 hours prior to delivery. However, this task presents challenges as it requires acquiring high-quality images, which becomes difficult during advanced pregnancy due to the lack of amniotic fluid. In this paper, we present a novel method that automatically predicts fetal birth weight by using fetal ultrasound video scans and clinical data. Our proposed method is based on a Transformer-based approach that combines a Residual Transformer Module with a Dynamic Affine Feature Map Transform. This method leverages tabular clinical data to evaluate 2D+t spatio-temporal features in fetal ultrasound video scans. Development and evaluation were carried out on a clinical set comprising 582 2D fetal ultrasound videos and clinical records of pregnancies from 194 patients performed less than 24 hours before delivery. Our results show that our method outperforms several state-of-the-art automatic methods and estimates fetal birth weight with an accuracy comparable to human experts. Hence, automatic measurements obtained by our method can reduce the risk of errors inherent in manual measurements. Observer studies suggest that our approach may be used as an aid for less experienced clinicians to predict fetal birth weight before delivery, optimizing perinatal care regardless of the available expertise.
Collapse
Affiliation(s)
- Szymon Płotka
- Sano Centre for Computational Medicine, Cracow, Poland; Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Department of Biomedical Engineering and Physics, Amsterdam University Medical Center, Amsterdam, The Netherlands.
| | | | | | - Paweł Gutaj
- Department of Reproduction, Poznan University of Medical Sciences, Poznan, Poznan, Poland
| | - Michał Lipa
- First Department of Obstetrics and Gynecology, Medical University of Warsaw, Warsaw, Poland
| | - Tomasz Trzciński
- Institute of Computer Science, Warsaw University of Technology, Warsaw, Poland
| | - Ivana Išgum
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Department of Biomedical Engineering and Physics, Amsterdam University Medical Center, Amsterdam, The Netherlands; Department of Radiology and Nuclear Medicine, Amsterdam University Medical Center, location University of Amsterdam, Amsterdam, The Netherlands
| | - Clara I Sánchez
- Informatics Institute, University of Amsterdam, Amsterdam, The Netherlands; Department of Biomedical Engineering and Physics, Amsterdam University Medical Center, Amsterdam, The Netherlands
| | - Arkadiusz Sitek
- Center for Advanced Medical Computing and Simulation, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
28
|
Chai H, Yin Z, Ding Y, Liu L, Fang B, Liao Q. A Model-Agnostic Approach to Mitigate Gradient Interference for Multi-Task Learning. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:7810-7823. [PMID: 37015580 DOI: 10.1109/tcyb.2022.3223377] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Multitask learning (MTL) is a powerful technique for jointly learning multiple tasks. However, it is difficult to achieve a tradeoff between tasks during iterative training, as some tasks may compete with each other. Existing methods manually design specific network models to mitigate task conflicts, but they require considerable manual effort and prior knowledge about task relationships to tune the model so as to obtain the best performance for each task. Moreover, few works have offered formal descriptions of task conflicts and theoretical explanations for the cause of task conflict problems. In this article, we provide a formal description of task conflicts that are caused by the gradient interference problem of tasks. To alleviate this issue, we propose a novel model-agnostic approach to mitigate gradient interference (MAMG) by designing a gradient clipping rule that directly modifies the interfering components on the gradient interfering direction. Specifically, MAMG is model-agnostic and thus it can be applied to a large number of multitask models. We also theoretically prove the convergence of MAMG and its superiority to existing MTL methods. We evaluate our method on a variety of real-world large datasets, and extensive experimental results confirm that MAMG can outperform some state-of-the-art algorithms on different types of tasks and can be easily applied to various methods.
Collapse
|
29
|
Hong Y, Cornea E, Girault JB, Bagonis M, Foster M, Kim SH, Prieto JC, Chen H, Gao W, Styner MA, Gilmore JH. Structural and functional connectome relationships in early childhood. Dev Cogn Neurosci 2023; 64:101314. [PMID: 37898019 PMCID: PMC10630618 DOI: 10.1016/j.dcn.2023.101314] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 09/27/2023] [Accepted: 10/12/2023] [Indexed: 10/30/2023] Open
Abstract
There is strong evidence that the functional connectome is highly related to the white matter connectome in older children and adults, though little is known about structure-function relationships in early childhood. We investigated the development of cortical structure-function coupling in children longitudinally scanned at 1, 2, 4, and 6 years of age (N = 360) and in a comparison sample of adults (N = 89). We also applied a novel graph convolutional neural network-based deep learning model with a new loss function to better capture inter-subject heterogeneity and predict an individual's functional connectivity from the corresponding structural connectivity. We found regional patterns of structure-function coupling in early childhood that were consistent with adult patterns. In addition, our deep learning model improved the prediction of individual functional connectivity from its structural counterpart compared to existing models.
Collapse
Affiliation(s)
- Yoonmi Hong
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America.
| | - Emil Cornea
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America
| | - Jessica B Girault
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America; Carolina Institute for Developmental Disabilities, University of North Carolina at Chapel Hill, United States of America
| | - Maria Bagonis
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America
| | - Mark Foster
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America
| | - Sun Hyung Kim
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America
| | - Juan Carlos Prieto
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America
| | - Haitao Chen
- Biomedical Imaging Research Institute (BIRI), Department of Biomedical Sciences and Imaging, Cedars-Sinai Medical Center, United States of America
| | - Wei Gao
- Biomedical Imaging Research Institute (BIRI), Department of Biomedical Sciences and Imaging, Cedars-Sinai Medical Center, United States of America
| | - Martin A Styner
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America; Department of Computer Science, University of North Carolina at Chapel Hill, United States of America
| | - John H Gilmore
- Department of Psychiatry, University of North Carolina at Chapel Hill, United States of America
| |
Collapse
|
30
|
Liu M, Zhang J, Wang Y, Zhou Y, Xie F, Guo Q, Shi F, Zhang H, Wang Q, Shen D. A common spectrum underlying brain disorders across lifespan revealed by deep learning on brain networks. iScience 2023; 26:108244. [PMID: 38026184 PMCID: PMC10651682 DOI: 10.1016/j.isci.2023.108244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 09/26/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
Brain disorders in the early and late life of humans potentially share pathological alterations in brain functions. However, the key neuroimaging evidence remains unrevealed for elucidating such commonness and the relationships among these disorders. To explore this puzzle, we build a restricted single-branch deep learning model, using multi-site functional magnetic resonance imaging data (N = 4,410, 6 sites), for classifying 5 different early- and late-life brain disorders from healthy controls (cognitively unimpaired). Our model achieves 62.6 ± 1.9% overall classification accuracy and thus supports us in detecting a set of commonly affected functional subnetworks, including default mode, executive control, visual, and limbic networks. In the deep-layer representation of data, we observe young and aging patients with disorders are continuously distributed, which is in line with the clinical concept of the "spectrum of disorders." The relationships among brain disorders from the revealed spectrum promote the understanding of disorder comorbidities and time associations in the lifespan.
Collapse
Affiliation(s)
- Mianxin Liu
- School of Biomedical Engineering, State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China
- Shanghai Artificial Intelligence Laboratory, Shanghai 200232, China
| | - Jingyang Zhang
- School of Biomedical Engineering, State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China
| | - Yao Wang
- Department of Radiology, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200001, China
| | - Yan Zhou
- Department of Radiology, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200001, China
| | - Fang Xie
- PET Center, Huashan Hospital, Fudan University, Shanghai 200040, China
| | - Qihao Guo
- Department of Gerontology, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, Shanghai 200233, China
| | - Feng Shi
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd, Shanghai 200232, China
| | - Han Zhang
- School of Biomedical Engineering, State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China
| | - Qian Wang
- School of Biomedical Engineering, State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China
| | - Dinggang Shen
- School of Biomedical Engineering, State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China
- Department of Research and Development, Shanghai United Imaging Intelligence Co., Ltd, Shanghai 200232, China
- Shanghai Clinical Research and Trial Center, Shanghai 201210, China
| |
Collapse
|
31
|
Shanmugavadivel K, Sathishkumar VE, Cho J, Subramanian M. Advancements in computer-assisted diagnosis of Alzheimer's disease: A comprehensive survey of neuroimaging methods and AI techniques for early detection. Ageing Res Rev 2023; 91:102072. [PMID: 37709055 DOI: 10.1016/j.arr.2023.102072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 09/05/2023] [Accepted: 09/10/2023] [Indexed: 09/16/2023]
Abstract
Alzheimer's Disease (AD) is a brain disorder that causes the brain to shrink and eventually causes brain cells to die. This neurological condition progressively hampers cognitive and memory functions, along with the ability to carry out fundamental tasks over time. From the symptoms it is very difficult to detect during its early stage. It has become necessary to develop a computer assisted diagnostic models for the early AD detection. This survey work, discussed about a review of 110 published AD detection methods and techniques from the year 2011 to till-date. This study lies in its comprehensive exploration of AD detection methods using a range of artificial intelligence (AI) techniques and neuroimaging modalities. By collecting and analysing 50 papers related to AD diagnosis datasets, the study provides a comprehensive understanding of the diversity of input types, subjects, and classes used in AD research. Summarizing 60 papers on methodologies gives researchers a succinct overview of various approaches that contribute to enhancing detection accuracy. From the review, data are acquired and pre-processed form multiple modalities of neuroimaging. This paper mainly focused on review of different datasets used, various feature extraction methods, parameters used in neuro images. To diagnosis the Alzheimer's disease, the existing methods utilized three most common artificial intelligence techniques such as machine learning, deep learning, and transfer learning. We conclude this survey work by providing future perspectives for AD diagnosis at early stage.
Collapse
Affiliation(s)
| | - V E Sathishkumar
- Department of Software Engineering, Jeonbuk National University, Jeonju-si, Jeollabuk-do 54896, Republic of Korea
| | - Jaehyuk Cho
- Department of Software Engineering, Jeonbuk National University, Jeonju-si, Jeollabuk-do 54896, Republic of Korea.
| | | |
Collapse
|
32
|
Qiang YR, Zhang SW, Li JN, Li Y, Zhou QY. Diagnosis of Alzheimer's disease by joining dual attention CNN and MLP based on structural MRIs, clinical and genetic data. Artif Intell Med 2023; 145:102678. [PMID: 37925204 DOI: 10.1016/j.artmed.2023.102678] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 07/12/2023] [Accepted: 10/03/2023] [Indexed: 11/06/2023]
Abstract
Alzheimer's disease (AD) is an irreversible central nervous degenerative disease, while mild cognitive impairment (MCI) is a precursor state of AD. Accurate early diagnosis of AD is conducive to the prevention and early intervention treatment of AD. Although some computational methods have been developed for AD diagnosis, most employ only neuroimaging, ignoring other data (e.g., genetic, clinical) that may have potential disease information. In addition, the results of some methods lack interpretability. In this work, we proposed a novel method (called DANMLP) of joining dual attention convolutional neural network (CNN) and multilayer perceptron (MLP) for computer-aided AD diagnosis by integrating multi-modality data of the structural magnetic resonance imaging (sMRI), clinical data (i.e., demographics, neuropsychology), and APOE genetic data. Our DANMLP consists of four primary components: (1) the Patch-CNN for extracting the image characteristics from each local patch, (2) the position self-attention block for capturing the dependencies between features within a patch, (3) the channel self-attention block for capturing dependencies of inter-patch features, (4) two MLP networks for extracting the clinical features and outputting the AD classification results, respectively. Compared with other state-of-the-art methods in the 5CV test, DANMLP achieves 93% and 82.4% classification accuracy for the AD vs. MCI and MCI vs. NC tasks on the ADNI database, which is 0.2%∼15.2% and 3.4%∼26.8% higher than that of other five methods, respectively. The individualized visualization of focal areas can also help clinicians in the early diagnosis of AD. These results indicate that DANMLP can be effectively used for diagnosing AD and MCI patients.
Collapse
Affiliation(s)
- Yan-Rui Qiang
- Key Laboratory of Information Fusion Technology, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Shao-Wu Zhang
- Key Laboratory of Information Fusion Technology, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China.
| | - Jia-Ni Li
- Key Laboratory of Information Fusion Technology, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Yan Li
- Key Laboratory of Information Fusion Technology, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Qin-Yi Zhou
- Key Laboratory of Information Fusion Technology, School of Automation, Northwestern Polytechnical University, Xi'an, 710072, China
| |
Collapse
|
33
|
Dai Y, Yu-Chun H, Fernandes BS, Zhang K, Xiaoyang L, Enduru N, Liu A, Manuel AM, Jiang X, Zhao Z. Disentangling accelerated cognitive decline from the normal aging process and unraveling its genetic components: A neuroimaging-based deep learning approach. RESEARCH SQUARE 2023:rs.3.rs-3328861. [PMID: 37720047 PMCID: PMC10503860 DOI: 10.21203/rs.3.rs-3328861/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/19/2023]
Abstract
Background The progressive cognitive decline that is an integral component of AD unfolds in tandem with the natural aging process. Neuroimaging features have demonstrated the capacity to distinguish cognitive decline changes stemming from typical brain aging and Alzheimer's disease between different chronological points. Methods We developed a deep-learning framework based on dual-loss Siamese ResNet network to extract fine-grained information from the longitudinal structural magnetic resonance imaging (MRI) data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) study. We then conducted genome-wide association studies (GWAS) and post-GWAS analyses to reveal the genetic basis of AD-related accelerated cognitive decline. Results We used our model to process data from 1,313 individuals, training it on 414 cognitively normal people and predicting cognitive assessment for all participants. In our analysis of accelerated cognitive decline GWAS, we identified two genome-wide significant loci: APOE locus (chromosome 19 p13.32) and rs144614292 (chromosome 11 p15.1). Variant rs144614292 (G>T) has not been reported in previous AD GWA studies. It is within the intronic region of NELL1, which is expressed in neuron and plays a role in controlling cell growth and differentiation. In addition, MUC7 and PROL1/OPRPNon chromosome 4 were significant at the gene level. The cell-type-specific enrichment analysis and functional enrichment of GWAS signals highlighted the microglia and immune-response pathways. Furthermore, we found that the cognitive decline slope GWAS was positively correlated with previous AD GWAS. Conclusion Our deep learning model was demonstrated effective on extracting relevant neuroimaging features and predicting individual cognitive decline. We reported a novel variant (rs144614292) within the NELL1 gene. Our approach has the potential to disentangle accelerated cognitive decline from the normal aging process and to determine its related genetic factors, leveraging opportunities for early intervention.
Collapse
Affiliation(s)
- Yulin Dai
- The University of Texas Health Science Center at Houston
| | - Hsu Yu-Chun
- The University of Texas Health Science Center at Houston
| | | | - Kai Zhang
- The University of Texas Health Science Center at Houston
| | - Li Xiaoyang
- The University of Texas Health Science Center at Houston
| | - Nitesh Enduru
- The University of Texas Health Science Center at Houston
| | - Andi Liu
- The University of Texas Health Science Center at Houston
| | | | - Xiaoqian Jiang
- The University of Texas Health Science Center at Houston
| | - Zhongming Zhao
- The University of Texas Health Science Center at Houston
| |
Collapse
|
34
|
Cao G, Zhang M, Wang Y, Zhang J, Han Y, Xu X, Huang J, Kang G. End-to-end automatic pathology localization for Alzheimer's disease diagnosis using structural MRI. Comput Biol Med 2023; 163:107110. [PMID: 37321102 DOI: 10.1016/j.compbiomed.2023.107110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 05/18/2023] [Accepted: 05/30/2023] [Indexed: 06/17/2023]
Abstract
Structural magnetic resonance imaging (sMRI) is an essential part of the clinical assessment of patients at risk of Alzheimer dementia. One key challenge in sMRI-based computer-aided dementia diagnosis is to localize local pathological regions for discriminative feature learning. Existing solutions predominantly depend on generating saliency maps for pathology localization and handle the localization task independently of the dementia diagnosis task, leading to a complex multi-stage training pipeline that is hard to optimize with weakly-supervised sMRI-level annotations. In this work, we aim to simplify the pathology localization task and construct an end-to-end automatic localization framework (AutoLoc) for Alzheimer's disease diagnosis. To this end, we first present an efficient pathology localization paradigm that directly predicts the coordinate of the most disease-related region in each sMRI slice. Then, we approximate the non-differentiable patch-cropping operation with the bilinear interpolation technique, which eliminates the barrier to gradient backpropagation and thus enables the joint optimization of localization and diagnosis tasks. Extensive experiments on commonly used ADNI and AIBL datasets demonstrate the superiority of our method. Especially, we achieve 93.38% and 81.12% accuracy on Alzheimer's disease classification and mild cognitive impairment conversion prediction tasks, respectively. Several important brain regions, such as rostral hippocampus and globus pallidus, are identified to be highly associated with Alzheimer's disease.
Collapse
Affiliation(s)
- Gongpeng Cao
- Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, No. 10 Xitucheng Road, Haidian District, Beijing, 100876, China
| | - Manli Zhang
- Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, No. 10 Xitucheng Road, Haidian District, Beijing, 100876, China
| | - Yiping Wang
- Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, No. 10 Xitucheng Road, Haidian District, Beijing, 100876, China
| | - Jing Zhang
- Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, No. 10 Xitucheng Road, Haidian District, Beijing, 100876, China
| | - Ying Han
- Department of Neurology, Xuanwu Hospital of Capital Medical University, No. 45 Changchun Street, Xicheng District, Beijing, 100053, China
| | - Xin Xu
- Department of Neurosurgery, Chinese PLA General Hospital, No. 28 Fuxing Road, Haidian District, Beijing, 100853, China
| | - Jinguo Huang
- Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, No. 10 Xitucheng Road, Haidian District, Beijing, 100876, China.
| | - Guixia Kang
- Key Laboratory of Universal Wireless Communications, Ministry of Education, Beijing University of Posts and Telecommunications, No. 10 Xitucheng Road, Haidian District, Beijing, 100876, China.
| |
Collapse
|
35
|
Wen J, Li Y, Fang M, Zhu L, Feng DD, Li P. Fine-Grained and Multiple Classification for Alzheimer's Disease With Wavelet Convolution Unit Network. IEEE Trans Biomed Eng 2023; 70:2592-2603. [PMID: 37030751 DOI: 10.1109/tbme.2023.3256042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023]
Abstract
In this article, we propose a novel wavelet convolution unit for the image-oriented neural network to integrate wavelet analysis with a vanilla convolution operator to extract deep abstract features more efficiently. On one hand, in order to acquire non-local receptive fields and avoid information loss, we define a new convolution operation by composing a traditional convolution function and approximate and detailed representations after single-scale wavelet decomposition of source images. On the other hand, multi-scale wavelet decomposition is introduced to obtain more comprehensive multi-scale feature information. Then, we fuse all these cross-scale features to improve the problem of inaccurate localization of singular points. Given the novel wavelet convolution unit, we further design a network based on it for fine-grained Alzheimer's disease classifications (i.e., Alzheimer's disease, Normal controls, early mild cognitive impairment, late mild cognitive impairment). Up to now, only a few methods have studied one or several fine-grained classifications, and even fewer methods can achieve both fine-grained and multi-class classifications. We adopt the novel network and diffuse tensor images to achieve fine-grained classifications, which achieved state-of-the-art accuracy for all eight kinds of fine-grained classifications, up to 97.30%, 95.78%, 95.00%, 94.00%, 97.89%, 95.71%, 95.07%, 93.79%. In order to build a reference standard for Alzheimer's disease classifications, we actually implemented all twelve coarse-grained and fine-grained classifications. The results show that the proposed method achieves solidly high accuracy for them. Its classification ability greatly exceeds any kind of existing Alzheimer's disease classification method.
Collapse
|
36
|
Li K, Yang G, Chang S, Yao J, He C, Lu F, Wang X, Wang Z. Comprehensive assessment of the anterior segment in refraction corrected OCT based on multitask learning. BIOMEDICAL OPTICS EXPRESS 2023; 14:3968-3987. [PMID: 37799701 PMCID: PMC10549746 DOI: 10.1364/boe.493065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 06/28/2023] [Accepted: 06/28/2023] [Indexed: 10/07/2023]
Abstract
Anterior segment diseases are among the leading causes of irreversible blindness. However, a method capable of recognizing all important anterior segment structures for clinical diagnosis is lacking. By sharing the knowledge learned from each task, we proposed a fully automated multitask deep learning method that allows for simultaneous segmentation and quantification of all major anterior segment structures, including the iris, lens, cornea, as well as implantable collamer lens (ICL) and intraocular lens (IOL), and meanwhile for landmark detection of scleral spur and iris root in anterior segment OCT (AS-OCT) images. In addition, we proposed a refraction correction method to correct for the true geometry of the anterior segment distorted by light refraction during OCT imaging. 1251 AS-OCT images from 180 patients were collected and were used to train and test the model. Experiments demonstrated that our proposed network was superior to state-of-the-art segmentation and landmark detection methods, and close agreement was achieved between manually and automatically computed clinical parameters associated with anterior chamber, pupil, iris, ICL, and IOL. Finally, as an example, we demonstrated how our proposed method can be applied to facilitate the clinical evaluation of cataract surgery.
Collapse
Affiliation(s)
- Kaiwen Li
- School of Electronic Science and Engineering,
University of Electronic Science and Technology of China, Chengdu, Sichuan 610054, China
| | - Guangqian Yang
- School of Electronic Science and Engineering,
University of Electronic Science and Technology of China, Chengdu, Sichuan 610054, China
| | - Shuimiao Chang
- Department of Cataract, Shanxi Eye Hospital Affiliated to Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Jinhan Yao
- Department of Cataract, Shanxi Eye Hospital Affiliated to Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Chong He
- School of Medicine, University of Electronic Science and Technology of China, Chengdu, Sichuan 610054, China
| | - Fang Lu
- School of Medicine, University of Electronic Science and Technology of China, Chengdu, Sichuan 610054, China
| | - Xiaogang Wang
- Department of Cataract, Shanxi Eye Hospital Affiliated to Shanxi Medical University, Taiyuan, Shanxi 030001, China
| | - Zhao Wang
- School of Electronic Science and Engineering,
University of Electronic Science and Technology of China, Chengdu, Sichuan 610054, China
| |
Collapse
|
37
|
Zhang J, He X, Liu Y, Cai Q, Chen H, Qing L. Multi-modal cross-attention network for Alzheimer's disease diagnosis with multi-modality data. Comput Biol Med 2023; 162:107050. [PMID: 37269680 DOI: 10.1016/j.compbiomed.2023.107050] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 04/26/2023] [Accepted: 05/03/2023] [Indexed: 06/05/2023]
Abstract
Alzheimer's disease (AD) is a neurodegenerative disorder, the most common cause of dementia, so the accurate diagnosis of AD and its prodromal stage mild cognitive impairment (MCI) is significant. Recent studies have demonstrated that multiple neuroimaging and biological measures contain complementary information for diagnosis. Many existing multi-modal models based on deep learning simply concatenate each modality's features despite substantial differences in representation spaces. In this paper, we propose a novel multi-modal cross-attention AD diagnosis (MCAD) framework to learn the interaction between modalities for better playing their complementary roles for AD diagnosis with multi-modal data including structural magnetic resonance imaging (sMRI), fluorodeoxyglucose-positron emission tomography (FDG-PET) and cerebrospinal fluid (CSF) biomarkers. Specifically, the imaging and non-imaging representations are learned by the image encoder based on cascaded dilated convolutions and CSF encoder, respectively. Then, a multi-modal interaction module is introduced, which takes advantage of cross-modal attention to integrate imaging and non-imaging information and reinforce relationships between these modalities. Moreover, an extensive objective function is designed to reduce the discrepancy between modalities for effectively fusing the features of multi-modal data, which could further improve the diagnosis performance. We evaluate the effectiveness of our proposed method on the ADNI dataset, and the extensive experiments demonstrate that our MCAD achieves superior performance for multiple AD-related classification tasks, compared to several competing methods. Also, we investigate the importance of cross-attention and the contribution of each modality to the diagnostics performance. The experimental results demonstrate that combining multi-modality data via cross-attention is helpful for accurate AD diagnosis.
Collapse
Affiliation(s)
- Jin Zhang
- College of Electronics and Information Engineering, Sichuan University, Chengdu, Sichuan, 610065, China
| | - Xiaohai He
- College of Electronics and Information Engineering, Sichuan University, Chengdu, Sichuan, 610065, China.
| | - Yan Liu
- Department of Neurology, The Affiliated Hospital of Southwest Jiaotong University, The Third People's Hospital of Chengdu, Chengdu, Sichuan, 610031, China
| | - Qingyan Cai
- Department of Geriatric Medicine, The Fourth People's Hospital of Chengdu, Chengdu, Sichuan, 610036, China
| | - Honggang Chen
- College of Electronics and Information Engineering, Sichuan University, Chengdu, Sichuan, 610065, China
| | - Linbo Qing
- College of Electronics and Information Engineering, Sichuan University, Chengdu, Sichuan, 610065, China
| |
Collapse
|
38
|
Gao X, Cai H, Liu M. A Hybrid Multi-Scale Attention Convolution and Aging Transformer Network for Alzheimer's Disease Diagnosis. IEEE J Biomed Health Inform 2023; 27:3292-3301. [PMID: 37104100 DOI: 10.1109/jbhi.2023.3270937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2023]
Abstract
Deep neural networks have been successfully investigated in the computational analysis of structural magnetic resonance imaging (sMRI) data for the diagnosis of dementia, such as Alzheimer's disease (AD). The disease-related changes in sMRI may be different in local brain regions, which have variant structures but with some correlations. In addition, aging increases the risk of dementia. However, it is still challenging to capture the local variations and long-range correlations of different brain regions and make use of the age information for disease diagnosis. To address these problems, we propose a hybrid network with multi-scale attention convolution and aging transformer for AD diagnosis. First, to capture the local variations, a multi-scale attention convolution is proposed to learn the feature maps with multi-scale kernels, which are adaptively aggregated by an attention module. Then, to model the long-range correlations of brain regions, a pyramid non-local block is used on the high-level features to learn more powerful features. Finally, we propose an aging transformer subnetwork to embed the age information into image features and capture the dependencies between subjects at different ages. The proposed method can learn not only the subject-specific rich features but also the inter-subject age correlations in an end-to-end framework. Our method is evaluated with T1-weighted sMRI scans from a large cohort of subjects on the Alzheimer's Disease Neuroimaging Initiative (ADNI) database. Experimental results demonstrate that our method has achieved promising performance for AD-related diagnosis.
Collapse
|
39
|
李 昕, 李 振, 刘 毅, 苏 芮, 徐 永, 景 军, 尹 立. [Research on mild cognitive impairment diagnosis based on Bayesian optimized long-short-term neural network model]. SHENG WU YI XUE GONG CHENG XUE ZA ZHI = JOURNAL OF BIOMEDICAL ENGINEERING = SHENGWU YIXUE GONGCHENGXUE ZAZHI 2023; 40:450-457. [PMID: 37380383 PMCID: PMC10307618 DOI: 10.7507/1001-5515.202205005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 04/16/2023] [Indexed: 06/30/2023]
Abstract
The recurrent neural network architecture improves the processing ability of time-series data. However, issues such as exploding gradients and poor feature extraction limit its application in the automatic diagnosis of mild cognitive impairment (MCI). This paper proposed a research approach for building an MCI diagnostic model using a Bayesian-optimized bidirectional long short-term memory network (BO-BiLSTM) to address this problem. The diagnostic model was based on a Bayesian algorithm and combined prior distribution and posterior probability results to optimize the BO-BiLSTM network hyperparameters. It also used multiple feature quantities that fully reflected the cognitive state of the MCI brain, such as power spectral density, fuzzy entropy, and multifractal spectrum, as the input of the diagnostic model to achieve automatic MCI diagnosis. The results showed that the feature-fused Bayesian-optimized BiLSTM network model achieved an MCI diagnostic accuracy of 98.64% and effectively completed the diagnostic assessment of MCI. In conclusion, based on this optimization, the long short-term neural network model has achieved automatic diagnostic assessment of MCI, providing a new diagnostic model for intelligent diagnosis of MCI.
Collapse
Affiliation(s)
- 昕 李
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
- 河北省测试计量技术及仪器重点实验室(河北秦皇岛 066004)Measurement Technology and Instrumentation Key Lab of Hebei Province, Qinhuangdao, Hebei 066004, P. R. China
| | - 振阳 李
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
- 河北省测试计量技术及仪器重点实验室(河北秦皇岛 066004)Measurement Technology and Instrumentation Key Lab of Hebei Province, Qinhuangdao, Hebei 066004, P. R. China
| | - 毅 刘
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
- 河北省测试计量技术及仪器重点实验室(河北秦皇岛 066004)Measurement Technology and Instrumentation Key Lab of Hebei Province, Qinhuangdao, Hebei 066004, P. R. China
| | - 芮 苏
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
- 河北省测试计量技术及仪器重点实验室(河北秦皇岛 066004)Measurement Technology and Instrumentation Key Lab of Hebei Province, Qinhuangdao, Hebei 066004, P. R. China
| | - 永红 徐
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
- 河北省测试计量技术及仪器重点实验室(河北秦皇岛 066004)Measurement Technology and Instrumentation Key Lab of Hebei Province, Qinhuangdao, Hebei 066004, P. R. China
| | - 军 景
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
- 河北省测试计量技术及仪器重点实验室(河北秦皇岛 066004)Measurement Technology and Instrumentation Key Lab of Hebei Province, Qinhuangdao, Hebei 066004, P. R. China
| | - 立勇 尹
- 燕山大学 电气工程学院 生物医学工程研究所(河北秦皇岛 066004)School of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei 066004, P. R. China
| |
Collapse
|
40
|
Eskofier BM, Klucken J. Predictive Models for Health Deterioration: Understanding Disease Pathways for Personalized Medicine. Annu Rev Biomed Eng 2023; 25:131-156. [PMID: 36854259 DOI: 10.1146/annurev-bioeng-110220-030247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) methods are currently widely employed in medicine and healthcare. A PubMed search returns more than 100,000 articles on these topics published between 2018 and 2022 alone. Notwithstanding several recent reviews in various subfields of AI and ML in medicine, we have yet to see a comprehensive review around the methods' use in longitudinal analysis and prediction of an individual patient's health status within a personalized disease pathway. This review seeks to fill that gap. After an overview of the AI and ML methods employed in this field and of specific medical applications of models of this type, the review discusses the strengths and limitations of current studies and looks ahead to future strands of research in this field. We aim to enable interested readers to gain a detailed impression of the research currently available and accordingly plan future work around predictive models for deterioration in health status.
Collapse
Affiliation(s)
- Bjoern M Eskofier
- Machine Learning and Data Analytics Lab, Department of Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany;
| | - Jochen Klucken
- Digital Medicine Group, Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Belvaux, Luxembourg
- Digital Medicine Group, Department of Precision Health, Luxembourg Institute of Health, Strassen, Luxembourg
- Centre Hospitalier de Luxembourg, Luxembourg City, Luxembourg
| |
Collapse
|
41
|
Pallawi S, Singh DK. Study of Alzheimer’s disease brain impairment and methods for its early diagnosis: a comprehensive survey. INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL 2023; 12:7. [DOI: 10.1007/s13735-023-00271-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 02/10/2023] [Accepted: 02/15/2023] [Indexed: 01/03/2025]
|
42
|
Eslami M, Tabarestani S, Adjouadi M. A unique color-coded visualization system with multimodal information fusion and deep learning in a longitudinal study of Alzheimer's disease. Artif Intell Med 2023; 140:102543. [PMID: 37210151 PMCID: PMC10204620 DOI: 10.1016/j.artmed.2023.102543] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Revised: 03/28/2023] [Accepted: 04/02/2023] [Indexed: 05/22/2023]
Abstract
PURPOSE Automated diagnosis and prognosis of Alzheimer's Disease remain a challenging problem that machine learning (ML) techniques have attempted to resolve in the last decade. This study introduces a first-of-its-kind color-coded visualization mechanism driven by an integrated ML model to predict disease trajectory in a 2-year longitudinal study. The main aim of this study is to help capture visually in 2D and 3D renderings the diagnosis and prognosis of AD, therefore augmenting our understanding of the processes of multiclass classification and regression analysis. METHOD The proposed method, Machine Learning for Visualizing AD (ML4VisAD), is designed to predict disease progression through a visual output. This newly developed model takes baseline measurements as input to generate a color-coded visual image that reflects disease progression at different time points. The architecture of the network relies on convolutional neural networks. With 1123 subjects selected from the ADNI QT-PAD dataset, we use a 10-fold cross-validation process to evaluate the method. Multimodal inputs* include neuroimaging data (MRI, PET), neuropsychological test scores (excluding MMSE, CDR-SB, and ADAS to avoid bias), cerebrospinal fluid (CSF) biomarkers with measures of amyloid beta (ABETA), phosphorylated tau protein (PTAU), total tau protein (TAU), and risk factors that include age, gender, years of education, and ApoE4 gene. FINDINGS/RESULTS Based on subjective scores reached by three raters, the results showed an accuracy of 0.82 ± 0.03 for a 3-way classification and 0.68 ± 0.05 for a 5-way classification. The visual renderings were generated in 0.08 msec for a 23 × 23 output image and in 0.17 ms for a 45 × 45 output image. Through visualization, this study (1) demonstrates that the ML visual output augments the prospects for a more accurate diagnosis and (2) highlights why multiclass classification and regression analysis are incredibly challenging. An online survey was conducted to gauge this visualization platform's merits and obtain valuable feedback from users. All implementation codes are shared online on GitHub. CONCLUSION This approach makes it possible to visualize the many nuances that lead to a specific classification or prediction in the disease trajectory, all in context to multimodal measurements taken at baseline. This ML model can serve as a multiclass classification and prediction model while reinforcing the diagnosis and prognosis capabilities by including a visualization platform.
Collapse
Affiliation(s)
- Mohammad Eslami
- Harvard Ophthalmology AI lab, Schepens Eye Research Institute of Massachusetts Eye and Ear, Harvard Medical School, Boston, MA, USA; Center for Advanced Technology and Education, Florida International University, Miami, FL, United States.
| | - Solale Tabarestani
- Center for Advanced Technology and Education, Florida International University, Miami, FL, United States.
| | - Malek Adjouadi
- Center for Advanced Technology and Education, Florida International University, Miami, FL, United States.
| |
Collapse
|
43
|
Hoang GM, Kim UH, Kim JG. Vision transformers for the prediction of mild cognitive impairment to Alzheimer's disease progression using mid-sagittal sMRI. Front Aging Neurosci 2023; 15:1102869. [PMID: 37122374 PMCID: PMC10133493 DOI: 10.3389/fnagi.2023.1102869] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Accepted: 03/22/2023] [Indexed: 05/02/2023] Open
Abstract
Background Alzheimer's disease (AD) is one of the most common causes of neurodegenerative disease affecting over 50 million people worldwide. However, most AD diagnosis occurs in the moderate to late stage, which means that the optimal time for treatment has already passed. Mild cognitive impairment (MCI) is an intermediate state between cognitively normal people and AD patients. Therefore, the accurate prediction in the conversion process of MCI to AD may allow patients to start preventive intervention to slow the progression of the disease. Nowadays, neuroimaging techniques have been developed and are used to determine AD-related structural biomarkers. Deep learning approaches have rapidly become a key methodology applied to these techniques to find biomarkers. Methods In this study, we aimed to investigate an MCI-to-AD prediction method using Vision Transformers (ViT) to structural magnetic resonance images (sMRI). The Alzheimer's Disease Neuroimaging Initiative (ADNI) database containing 598 MCI subjects was used to predict MCI subjects' progression to AD. There are three main objectives in our study: (i) to propose an MRI-based Vision Transformers approach for MCI to AD progression classification, (ii) to evaluate the performance of different ViT architectures to obtain the most advisable one, and (iii) to visualize the brain region mostly affect the prediction of deep learning approach to MCI progression. Results Our method achieved state-of-the-art classification performance in terms of accuracy (83.27%), specificity (85.07%), and sensitivity (81.48%) compared with a set of conventional methods. Next, we visualized the brain regions that mostly contribute to the prediction of MCI progression for interpretability of the proposed model. The discriminative pathological locations include the thalamus, medial frontal, and occipital-corroborating the reliability of our model. Conclusion In conclusion, our methods provide an effective and accurate technique for the prediction of MCI conversion to AD. The results obtained in this study outperform previous reports using the ADNI collection, and it suggests that sMRI-based ViT could be efficiently applied with a considerable potential benefit for AD patient management. The brain regions mostly contributing to prediction, in conjunction with the identified anatomical features, will support the building of a robust solution for other neurodegenerative diseases in future.
Collapse
Affiliation(s)
- Gia Minh Hoang
- Department of Biomedical Science and Engineering, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Ue-Hwan Kim
- AI Graduate School, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| | - Jae Gwan Kim
- Department of Biomedical Science and Engineering, Gwangju Institute of Science and Technology, Gwangju, Republic of Korea
| |
Collapse
|
44
|
El-Sappagh S, Alonso-Moral JM, Abuhmed T, Ali F, Bugarín-Diz A. Trustworthy artificial intelligence in Alzheimer’s disease: state of the art, opportunities, and challenges. Artif Intell Rev 2023. [DOI: 10.1007/s10462-023-10415-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]
|
45
|
Lin GH, Lee SC, Yu YT, Huang CY. Machine learning-based brief version of the Caregiver-Teacher Report Form for preschoolers. RESEARCH IN DEVELOPMENTAL DISABILITIES 2023; 134:104437. [PMID: 36706597 DOI: 10.1016/j.ridd.2023.104437] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Revised: 01/09/2023] [Accepted: 01/12/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND The Caregiver-Teacher Report Form of the Child Behavior Checklist for Ages 1½-5 (C-TRF) is a widely used checklist to identify emotional and behavioral problems in preschoolers. However, the 100-item C-TRF restricts its utility. AIMS This study aimed to develop a machine learning-based short-form of the C-TRF (C-TRF-ML). METHODS AND PROCEDURES Three steps were executed. First, we split the data into three datasets in a ratio of 3:1:1 for training, validation, and cross-validation, respectively. Second, we selected a shortened item set and trained a scoring algorithm using joint learning for classification and regression using the training dataset. Then, we evaluated the similarity of scores between the C-TRF-ML and the C-TRF by r-squared and weighted kappa values using the validation dataset. Third, we cross-validated the C-TRF-ML by calculating the r-squared and weighted kappa values using the cross-validation dataset. OUTCOMES AND RESULTS Data of 363 children were analyzed. Thirty-six items of the C-TRF were retained. The r-squared values of C-TRF-ML scores were 0.86-0.96 in the cross-validation dataset. Weighted kappa values of the syndrome/problem grading were 0.72-0.94 in the cross-validation dataset. CONCLUSIONS AND IMPLICATIONS The C-TRF-ML had about 60 % fewer items than the C-TRF but yielded comparable scores with the C-TRF.
Collapse
Affiliation(s)
- Gong-Hong Lin
- International Ph.D. Program in Gerontology and Long-Term Care, College of Nursing, Taipei Medical University, Taipei, Taiwan
| | - Shih-Chieh Lee
- Department of Occupational Therapy, College of Medicine, National Cheng Kung University, Tainan City, Taiwan; School of Occupational Therapy, College of Medicine, National Taiwan University, Taipei city, Taiwan; Institute of Long-Term Care, MacKay Medical College, New Taipei City, Taiwan
| | - Yen-Ting Yu
- Department of Occupational Therapy, College of Medicine, National Cheng Kung University, Tainan City, Taiwan; School of Occupational Therapy, College of Medicine, National Taiwan University, Taipei city, Taiwan
| | - Chien-Yu Huang
- School of Occupational Therapy, College of Medicine, National Taiwan University, Taipei city, Taiwan; Department of Physical Medicine and Rehabilitation, National Taiwan University Hospital, Taipei city, Taiwan.
| |
Collapse
|
46
|
Zhao Y, Wang X, Che T, Bao G, Li S. Multi-task deep learning for medical image computing and analysis: A review. Comput Biol Med 2023; 153:106496. [PMID: 36634599 DOI: 10.1016/j.compbiomed.2022.106496] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 12/06/2022] [Accepted: 12/27/2022] [Indexed: 12/29/2022]
Abstract
The renaissance of deep learning has provided promising solutions to various tasks. While conventional deep learning models are constructed for a single specific task, multi-task deep learning (MTDL) that is capable to simultaneously accomplish at least two tasks has attracted research attention. MTDL is a joint learning paradigm that harnesses the inherent correlation of multiple related tasks to achieve reciprocal benefits in improving performance, enhancing generalizability, and reducing the overall computational cost. This review focuses on the advanced applications of MTDL for medical image computing and analysis. We first summarize four popular MTDL network architectures (i.e., cascaded, parallel, interacted, and hybrid). Then, we review the representative MTDL-based networks for eight application areas, including the brain, eye, chest, cardiac, abdomen, musculoskeletal, pathology, and other human body regions. While MTDL-based medical image processing has been flourishing and demonstrating outstanding performance in many tasks, in the meanwhile, there are performance gaps in some tasks, and accordingly we perceive the open challenges and the perspective trends. For instance, in the 2018 Ischemic Stroke Lesion Segmentation challenge, the reported top dice score of 0.51 and top recall of 0.55 achieved by the cascaded MTDL model indicate further research efforts in high demand to escalate the performance of current models.
Collapse
Affiliation(s)
- Yan Zhao
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100083, China
| | - Xiuying Wang
- School of Computer Science, The University of Sydney, Sydney, NSW, 2008, Australia.
| | - Tongtong Che
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100083, China
| | - Guoqing Bao
- School of Computer Science, The University of Sydney, Sydney, NSW, 2008, Australia
| | - Shuyu Li
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
47
|
Liu Y, Li H, Luo T, Zhang C, Xiao Z, Wei Y, Gao Y, Shi F, Shan F, Shen D. Structural Attention Graph Neural Network for Diagnosis and Prediction of COVID-19 Severity. IEEE TRANSACTIONS ON MEDICAL IMAGING 2023; 42:557-567. [PMID: 36459600 DOI: 10.1109/tmi.2022.3226575] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
With rapid worldwide spread of Coronavirus Disease 2019 (COVID-19), jointly identifying severe COVID-19 cases from mild ones and predicting the conversion time (from mild to severe) is essential to optimize the workflow and reduce the clinician's workload. In this study, we propose a novel framework for COVID-19 diagnosis, termed as Structural Attention Graph Neural Network (SAGNN), which can combine the multi-source information including features extracted from chest CT, latent lung structural distribution, and non-imaging patient information to conduct diagnosis of COVID-19 severity and predict the conversion time from mild to severe. Specifically, we first construct a graph to incorporate structural information of the lung and adopt graph attention network to iteratively update representations of lung segments. To distinguish different infection degrees of left and right lungs, we further introduce a structural attention mechanism. Finally, we introduce demographic information and develop a multi-task learning framework to jointly perform both tasks of classification and regression. Experiments are conducted on a real dataset with 1687 chest CT scans, which includes 1328 mild cases and 359 severe cases. Experimental results show that our method achieves the best classification (e.g., 86.86% in terms of Area Under Curve) and regression (e.g., 0.58 in terms of Correlation Coefficient) performance, compared with other comparison methods.
Collapse
|
48
|
Lin CT, Ghosh S, Hinkley LB, Dale CL, Souza ACS, Sabes JH, Hess CP, Adams ME, Cheung SW, Nagarajan SS. Multi-tasking deep network for tinnitus classification and severity prediction from multimodal structural MR images. J Neural Eng 2023; 20. [PMID: 36595270 DOI: 10.1088/1741-2552/acab33] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 12/13/2022] [Indexed: 12/15/2022]
Abstract
Objective:Subjective tinnitus is an auditory phantom perceptual disorder without an objective biomarker. Fast and efficient diagnostic tools will advance clinical practice by detecting or confirming the condition, tracking change in severity, and monitoring treatment response. Motivated by evidence of subtle anatomical, morphological, or functional information in magnetic resonance images of the brain, we examine data-driven machine learning methods for joint tinnitus classification (tinnitus or no tinnitus) and tinnitus severity prediction.Approach:We propose a deep multi-task multimodal framework for tinnitus classification and severity prediction using structural MRI (sMRI) data. To leverage complementary information multimodal neuroimaging data, we integrate two modalities of three-dimensional sMRI-T1 weighted (T1w) and T2 weighted (T2w) images. To explore the key components in the MR images that drove task performance, we segment both T1w and T2w images into three different components-cerebrospinal fluid, grey matter and white matter, and evaluate performance of each segmented image.Main results:Results demonstrate that our multimodal framework capitalizes on the information across both modalities (T1w and T2w) for the joint task of tinnitus classification and severity prediction.Significance:Our model outperforms existing learning-based and conventional methods in terms of accuracy, sensitivity, specificity, and negative predictive value.
Collapse
Affiliation(s)
- Chieh-Te Lin
- Department of Radiology and Biomedical Imaging, University of California San Francisco, 513 Parnassus Ave, San Francisco, CA 94143, United States of America
| | - Sanjay Ghosh
- Department of Radiology and Biomedical Imaging, University of California San Francisco, 513 Parnassus Ave, San Francisco, CA 94143, United States of America
| | - Leighton B Hinkley
- Department of Radiology and Biomedical Imaging, University of California San Francisco, 513 Parnassus Ave, San Francisco, CA 94143, United States of America
| | - Corby L Dale
- Department of Radiology and Biomedical Imaging, University of California San Francisco, 513 Parnassus Ave, San Francisco, CA 94143, United States of America
| | - Ana C S Souza
- Department of Telecommunication and Mechatronics Engineering, Federal University of Sao Joao del-Rei, Praca Frei Orlando, 170, Sao Joao del Rei 36307, MG, Brazil
| | - Jennifer H Sabes
- Department of Otolaryngology-Head and Neck Surgery, University of California San Francisco, 2380 Sutter St., San Francisco, CA 94115, United States of America
| | - Christopher P Hess
- Department of Radiology and Biomedical Imaging, University of California San Francisco, 513 Parnassus Ave, San Francisco, CA 94143, United States of America
| | - Meredith E Adams
- Department of Otolaryngology-Head and Neck Surgery, University of Minnesota, Phillips Wangensteen Building, 516 Delaware St., Minneapolis, MN 55455, United States of America
| | - Steven W Cheung
- Department of Otolaryngology-Head and Neck Surgery, University of California San Francisco, 2380 Sutter St., San Francisco, CA 94115, United States of America.,Surgical Services, Veterans Affairs, 4150 Clement St., San Francisco, CA 94121, United States of America
| | - Srikantan S Nagarajan
- Department of Radiology and Biomedical Imaging, University of California San Francisco, 513 Parnassus Ave, San Francisco, CA 94143, United States of America.,Department of Otolaryngology-Head and Neck Surgery, University of California San Francisco, 2380 Sutter St., San Francisco, CA 94115, United States of America.,Surgical Services, Veterans Affairs, 4150 Clement St., San Francisco, CA 94121, United States of America
| |
Collapse
|
49
|
Pang C, Zhang Y, Xue Z, Bao J, Keong Li B, Liu Y, Liu Y, Sheng M, Peng B, Dai Y. Improving model robustness via enhanced feature representation and sample distribution based on cascaded classifiers for computer-aided diagnosis of brain disease. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2022.104047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
50
|
Hao J, Shen T, Zhu X, Liu Y, Behera A, Zhang D, Chen B, Liu J, Zhang J, Zhao Y. Retinal Structure Detection in OCTA Image via Voting-Based Multitask Learning. IEEE TRANSACTIONS ON MEDICAL IMAGING 2022; 41:3969-3980. [PMID: 36044489 DOI: 10.1109/tmi.2022.3202183] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Automated detection of retinal structures, such as retinal vessels (RV), the foveal avascular zone (FAZ), and retinal vascular junctions (RVJ), are of great importance for understanding diseases of the eye and clinical decision-making. In this paper, we propose a novel Voting-based Adaptive Feature Fusion multi-task network (VAFF-Net) for joint segmentation, detection, and classification of RV, FAZ, and RVJ in optical coherence tomography angiography (OCTA). A task-specific voting gate module is proposed to adaptively extract and fuse different features for specific tasks at two levels: features at different spatial positions from a single encoder, and features from multiple encoders. In particular, since the complexity of the microvasculature in OCTA images makes simultaneous precise localization and classification of retinal vascular junctions into bifurcation/crossing a challenging task, we specifically design a task head by combining the heatmap regression and grid classification. We take advantage of three different en face angiograms from various retinal layers, rather than following existing methods that use only a single en face. We carry out extensive experiments on three OCTA datasets acquired using different imaging devices, and the results demonstrate that the proposed method performs on the whole better than either the state-of-the-art single-purpose methods or existing multi-task learning solutions. We also demonstrate that our multi-task learning method generalizes across other imaging modalities, such as color fundus photography, and may potentially be used as a general multi-task learning tool. We also construct three datasets for multiple structure detection, and part of these datasets with the source code and evaluation benchmark have been released for public access.
Collapse
|