1
|
Yu M, Fang Y, Liu Y, Bozoki AC, Xiao S, Yue L, Liu M. Hybrid multi-modality multi-task learning for forecasting progression trajectories in subjective cognitive decline. Neural Netw 2025; 186:107263. [PMID: 39985974 DOI: 10.1016/j.neunet.2025.107263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 12/11/2024] [Accepted: 02/07/2025] [Indexed: 02/24/2025]
Abstract
While numerous studies strive to exploit the complementary potential of MRI and PET using learning-based methods, the effective fusion of the two modalities remains a tricky problem due to their inherently distinctive properties. In addition, current studies often face the problem of small sample sizes and missing PET data due to factors such as patient withdrawal or low image quality. To this end, we propose a hybrid multi-modality multi-task learning (HM2L) framework with cross-domain knowledge transfer for forecasting trajectories of SCD progression. Our HM2L comprises (1) missing PET imputation, (2) multi-modality feature extraction for MRI and PET feature learning with a novel softmax-triplet constraint, (3) attention-based multi-modality fusion of MRI and PET features, and (4) multi-task prediction of category labels and clinical scores such as Mini-Mental State Examination (MMSE) and Geriatric Depression Scale (GDS). To handle problems with small sample sizes, a transfer learning strategy is developed to enable knowledge transfer from a relatively large scale dataset with MRI and PET from 795 subjects to two small-scale SCD cohorts with a total of 136 subjects. Experimental results indicate HM2L surpasses several state-of-the-art methods in jointly predicting category labels and clinical scores of subjective cognitive decline. Results show that the MMSE scores of SCD subjects who develop mild cognitive impairment during the 2-year/7-year follow-up are significantly lower than those of subjects who remain stable, while there exists a complex relationship between SCD progression with GDS.
Collapse
Affiliation(s)
- Minhui Yu
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; Joint Department of Biomedical Engineering, University of North Carolina at Chapel Hill and North Carolina State University, Chapel Hill, NC 27599, USA
| | - Yuqi Fang
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yunbi Liu
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen 518172, China
| | - Andrea C Bozoki
- Department of Neurology, University of North Carolina at Chapel Hill, NC 27599, USA
| | - Shifu Xiao
- Department of Geriatric Psychiatry, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200240, China
| | - Ling Yue
- Department of Geriatric Psychiatry, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200240, China.
| | - Mingxia Liu
- Department of Radiology and BRIC, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| |
Collapse
|
2
|
Huang J, Chen C, Vong CM, Cheung YM. Broad Multitask Learning System With Group Sparse Regularization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:8265-8278. [PMID: 38949943 DOI: 10.1109/tnnls.2024.3416191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/03/2024]
Abstract
The broad learning system (BLS) featuring lightweight, incremental extension, and strong generalization capabilities has been successful in its applications. Despite these advantages, BLS struggles in multitask learning (MTL) scenarios with its limited ability to simultaneously unravel multiple complex tasks where existing BLS models cannot adequately capture and leverage essential information across tasks, decreasing their effectiveness and efficacy in MTL scenarios. To address these limitations, we proposed an innovative MTL framework explicitly designed for BLS, named group sparse regularization for broad multitask learning system using related task-wise (BMtLS-RG). This framework combines a task-related BLS learning mechanism with a group sparse optimization strategy, significantly boosting BLS's ability to generalize in MTL environments. The task-related learning component harnesses task correlations to enable shared learning and optimize parameters efficiently. Meanwhile, the group sparse optimization approach helps minimize the effects of irrelevant or noisy data, thus enhancing the robustness and stability of BLS in navigating complex learning scenarios. To address the varied requirements of MTL challenges, we presented two additional variants of BMtLS-RG: BMtLS-RG with sharing parameters of feature mapped nodes (BMtLS-RGf), which integrates a shared feature mapping layer, and BMtLS-RGf and enhanced nodes (BMtLS-RGfe), which further includes an enhanced node layer atop the shared feature mapping structure. These adaptations provide customized solutions tailored to the diverse landscape of MTL problems. We compared BMtLS-RG with state-of-the-art (SOTA) MTL and BLS algorithms through comprehensive experimental evaluation across multiple practical MTL and UCI datasets. BMtLS-RG outperformed SOTA methods in 97.81% of classification tasks and achieved optimal performance in 96.00% of regression tasks, demonstrating its superior accuracy and robustness. Furthermore, BMtLS-RG exhibited satisfactory training efficiency, outperforming existing MTL algorithms by 8.04-42.85 times.
Collapse
|
3
|
Sheng X, Cai H, Nie Y, He S, Cheung YM, Chen J. Modality-Aware Discriminative Fusion Network for Integrated Analysis of Brain Imaging Genomics. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:8577-8591. [PMID: 39178069 DOI: 10.1109/tnnls.2024.3439530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
Abstract
Mild cognitive impairment (MCI) represents an early stage of Alzheimer's disease (AD), characterized by subtle clinical symptoms that pose challenges for accurate diagnosis. The quest for the identification of MCI individuals has highlighted the importance of comprehending the underlying mechanisms of disease causation. Integrated analysis of brain imaging and genomics offers a promising avenue for predicting MCI risk before clinical symptom onset. However, most existing methods face challenges in: 1) mining the brain network-specific topological structure and addressing the single nucleotide polymorphisms (SNPs)-related noise contamination and 2) extracting the discriminative properties of brain imaging genomics, resulting in limited accuracy for MCI diagnosis. To this end, a modality-aware discriminative fusion network (MA-DFN) is proposed to integrate the complementary information from brain imaging genomics to diagnose MCI. Specifically, we first design two modality-specific feature extraction modules: the graph convolutional network with edge-augmented self-attention module (GCN-EASA) and the deep adversarial denoising autoencoder module (DAD-AE), to capture the topological structure of brain networks and the intrinsic distribution of SNPs. Subsequently, a discriminative-enhanced fusion network with correlation regularization module (DFN-CorrReg) is employed to enhance inter-modal consistency and between-class discrimination in brain imaging and genomics. Compared to other state-of-the-art approaches, MA-DFN not only exhibits superior performance in stratifying cognitive normal (CN) and MCI individuals but also identifies disease-related brain regions and risk SNPs locus, which hold potential as putative biomarkers for MCI diagnosis.
Collapse
|
4
|
Sun Y, Shao S, Huang J, Shi H, Yan L, Lu Y, Liu P, Jiang Y, Qiao J, Zhang L. Development and validation of a deep learning model based on cascade mask regional convolutional neural network to noninvasively and accurately identify human round spermatids. J Adv Res 2025:S2090-1232(25)00219-X. [PMID: 40185275 DOI: 10.1016/j.jare.2025.03.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 03/31/2025] [Accepted: 03/31/2025] [Indexed: 04/07/2025] Open
Abstract
INTRODUCTION The difficulty of identifying human round spermatids (hRSs) has impeded applications of the human round spermatid injection (ROSI) technique. RSs can be accurately screened through flow cytometric analysis utilizing the Hoechst fluorescence profile reflecting DNA, but this method is not suitable for isolating hRSs due to the toxicity associated with Hoechst staining. OBJECTIVE To evaluate the capacity of a deep learning model grounded in a cascade mask region-based convolutional neural network (R-CNN) for the noninvasive and accurate identification of hRSs. METHODS In this study, we presented the development and validation of a deep learning model for identifying hRSs through the analysis of 3457 optical light microscope images of sorted hRSs obtained via flow cytometric analysis. The model's accuracy and specificity were evaluated by calculating the mean average precision (mAP). Furthermore, a double-blind experiment was conducted to access the reliability of the proposed model in accurately identifying hRSs. It detected the expression of protamine (PRM1) and/or peanut lectin (PNA), which are established markers for RSs. RESULTS Our deep learning-based model demonstrated a high precision, achieving a mAP of over 0.80 for isolating hRSs in test datasets. The expression of PRM1 and/or PNA was observed in all cells noninvasively selected by our AI model during an independent double-blind test. This phenomenon confirmed the accuracy and effectiveness of the proposed model. The model's capability for noninvasive and accurate isolation of hRSs among spermatogenic cells highlighted its robustness and generalizability for clinical applications. CONCLUSION The deep learning AI model based on a cascade R-CNN has the ability to accurately identify hRSs among spermatogenic cells. The application of this noninvasive method, which requires no additional procedures in clinical practice, is able to facilitate the widespread implementation of ROSI technique. Therefore, it can provide patients with spermatogenic arrest the opportunity to become biological fathers.
Collapse
Affiliation(s)
- Yujiao Sun
- Institute of Genetics and Development Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Shihao Shao
- School of Basic Medical Sciences Peking University, Beijing 100101, China
| | - Jiangwei Huang
- Institute of Genetics and Development Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Hao Shi
- National Center for Protein Sciences at Peking University, School of Life Sciences, Peking University, Beijing 100191, China
| | - Liying Yan
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China
| | - Yongjie Lu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China
| | - Ping Liu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China
| | - Yuqiang Jiang
- Institute of Genetics and Development Biology, Chinese Academy of Sciences, Beijing 100101, China.
| | - Jie Qiao
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China; Beijing Advanced Innovation Center for Genomics, Beijing 100191, China; Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100191, China.
| | - Li Zhang
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing 100191, China; National Clinical Research Center for Obstetrics and Gynecology, Beijing 100191, China; Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing 100191, China; Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing 100191, China; National Clinical Key Specialty Construction Program, Beijing 100191, China.
| |
Collapse
|
5
|
Xu J, Dong A, Yang Y, Jin S, Zeng J, Xu Z, Jiang W, Zhang L, Dong J, Wang B. VSNet: Vessel Structure-aware Network for hepatic and portal vein segmentation. Med Image Anal 2025; 101:103458. [PMID: 39913966 DOI: 10.1016/j.media.2025.103458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 11/13/2024] [Accepted: 01/07/2025] [Indexed: 03/05/2025]
Abstract
Identifying and segmenting hepatic and portal veins (two predominant vascular systems in the liver, from CT scans) play a crucial role for clinicians in preoperative planning for treatment strategies. However, existing segmentation models often struggle to capture fine details of minor veins. In this article, we introduce Vessel Structure-aware Network (VSNet), a multi-task learning model with vessel-growing decoder, to address the challenge. VSNet excels at accurate segmentation by capturing the topological features of both minor veins while preserving correct connectivity from minor vessels to trucks. We also build and publish the largest dataset (303 cases) for hepatic and portal vessel segmentation. Through comprehensive experiments, we demonstrate that VSNet achieves the best Dice for hepatic vein of 0.824 and portal vein of 0.807 on our proposed dataset, significantly outperforming other popular segmentation models. The source code and dataset are publicly available at https://github.com/XXYZB/VSNet.
Collapse
Affiliation(s)
- Jichen Xu
- National Key Laboratory of Human-Machine Hybrid Augmented Intelligence, National Engineering Research Center for Visual Information and Applications, and Institute of Artificial Intelligence and Robotics, Xi'an Jiaotong University, Xi'an, China
| | - Anqi Dong
- Division of Decision and Control Systems and Department of Mathematics, KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden
| | - Yang Yang
- School of Automation, Northwestern Polytechnical University, Xi'an, China
| | - Shuo Jin
- Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, Beijing, China
| | - Jianping Zeng
- Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, Beijing, China
| | - Zhengqing Xu
- Jingzhen Medical Technology Ltd., China; Matrix Medical Technology Ltd., China
| | - Wei Jiang
- Research Center of Artificial Intelligence of Shangluo, Shangluo University, Shangluo, China
| | - Liang Zhang
- School of Computer Science and Technology, Xidian University, Xi'an, China
| | - Jiahong Dong
- Beijing Tsinghua Changgung Hospital, School of Clinical Medicine, Tsinghua University, Beijing, China
| | - Bo Wang
- Jingzhen Medical Technology Ltd., China; Matrix Medical Technology Ltd., China; Institute of Medical Equipment Science and Engineering, Huazhong University of Science and Technology, Wuhan, China.
| |
Collapse
|
6
|
Zhang X, Xiao Z, Wu X, Chen Y, Zhao J, Hu Y, Liu J. Pyramid Pixel Context Adaption Network for Medical Image Classification With Supervised Contrastive Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:6802-6815. [PMID: 38829749 DOI: 10.1109/tnnls.2024.3399164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2024]
Abstract
Spatial attention (SA) mechanism has been widely incorporated into deep neural networks (DNNs), significantly lifting the performance in computer vision tasks via long-range dependency modeling. However, it may perform poorly in medical image analysis. Unfortunately, the existing efforts are often unaware that long-range dependency modeling has limitations in highlighting subtle lesion regions. To overcome this limitation, we propose a practical yet lightweight architectural unit, pyramid pixel context adaption (PPCA) module, which exploits multiscale pixel context information to recalibrate pixel position in a pixel-independent manner dynamically. PPCA first applies a well-designed cross-channel pyramid pooling (CCPP) to aggregate multiscale pixel context information, then eliminates the inconsistency among them by the well-designed pixel normalization (PN), and finally estimates per pixel attention weight via a pixel context integration. By embedding PPCA into a DNN with negligible overhead, the PPCA network (PPCANet) is developed for medical image classification. In addition, we introduce supervised contrastive learning to enhance feature representation by exploiting the potential of label information via supervised contrastive loss (CL). The extensive experiments on six medical image datasets show that the PPCANet outperforms state-of-the-art (SOTA) attention-based networks and recent DNNs. We also provide visual analysis and ablation study to explain the behavior of PPCANet in the decision-making process.
Collapse
|
7
|
Bi XA, Chen K, Jiang S, Luo S, Zhou W, Xing Z, Xu L, Liu Z, Liu T. Community Graph Convolution Neural Network for Alzheimer's Disease Classification and Pathogenetic Factors Identification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:1959-1973. [PMID: 37204952 DOI: 10.1109/tnnls.2023.3269446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
As a complex neural network system, the brain regions and genes collaborate to effectively store and transmit information. We abstract the collaboration correlations as the brain region gene community network (BG-CN) and present a new deep learning approach, such as the community graph convolutional neural network (Com-GCN), for investigating the transmission of information within and between communities. The results can be used for diagnosing and extracting causal factors for Alzheimer's disease (AD). First, an affinity aggregation model for BG-CN is developed to describe intercommunity and intracommunity information transmission. Second, we design the Com-GCN architecture with intercommunity convolution and intracommunity convolution operations based on the affinity aggregation model. Through sufficient experimental validation on the AD neuroimaging initiative (ADNI) dataset, the design of Com-GCN matches the physiological mechanism better and improves the interpretability and classification performance. Furthermore, Com-GCN can identify lesioned brain regions and disease-causing genes, which may assist precision medicine and drug design in AD and serve as a valuable reference for other neurological disorders.
Collapse
|
8
|
Sun X, Yao F, Ding C. Modeling High-Order Relationships: Brain-Inspired Hypergraph-Induced Multimodal-Multitask Framework for Semantic Comprehension. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:12142-12156. [PMID: 37028292 DOI: 10.1109/tnnls.2023.3252359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Semantic comprehension aims to reasonably reproduce people's real intentions or thoughts, e.g., sentiment, humor, sarcasm, motivation, and offensiveness, from multiple modalities. It can be instantiated as a multimodal-oriented multitask classification issue and applied to scenarios, such as online public opinion supervision and political stance analysis. Previous methods generally employ multimodal learning alone to deal with varied modalities or solely exploit multitask learning to solve various tasks, a few to unify both into an integrated framework. Moreover, multimodal-multitask cooperative learning could inevitably encounter the challenges of modeling high-order relationships, i.e., intramodal, intermodal, and intertask relationships. Related research of brain sciences proves that the human brain possesses multimodal perception and multitask cognition for semantic comprehension via decomposing, associating, and synthesizing processes. Thus, establishing a brain-inspired semantic comprehension framework to bridge the gap between multimodal and multitask learning becomes the primary motivation of this work. Motivated by the superiority of the hypergraph in modeling high-order relations, in this article, we propose a hypergraph-induced multimodal-multitask (HIMM) network for semantic comprehension. HIMM incorporates monomodal, multimodal, and multitask hypergraph networks to, respectively, mimic the decomposing, associating, and synthesizing processes to tackle the intramodal, intermodal, and intertask relationships accordingly. Furthermore, temporal and spatial hypergraph constructions are designed to model the relationships in the modality with sequential and spatial structures, respectively. Also, we elaborate a hypergraph alternative updating algorithm to ensure that vertices aggregate to update hyperedges and hyperedges converge to update their connected vertices. Experiments on the dataset with two modalities and five tasks verify the effectiveness of HIMM on semantic comprehension.
Collapse
|
9
|
Liu R, Huang ZA, Hu Y, Zhu Z, Wong KC, Tan KC. Spatial-Temporal Co-Attention Learning for Diagnosis of Mental Disorders From Resting-State fMRI Data. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:10591-10605. [PMID: 37027556 DOI: 10.1109/tnnls.2023.3243000] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Neuroimaging techniques have been widely adopted to detect the neurological brain structures and functions of the nervous system. As an effective noninvasive neuroimaging technique, functional magnetic resonance imaging (fMRI) has been extensively used in computer-aided diagnosis (CAD) of mental disorders, e.g., autism spectrum disorder (ASD) and attention deficit/hyperactivity disorder (ADHD). In this study, we propose a spatial-temporal co-attention learning (STCAL) model for diagnosing ASD and ADHD from fMRI data. In particular, a guided co-attention (GCA) module is developed to model the intermodal interactions of spatial and temporal signal patterns. A novel sliding cluster attention module is designed to address global feature dependency of self-attention mechanism in fMRI time series. Comprehensive experimental results demonstrate that our STCAL model can achieve competitive accuracies of 73.0 ± 4.5%, 72.0 ± 3.8%, and 72.5 ± 4.2% on the ABIDE I, ABIDE II, and ADHD-200 datasets, respectively. Moreover, the potential for feature pruning based on the co-attention scores is validated by the simulation experiment. The clinical interpretation analysis of STCAL can allow medical professionals to concentrate on the discriminative regions of interest and key time frames from fMRI data.
Collapse
|
10
|
Wang Y, Liu S, Spiteri AG, Huynh ALH, Chu C, Masters CL, Goudey B, Pan Y, Jin L. Understanding machine learning applications in dementia research and clinical practice: a review for biomedical scientists and clinicians. Alzheimers Res Ther 2024; 16:175. [PMID: 39085973 PMCID: PMC11293066 DOI: 10.1186/s13195-024-01540-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2024] [Accepted: 07/21/2024] [Indexed: 08/02/2024]
Abstract
Several (inter)national longitudinal dementia observational datasets encompassing demographic information, neuroimaging, biomarkers, neuropsychological evaluations, and muti-omics data, have ushered in a new era of potential for integrating machine learning (ML) into dementia research and clinical practice. ML, with its proficiency in handling multi-modal and high-dimensional data, has emerged as an innovative technique to facilitate early diagnosis, differential diagnosis, and to predict onset and progression of mild cognitive impairment and dementia. In this review, we evaluate current and potential applications of ML, including its history in dementia research, how it compares to traditional statistics, the types of datasets it uses and the general workflow. Moreover, we identify the technical barriers and challenges of ML implementations in clinical practice. Overall, this review provides a comprehensive understanding of ML with non-technical explanations for broader accessibility to biomedical scientists and clinicians.
Collapse
Affiliation(s)
- Yihan Wang
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia
- Florey Department of Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia
| | - Shu Liu
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia
- Florey Department of Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia
- The ARC Training Centre in Cognitive Computing for Medical Technologies, The University of Melbourne, Carlton, VIC, 3010, Australia
| | - Alanna G Spiteri
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia
| | - Andrew Liem Hieu Huynh
- Department of Aged Care, Austin Health, Heidelberg, VIC, 3084, Australia
- Department of Medicine, Austin Health, University of Melbourne, Heidelberg, VIC, 3084, Australia
| | - Chenyin Chu
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia
- Florey Department of Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia
| | - Colin L Masters
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia
| | - Benjamin Goudey
- Florey Department of Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia
- The ARC Training Centre in Cognitive Computing for Medical Technologies, The University of Melbourne, Carlton, VIC, 3010, Australia
| | - Yijun Pan
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia.
- Florey Department of Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia.
| | - Liang Jin
- The Florey Institute of Neuroscience and Mental Health, 30 Royal Parade, Parkville, VIC, 3052, Australia
- Florey Department of Neuroscience and Mental Health, The University of Melbourne, 30 Royal Parade, Parkville, VIC, 3052, Australia
| |
Collapse
|
11
|
Zhang K, Chen C, Yuan C, Chen S, Wang X, He X. PatchNet: Maximize the Exploration of Congeneric Semantics for Weakly Supervised Semantic Segmentation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:10984-10995. [PMID: 37314912 DOI: 10.1109/tnnls.2023.3246109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
With the increase in the number of image data and the lack of corresponding labels, weakly supervised learning has drawn a lot of attention recently in computer vision tasks, especially in the fine-grained semantic segmentation problem. To alleviate human efforts from expensive pixel-by-pixel annotations, our method focuses on weakly supervised semantic segmentation (WSSS) with image-level labels, which are much easier to obtain. As a considerable gap exists between pixel-level segmentation and image-level labels, how to reflect the image-level semantic information on each pixel is an important question. To explore the congeneric semantic regions from the same class to the maximum, we construct the patch-level semantic augmentation network (PatchNet) based on the self-detected patches from different images that contain the same class labels. Patches can frame the objects as much as possible and include as little background as possible. The patch-level semantic augmentation network that is established with patches as the nodes can maximize the mutual learning of similar objects. We regard the embedding vectors of patches as nodes and use a transformer-based complementary learning module to construct weighted edges according to the embedding similarity between different nodes. Moreover, to better supplement semantic information, we propose softcomplementary loss functions matched with the whole network structure. We conduct experiments on the popular PASCAL VOC 2012 and MS COCO 2014 benchmarks, and our model yields the state-of-the-art performance.
Collapse
|
12
|
Huang ZA, Liu R, Zhu Z, Tan KC. Multitask Learning for Joint Diagnosis of Multiple Mental Disorders in Resting-State fMRI. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:8161-8175. [PMID: 36459608 DOI: 10.1109/tnnls.2022.3225179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Facing the increasing worldwide prevalence of mental disorders, the symptom-based diagnostic criteria struggle to address the urgent public health concern due to the global shortfall in well-qualified professionals. Thanks to the recent advances in neuroimaging techniques, functional magnetic resonance imaging (fMRI) has surfaced as a new solution to characterize neuropathological biomarkers for detecting functional connectivity (FC) anomalies in mental disorders. However, the existing computer-aided diagnosis models for fMRI analysis suffer from unstable performance on large datasets. To address this issue, we propose an efficient multitask learning (MTL) framework for joint diagnosis of multiple mental disorders using resting-state fMRI data. A novel multiobjective evolutionary clustering algorithm is presented to group regions of interests (ROIs) into different clusters for FC pattern analysis. On the optimal clustering solution, the multicluster multigate mixture-of-expert model is used for the final classification by capturing the highly consistent feature patterns among related diagnostic tasks. Extensive simulation experiments demonstrate that the performance of the proposed framework is superior to that of the other state-of-the-art methods. Moreover, the potential for practical application of the framework is also validated in terms of limited computational resources, real-time analysis, and insufficient training data. The proposed model can identify the remarkable interpretative biomarkers associated with specific mental disorders for clinical interpretation analysis.
Collapse
|
13
|
Liu R, Huang ZA, Hu Y, Zhu Z, Wong KC, Tan KC. Attention-Like Multimodality Fusion With Data Augmentation for Diagnosis of Mental Disorders Using MRI. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:7627-7641. [PMID: 36374900 DOI: 10.1109/tnnls.2022.3219551] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The globally rising prevalence of mental disorders leads to shortfalls in timely diagnosis and therapy to reduce patients' suffering. Facing such an urgent public health problem, professional efforts based on symptom criteria are seriously overstretched. Recently, the successful applications of computer-aided diagnosis approaches have provided timely opportunities to relieve the tension in healthcare services. Particularly, multimodal representation learning gains increasing attention thanks to the high temporal and spatial resolution information extracted from neuroimaging fusion. In this work, we propose an efficient multimodality fusion framework to identify multiple mental disorders based on the combination of functional and structural magnetic resonance imaging. A multioutput conditional generative adversarial network (GAN) is developed to address the scarcity of multimodal data for augmentation. Based on the augmented training data, the multiheaded gating fusion model is proposed for classification by extracting the complementary features across different modalities. The experiments demonstrate that the proposed model can achieve robust accuracies of 75.1 ± 1.5 %, 72.9 ± 1.1 %, and 87.2 ± 1.5 % for autism spectrum disorder (ASD), attention deficit/hyperactivity disorder, and schizophrenia, respectively. In addition, the interpretability of our model is expected to enable the identification of remarkable neuropathology diagnostic biomarkers, leading to well-informed therapeutic decisions.
Collapse
|
14
|
Tang X, Zhang C, Guo R, Yang X, Qian X. A Causality-Aware Graph Convolutional Network Framework for Rigidity Assessment in Parkinsonians. IEEE TRANSACTIONS ON MEDICAL IMAGING 2024; 43:229-240. [PMID: 37432810 DOI: 10.1109/tmi.2023.3294182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/13/2023]
Abstract
Rigidity is one of the common motor disorders in Parkinson's disease (PD), which lead to life quality deterioration. The widely-used rating-scale-based approach for rigidity assessment still depends on the availability of experienced neurologists and is limited by rating subjectivity. Given the recent successful applications of quantitative susceptibility mapping (QSM) in auxiliary PD diagnosis, automated assessment of PD rigidity can be essentially achieved through QSM analysis. However, a major challenge is the performance instability due to the confounding factors (e.g., noise and distribution shift) which conceal the truly-causal features. Therefore, we propose a causality-aware graph convolutional network (GCN) framework, where causal feature selection is combined with causal invariance to ensure that causality-informed model decisions are reached. Firstly, a GCN model that integrates causal feature selection is systematically constructed at three graph levels: node, structure, and representation. In this model, a causal diagram is learned to extract a subgraph with truly-causal information. Secondly, a non-causal perturbation strategy is developed along with an invariance constraint to ensure the stability of the assessment results under different distributions, and thus avoid spurious correlations caused by distribution shifts. The superiority of the proposed method is shown by extensive experiments and the clinical value is revealed by the direct relevance of selected brain regions to rigidity in PD. Besides, its extensibility is verified on other two tasks: PD bradykinesia and mental state for Alzheimer's disease. Overall, we provide a clinically-potential tool for automated and stable assessment of PD rigidity. Our source code will be available at https://github.com/SJTUBME-QianLab/Causality-Aware-Rigidity.
Collapse
|
15
|
Sakharova T, Mao S, Osadchuk M. Updated Models of Alzheimer's Disease with Deep Neural Networks. J Alzheimers Dis 2024; 100:685-697. [PMID: 38905045 DOI: 10.3233/jad-240183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
Background In recent years, researchers have focused on developing precise models for the progression of Alzheimer's disease (AD) using deep neural networks. Forecasting the progression of AD through the analysis of time series data represents a promising approach. Objective The primary objective of this research is to formulate an effective methodology for forecasting the progression of AD through the integration of multi-task learning techniques and the analysis of pertinent medical data. Methods This study primarily utilized volumetric measurements obtained through magnetic resonance imaging (MRI), trajectories of cognitive assessments, and clinical status indicators. The research encompassed 150 patients diagnosed with AD who underwent examination between 2020 and 2022 in Beijing, China. A multi-task learning approach was employed to train forecasting models using MRI data, trajectories of cognitive assessments, and clinical status. Correlation analysis was conducted at various time points. Results At the baseline, a robust correlation was observed among the forecasting tasks: 0.75 for volumetric MRI measurements, 0.62 for trajectories of cognitive assessment, and 0.48 for clinical status. The implementation of a multi-task learning framework enhanced performance by 12.7% for imputing missing values and 14.8% for prediction accuracy. Conclusions The findings of our study, indicate that multi-task learning can effectively predict the progression of AD. However, it is important to note that the study's generalizability may be limited due to the restricted dataset and the specific population under examination. These conclusions represent a significant stride toward more precise diagnosis and treatment of this neurological disorder.
Collapse
Affiliation(s)
- Tatyana Sakharova
- Department of Biology and General Genetics, I.M. Sechenov First Moscow State Medical University, Moscow, Russia
| | - Siqi Mao
- Department of Statistics, Florida State University, Tallahassee, FL, USA
| | - Mikhail Osadchuk
- Department of Polyclinic Therapy, I.M. Sechenov First Moscow State Medical University, Moscow, Russia
| |
Collapse
|
16
|
Zhao K, Chen P, Alexander-Bloch A, Wei Y, Dyrba M, Yang F, Kang X, Wang D, Fan D, Ye S, Tang Y, Yao H, Zhou B, Lu J, Yu C, Wang P, Liao Z, Chen Y, Huang L, Zhang X, Han Y, Li S, Liu Y. A neuroimaging biomarker for Individual Brain-Related Abnormalities In Neurodegeneration (IBRAIN): a cross-sectional study. EClinicalMedicine 2023; 65:102276. [PMID: 37954904 PMCID: PMC10632687 DOI: 10.1016/j.eclinm.2023.102276] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Revised: 09/27/2023] [Accepted: 09/29/2023] [Indexed: 11/14/2023] Open
Abstract
Background Alzheimer's disease (AD) is a prevalent neurodegenerative disorder that poses a worldwide public health challenge. A neuroimaging biomarker would significantly improve early diagnosis and intervention, ultimately enhancing the quality of life for affected individuals and reducing the burden on healthcare systems. Methods Cross-sectional and longitudinal data (10,099 participants with 13,380 scans) from 12 independent datasets were used in the present study (this study was performed between September 1, 2021 and February 15, 2023). The Individual Brain-Related Abnormalities In Neurodegeneration (IBRAIN) score was developed via integrated regional- and network-based measures under an ensemble machine learning model based on structural MRI data. We systematically assessed whether IBRAIN could be a neuroimaging biomarker for AD. Findings IBRAIN accurately differentiated individuals with AD from NCs (AUC = 0.92) and other neurodegenerative diseases, including Frontotemporal dementia (FTD), Parkinson's disease (PD), Vascular dementia (VaD) and Amyotrophic Lateral Sclerosis (ALS) (AUC = 0.92). IBRAIN was significantly correlated to clinical measures and gene expression, enriched in immune process and protein metabolism. The IBRAIN score exhibited a significant ability to reveal the distinct progression of prodromal AD (i.e., Mild cognitive impairment, MCI) (Hazard Ratio (HR) = 6.52 [95% CI: 4.42∼9.62], p < 1 × 10-16), which offers similar powerful performance with Cerebrospinal Fluid (CSF) Aβ (HR = 3.78 [95% CI: 2.63∼5.43], p = 2.13 × 10-14) and CSF Tau (HR = 3.77 [95% CI: 2.64∼5.39], p = 9.53 × 10-15) based on the COX and Log-rank test. Notably, the IBRAIN shows comparable sensitivity (beta = -0.70, p < 1 × 10-16) in capturing longitudinal changes in individuals with conversion to AD than CSF Aβ (beta = -0.26, p = 4.40 × 10-9) and CSF Tau (beta = 0.12, p = 1.02 × 10-5). Interpretation Our findings suggested that IBRAIN is a biologically relevant, specific, and sensitive neuroimaging biomarker that can serve as a clinical measure to uncover prodromal AD progression. It has strong potential for application in future clinical practice and treatment trials. Funding Science and Technology Innovation 2030 Major Projects, the National Natural Science Foundation of China, Beijing Natural Science Funds, the Fundamental Research Funds for the CentralUniversity, and the Startup Funds for Talents at Beijing Normal University.
Collapse
Affiliation(s)
- Kun Zhao
- School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China
| | - Pindong Chen
- School of Artificial Intelligence, University of Chinese Academy of Sciences & Brainnetome Centre, Chinese Academy of Sciences, Beijing, China
| | - Aaron Alexander-Bloch
- Department of Psychiatry, University of Pennsylvania, Philadelphia, USA
- Department of Child and Adolescent Psychiatry and Behavioral Science, Children's Hospital of Philadelphia, Philadelphia, USA
| | - Yongbin Wei
- School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China
| | - Martin Dyrba
- German Centre for Neurodegenerative Diseases (DZNE), Rostock, Germany
| | - Fan Yang
- CAS Key Laboratory of Molecular Imaging, Institute of Automation, Beijing, China
| | - Xiaopeng Kang
- School of Artificial Intelligence, University of Chinese Academy of Sciences & Brainnetome Centre, Chinese Academy of Sciences, Beijing, China
| | - Dawei Wang
- Department of Radiology, Qilu Hospital of Shandong University, Ji'nan, China
| | - Dongsheng Fan
- Department of Neurology, Peking University Third Hospital, Beijing, China
- Key Laboratory for Neuroscience, National Health Commission/Ministry of Education, Peking University, Beijing, China
- Beijing Key Laboratory of Biomarker and Translational Research in Neurodegenerative Diseases, Beijing, China
| | - Shan Ye
- Department of Neurology, Peking University Third Hospital, Beijing, China
- Key Laboratory for Neuroscience, National Health Commission/Ministry of Education, Peking University, Beijing, China
- Beijing Key Laboratory of Biomarker and Translational Research in Neurodegenerative Diseases, Beijing, China
| | - Yi Tang
- Department of Neurology, Xuanwu Hospital of Capital Medical University, Beijing, China
| | - Hongxiang Yao
- Department of Radiology, The Second Medical Centre, National Clinical Research Centre for Geriatric Diseases, Chinese PLA General Hospital, Beijing, China
| | - Bo Zhou
- Department of Neurology, The Second Medical Centre, National Clinical Research Centre for Geriatric Diseases, Chinese PLA General Hospital, Beijing, China
| | - Jie Lu
- Department of Radiology, Xuanwu Hospital of Capital Medical University, Beijing, China
| | - Chunshui Yu
- Department of Radiology, Tianjin Medical University General Hospital, Tianjin, China
| | - Pan Wang
- Department of Neurology, Tianjin Huanhu Hospital, Tianjin, China
| | - Zhengluan Liao
- Department of Psychiatry, People's Hospital of Hangzhou Medical College, Zhejiang Provincial People's Hospital, Hangzhou, China
| | - Yan Chen
- Department of Psychiatry, People's Hospital of Hangzhou Medical College, Zhejiang Provincial People's Hospital, Hangzhou, China
| | - Longjian Huang
- Affiliated Hospital of Youjiang Medical University for Nationalities, Baise, China
| | - Xi Zhang
- Department of Neurology, The Second Medical Centre, National Clinical Research Centre for Geriatric Diseases, Chinese PLA General Hospital, Beijing, China
| | - Ying Han
- Department of Neurology, Xuanwu Hospital of Capital Medical University, Beijing, China
- National Clinical Research Centre for Geriatric Disorders, Beijing, China
- Centre of Alzheimer's Disease, Beijing Institute for Brain Disorders, Beijing, China
| | - Shuyu Li
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China
| | - Yong Liu
- School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, China
- School of Artificial Intelligence, University of Chinese Academy of Sciences & Brainnetome Centre, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
17
|
Lu M, Du Z, Zhao J, Jiang L, Liu R, Zhang M, Xu T, Wei J, Wang W, Xu L, Guo H, Chen C, Yu X, Tan Z, Fang J, Zou Y. Neuroimaging mechanisms of acupuncture on functional reorganization for post-stroke motor improvement: a machine learning-based functional magnetic resonance imaging study. Front Neurosci 2023; 17:1143239. [PMID: 37274194 PMCID: PMC10235506 DOI: 10.3389/fnins.2023.1143239] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 05/03/2023] [Indexed: 06/06/2023] Open
Abstract
Objective Motor recovery is crucial in stroke rehabilitation, and acupuncture can influence recovery. Neuroimaging and machine learning approaches provide new research directions to explore the brain functional reorganization and acupuncture mechanisms after stroke. We applied machine learning to predict the classification of the minimal clinically important differences (MCID) for motor improvement and identify the neuroimaging features, in order to explore brain functional reorganization and acupuncture mechanisms for motor recovery after stroke. Methods In this study, 49 patients with unilateral motor pathway injury (basal ganglia and/or corona radiata) after ischemic stroke were included and evaluated the motor function by Fugl-Meyer Assessment scores (FMA) at baseline and at 2-week follow-up sessions. Patients were divided by the difference between the twice FMA scores into one group showing minimal clinically important difference (MCID group, n = 28) and the other group with no minimal clinically important difference (N-MCID, n = 21). Machine learning was performed by PRoNTo software to predict the classification of the patients and identify the feature brain regions of interest (ROIs). In addition, a matched group of healthy controls (HC, n = 26) was enrolled. Patients and HC underwent magnetic resonance imaging examination in the resting state and in the acupuncture state (acupuncture at the Yanglingquan point on one side) to compare the differences in brain functional connectivity (FC) and acupuncture effects. Results Through machine learning, we obtained a balance accuracy rate of 75.51% and eight feature ROIs. Compared to HC, we found that the stroke patients with lower FC between these feature ROIs with other brain regions, while patients in the MCID group exhibited a wider range of lower FC. When acupuncture was applied to Yanglingquan (GB 34), the abnormal FC of patients was decreased, with different targets of effects in different groups. Conclusion Feature ROIs identified by machine learning can predict the classification of stroke patients with different motor improvements, and the FC between these ROIs with other brain regions is decreased. Acupuncture can modulate the bilateral cerebral hemispheres to restore abnormal FC via different targets, thereby promoting motor recovery after stroke. Clinical trial registration https://www.chictr.org.cn/showproj.html?proj=37359, ChiCTR1900022220.
Collapse
Affiliation(s)
- Mengxin Lu
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhongming Du
- Department of Acupuncture, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jiping Zhao
- Department of Acupuncture, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Lan Jiang
- Department of Chinese Medicine, Peking Union Medical College Hospital, Beijing, China
| | - Ruoyi Liu
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Muzhao Zhang
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Tianjiao Xu
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jingpei Wei
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Wei Wang
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Lingling Xu
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Haijiao Guo
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Chen Chen
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Xin Yu
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhongjian Tan
- Department of Radiology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jiliang Fang
- Department of Radiology, Guang’anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, China
| | - Yihuai Zou
- Department of Neurology, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|
18
|
Liu F, Wang H, Liang SN, Jin Z, Wei S, Li X. MPS-FFA: A multiplane and multiscale feature fusion attention network for Alzheimer's disease prediction with structural MRI. Comput Biol Med 2023; 157:106790. [PMID: 36958239 DOI: 10.1016/j.compbiomed.2023.106790] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 02/13/2023] [Accepted: 03/11/2023] [Indexed: 03/17/2023]
Abstract
Structural magnetic resonance imaging (sMRI) is a popular technique that is widely applied in Alzheimer's disease (AD) diagnosis. However, only a few structural atrophy areas in sMRI scans are highly associated with AD. The degree of atrophy in patients' brain tissues and the distribution of lesion areas differ among patients. Therefore, a key challenge in sMRI-based AD diagnosis is identifying discriminating atrophy features. Hence, we propose a multiplane and multiscale feature-level fusion attention (MPS-FFA) model. The model has three components, (1) A feature encoder uses a multiscale feature extractor with hybrid attention layers to simultaneously capture and fuse multiple pathological features in the sagittal, coronal, and axial planes. (2) A global attention classifier combines clinical scores and two global attention layers to evaluate the feature impact scores and balance the relative contributions of different feature blocks. (3) A feature similarity discriminator minimizes the feature similarities among heterogeneous labels to enhance the ability of the network to discriminate atrophy features. The MPS-FFA model provides improved interpretability for identifying discriminating features using feature visualization. The experimental results on the baseline sMRI scans from two databases confirm the effectiveness (e.g., accuracy and generalizability) of our method in locating pathological locations. The source code is available at https://github.com/LiuFei-AHU/MPSFFA.
Collapse
Affiliation(s)
- Fei Liu
- Anhui Provincial International Joint Research Center for Advanced Technology in Medical Imaging, Anhui University, Hefei, China; School of Computer Science and Technology, Anhui University, Hefei, China
| | - Huabin Wang
- Anhui Provincial International Joint Research Center for Advanced Technology in Medical Imaging, Anhui University, Hefei, China; School of Computer Science and Technology, Anhui University, Hefei, China.
| | - Shiuan-Ni Liang
- School of Engineering, Monash University Malaysia, Kuala Lumpur, Malaysia
| | - Zhe Jin
- Anhui Provincial International Joint Research Center for Advanced Technology in Medical Imaging, Anhui University, Hefei, China
| | - Shicheng Wei
- Anhui Provincial International Joint Research Center for Advanced Technology in Medical Imaging, Anhui University, Hefei, China
| | - Xuejun Li
- Anhui Provincial International Joint Research Center for Advanced Technology in Medical Imaging, Anhui University, Hefei, China; School of Computer Science and Technology, Anhui University, Hefei, China
| | | |
Collapse
|
19
|
Coll L, Pareto D, Carbonell-Mirabent P, Cobo-Calvo Á, Arrambide G, Vidal-Jordana Á, Comabella M, Castilló J, Rodríguez-Acevedo B, Zabalza A, Galán I, Midaglia L, Nos C, Salerno A, Auger C, Alberich M, Río J, Sastre-Garriga J, Oliver A, Montalban X, Rovira À, Tintoré M, Lladó X, Tur C. Deciphering multiple sclerosis disability with deep learning attention maps on clinical MRI. Neuroimage Clin 2023; 38:103376. [PMID: 36940621 PMCID: PMC10034138 DOI: 10.1016/j.nicl.2023.103376] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 03/09/2023] [Accepted: 03/09/2023] [Indexed: 03/17/2023]
Abstract
The application of convolutional neural networks (CNNs) to MRI data has emerged as a promising approach to achieving unprecedented levels of accuracy when predicting the course of neurological conditions, including multiple sclerosis, by means of extracting image features not detectable through conventional methods. Additionally, the study of CNN-derived attention maps, which indicate the most relevant anatomical features for CNN-based decisions, has the potential to uncover key disease mechanisms leading to disability accumulation. From a cohort of patients prospectively followed up after a first demyelinating attack, we selected those with T1-weighted and T2-FLAIR brain MRI sequences available for image analysis and a clinical assessment performed within the following six months (N = 319). Patients were divided into two groups according to expanded disability status scale (EDSS) score: ≥3.0 and < 3.0. A 3D-CNN model predicted the class using whole-brain MRI scans as input. A comparison with a logistic regression (LR) model using volumetric measurements as explanatory variables and a validation of the CNN model on an independent dataset with similar characteristics (N = 440) were also performed. The layer-wise relevance propagation method was used to obtain individual attention maps. The CNN model achieved a mean accuracy of 79% and proved to be superior to the equivalent LR-model (77%). Additionally, the model was successfully validated in the independent external cohort without any re-training (accuracy = 71%). Attention-map analyses revealed the predominant role of frontotemporal cortex and cerebellum for CNN decisions, suggesting that the mechanisms leading to disability accrual exceed the mere presence of brain lesions or atrophy and probably involve how damage is distributed in the central nervous system.
Collapse
Affiliation(s)
- Llucia Coll
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Deborah Pareto
- Section of Neuroradiology, Department of Radiology (IDI), Vall d'Hebron University Hospital, Spain, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Pere Carbonell-Mirabent
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Álvaro Cobo-Calvo
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Georgina Arrambide
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Ángela Vidal-Jordana
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Manuel Comabella
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Joaquín Castilló
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Breogán Rodríguez-Acevedo
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Ana Zabalza
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Ingrid Galán
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Luciana Midaglia
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Carlos Nos
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Annalaura Salerno
- Section of Neuroradiology, Department of Radiology (IDI), Vall d'Hebron University Hospital, Spain, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Cristina Auger
- Section of Neuroradiology, Department of Radiology (IDI), Vall d'Hebron University Hospital, Spain, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Manel Alberich
- Section of Neuroradiology, Department of Radiology (IDI), Vall d'Hebron University Hospital, Spain, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Jordi Río
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Jaume Sastre-Garriga
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Arnau Oliver
- Research institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| | - Xavier Montalban
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Àlex Rovira
- Section of Neuroradiology, Department of Radiology (IDI), Vall d'Hebron University Hospital, Spain, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Mar Tintoré
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Xavier Lladó
- Research institute of Computer Vision and Robotics, University of Girona, Girona, Spain
| | - Carmen Tur
- Multiple Sclerosis Centre of Catalonia (Cemcat), Hospital Universitari Vall d'Hebron, Universitat Autònoma de Barcelona, Barcelona, Spain.
| |
Collapse
|
20
|
Zhao Y, Wang X, Che T, Bao G, Li S. Multi-task deep learning for medical image computing and analysis: A review. Comput Biol Med 2023; 153:106496. [PMID: 36634599 DOI: 10.1016/j.compbiomed.2022.106496] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 12/06/2022] [Accepted: 12/27/2022] [Indexed: 12/29/2022]
Abstract
The renaissance of deep learning has provided promising solutions to various tasks. While conventional deep learning models are constructed for a single specific task, multi-task deep learning (MTDL) that is capable to simultaneously accomplish at least two tasks has attracted research attention. MTDL is a joint learning paradigm that harnesses the inherent correlation of multiple related tasks to achieve reciprocal benefits in improving performance, enhancing generalizability, and reducing the overall computational cost. This review focuses on the advanced applications of MTDL for medical image computing and analysis. We first summarize four popular MTDL network architectures (i.e., cascaded, parallel, interacted, and hybrid). Then, we review the representative MTDL-based networks for eight application areas, including the brain, eye, chest, cardiac, abdomen, musculoskeletal, pathology, and other human body regions. While MTDL-based medical image processing has been flourishing and demonstrating outstanding performance in many tasks, in the meanwhile, there are performance gaps in some tasks, and accordingly we perceive the open challenges and the perspective trends. For instance, in the 2018 Ischemic Stroke Lesion Segmentation challenge, the reported top dice score of 0.51 and top recall of 0.55 achieved by the cascaded MTDL model indicate further research efforts in high demand to escalate the performance of current models.
Collapse
Affiliation(s)
- Yan Zhao
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100083, China
| | - Xiuying Wang
- School of Computer Science, The University of Sydney, Sydney, NSW, 2008, Australia.
| | - Tongtong Che
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing, 100083, China
| | - Guoqing Bao
- School of Computer Science, The University of Sydney, Sydney, NSW, 2008, Australia
| | - Shuyu Li
- State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
21
|
Chun MY, Park CJ, Kim J, Jeong JH, Jang H, Kim K, Seo SW. Prediction of conversion to dementia using interpretable machine learning in patients with amnestic mild cognitive impairment. Front Aging Neurosci 2022; 14:898940. [PMID: 35992586 PMCID: PMC9389270 DOI: 10.3389/fnagi.2022.898940] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open
Abstract
Purpose Amnestic mild cognitive impairment (aMCI) is a transitional state between normal aging and Alzheimer's disease (AD). However, not all aMCI patients are observed to convert to AD dementia. Therefore, developing a predictive algorithm for the conversion of aMCI to AD dementia is important. Parametric methods, such as logistic regression, have been developed; however, it is difficult to reflect complex patterns, such as non-linear relationships and interactions between variables. Therefore, this study aimed to improve the predictive power of aMCI patients' conversion to dementia by using an interpretable machine learning (IML) algorithm and to identify the factors that increase the risk of individual conversion to dementia in each patient. Methods We prospectively recruited 705 patients with aMCI who had been followed-up for at least 3 years after undergoing baseline neuropsychological tests at the Samsung Medical Center between 2007 and 2019. We used neuropsychological tests and apolipoprotein E (APOE) genotype data to develop a predictive algorithm. The model-building and validation datasets were composed of data of 565 and 140 patients, respectively. For global interpretation, four algorithms (logistic regression, random forest, support vector machine, and extreme gradient boosting) were compared. For local interpretation, individual conditional expectations (ICE) and SHapley Additive exPlanations (SHAP) were used to analyze individual patients. Results Among the four algorithms, the extreme gradient boost model showed the best performance, with an area under the receiver operating characteristic curve of 0.852 and an accuracy of 0.807. Variables, such as age, education, the scores of visuospatial and memory domains, the sum of boxes of the Clinical Dementia Rating scale, Mini-Mental State Examination, and APOE genotype were important features for creating the algorithm. Through ICE and SHAP analyses, it was also possible to interpret which variables acted as strong factors for each patient. Conclusion We were able to propose a predictive algorithm for each aMCI individual's conversion to dementia using the IML technique. This algorithm is expected to be useful in clinical practice and the research field, as it can suggest conversion with high accuracy and identify the degree of influence of risk factors for each patient.
Collapse
Affiliation(s)
- Min Young Chun
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
| | - Chae Jung Park
- Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
- Alzheimer’s Disease Convergence Research Center, Samsung Medical Center, Seoul, South Korea
| | - Jonghyuk Kim
- Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
| | - Jee Hyang Jeong
- Department of Neurology, Ewha Womans University Seoul Hospital, Ewha Womans University College of Medicine, Seoul, South Korea
| | - Hyemin Jang
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
- Alzheimer’s Disease Convergence Research Center, Samsung Medical Center, Seoul, South Korea
| | - Kyunga Kim
- Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
- Biomedical Statistics Center, Data Science Research Institute, Research Institute for Future Medicine, Samsung Medical Center, Seoul, South Korea
| | - Sang Won Seo
- Department of Neurology, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, South Korea
- Department of Digital Health, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
- Alzheimer’s Disease Convergence Research Center, Samsung Medical Center, Seoul, South Korea
- Department of Health Sciences and Technology, Samsung Advanced Institute for Health Sciences and Technology, Sungkyunkwan University, Seoul, South Korea
- Department of Intelligent Precision Healthcare Convergence, Sungkyunkwan University, Suwon, South Korea
| |
Collapse
|
22
|
Chen L, Qiao H, Zhu F. Alzheimer's Disease Diagnosis With Brain Structural MRI Using Multiview-Slice Attention and 3D Convolution Neural Network. Front Aging Neurosci 2022; 14:871706. [PMID: 35557839 PMCID: PMC9088013 DOI: 10.3389/fnagi.2022.871706] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 03/17/2022] [Indexed: 01/01/2023] Open
Abstract
Numerous artificial intelligence (AI) based approaches have been proposed for automatic Alzheimer's disease (AD) prediction with brain structural magnetic resonance imaging (sMRI). Previous studies extract features from the whole brain or individual slices separately, ignoring the properties of multi-view slices and feature complementarity. For this reason, we present a novel AD diagnosis model based on the multiview-slice attention and 3D convolution neural network (3D-CNN). Specifically, we begin by extracting the local slice-level characteristic in various dimensions using multiple sub-networks. Then we proposed a slice-level attention mechanism to emphasize specific 2D-slices to exclude the redundancy features. After that, a 3D-CNN was employed to capture the global subject-level structural changes. Finally, all these 2D and 3D features were fused to obtain more discriminative representations. We conduct the experiments on 1,451 subjects from ADNI-1 and ADNI-2 datasets. Experimental results showed the superiority of our model over the state-of-the-art approaches regarding dementia classification. Specifically, our model achieves accuracy values of 91.1 and 80.1% on ADNI-1 for AD diagnosis and mild cognitive impairment (MCI) convention prediction, respectively.
Collapse
Affiliation(s)
- Lin Chen
- Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China
| | - Hezhe Qiao
- Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Fan Zhu
- Chongqing Key Laboratory of Big Data and Intelligent Computing, Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China
| |
Collapse
|
23
|
Bento M, Fantini I, Park J, Rittner L, Frayne R. Deep Learning in Large and Multi-Site Structural Brain MR Imaging Datasets. Front Neuroinform 2022; 15:805669. [PMID: 35126080 PMCID: PMC8811356 DOI: 10.3389/fninf.2021.805669] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2021] [Accepted: 12/27/2021] [Indexed: 12/22/2022] Open
Abstract
Large, multi-site, heterogeneous brain imaging datasets are increasingly required for the training, validation, and testing of advanced deep learning (DL)-based automated tools, including structural magnetic resonance (MR) image-based diagnostic and treatment monitoring approaches. When assembling a number of smaller datasets to form a larger dataset, understanding the underlying variability between different acquisition and processing protocols across the aggregated dataset (termed “batch effects”) is critical. The presence of variation in the training dataset is important as it more closely reflects the true underlying data distribution and, thus, may enhance the overall generalizability of the tool. However, the impact of batch effects must be carefully evaluated in order to avoid undesirable effects that, for example, may reduce performance measures. Batch effects can result from many sources, including differences in acquisition equipment, imaging technique and parameters, as well as applied processing methodologies. Their impact, both beneficial and adversarial, must be considered when developing tools to ensure that their outputs are related to the proposed clinical or research question (i.e., actual disease-related or pathological changes) and are not simply due to the peculiarities of underlying batch effects in the aggregated dataset. We reviewed applications of DL in structural brain MR imaging that aggregated images from neuroimaging datasets, typically acquired at multiple sites. We examined datasets containing both healthy control participants and patients that were acquired using varying acquisition protocols. First, we discussed issues around Data Access and enumerated the key characteristics of some commonly used publicly available brain datasets. Then we reviewed methods for correcting batch effects by exploring the two main classes of approaches: Data Harmonization that uses data standardization, quality control protocols or other similar algorithms and procedures to explicitly understand and minimize unwanted batch effects; and Domain Adaptation that develops DL tools that implicitly handle the batch effects by using approaches to achieve reliable and robust results. In this narrative review, we highlighted the advantages and disadvantages of both classes of DL approaches, and described key challenges to be addressed in future studies.
Collapse
Affiliation(s)
- Mariana Bento
- Electrical and Software Engineering, Schulich School of Engineering, University of Calgary, Calgary, AB, Canada
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Calgary Image Processing and Analysis Centre, Foothills Medical Centre, Calgary, AB, Canada
- *Correspondence: Mariana Bento
| | - Irene Fantini
- School of Electrical and Computer Engineering, University of Campinas, Campinas, Brazil
| | - Justin Park
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Calgary Image Processing and Analysis Centre, Foothills Medical Centre, Calgary, AB, Canada
- Radiology and Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Leticia Rittner
- School of Electrical and Computer Engineering, University of Campinas, Campinas, Brazil
| | - Richard Frayne
- Hotchkiss Brain Institute, University of Calgary, Calgary, AB, Canada
- Calgary Image Processing and Analysis Centre, Foothills Medical Centre, Calgary, AB, Canada
- Radiology and Clinical Neurosciences, Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
- Seaman Family MR Research Centre, Foothills Medical Centre, Calgary, AB, Canada
| |
Collapse
|
24
|
Zhang Y, Wang Q, Xue M, Pang B, Yang M, Zhang Z, Niu W. Identifying factors associated with central obesity in school students using artificial intelligence techniques. Front Pediatr 2022; 10:1060270. [PMID: 36533227 PMCID: PMC9748186 DOI: 10.3389/fped.2022.1060270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Accepted: 11/14/2022] [Indexed: 12/03/2022] Open
Abstract
OBJECTIVES We, in a large survey of school students from Beijing, aimed to identify the minimal number of promising factors associated with central obesity and the optimal machine-learning algorithm. METHODS Using a cluster sampling strategy, this cross-sectional survey was conducted in Beijing in early 2022 among students 6-14 years of age. Information was gleaned via online questionnaires and analyzed by the PyCharm and Python. RESULTS Data from 11,308 children were abstracted for analysis, and 3,970 of children had central obesity. Light gradient boosting machine (LGBM) outperformed the other 10 models. The accuracy, precision, recall, F1 score, area under the receiver operating characteristic of LGBM were 0.769982, 0.688312, 0.612323, 0.648098, and 0.825352, respectively. After a comprehensive evaluation, the minimal set involving top 6 important variables that can predict central obesity with descent performance was ascertained, including father's body mass index (BMI), mother's BMI, picky for foods, outdoor activity, screen, and sex. Validation using the deep-learning model indicated that prediction performance between variables in the minimal set and in the whole set was comparable. CONCLUSIONS We have identified and validated a minimal set of six important factors that can decently predict the risk of central obesity when using the optimal LGBM model relative to the whole set.
Collapse
Affiliation(s)
- Yicheng Zhang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Qiong Wang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Mei Xue
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Bo Pang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Min Yang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Zhixin Zhang
- Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China.,International Medical Services, China-Japan Friendship Hospital, Beijing, China
| | - Wenquan Niu
- Institute of Clinical Medical Sciences, China-Japan Friendship Hospital, Beijing, China
| |
Collapse
|
25
|
Xue M, Wang Q, Zhang Y, Pang B, Yang M, Deng X, Zhang Z, Niu W. Factors Associated With Lower Respiratory Tract Infection Among Chinese Students Aged 6-14 Years. Front Pediatr 2022; 10:911591. [PMID: 35783299 PMCID: PMC9243225 DOI: 10.3389/fped.2022.911591] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/02/2022] [Accepted: 05/23/2022] [Indexed: 12/03/2022] Open
Abstract
AIMS We employed machine-learning methods to explore data from a large survey on students, with the goal of identifying and validating a thrifty panel of important factors associated with lower respiratory tract infection (LRTI). METHODS Cross-sectional cluster sampling was performed for a survey of students aged 6-14 years who attended primary or junior high school in Beijing within January, 2022. Data were collected via electronic questionnaires. Statistical analyses were completed using the PyCharm (Edition 2018.1 x64) and Python (Version 3.7.6). RESULTS Data from 11,308 students (5,527 girls and 5,781 boys) were analyzed, and 909 of them had LRTI with the prevalence of 8.01%. After a comprehensive evaluation, the Gaussian naive Bayes (gNB) algorithm outperformed the other machine-learning algorithms. The gNB algorithm had accuracy of 0.856, precision of 0.140, recall of 0.165, F1 score of 0.151, and area under the receiver operating characteristic curve (AUROC) of 0.652. Using the optimal gNB algorithm, top five important factors, including age, rhinitis, sitting time, dental caries, and food or drug allergy, had decent prediction performance. In addition, the top five factors had prediction performance comparable to all factors modeled. For example, under the sequential deep-learning model, the accuracy and loss were separately gauged at 92.26 and 25.62% when incorporating the top five factors, and 92.22 and 25.52% when incorporating all factors. CONCLUSIONS Our findings showed the top five important factors modeled by gNB algorithm can sufficiently represent all involved factors in predicting LRTI risk among Chinese students aged 6-14 years.
Collapse
Affiliation(s)
- Mei Xue
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Qiong Wang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Yicheng Zhang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Bo Pang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Min Yang
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Xiangling Deng
- Graduate School, Beijing University of Chinese Medicine, Beijing, China.,Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China
| | - Zhixin Zhang
- Department of Pediatrics, China-Japan Friendship Hospital, Beijing, China.,International Medical Services, China-Japan Friendship Hospital, Beijing, China
| | - Wenquan Niu
- Institute of Clinical Medical Sciences, China-Japan Friendship Hospital, Beijing, China
| |
Collapse
|
26
|
Multi-view prediction of Alzheimer's disease progression with end-to-end integrated framework. J Biomed Inform 2021; 125:103978. [PMID: 34922021 DOI: 10.1016/j.jbi.2021.103978] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 12/05/2021] [Accepted: 12/11/2021] [Indexed: 11/21/2022]
Abstract
Alzheimer's disease is a common neurodegenerative brain disease that affects the elderly population worldwide. Its early automatic detection is vital for early intervention and treatment. A common solution is to perform future cognitive score prediction based on the baseline brain structural magnetic resonance image (MRI), which can directly infer the potential severity of disease. Recently, several studies have modelled disease progression by predicting the future brain MRI that can provide visual information of brain changes over time. Nevertheless, no studies explore the intra correlation of these two solutions, and it is unknown whether the predicted MRI can assist the prediction of cognitive score. Here, instead of independent prediction, we aim to predict disease progression in multi-view, i.e., predicting subject-specific changes of cognitive score and MRI volume concurrently. To achieve this, we propose an end-to-end integrated framework, where a regression model and a generative adversarial network are integrated together and then jointly optimized. Three integration strategies are exploited to unify these two models. Moreover, considering that some brain regions, such as hippocampus and middle temporal gyrus, could change significantly during the disease progression, a region-of-interest (ROI) mask and a ROI loss are introduced into the integrated framework to leverage this anatomical prior knowledge. Experimental results on the longitudinal Alzheimer's Disease Neuroimaging Initiative dataset demonstrated that the integrated framework outperformed the independent regression model for cognitive score prediction. And its performance can be further improved with the ROI loss for both cognitive score and MRI prediction.
Collapse
|
27
|
Zhang Z, Gao L, Jin G, Guo L, Yao Y, Dong L, Han J. THAN: task-driven hierarchical attention network for the diagnosis of mild cognitive impairment and Alzheimer's disease. Quant Imaging Med Surg 2021; 11:3338-3354. [PMID: 34249658 PMCID: PMC8249997 DOI: 10.21037/qims-21-91] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Accepted: 04/26/2021] [Indexed: 11/06/2022]
Abstract
BACKGROUND To assist doctors to diagnose mild cognitive impairment (MCI) and Alzheimer's disease (AD) early and accurately, convolutional neural networks based on structural magnetic resonance imaging (sMRI) images have been developed and shown excellent performance. However, they are still limited in their capacity in extracting discriminative features because of large sMRI image volumes yet small lesion regions and the small number of sMRI images. METHODS We proposed a task-driven hierarchical attention network (THAN) taking advantage of the merits of patch-based and attention-based convolutional neural networks for MCI and AD diagnosis. THAN consists of an information sub-network and a hierarchical attention sub-network. In the information sub-network, an information map extractor, a patch-assistant module, and a mutual-boosting loss function are designed to generate a task-driven information map, which automatically highlights disease-related regions and their importance for final classification. In the hierarchical attention sub-network, a visual attention module and a semantic attention module are devised based on the information map to extract discriminative features for disease diagnosis. RESULTS Extensive experiments were conducted for four classification tasks: MCI versus (vs.) normal controls (NC), AD vs. NC, AD vs. MCI, and AD vs. MCI vs. NC. Results demonstrated that THAN attained the accuracy of 81.6% for MCI vs. NC, 93.5% for AD vs. NC, 80.8% for AD vs. MCI, and 62.9% for AD vs. MCI vs. NC. It outperformed advanced attention-based and patch-based methods. Moreover, information maps generated by the information sub-network could highlight the potential biomarkers of MCI and AD, such as the hippocampus and ventricles. Furthermore, when the visual and semantic attention modules were combined, the performance of the four tasks was highly improved. CONCLUSIONS The information sub-network can automatically highlight the disease-related regions. The hierarchical attention sub-network can extract discriminative visual and semantic features. Through the two sub-networks, THAN fully exploits the visual and semantic features of disease-related regions and meanwhile considers global features of sMRI images, which finally facilitate the diagnosis of MCI and AD.
Collapse
Affiliation(s)
- Zhehao Zhang
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
| | - Linlin Gao
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
| | - Guang Jin
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
| | - Lijun Guo
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
| | - Yudong Yao
- Research Institute for Medical and Biological Engineering, Ningbo University, Ningbo, China
| | - Li Dong
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
| | - Jinming Han
- Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
| | - the Alzheimer’s Disease NeuroImaging Initiative
- Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo, China
- Research Institute for Medical and Biological Engineering, Ningbo University, Ningbo, China
- Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
28
|
Hu X, Wang L, Yang X, Zhou X, Xue W, Cao Y, Liu S, Huang Y, Guo S, Shang N, Ni D, Gu N. Joint Landmark and Structure Learning for Automatic Evaluation of Developmental Dysplasia of the Hip. IEEE J Biomed Health Inform 2021; 26:345-358. [PMID: 34101608 DOI: 10.1109/jbhi.2021.3087494] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The ultrasound (US) screening of the infant hip is vital for early diagnosis of developmental dysplasia of the hip (DDH). The US diagnosis of DDH refers to measuring alpha and beta angles that quantify hip joint development. These two angles are calculated from key anatomical landmarks and structures of the hip. However, this measurement process is not trivial for sonographers and usually requires a thorough understanding of complex anatomical structures. In this study, we propose a multi-task framework to learn the relationships among landmarks and structures jointly and automatically evaluate DDH. Our multi-task networks are equipped with three novel modules. Firstly, we adopt Mask R-CNN as the basic framework to detect and segment key anatomical structures and add one landmark detection branch to form a new multi-task framework. Secondly, we propose a novel shape similarity loss to refine the incomplete anatomical structure prediction robustly and accurately. Thirdly, we further incorporate the landmark-structure consistent prior to ensure the consistency of the bony rim estimated from the segmented structure and the detected landmark. In our experiments, 1,231 US images of the infant hip from 632 patients are collected, of which 247 images from 126 patients are tested. The average errors in alpha and beta angles are 2.221 and 2.899. About 93% and 85% estimates of alpha and beta angles have errors less than 5 degrees, respectively. Experimental results demonstrate that the proposed method can accurately and robustly realize the automatic evaluation of DDH, showing great potential for clinical application.
Collapse
|