1
|
Irfan M, Haq IU, Malik KM, Muhammad K. One-shot learning for generalization in medical image classification across modalities. Comput Med Imaging Graph 2025; 122:102507. [PMID: 40049026 DOI: 10.1016/j.compmedimag.2025.102507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 01/22/2025] [Accepted: 01/29/2025] [Indexed: 03/24/2025]
Abstract
Generalizability is one of the biggest challenges hindering the advancement of medical sensing technologies across multiple imaging modalities. This issue is further impaired when the imaging data is limited in scope or of poor quality. To tackle this, we propose a generalized and robust, lightweight one-shot learning method for medical image classification across various imaging modalities, including X-ray, microscopic, and CT scans. Our model introduces a collaborative one-shot training (COST) approach, incorporating both meta-learning and metric-learning. This approach allows for effective training on only one image per class. To ensure generalization with fewer epochs, we employ gradient generalization at dense and fully connected layers, utilizing a lightweight Siamese network with triplet loss and shared parameters. The proposed model was evaluated on 12 medical image datasets from MedMNIST2D, achieving an average accuracy of 91.5 % and area under the curve (AUC) of 0.89, outperforming state-of-the-art models such as ResNet-50 and AutoML by over 10 % on certain datasets. Further, in the OCTMNIST dataset, our model achieved an AUC of 0.91 compared to ResNet-50's 0.77. Ablation studies further validate the superiority of our approach, with the COST method showing significant improvement in convergence speed and accuracy when compared to traditional one-shot learning setups. Additionally, our model's lightweight architecture requires only 0.15 million parameters, making it well-suited for deployment on resource-constrained devices.
Collapse
Affiliation(s)
- Muhammad Irfan
- SMILES LAB, College of Innovation & Technology, University of Michigan-Flint, Flint, MI 48502, USA
| | - Ijaz Ul Haq
- SMILES LAB, College of Innovation & Technology, University of Michigan-Flint, Flint, MI 48502, USA
| | - Khalid Mahmood Malik
- SMILES LAB, College of Innovation & Technology, University of Michigan-Flint, Flint, MI 48502, USA.
| | - Khan Muhammad
- VIS2KNOW Lab, Department of Applied Artificial Intelligence, School of Convergence, College of Computing and Informatics, Sungkyunkwan University, Seoul 03063, South Korea.
| |
Collapse
|
2
|
Moghaddam AH, Kerdabadi MN, Zhong C, Yao Z. Meta-Learning on Augmented Gene Expression Profiles for Enhanced Lung Cancer Detection. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2025; 2024:828-837. [PMID: 40417531 PMCID: PMC12099339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 05/27/2025]
Abstract
Gene expression profiles obtained through DNA microarray have proven successful in providing critical information for cancer detection classifiers. However, the limited number of samples in these datasets poses a challenge to employ complex methodologies such as deep neural networks for sophisticated analysis. To address this "small data" dilemma, Meta-Learning has been introduced as a solution to enhance the optimization of machine learning models by utilizing similar datasets, thereby facilitating a quicker adaptation to target datasets without the requirement of sufficient samples. In this study, we present a meta-learning-based approach for predicting lung cancer from gene expression profiles. We apply this framework to well-established deep learning methodologies and employ four distinct datasets for the meta-learning tasks, where one as the target dataset and the rest as source datasets. Our approach is evaluated against both traditional and deep learning methodologies, and the results show the superior performance of meta-learning on augmented source data compared to the baselines trained on single datasets. Moreover, we conduct the comparative analysis between meta-learning and transfer learning methodologies to highlight the efficiency of the proposed approach in addressing the challenges associated with limited sample sizes. Finally, we incorporate the explainability study to illustrate the distinctiveness of decisions made by meta-learning.
Collapse
Affiliation(s)
| | | | | | - Zijun Yao
- University of Kansas, Lawrence, KS, USA
| |
Collapse
|
3
|
Hama T, Alsaleh MM, Allery F, Choi JW, Tomlinson C, Wu H, Lai A, Pontikos N, Thygesen JH. Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review. J Med Internet Res 2025; 27:e57358. [PMID: 40100249 PMCID: PMC11962322 DOI: 10.2196/57358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 12/14/2024] [Accepted: 02/18/2025] [Indexed: 03/20/2025] Open
Abstract
BACKGROUND The use of structured electronic health records in health care systems has grown rapidly. These systems collect huge amounts of patient information, including diagnosis codes representing temporal medical history. Sequential diagnostic information has proven valuable for predicting patient outcomes. However, the extent to which these types of data have been incorporated into deep learning (DL) models has not been examined. OBJECTIVE This systematic review aims to describe the use of sequential diagnostic data in DL models, specifically to understand how these data are integrated, whether sample size improves performance, and whether the identified models are generalizable. METHODS Relevant studies published up to May 15, 2023, were identified using 4 databases: PubMed, Embase, IEEE Xplore, and Web of Science. We included all studies using DL algorithms trained on sequential diagnosis codes to predict patient outcomes. We excluded review articles and non-peer-reviewed papers. We evaluated the following aspects in the included papers: DL techniques, characteristics of the dataset, prediction tasks, performance evaluation, generalizability, and explainability. We also assessed the risk of bias and applicability of the studies using the Prediction Model Study Risk of Bias Assessment Tool (PROBAST). We used the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist to report our findings. RESULTS Of the 740 identified papers, 84 (11.4%) met the eligibility criteria. Publications in this area increased yearly. Recurrent neural networks (and their derivatives; 47/84, 56%) and transformers (22/84, 26%) were the most commonly used architectures in DL-based models. Most studies (45/84, 54%) presented their input features as sequences of visit embeddings. Medications (38/84, 45%) were the most common additional feature. Of the 128 predictive outcome tasks, the most frequent was next-visit diagnosis (n=30, 23%), followed by heart failure (n=18, 14%) and mortality (n=17, 13%). Only 7 (8%) of the 84 studies evaluated their models in terms of generalizability. A positive correlation was observed between training sample size and model performance (area under the receiver operating characteristic curve; P=.02). However, 59 (70%) of the 84 studies had a high risk of bias. CONCLUSIONS The application of DL for advanced modeling of sequential medical codes has demonstrated remarkable promise in predicting patient outcomes. The main limitation of this study was the heterogeneity of methods and outcomes. However, our analysis found that using multiple types of features, integrating time intervals, and including larger sample sizes were generally related to an improved predictive performance. This review also highlights that very few studies (7/84, 8%) reported on challenges related to generalizability and less than half (38/84, 45%) of the studies reported on challenges related to explainability. Addressing these shortcomings will be instrumental in unlocking the full potential of DL for enhancing health care outcomes and patient care. TRIAL REGISTRATION PROSPERO CRD42018112161; https://tinyurl.com/yc6h9rwu.
Collapse
Affiliation(s)
- Tuankasfee Hama
- Institute of Health Informatics, University College London, London, United Kingdom
| | - Mohanad M Alsaleh
- Institute of Health Informatics, University College London, London, United Kingdom
- Department of Health Informatics, College of Applied Medical Sciences, Qassim University, Buraydah, Saudi Arabia
| | - Freya Allery
- Institute of Health Informatics, University College London, London, United Kingdom
| | - Jung Won Choi
- Institute of Health Informatics, University College London, London, United Kingdom
| | | | - Honghan Wu
- Institute of Health Informatics, University College London, London, United Kingdom
| | - Alvina Lai
- Institute of Health Informatics, University College London, London, United Kingdom
| | - Nikolas Pontikos
- UCL Institute of Ophthalmology, University College London, London, United Kingdom
| | - Johan H Thygesen
- Institute of Health Informatics, University College London, London, United Kingdom
| |
Collapse
|
4
|
Brieghel C, Werling M, Frederiksen CM, Parviz M, Lacoppidan T, Faitova T, Teglgaard RS, Vainer N, da Cunha-Bang C, Rotbain EC, Agius R, Niemann CU. The Danish Lymphoid Cancer Research (DALY-CARE) Data Resource: The Basis for Developing Data-Driven Hematology. Clin Epidemiol 2025; 17:131-145. [PMID: 39996155 PMCID: PMC11849980 DOI: 10.2147/clep.s479672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Accepted: 12/19/2024] [Indexed: 02/26/2025] Open
Abstract
Background Lymphoid-lineage cancers (LC; International Classification of Diseases, 10th edition [ICD10] C81.x-C90.x, C91.1-C91.9, C95.1, C95.7, C95.9, D47.2, D47.9B, and E85.8A) share many epidemiological and clinical features, which favor meta-learning when developing medical artificial intelligence (mAI). However, access to large, shared datasets is largely missing and limits mAI research. Aim Creating a large-scale data repository for patients with LC to develop data-driven hematology. Methods We gathered electronic health data and created open-source processing pipelines to create a comprehensive data resource for Danish LC Research (DALY-CARE) approved for epidemiological, molecular, and data-driven research. Results We included all Danish adults registered with LC diagnoses since 2002 (n=65,774) and combined 10 nationwide registers, electronic health records (EHR), and laboratory data on a high-powered cloud-computer to develop a secure research environment. Among other, data include treatments (ie 21,750 cytoreductive treatment plans, 21.3M outpatient prescriptions, and 12.7M in-hospital administrations), biochemical analyses (77.3M), comorbidity (14.8M ICD10 codes), pathology codes (4.5M), treatment procedures (8.3M), surgical procedures (1.0M), radiological examinations (3.3M), vital signs (18.3M values), and survival data. We herein describe the data infrastructure and exemplify how DALY-CARE has been used for molecular studies, real-world evidence to evaluate the efficacy of care, and mAI deployed directly into EHR systems. Conclusion The DALY-CARE data resource allows for the development of near real-time decision-support tools and extrapolation of clinical trial results to clinical practice, thereby improving care for patients with LC while facilitating streamlining of health data infrastructure across cohorts and medical specialties.
Collapse
Affiliation(s)
- Christian Brieghel
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Mikkel Werling
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Casper Møller Frederiksen
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Mehdi Parviz
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Thomas Lacoppidan
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
| | - Tereza Faitova
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Rebecca Svanberg Teglgaard
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Department of Clinical Immunology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
| | - Noomi Vainer
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Caspar da Cunha-Bang
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Emelie Curovic Rotbain
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
| | - Rudi Agius
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
| | - Carsten Utoft Niemann
- Department of Hematology, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
- Danish Cancer Institute, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
5
|
Li C, Denison T, Zhu T. A Survey of Few-Shot Learning for Biomedical Time Series. IEEE Rev Biomed Eng 2025; 18:192-210. [PMID: 39504299 DOI: 10.1109/rbme.2024.3492381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2024]
Abstract
Advancements in wearable sensor technologies and the digitization of medical records have contributed to the unprecedented ubiquity of biomedical time series data. Data-driven models have tremendous potential to assist clinical diagnosis and improve patient care by improving long-term monitoring capabilities, facilitating early disease detection and intervention, as well as promoting personalized healthcare delivery. However, accessing extensively labeled datasets to train data-hungry deep learning models encounters many barriers, such as long-tail distribution of rare diseases, cost of annotation, privacy and security concerns, data-sharing regulations, and ethical considerations. An emerging approach to overcome the scarcity of labeled data is to augment AI methods with human-like capabilities to leverage past experiences to learn new tasks with limited examples, called few-shot learning. This survey provides a comprehensive review and comparison of few-shot learning methods for biomedical time series applications. The clinical benefits and limitations of such methods are discussed in relation to traditional data-driven approaches. This paper aims to provide insights into the current landscape of few-shot learning for biomedical time series and its implications for future research and applications.
Collapse
|
6
|
Agius R, Riis-Jensen AC, Wimmer B, da Cunha-Bang C, Murray DD, Poulsen CB, Bertelsen MB, Schwartz B, Lundgren JD, Langberg H, Niemann CU. Deployment and validation of the CLL treatment infection model adjoined to an EHR system. NPJ Digit Med 2024; 7:147. [PMID: 38839920 PMCID: PMC11153589 DOI: 10.1038/s41746-024-01132-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 05/08/2024] [Indexed: 06/07/2024] Open
Abstract
Research algorithms are seldom externally validated or integrated into clinical practice, leaving unknown challenges in deployment. In such efforts, one needs to address challenges related to data harmonization, the performance of an algorithm in unforeseen missingness, automation and monitoring of predictions, and legal frameworks. We here describe the deployment of a high-dimensional data-driven decision support model into an EHR and derive practical guidelines informed by this deployment that includes the necessary processes, stakeholders and design requirements for a successful deployment. For this, we describe our deployment of the chronic lymphocytic leukemia (CLL) treatment infection model (CLL-TIM) as a stand-alone platform adjoined to an EPIC-based Danish Electronic Health Record (EHR), with the presentation of personalized predictions in a clinical context. CLL-TIM is an 84-variable data-driven prognostic model utilizing 7-year medical patient records and predicts the 2-year risk composite outcome of infection and/or treatment post-CLL diagnosis. As an independent validation cohort for this deployment, we used a retrospective population-based cohort of patients diagnosed with CLL from 2018 onwards (n = 1480). Unexpectedly high levels of missingness for key CLL-TIM variables were exhibited upon deployment. High dimensionality, with the handling of missingness, and predictive confidence were critical design elements that enabled trustworthy predictions and thus serves as a priority for prognostic models seeking deployment in new EHRs. Our setup for deployment, including automation and monitoring into EHR that meets Medical Device Regulations, may be used as step-by-step guidelines for others aiming at designing and deploying research algorithms into clinical practice.
Collapse
Affiliation(s)
- Rudi Agius
- Department of Hematology, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | | | - Bettina Wimmer
- SP Sundhedsdata, The Data Unit, Capital Region of Denmark, Copenhagen, Denmark
| | - Caspar da Cunha-Bang
- Department of Hematology, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | - Daniel Dawson Murray
- Center of Excellence for Health, Immunity, and Infections (CHIP), Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | | | | | - Berit Schwartz
- Rigshospitalets Innoovationscenter, Copenhagen University Hospital Rigshopsitalet, Copenhagen, Denmark
| | - Jens Dilling Lundgren
- Center of Excellence for Health, Immunity, and Infections (CHIP), Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark
| | - Henning Langberg
- Rigshospitalets Innoovationscenter, Copenhagen University Hospital Rigshopsitalet, Copenhagen, Denmark
| | - Carsten Utoft Niemann
- Department of Hematology, Rigshospitalet, Copenhagen University Hospital, Copenhagen, Denmark.
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
7
|
Stewart R, Chaturvedi J, Roberts A. Natural language processing - relevance to patient outcomes and real-world evidence. Expert Rev Pharmacoecon Outcomes Res 2024; 24:5-9. [PMID: 37874661 DOI: 10.1080/14737167.2023.2275670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 10/23/2023] [Indexed: 10/26/2023]
Affiliation(s)
- Robert Stewart
- King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, UK
- South London and Maudsley NHS Foundation Trust, London, UK
| | - Jaya Chaturvedi
- King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, UK
| | - Angus Roberts
- King's College London, Institute of Psychiatry, Psychology and Neuroscience, London, UK
| |
Collapse
|
8
|
Cui W, Akrami H, Zhao G, Joshi AA, Leahy RM. Meta Transfer of Self-Supervised Knowledge: Foundation Model in Action for Post-Traumatic Epilepsy Prediction. ARXIV 2023:arXiv:2312.14204v1. [PMID: 38196751 PMCID: PMC10775348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/11/2024]
Abstract
Despite the impressive advancements achieved using deep-learning for functional brain activity analysis, the heterogeneity of functional patterns and scarcity of imaging data still pose challenges in tasks such as prediction of future onset of Post-Traumatic Epilepsy (PTE) from data acquired shortly after traumatic brain injury (TBI). Foundation models pre-trained on separate large-scale datasets can improve the performance from scarce and heterogeneous datasets. For functional Magnetic Resonance Imaging (fMRI), while data may be abundantly available from healthy controls, clinical data is often scarce, limiting the ability of foundation models to identify clinically-relevant features. We overcome this limitation by introducing a novel training strategy for our foundation model by integrating meta-learning with self-supervised learning to improve the generalization from normal to clinical features. In this way we enable generalization to other downstream clinical tasks, in our case prediction of PTE. To achieve this, we perform self-supervised training on the control dataset to focus on inherent features that are not limited to a particular supervised task while applying meta-learning, which strongly improves the model's generalizability using bi-level optimization. Through experiments on neurological disorder classification tasks, we demonstrate that the proposed strategy significantly improves task performance on small-scale clinical datasets. To explore the generalizability of the foundation model in downstream applications, we then apply the model to an unseen TBI dataset for prediction of PTE using zero-shot learning. Results further demonstrated the enhanced generalizability of our foundation model.
Collapse
Affiliation(s)
- Wenhui Cui
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles 90089, United States
| | - Haleh Akrami
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles 90089, United States
| | - Ganning Zhao
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles 90089, United States
| | - Anand A. Joshi
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles 90089, United States
| | - Richard M. Leahy
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles 90089, United States
| |
Collapse
|
9
|
Zhou F, Qi X, Zhang K, Trajcevski G, Zhong T. MetaGeo: A General Framework for Social User Geolocation Identification With Few-Shot Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023; 34:8950-8964. [PMID: 35259118 DOI: 10.1109/tnnls.2022.3154204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Identifying the geolocation of social media users is an important problem in a wide range of applications, spanning from disease outbreaks, emergency detection, local event recommendation, to fake news localization, online marketing planning, and even crime control and prevention. Researchers have attempted to propose various models by combining different sources of information, including text, social relation, and contextual data, which indeed has achieved promising results. However, existing approaches still suffer from certain constraints, such as: 1) a very few samples are available and 2) prediction models are not easy to be generalized for users from new regions-which are challenges that motivate our study. In this article, we propose a general framework for identifying user geolocation-MetaGeo, which is a meta-learning-based approach, learning the prior distribution of the geolocation task in order to quickly adapt the prediction toward users from new locations. Different from typical meta-learning settings that only learn a new concept from few-shot samples, MetaGeo improves the geolocation prediction with conventional settings by ensembling numerous mini-tasks. In addition, MetaGeo incorporates probabilistic inference to alleviate two issues inherent in training with few samples: location uncertainty and task ambiguity. To demonstrate the effectiveness of MetaGeo, we conduct extensive experimental evaluations on three real-world datasets and compare the performance with several state-of-the-art benchmark models. The results demonstrate the superiority of MetaGeo in both the settings where the predicted locations/regions are known or have not been seen during training.
Collapse
|
10
|
Chen B, Han Y, Yan L. A Few-shot learning approach for Monkeypox recognition from a cross-domain perspective. J Biomed Inform 2023; 144:104449. [PMID: 37488025 DOI: 10.1016/j.jbi.2023.104449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2023] [Revised: 06/09/2023] [Accepted: 07/16/2023] [Indexed: 07/26/2023]
Abstract
Monkeypox is a zoonotic infectious skin disease initially endemic in Africa only. However, some countries are now beginning to report cases of apparent community transmission. In Computer Aided Diagnosis, deep learning has gained substantial improvement over traditional methods. Commonly, training a supervised deep model requires a large number of labeled samples. However, the collection and annotation of new disease images such as human monkeypox are time-consuming and expensive. Thus, we introduce a few-shot learning based approach for the recognition of human monkeypox in images. It requires merely a small number of training samples. In particular, it is a novel framework built with a normal backbone and auxiliary backbones. They are co-trained with Self-supervised Learning and Cross-domain Adaption techniques. The self-supervision penalty is used to help the auxiliary backbones effectively learn priors from source domain. The combined features across different domains are unified through a power transform layer. Extensive experiments are conducted on a task of recognizing chickenpox, measles, and human monkeypox diseases in a three-way few-shot manner. The results demonstrate that our method outperforms mainstream few-shot learning algorithms such as meta-learning based and fine-tuning based methods.
Collapse
Affiliation(s)
- Bolin Chen
- School of Statistics, Xi'an University of Finance and Economics, Xi'an, 710100, PR China
| | - Yu Han
- School of Statistics, Xi'an University of Finance and Economics, Xi'an, 710100, PR China
| | - Lin Yan
- School of Statistics, Xi'an University of Finance and Economics, Xi'an, 710100, PR China.
| |
Collapse
|
11
|
Amirahmadi A, Ohlsson M, Etminani K. Deep learning prediction models based on EHR trajectories: A systematic review. J Biomed Inform 2023; 144:104430. [PMID: 37380061 DOI: 10.1016/j.jbi.2023.104430] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 06/08/2023] [Accepted: 06/17/2023] [Indexed: 06/30/2023]
Abstract
BACKGROUND Electronic health records (EHRs) are generated at an ever-increasing rate. EHR trajectories, the temporal aspect of health records, facilitate predicting patients' future health-related risks. It enables healthcare systems to increase the quality of care through early identification and primary prevention. Deep learning techniques have shown great capacity for analyzing complex data and have been successful for prediction tasks using complex EHR trajectories. This systematic review aims to analyze recent studies to identify challenges, knowledge gaps, and ongoing research directions. METHODS For this systematic review, we searched Scopus, PubMed, IEEE Xplore, and ACM databases from Jan 2016 to April 2022 using search terms centered around EHR, deep learning, and trajectories. Then the selected papers were analyzed according to publication characteristics, objectives, and their solutions regarding existing challenges, such as the model's capacity to deal with intricate data dependencies, data insufficiency, and explainability. RESULTS After removing duplicates and out-of-scope papers, 63 papers were selected, which showed rapid growth in the number of research in recent years. Predicting all diseases in the next visit and the onset of cardiovascular diseases were the most common targets. Different contextual and non-contextual representation learning methods are employed to retrieve important information from the sequence of EHR trajectories. Recurrent neural networks and the time-aware attention mechanism for modeling long-term dependencies, self-attentions, convolutional neural networks, graphs for representing inner visit relations, and attention scores for explainability were frequently used among the reviewed publications. CONCLUSIONS This systematic review demonstrated how recent breakthroughs in deep learning methods have facilitated the modeling of EHR trajectories. Research on improving the ability of graph neural networks, attention mechanisms, and cross-modal learning to analyze intricate dependencies among EHRs has shown good progress. There is a need to increase the number of publicly available EHR trajectory datasets to allow for easier comparison among different models. Also, very few developed models can handle all aspects of EHR trajectory data.
Collapse
Affiliation(s)
- Ali Amirahmadi
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden.
| | - Mattias Ohlsson
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden; Computational Biology & Biological Physics, Department of Astronomy and Theoretical Physics, Lund University, Sweden
| | - Kobra Etminani
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden
| |
Collapse
|
12
|
Zhang L, Khera R, Mortazavi BJ. Clinical Risk Prediction Models with Meta-Learning Prototypes of Patient Heterogeneity. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38083199 PMCID: PMC11007255 DOI: 10.1109/embc40787.2023.10340765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Hospitalized patients sometimes have complex health conditions, such as multiple diseases, underlying diseases, and complications. The heterogeneous patient conditions may have various representations. A generalized model ignores the differences among heterogeneous patients, and personalized models, even with transfer learning, are still limited to the small amount of training data and the repeated training process. Meta-learning provides a solution for training similar patients based on few-shot learning; however, cannot address common cross-domain patients. Inspired by prototypical networks [1], we proposed a meta-prototype for Electronic Health Records (EHR), a meta-learning-based model with flexible prototypes representing the heterogeneity in patients. We apply this technique to cardiovascular diseases in MIMIC-III and compare it against a set of benchmark models, and demonstrate its ability to address heterogeneous patient health conditions and improve the model performances from 1.2% to 11.9% on different metrics and prediction tasks.Clinical relevance- Developing an adaptive EHR risk prediction model for outcomes-driven phenotyping of heterogeneous patient health conditions.
Collapse
|
13
|
Pang Y, Kukull W, Sano M, Albin RL, Shen C, Zhou J, Dodge HH. Predicting Progression from Normal to MCI and from MCI to AD Using Clinical Variables in the National Alzheimer's Coordinating Center Uniform Data Set Version 3: Application of Machine Learning Models and a Probability Calculator. J Prev Alzheimers Dis 2023; 10:301-313. [PMID: 36946457 PMCID: PMC10033942 DOI: 10.14283/jpad.2023.10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Clinical trials are increasingly focused on pre-manifest and early Alzheimer's disease (AD). Accurately predicting clinical progressions from normal to MCI or from MCI to dementia/AD versus non-progression is challenging. Accurate identification of symptomatic progressors is important to avoid unnecessary treatment and improve trial efficiency. Due to large inter-individual variability, biomarker positivity and comorbidity information are often insufficient to identify those destined to have symptomatic progressions. Using only clinical variables, we aimed to predict clinical progressions, estimating probabilities of progressions with a small set of variables selected by machine learning approaches. This work updates our previous work that was applied to the National Alzheimer's Coordinating Center (NACC) Uniform Data Set Version 2 (V2), by using the most recent version (V3) with additional analyses. We generated a user-friendly conversion probability calculator which can be used for effectively pre-screening trial participants.
Collapse
Affiliation(s)
- Y Pang
- Hiroko H. Dodge, Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA,
| | | | | | | | | | | | | |
Collapse
|
14
|
Lin S, Wang M, Shi C, Xu Z, Chen L, Gao Q, Chen J. MR-KPA: medication recommendation by combining knowledge-enhanced pre-training with a deep adversarial network. BMC Bioinformatics 2022; 23:552. [PMID: 36536291 PMCID: PMC9762031 DOI: 10.1186/s12859-022-05102-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 12/06/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Medication recommendation based on electronic medical record (EMR) is a research hot spot in smart healthcare. For developing computational medication recommendation methods based on EMR, an important challenge is the lack of a large number of longitudinal EMR data with time correlation. Faced with this challenge, this paper proposes a new EMR-based medication recommendation model called MR-KPA, which combines knowledge-enhanced pre-training with the deep adversarial network to improve medication recommendation from both feature representation and the fine-tuning process. Firstly, a knowledge-enhanced pre-training visit model is proposed to realize domain knowledge-based external feature fusion and pre-training-based internal feature mining for improving the feature representation. Secondly, a medication recommendation model based on the deep adversarial network is developed to optimize the fine-tuning process of pre-training visit model and alleviate over-fitting of model caused by the task gap between pre-training and recommendation. RESULT The experimental results on EMRs from medical and health institutions in Hainan Province, China show that the proposed MR-KPA model can effectively improve the accuracy of medication recommendation on small-scale longitudinal EMR data compared with existing representative methods. CONCLUSION The advantages of the proposed MR-KPA are mainly attributed to knowledge enhancement based on ontology embedding, the pre-training visit model and adversarial training. Each of these three optimizations is very effective for improving the capability of medication recommendation on small-scale longitudinal EMR data, and the pre-training visit model has the most significant improvement effect. These three optimizations are also complementary, and their integration makes the proposed MR-KPA model achieve the best recommendation effect.
Collapse
Affiliation(s)
- Shaofu Lin
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China
| | - Mengzhen Wang
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China
| | - Chengyu Shi
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China
| | - Zhe Xu
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China
| | - Lihong Chen
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China ,grid.28703.3e0000 0000 9040 3743Beijing International Collaboration Base on Brain Informatics and Wisdom Services, Beijing University of Technology, Beijing, China
| | - Qingcai Gao
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China ,grid.28703.3e0000 0000 9040 3743Beijing International Collaboration Base on Brain Informatics and Wisdom Services, Beijing University of Technology, Beijing, China
| | - Jianhui Chen
- grid.28703.3e0000 0000 9040 3743Faculty of Information Technology, Beijing University of Technology, Beijing, China ,grid.28703.3e0000 0000 9040 3743Beijing International Collaboration Base on Brain Informatics and Wisdom Services, Beijing University of Technology, Beijing, China ,grid.28703.3e0000 0000 9040 3743Beijing Key Laboratory of MRI and Brain Informatics, Beijing University of Technology, Beijing, China ,grid.419897.a0000 0004 0369 313XEngineering Research Center of Intelligent Perception and Autonomous Control, Ministry of Education, Beijing, China ,grid.419897.a0000 0004 0369 313XEngineering Research Center of Digital Community, Ministry of Education, Beijing, China
| |
Collapse
|
15
|
Zeng J, Gensheimer MF, Rubin DL, Athey S, Shachter RD. Uncovering interpretable potential confounders in electronic medical records. Nat Commun 2022; 13:1014. [PMID: 35197467 PMCID: PMC8866497 DOI: 10.1038/s41467-022-28546-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 01/28/2022] [Indexed: 12/25/2022] Open
Abstract
Randomized clinical trials (RCT) are the gold standard for informing treatment decisions. Observational studies are often plagued by selection bias, and expert-selected covariates may insufficiently adjust for confounding. We explore how unstructured clinical text can be used to reduce selection bias and improve medical practice. We develop a framework based on natural language processing to uncover interpretable potential confounders from text. We validate our method by comparing the estimated hazard ratio (HR) with and without the confounders against established RCTs. We apply our method to four cohorts built from localized prostate and lung cancer datasets from the Stanford Cancer Institute and show that our method shifts the HR estimate towards the RCT results. The uncovered terms can also be interpreted by oncologists for clinical insights. We present this proof-of-concept study to enable more credible causal inference using observational data, uncover meaningful insights from clinical text, and inform high-stakes medical decisions.
Collapse
Affiliation(s)
- Jiaming Zeng
- Department of Management Science and Engineering, Stanford University, Stanford, CA, 94305, USA.
| | - Michael F Gensheimer
- Department of Radiation Oncology, Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Daniel L Rubin
- Department of Biomedical Data Science, Radiology, and Medicine, Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Susan Athey
- Graduate School of Business, Stanford University, Stanford, CA, 94305, USA
| | - Ross D Shachter
- Department of Management Science and Engineering, Stanford University, Stanford, CA, 94305, USA
| |
Collapse
|
16
|
Meta Learning and the AI Learning Process. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
17
|
Sun X, Guo W, Shen J. Toward attention-based learning to predict the risk of brain degeneration with multimodal medical data. Front Neurosci 2022; 16:1043626. [PMID: 36741058 PMCID: PMC9889549 DOI: 10.3389/fnins.2022.1043626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 12/12/2022] [Indexed: 01/20/2023] Open
Abstract
Introduction Brain degeneration is commonly caused by some chronic diseases, such as Alzheimer's disease (AD) and diabetes mellitus (DM). The risk prediction of brain degeneration aims to forecast the situation of disease progression of patients in the near future based on their historical health records. It is beneficial for patients to make an accurate clinical diagnosis and early prevention of disease. Current risk predictions of brain degeneration mainly rely on single-modality medical data, such as Electronic Health Records (EHR) or magnetic resonance imaging (MRI). However, only leveraging EHR or MRI data for the pertinent and accurate prediction is insufficient because of single-modality information (e.g., pixel or volume information of image data or clinical context information of non-image data). Methods Several deep learning-based methods have used multimodal data to predict the risks of specified diseases. However, most of them simply integrate different modalities in an early, intermediate, or late fusion structure and do not care about the intra-modal and intermodal dependencies. A lack of these dependencies would lead to sub-optimal prediction performance. Thus, we propose an encoder-decoder framework for better risk prediction of brain degeneration by using MRI and EHR. An encoder module is one of the key components and mainly focuses on feature extraction of input data. Specifically, we introduce an encoder module, which integrates intra-modal and inter-modal dependencies with the spatial-temporal attention and cross-attention mechanism. The corresponding decoder module is another key component and mainly parses the features from the encoder. In the decoder module, a disease-oriented module is used to extract the most relevant disease representation features. We take advantage of a multi-head attention module followed by a fully connected layer to produce the predicted results. Results As different types of AD and DM influence the nature and severity of brain degeneration, we evaluate the proposed method for three-class prediction of AD and three-class prediction of DM. Our results show that the proposed method with integrated MRI and EHR data achieves an accuracy of 0.859 and 0.899 for the risk prediction of AD and DM, respectively. Discussion The prediction performance is significantly better than the benchmarks, including MRI-only, EHR-only, and state-of-the-art multimodal fusion methods.
Collapse
Affiliation(s)
- Xiaofei Sun
- Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong, Hong Kong SAR, China
| | - Weiwei Guo
- EchoX Technology Limited, Hong Kong, Hong Kong SAR, China
| | - Jing Shen
- Department of Radiology, Affiliated Zhongshan Hospital of Dalian University, Dalian, Liaoning, China
- *Correspondence: Jing Shen,
| |
Collapse
|
18
|
Crema C, Attardi G, Sartiano D, Redolfi A. Natural language processing in clinical neuroscience and psychiatry: A review. Front Psychiatry 2022; 13:946387. [PMID: 36186874 PMCID: PMC9515453 DOI: 10.3389/fpsyt.2022.946387] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 08/22/2022] [Indexed: 11/13/2022] Open
Abstract
Natural language processing (NLP) is rapidly becoming an important topic in the medical community. The ability to automatically analyze any type of medical document could be the key factor to fully exploit the data it contains. Cutting-edge artificial intelligence (AI) architectures, particularly machine learning and deep learning, have begun to be applied to this topic and have yielded promising results. We conducted a literature search for 1,024 papers that used NLP technology in neuroscience and psychiatry from 2010 to early 2022. After a selection process, 115 papers were evaluated. Each publication was classified into one of three categories: information extraction, classification, and data inference. Automated understanding of clinical reports in electronic health records has the potential to improve healthcare delivery. Overall, the performance of NLP applications is high, with an average F1-score and AUC above 85%. We also derived a composite measure in the form of Z-scores to better compare the performance of NLP models and their different classes as a whole. No statistical differences were found in the unbiased comparison. Strong asymmetry between English and non-English models, difficulty in obtaining high-quality annotated data, and train biases causing low generalizability are the main limitations. This review suggests that NLP could be an effective tool to help clinicians gain insights from medical reports, clinical research forms, and more, making NLP an effective tool to improve the quality of healthcare services.
Collapse
Affiliation(s)
- Claudio Crema
- Laboratory of Neuroinformatics, IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy
| | | | - Daniele Sartiano
- Istituto di Informatica e Telematica, Consiglio Nazionale delle Ricerche, Pisa, Italy
| | - Alberto Redolfi
- Laboratory of Neuroinformatics, IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy
| |
Collapse
|
19
|
Xie F, Yuan H, Ning Y, Ong MEH, Feng M, Hsu W, Chakraborty B, Liu N. Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies. J Biomed Inform 2021; 126:103980. [PMID: 34974189 DOI: 10.1016/j.jbi.2021.103980] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 11/07/2021] [Accepted: 12/20/2021] [Indexed: 12/21/2022]
Abstract
OBJECTIVE Temporal electronic health records (EHRs) contain a wealth of information for secondary uses, such as clinical events prediction and chronic disease management. However, challenges exist for temporal data representation. We therefore sought to identify these challenges and evaluate novel methodologies for addressing them through a systematic examination of deep learning solutions. METHODS We searched five databases (PubMed, Embase, the Institute of Electrical and Electronics Engineers [IEEE] Xplore Digital Library, the Association for Computing Machinery [ACM] Digital Library, and Web of Science) complemented with hand-searching in several prestigious computer science conference proceedings. We sought articles that reported deep learning methodologies on temporal data representation in structured EHR data from January 1, 2010, to August 30, 2020. We summarized and analyzed the selected articles from three perspectives: nature of time series, methodology, and model implementation. RESULTS We included 98 articles related to temporal data representation using deep learning. Four major challenges were identified, including data irregularity, heterogeneity, sparsity, and model opacity. We then studied how deep learning techniques were applied to address these challenges. Finally, we discuss some open challenges arising from deep learning. CONCLUSION Temporal EHR data present several major challenges for clinical prediction modeling and data utilization. To some extent, current deep learning solutions can address these challenges. Future studies may consider designing comprehensive and integrated solutions. Moreover, researchers should incorporate clinical domain knowledge into study designs and enhance model interpretability to facilitate clinical implementation.
Collapse
Affiliation(s)
- Feng Xie
- Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore; Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore
| | - Han Yuan
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore
| | - Yilin Ning
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore
| | - Marcus Eng Hock Ong
- Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore; Department of Emergency Medicine, Singapore General Hospital, Singapore
| | - Mengling Feng
- Saw Swee Hock School of Public Health, National University of Singapore, Singapore
| | - Wynne Hsu
- School of Computing, National University of Singapore, Singapore; Institute of Data Science, National University of Singapore, Singapore
| | - Bibhas Chakraborty
- Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore; Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore; Department of Statistics and Data Science, National University of Singapore, Singapore; Department of Biostatistics and Bioinformatics, Duke University, Durham, NC, United States
| | - Nan Liu
- Programme in Health Services and Systems Research, Duke-NUS Medical School, Singapore; Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore; Institute of Data Science, National University of Singapore, Singapore; SingHealth AI Health Program, Singapore Health Services, Singapore.
| |
Collapse
|
20
|
An Y, Tang K, Wang J. Time-Aware Multi-Type Data Fusion Representation Learning Framework for Risk Prediction of Cardiovascular Diseases. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; PP:1-1. [PMID: 34618675 DOI: 10.1109/tcbb.2021.3118418] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Predicting the future risk of cardiovascular diseases from the historical Electronic Health Records (EHRs) is a significant research task in personalized healthcare fields. In recent years, many deep neural network-based methods have emerged, which model patient disease progression by capturing the temporal patterns in sequential visit data. However, existing methods usually cannot effectively integrate the features of heterogeneous clinical data, and do not fully consider the impact of patients age and irregular time interval between consecutive medical records on the patients disease development. To address these challenges, we propose a Time-Aware Multi-type Data fUsion Representation learning framework (TAMDUR) for CVDs risk prediction. In this framework, we design a time-aware decay function, which is based on the patients age and the elapsed time between visits, to model the disease progression pattern. A parallel combination of Bi LSTM and CNN is constructed to respectively learn the temporal and non-temporal features from various types of clinical data. Finally, a multi-type data fusion representation layer based on self-attention is utilized to integrate various features and their correlations to obtain the final patient representation. We evaluate our model on a real medical dataset, and the experimental results demonstrate that TAMDUR outperforms the state-of-the-art approaches.
Collapse
|
21
|
Chen J, Chen Y, Li J, Wang J, Lin Z, Nandi AK. Stroke Risk Prediction with Hybrid Deep Transfer Learning Framework. IEEE J Biomed Health Inform 2021; 26:411-422. [PMID: 34115602 DOI: 10.1109/jbhi.2021.3088750] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Stroke has become a leading cause of death and long-term disability in the world, and there is no effective treatment.Deep learning-based approaches have the potential to outperform existing stroke risk prediction models, they rely on large well-labeled data. Due to the strict privacy protection policy in health-care systems, stroke data is usually distributed among different hospitals in small pieces. In addition, the positive and negative instances of such data are extremely imbalanced. Transfer learning solves small data issue by exploiting the knowledge of a correlated domain, especially when multiple source are available.In this work, we propose a novel Hybrid Deep Transfer Learning-based Stroke Risk Prediction (HDTL-SRP) scheme to exploit the knowledge structure from multiple correlated sources (i.e.,external stroke data, chronic diseases data, such as hypertension and diabetes). The proposed framework has been extensively tested in synthetic and real-world scenarios, and it outperforms the state-of-the-art stroke risk prediction models. It also shows the potential of real-world deployment among multiple hospitals aided with 5G/B5G infrastructures.
Collapse
|
22
|
Sun Z, Yin H, Chen H, Chen T, Cui L, Yang F. Disease Prediction via Graph Neural Networks. IEEE J Biomed Health Inform 2021; 25:818-826. [PMID: 32749976 DOI: 10.1109/jbhi.2020.3004143] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
With the increasingly available electronic medical records (EMRs), disease prediction has recently gained immense research attention, where an accurate classifier needs to be trained to map the input prediction signals (e.g., symptoms, patient demographics, etc.) to the estimated diseases for each patient. However, existing machine learning-based solutions heavily rely on abundant manually labeled EMR training data to ensure satisfactory prediction results, impeding their performance in the existence of rare diseases that are subject to severe data scarcity. For each rare disease, the limited EMR data can hardly offer sufficient information for a model to correctly distinguish its identity from other diseases with similar clinical symptoms. Furthermore, most existing disease prediction approaches are based on the sequential EMRs collected for every patient and are unable to handle new patients without historical EMRs, reducing their real-life practicality. In this paper, we introduce an innovative model based on Graph Neural Networks (GNNs) for disease prediction, which utilizes external knowledge bases to augment the insufficient EMR data, and learns highly representative node embeddings for patients, diseases and symptoms from the medical concept graph and patient record graph respectively constructed from the medical knowledge base and EMRs. By aggregating information from directly connected neighbor nodes, the proposed neural graph encoder can effectively generate embeddings that capture knowledge from both data sources, and is able to inductively infer the embeddings for a new patient based on the symptoms reported in her/his EMRs to allow for accurate prediction on both general diseases and rare diseases. Extensive experiments on a real-world EMR dataset have demonstrated the state-of-the-art performance of our proposed model.
Collapse
|
23
|
Kristjanpoller W, Michell K, Minutolo MC. A causal framework to determine the effectiveness of dynamic quarantine policy to mitigate COVID-19. Appl Soft Comput 2021; 104:107241. [PMID: 33679272 PMCID: PMC7920818 DOI: 10.1016/j.asoc.2021.107241] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2020] [Revised: 02/18/2021] [Accepted: 02/24/2021] [Indexed: 12/19/2022]
Abstract
Since the start of the pandemic caused by the novel coronavirus, COVID-19, more than 106 million people have been infected and global deaths have surpassed 2.4 million. In Chile, the government restricted the activities and movement of people, organizations, and companies, under the concept of dynamic quarantine across municipalities for a predefined period of time. Chile is an interesting context to study because reports to have a higher quantity of infections per million people as well as a higher number of polymerize chain reaction (PCR) tests per million people. The higher testing rate means that Chile has good measurement of the contagious compared to other countries. Further, the heterogeneity of the social, economic, and demographic variables collected of each Chilean municipality provides a robust set of control data to better explain the contagious rate for each city. In this paper, we propose a framework to determine the effectiveness of the dynamic quarantine policy by analyzing different causal models (meta-learners and causal forest) including a time series pattern related to effective reproductive number. Additionally, we test the ability of the proposed framework to understand and explain the spread over benchmark traditional models and to interpret the Shapley Additive Explanations (SHAP) plots. The conclusions derived from the proposed framework provide important scientific information for government policymakers in disease control strategies, not only to analyze COVID-19 but to have a better model to determine social interventions for future outbreaks.
Collapse
Affiliation(s)
- Werner Kristjanpoller
- Departamento de Industrias, Universidad Técnica Federico Santa María, Av. España 1680, Valparaíso, Chile
| | - Kevin Michell
- Departamento de Industrias, Universidad Técnica Federico Santa María, Av. España 1680, Valparaíso, Chile
| | - Marcel C Minutolo
- Robert Morris University, 6001 University Blvd Moon Township, PA 15108, United States of America
| |
Collapse
|
24
|
Si Y, Du J, Li Z, Jiang X, Miller T, Wang F, Jim Zheng W, Roberts K. Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review. J Biomed Inform 2021; 115:103671. [PMID: 33387683 PMCID: PMC11290708 DOI: 10.1016/j.jbi.2020.103671] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Revised: 10/23/2020] [Accepted: 12/23/2020] [Indexed: 12/22/2022]
Abstract
OBJECTIVES Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records (EHRs). This is generally performed using advanced deep learning methods. This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective. METHODS We identified studies developing patient representations from EHRs with deep learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49 papers were included for a comprehensive data collection. RESULTS Publications developing patient representations almost doubled each year from 2015 until 2019. We noticed a typical workflow starting with feeding raw data, applying deep learning models, and ending with clinical outcome predictions as evaluations of the learned representations. Specifically, learning representations from structured EHR data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely applied as the deep learning architecture (Long short-term memory: 13 studies, Gated recurrent unit: 11 studies). Learning was mainly performed in a supervised manner (30 studies) optimized with cross-entropy loss. Disease prediction was the most common application and evaluation (31 studies). Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns of EHR data, and code availability was assured in 20 studies. DISCUSSION & CONCLUSION The existing predictive models mainly focus on the prediction of single diseases, rather than considering the complex mechanisms of patients from a holistic review. We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review. Advances in patient representation learning techniques will be essential for powering patient-level EHR analyses. Future work will still be devoted to leveraging the richness and potential of available EHR data. Reproducibility and transparency of reported results will hopefully improve. Knowledge distillation and advanced learning techniques will be exploited to assist the capability of learning patient representation further.
Collapse
Affiliation(s)
- Yuqi Si
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Jingcheng Du
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Zhao Li
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Xiaoqian Jiang
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Timothy Miller
- Computational Health Informatics Program (CHIP), Boston Children's Hospital and Harvard Medical School, MA, USA
| | - Fei Wang
- Department of Population Health Sciences. Weill Cornell Medicine, Cornell University, NY, USA
| | - W Jim Zheng
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA
| | - Kirk Roberts
- School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA.
| |
Collapse
|
25
|
Liu L, Liu Z, Wu H, Wang Z, Shen J, Song Y, Zhang M. Multi-task Learning via Adaptation to Similar Tasks for Mortality Prediction of Diverse Rare Diseases. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2021; 2020:763-772. [PMID: 33936451 PMCID: PMC8075548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
The mortality prediction of diverse rare diseases using electronic health record (EHR) data is a crucial task for intelligent healthcare. However, data insufficiency and the clinical diversity of rare diseases make it hard for deep learning models to be trained. Mortality prediction for these patients with different diseases can be viewed as a multi-task learning problem with insufficient data but a large number of tasks. On the other hand, insufficient training data makes it difficult to train task-specific modules in multi-task learning models. To address the challenges of data insufficiency and task diversity, we propose an initialization-sharing multi-task learning method (Ada-SiT). Ada-Sit can learn the parameter initialization and dynamically measure the tasks' similarities, used for fast adaptation. We use Ada-SiT to train long short-term memory networks (LSTM) based prediction models on longitudinal EHR data. The experimental results demonstrate that the proposed model is effective for mortality prediction of diverse rare diseases.
Collapse
Affiliation(s)
- Luchen Liu
- Department of Computer Science, Peking University, Beijing, China
| | - Zequn Liu
- Department of Computer Science, Peking University, Beijing, China
| | - Haoxian Wu
- Department of Computer Science, Peking University, Beijing, China
| | - Zichang Wang
- Department of Computer Science, Peking University, Beijing, China
| | - Jianhao Shen
- Department of Computer Science, Peking University, Beijing, China
| | - Yipiing Song
- Department of Computer Science, Peking University, Beijing, China
| | - Ming Zhang
- Department of Computer Science, Peking University, Beijing, China
| |
Collapse
|
26
|
Meta Learning and the AI Learning Process. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-58080-3_327-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
27
|
Tjandra D, Migrino RQ, Giordani B, Wiens J. Cohort discovery and risk stratification for Alzheimer's disease: an electronic health record-based approach. ALZHEIMER'S & DEMENTIA (NEW YORK, N. Y.) 2020; 6:e12035. [PMID: 32548236 PMCID: PMC7293993 DOI: 10.1002/trc2.12035] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 04/18/2020] [Indexed: 11/17/2022]
Abstract
BACKGROUND We sought to leverage data routinely collected in electronic health records (EHRs), with the goal of developing patient risk stratification tools for predicting risk of developing Alzheimer's disease (AD). METHOD Using EHR data from the University of Michigan (UM) hospitals and consensus-based diagnoses from the Michigan Alzheimer's Disease Research Center, we developed and validated a cohort discovery tool for identifying patients with AD. Applied to all UM patients, these labels were used to train an EHR-based machine learning model for predicting AD onset within 10 years. RESULTS Applied to a test cohort of 1697 UM patients, the model achieved an area under the receiver operating characteristics curve of 0.70 (95% confidence interval = 0.63-0.77). Important predictive factors included cardiovascular factors and laboratory blood testing. CONCLUSION Routinely collected EHR data can be used to predict AD onset with modest accuracy. Mining routinely collected data could shed light on early indicators of AD appearance and progression.
Collapse
Affiliation(s)
- Donna Tjandra
- Department of Electrical Engineering and Computer ScienceUniversity of MichiganAnn ArborMichiganUSA
| | - Raymond Q. Migrino
- Phoenix Veterans Affairs Health Care SystemPhoenixArizonaUSA
- Department of MedicineUniversity of Arizona College of Medicine‐PhoenixPhoenixArizonaUSA
| | - Bruno Giordani
- Department of Psychiatry, Neuropsychology ProgramUniversity of Michigan Ann ArborAnn ArborMichiganUSA
| | - Jenna Wiens
- Department of Electrical Engineering and Computer ScienceUniversity of MichiganAnn ArborMichiganUSA
| |
Collapse
|