Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Landi I, Glicksberg BS, Lee HC, Cherng S, Landi G, Danieletto M, Dudley JT, Furlanello C, Miotto R. Deep representation learning of electronic health records to unlock patient stratification at scale. NPJ Digit Med 2020;3:96. [PMID: 32699826 PMCID: PMC7367859 DOI: 10.1038/s41746-020-0301-z] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 06/17/2020] [Indexed: 12/15/2022] Open

For:	Landi I, Glicksberg BS, Lee HC, Cherng S, Landi G, Danieletto M, Dudley JT, Furlanello C, Miotto R. Deep representation learning of electronic health records to unlock patient stratification at scale. NPJ Digit Med 2020;3:96. [PMID: 32699826 PMCID: PMC7367859 DOI: 10.1038/s41746-020-0301-z] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 06/17/2020] [Indexed: 12/15/2022] Open

Number

Cited by Other Article(s)

Heumos L, Ehmele P, Treis T, Upmeier Zu Belzen J, Roellin E, May L, Namsaraeva A, Horlava N, Shitov VA, Zhang X, Zappia L, Knoll R, Lang NJ, Hetzel L, Virshup I, Sikkema L, Curion F, Eils R, Schiller HB, Hilgendorff A, Theis FJ. An open-source framework for end-to-end analysis of electronic health record data. Nat Med 2024:10.1038/s41591-024-03214-0. [PMID: 39266748 DOI: 10.1038/s41591-024-03214-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 07/25/2024] [Indexed: 09/14/2024]

Affiliation(s)

Lukas Heumos Institute of Computational Biology, Helmholtz Munich, Munich, Germany Institute of Lung Health and Immunity and Comprehensive Pneumology Center with the CPC-M bioArchive; Helmholtz Zentrum Munich; member of the German Center for Lung Research (DZL), Munich, Germany TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Philipp Ehmele Institute of Computational Biology, Helmholtz Munich, Munich, Germany
Tim Treis Institute of Computational Biology, Helmholtz Munich, Munich, Germany TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Julius Upmeier Zu Belzen Health Data Science Unit, Heidelberg University and BioQuant, Heidelberg, Germany
Eljas Roellin Institute of Computational Biology, Helmholtz Munich, Munich, Germany Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Lilly May Institute of Computational Biology, Helmholtz Munich, Munich, Germany Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Altana Namsaraeva Institute of Computational Biology, Helmholtz Munich, Munich, Germany Konrad Zuse School of Excellence in Learning and Intelligent Systems (ELIZA), Darmstadt, Germany
Nastassya Horlava Institute of Computational Biology, Helmholtz Munich, Munich, Germany TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Vladimir A Shitov Institute of Computational Biology, Helmholtz Munich, Munich, Germany TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Xinyue Zhang Institute of Computational Biology, Helmholtz Munich, Munich, Germany
Luke Zappia Institute of Computational Biology, Helmholtz Munich, Munich, Germany Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Rainer Knoll Systems Medicine, Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE), Bonn, Germany
Niklas J Lang Institute of Lung Health and Immunity and Comprehensive Pneumology Center with the CPC-M bioArchive; Helmholtz Zentrum Munich; member of the German Center for Lung Research (DZL), Munich, Germany
Leon Hetzel Institute of Computational Biology, Helmholtz Munich, Munich, Germany Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Isaac Virshup Institute of Computational Biology, Helmholtz Munich, Munich, Germany
Lisa Sikkema Institute of Computational Biology, Helmholtz Munich, Munich, Germany TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Fabiola Curion Institute of Computational Biology, Helmholtz Munich, Munich, Germany Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany
Roland Eils Health Data Science Unit, Heidelberg University and BioQuant, Heidelberg, Germany Center for Digital Health, Berlin Institute of Health (BIH) at Charité - Universitätsmedizin Berlin, Berlin, Germany
Herbert B Schiller Institute of Lung Health and Immunity and Comprehensive Pneumology Center with the CPC-M bioArchive; Helmholtz Zentrum Munich; member of the German Center for Lung Research (DZL), Munich, Germany Research Unit, Precision Regenerative Medicine (PRM), Helmholtz Munich, Munich, Germany
Anne Hilgendorff Institute of Lung Health and Immunity and Comprehensive Pneumology Center with the CPC-M bioArchive; Helmholtz Zentrum Munich; member of the German Center for Lung Research (DZL), Munich, Germany Center for Comprehensive Developmental Care (CDeCLMU) at the Social Pediatric Center, Dr. von Hauner Children's Hospital, LMU Hospital, Ludwig Maximilian University, Munich, Germany
Fabian J Theis Institute of Computational Biology, Helmholtz Munich, Munich, Germany. TUM School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany. Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Munich, Germany.

Collapse

Naumova K, Devos A, Karimireddy SP, Jaggi M, Hartley MA. MyThisYourThat for interpretable identification of systematic bias in federated learning for biomedical images. NPJ Digit Med 2024;7:238. [PMID: 39242810 PMCID: PMC11379706 DOI: 10.1038/s41746-024-01226-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 08/14/2024] [Indexed: 09/09/2024] Open

Pavia G, Branda F, Ciccozzi A, Romano C, Locci C, Azzena I, Pascale N, Marascio N, Quirino A, Matera G, Giovanetti M, Casu M, Sanna D, Ceccarelli G, Ciccozzi M, Scarpa F. Integrating Digital Health Solutions with Immunization Strategies: Improving Immunization Coverage and Monitoring in the Post-COVID-19 Era. Vaccines (Basel) 2024;12:847. [PMID: 39203973 PMCID: PMC11359052 DOI: 10.3390/vaccines12080847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 07/22/2024] [Accepted: 07/26/2024] [Indexed: 09/03/2024] Open

Affiliation(s)

Grazia Pavia Unit of Clinical Microbiology, Department of Health Sciences, “Magna Græcia” University of Catanzaro—“Renato Dulbecco” Teaching Hospital, 88100 Catanzaro, Italy; (G.P.); (N.M.); (A.Q.); (G.M.)
Francesco Branda Unit of Medical Statistics and Molecular Epidemiology, Università Campus Bio-Medico di Roma, 00128 Rome, Italy; (C.R.); (M.C.)
Alessandra Ciccozzi Department of Biomedical Sciences, University of Sassari, 07100 Sassari, Italy; (A.C.); (C.L.); (D.S.); (F.S.)
Chiara Romano Unit of Medical Statistics and Molecular Epidemiology, Università Campus Bio-Medico di Roma, 00128 Rome, Italy; (C.R.); (M.C.)
Chiara Locci Department of Biomedical Sciences, University of Sassari, 07100 Sassari, Italy; (A.C.); (C.L.); (D.S.); (F.S.) Department of Veterinary Medicine, University of Sassari, 07100 Sassari, Italy; (I.A.); (N.P.); (M.C.)
Ilenia Azzena Department of Veterinary Medicine, University of Sassari, 07100 Sassari, Italy; (I.A.); (N.P.); (M.C.)
Noemi Pascale Department of Veterinary Medicine, University of Sassari, 07100 Sassari, Italy; (I.A.); (N.P.); (M.C.)
Nadia Marascio Unit of Clinical Microbiology, Department of Health Sciences, “Magna Græcia” University of Catanzaro—“Renato Dulbecco” Teaching Hospital, 88100 Catanzaro, Italy; (G.P.); (N.M.); (A.Q.); (G.M.)
Angela Quirino Unit of Clinical Microbiology, Department of Health Sciences, “Magna Græcia” University of Catanzaro—“Renato Dulbecco” Teaching Hospital, 88100 Catanzaro, Italy; (G.P.); (N.M.); (A.Q.); (G.M.)
Giovanni Matera Unit of Clinical Microbiology, Department of Health Sciences, “Magna Græcia” University of Catanzaro—“Renato Dulbecco” Teaching Hospital, 88100 Catanzaro, Italy; (G.P.); (N.M.); (A.Q.); (G.M.)
Marta Giovanetti Department of Sciences and Technologies for Sustainable Development and One Health, Università Campus Bio-Medico di Roma, 00128 Rome, Italy; Instituto René Rachou, Fundação Oswaldo Cruz, Belo Horizonte 30190-002, Minas Gerais, Brazil Climate Amplified Diseases and Epidemics (CLIMADE), Brasilia 70070-130, Goias, Brazil
Marco Casu Department of Veterinary Medicine, University of Sassari, 07100 Sassari, Italy; (I.A.); (N.P.); (M.C.)
Daria Sanna Department of Biomedical Sciences, University of Sassari, 07100 Sassari, Italy; (A.C.); (C.L.); (D.S.); (F.S.)
Giancarlo Ceccarelli Department of Public Health and Infectious Diseases, University Hospital Policlinico Umberto I, Sapienza University of Rome, 00161 Rome, Italy;
Massimo Ciccozzi Unit of Medical Statistics and Molecular Epidemiology, Università Campus Bio-Medico di Roma, 00128 Rome, Italy; (C.R.); (M.C.)
Fabio Scarpa Department of Biomedical Sciences, University of Sassari, 07100 Sassari, Italy; (A.C.); (C.L.); (D.S.); (F.S.)

Collapse

Meng W, Xu J, Huang Y, Wang C, Song Q, Ma A, Song L, Bian J, Ma Q, Yin R. Autoencoder to Identify Sex-Specific Sub-phenotypes in Alzheimer's Disease Progression Using Longitudinal Electronic Health Records. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.07.07.24310055. [PMID: 39040206 PMCID: PMC11261930 DOI: 10.1101/2024.07.07.24310055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/24/2024]

Tang AS, Woldemariam SR, Miramontes S, Norgeot B, Oskotsky TT, Sirota M. Harnessing EHR data for health research. Nat Med 2024;30:1847-1855. [PMID: 38965433 DOI: 10.1038/s41591-024-03074-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 05/17/2024] [Indexed: 07/06/2024]

Wang R, Kuo PC, Chen LC, Seastedt KP, Gichoya JW, Celi LA. Drop the shortcuts: image augmentation improves fairness and decreases AI detection of race and other demographics from medical images. EBioMedicine 2024;102:105047. [PMID: 38471396 PMCID: PMC10945176 DOI: 10.1016/j.ebiom.2024.105047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 02/15/2024] [Accepted: 02/21/2024] [Indexed: 03/14/2024] Open

Abstract

BACKGROUND

It has been shown that AI models can learn race on medical images, leading to algorithmic bias. Our aim in this study was to enhance the fairness of medical image models by eliminating bias related to race, age, and sex. We hypothesise models may be learning demographics via shortcut learning and combat this using image augmentation.

METHODS

This study included 44,953 patients who identified as Asian, Black, or White (mean age, 60.68 years ±18.21; 23,499 women) for a total of 194,359 chest X-rays (CXRs) from MIMIC-CXR database. The included CheXpert images comprised 45,095 patients (mean age 63.10 years ±18.14; 20,437 women) for a total of 134,300 CXRs were used for external validation. We also collected 1195 3D brain magnetic resonance imaging (MRI) data from the ADNI database, which included 273 participants with an average age of 76.97 years ±14.22, and 142 females. DL models were trained on either non-augmented or augmented images and assessed using disparity metrics. The features learned by the models were analysed using task transfer experiments and model visualisation techniques.

FINDINGS

In the detection of radiological findings, training a model using augmented CXR images was shown to reduce disparities in error rate among racial groups (-5.45%), age groups (-13.94%), and sex (-22.22%). For AD detection, the model trained with augmented MRI images was shown 53.11% and 31.01% reduction of disparities in error rate among age and sex groups, respectively. Image augmentation led to a reduction in the model's ability to identify demographic attributes and resulted in the model trained for clinical purposes incorporating fewer demographic features.

INTERPRETATION

The model trained using the augmented images was less likely to be influenced by demographic information in detecting image labels. These results demonstrate that the proposed augmentation scheme could enhance the fairness of interpretations by DL models when dealing with data from patients with different demographic backgrounds.

FUNDING

National Science and Technology Council (Taiwan), National Institutes of Health.

Collapse

AlSaad R, Malluhi Q, Abd-Alrazaq A, Boughorbel S. Temporal self-attention for risk prediction from electronic health records using non-stationary kernel approximation. Artif Intell Med 2024;149:102802. [PMID: 38462292 DOI: 10.1016/j.artmed.2024.102802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 09/27/2023] [Accepted: 02/03/2024] [Indexed: 03/12/2024]

Abstract

Effective modeling of patient representation from electronic health records (EHRs) is increasingly becoming a vital research topic. Yet, modeling the non-stationarity in EHR data has received less attention. Most existing studies follow a strong assumption of stationarity in patient representation from EHRs. However, in practice, a patient's visits are irregularly spaced over a relatively long period of time, and disease progression patterns exhibit non-stationarity. Furthermore, the time gaps between patient visits often encapsulate significant domain knowledge, potentially revealing undiscovered patterns that characterize specific medical conditions. To address these challenges, we introduce a new method which combines the self-attention mechanism with non-stationary kernel approximation to capture both contextual information and temporal relationships between patient visits in EHRs. To assess the effectiveness of our proposed approach, we use two real-world EHR datasets, comprising a total of 76,925 patients, for the task of predicting the next diagnosis code for a patient, given their EHR history. The first dataset is a general EHR cohort and consists of 11,451 patients with a total of 3,485 unique diagnosis codes. The second dataset is a disease-specific cohort that includes 65,474 pregnant patients and encompasses a total of 9,782 unique diagnosis codes. Our experimental evaluation involved nine prediction models, categorized into three distinct groups. Group 1 comprises the baselines: original self-attention with positional encoding model, RETAIN model, and LSTM model. Group 2 includes models employing self-attention with stationary kernel approximations, specifically incorporating three variations of Bochner's feature maps. Lastly, Group 3 consists of models utilizing self-attention with non-stationary kernel approximations, including quadratic, cubic, and bi-quadratic polynomials. The experimental results demonstrate that non-stationary kernels significantly outperformed baseline methods for NDCG@10 and Hit@10 metrics in both datasets. The performance boost was more substantial in dataset 1 for the NDCG@10 metric. On the other hand, stationary Kernels showed significant but smaller gains over baselines and were nearly as effective as Non-stationary Kernels for Hit@10 in dataset 2. These findings robustly validate the efficacy of employing non-stationary kernels for temporal modeling of EHR data, and emphasize the importance of modeling non-stationary temporal information in healthcare prediction tasks.

Collapse

Lu HY, Ding X, Hirst JE, Yang Y, Yang J, Mackillop L, Clifton DA. Digital Health and Machine Learning Technologies for Blood Glucose Monitoring and Management of Gestational Diabetes. IEEE Rev Biomed Eng 2024;17:98-117. [PMID: 37022834 PMCID: PMC7615520 DOI: 10.1109/rbme.2023.3242261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]

Li Z, Yan C, Zhang X, Gharibi G, Yin Z, Jiang X, Malin BA. Split Learning for Distributed Collaborative Training of Deep Learning Models in Health Informatics. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024;2023:1047-1056. [PMID: 38222326 PMCID: PMC10785879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]

Xu J, Yin R, Huang Y, Gao H, Wu Y, Guo J, Smith GE, DeKosky ST, Wang F, Guo Y, Bian J. Identification of Outcome-Oriented Progression Subtypes from Mild Cognitive Impairment to Alzheimer's Disease Using Electronic Health Records. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2024;2023:764-773. [PMID: 38222396 PMCID: PMC10785946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]

Su L, Liu S, Long Y, Chen C, Chen K, Chen M, Chen Y, Cheng Y, Cui Y, Ding Q, Ding R, Duan M, Gao T, Gu X, He H, He J, Hu B, Hu C, Huang R, Huang X, Jiang H, Jiang J, Lan Y, Li J, Li L, Li L, Li W, Li Y, Lin J, Luo X, Lyu F, Mao Z, Miao H, Shang X, Shang X, Shang Y, Shen Y, Shi Y, Sun Q, Sun W, Tang Z, Wang B, Wang H, Wang H, Wang L, Wang L, Wang S, Wang Z, Wang Z, Wei D, Wu J, Wu Q, Xing X, Yang J, Yang X, Yu J, Yu W, Yu Y, Yuan H, Zhai Q, Zhang H, Zhang L, Zhang M, Zhang Z, Zhao C, Zheng R, Zhong L, Zhou F, Zhu W. Chinese experts' consensus on the application of intensive care big data. Front Med (Lausanne) 2024;10:1174429. [PMID: 38264049 PMCID: PMC10804886 DOI: 10.3389/fmed.2023.1174429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 11/09/2023] [Indexed: 01/25/2024] Open

Affiliation(s)

Longxiang Su Department of Critical Care Medicine, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
Shengjun Liu Department of Critical Care Medicine, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
Yun Long Department of Critical Care Medicine, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
Chaodong Chen Department of Surgical Intensive Critical Unit, Beijing Chao-yang Hospital, Capital Medical University, Beijing, China
Kai Chen Department of Critical Care Medicine, Fujian Provincial Key Laboratory of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, Fujian Provincial Center for Critical Care Medicine, Fuzhou, Fujian, China
Ming Chen Department of Critical Care Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, China
Yaolong Chen Evidence-based Medicine Center, School of Basic Medical Sciences, Lanzhou University, Lanzhou, China
Yisong Cheng Department of Critical Care Medicine, West China Hospital of Sichuan University, Chengdu, China
Yating Cui Department of Critical Care Medicine, The First Medical Center, Chinese PLA General Hospital, Beijing, China
Qi Ding Department of Surgical Intensive Critical Unit, Beijing Chao-yang Hospital, Capital Medical University, Beijing, China
Renyu Ding Department of Intensive Care Unit, The First Hospital of China Medical University, Shenyang, Liaoning, China
Meili Duan Department of Critical Care Medicine, Beijing Friendship Hospital, Capital Medical University, Beijing, China
Tao Gao Department of Critical Care Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, China
Xiaohua Gu Department of Critical Care Medicine, Northern Jiangsu People’s Hospital; Clinical Medical College, Yangzhou University, Yangzhou, China
Hongli He Intensive Care Unit, Sichuan Academy of Medical Sciences & Sichuan Provincial People’s Hospital, School of Medicine of University of Electronic Science and Technology, Chengdu, China
Jiawei He Department of Critical Care Medicine, Beijing Friendship Hospital, Capital Medical University, Beijing, China
Bo Hu Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China
Chang Hu Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China
Rui Huang Department of Critical Care Medicine, The Second Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
Xiaobo Huang Intensive Care Unit, Sichuan Academy of Medical Sciences & Sichuan Provincial People’s Hospital, School of Medicine of University of Electronic Science and Technology, Chengdu, China
Huizhen Jiang Department of Information Center, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
Jing Jiang Department of Critical Care Medicine, Chongqing General Hospital, Chongqing, China
Yunping Lan Intensive Care Unit, Sichuan Academy of Medical Sciences & Sichuan Provincial People’s Hospital, School of Medicine of University of Electronic Science and Technology, Chengdu, China
Jun Li Department of Critical Care Medicine, Fujian Provincial Key Laboratory of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, Fujian Provincial Center for Critical Care Medicine, Fuzhou, Fujian, China
Linfeng Li Medical Data Research Institute, Chongqing Medical University, Chongqing, China
Lu Li Department of Critical Care Medicine, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, China
Wenxiong Li Department of Surgical Intensive Critical Unit, Beijing Chao-yang Hospital, Capital Medical University, Beijing, China
Yongzai Li Information Network Center, QiLu Hospital, ShanDong University, Jinan, China
Jin Lin Department of Critical Care Medicine, Beijing Friendship Hospital, Capital Medical University, Beijing, China
Xufei Luo Evidence-based Medicine Center, School of Basic Medical Sciences, Lanzhou University, Lanzhou, China
Feng Lyu Department of Computer Science and Engineering, Central South University, Changsha, China
Zhi Mao Department of Critical Care Medicine, The First Medical Center, Chinese PLA General Hospital, Beijing, China
He Miao Department of Intensive Care Unit, The First Hospital of China Medical University, Shenyang, Liaoning, China
Xiaopu Shang Department of Information Management, Beijing Jiaotong University, Beijing, China
Xiuling Shang Department of Critical Care Medicine, Fujian Provincial Key Laboratory of Critical Care Medicine, Shengli Clinical Medical College of Fujian Medical University, Fujian Provincial Hospital, Fujian Provincial Center for Critical Care Medicine, Fuzhou, Fujian, China
You Shang Department of Critical Care Medicine, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
Yuwen Shen Intensive Care Unit of Cardiovascular Surgery Department, Qilu Hospital of Shandong University, Jinan, China
Yinghuan Shi National Institute of Healthcare Data Science, Nanjing University, Nanjing, China
Qihang Sun British Chinese Society of Health Informatics, Beijing, China
Weijun Sun Faculty of Automation, Guangdong University of Technology, Guangzhou, China
Zhiyun Tang Department of Intensive Care Unit, Zhejiang Provincial People’s Hospital, Affiliated People’s Hospital, Emergency and Intensive Care Unit Center, Hangzhou Medical College, Hangzhou, Zhejiang, China
Bo Wang Department of Critical Care Medicine, West China Hospital of Sichuan University, Chengdu, China
Haijun Wang Department of Intensive Care Unit, National Cancer Center/National Clinical Research Center, Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Hongliang Wang Department of Critical Care Medicine, The Second Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
Li Wang Department of Epidemiology and Biostatistics, Institute of Basic Medical Sciences Chinese Academy of Medical Sciences; School of Basic Medicine Peking Union Medical College, Beijing, China
Luhao Wang Department of Critical Care Medicine, Sun Yat-Sen University First Affiliated Hospital, Guangzhou, China
Sicong Wang Department of Critical Care Medicine, The Second Affiliated Hospital of Harbin Medical University, Harbin, Heilongjiang, China
Zhanwen Wang Intensive Care Unit, XiangYa Hospital, Central South University, Changsha, China National Clinical Research Center for Geriatric Disorders, Xiang Ya Hospital, Central South University, Changsha, China Hunan Provincial Clinical Research Center for Critical Care Medicine, Xiang Ya Hospital, Central South University, Changsha, China
Zhong Wang Department of Intensive Care Unit, The First Hospital of China Medical University, Shenyang, Liaoning, China
Dong Wei National Institute of Healthcare Data Science, Nanjing University, Nanjing, China
Jianfeng Wu Intensive Care Unit, XiangYa Hospital, Central South University, Changsha, China
Qin Wu Department of Critical Care Medicine, West China Hospital of Sichuan University, Chengdu, China
Xuezhong Xing Department of Epidemiology and Biostatistics, Institute of Basic Medical Sciences Chinese Academy of Medical Sciences; School of Basic Medicine Peking Union Medical College, Beijing, China
Jin Yang Department of Critical Care Medicine, Chongqing General Hospital, Chongqing, China
Xianghong Yang Department of Intensive Care Unit, National Cancer Center/National Clinical Research Center, Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Jiangquan Yu Department of Critical Care Medicine, Northern Jiangsu People’s Hospital; Clinical Medical College, Yangzhou University, Yangzhou, China
Wenkui Yu Department of Critical Care Medicine, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Nanjing, Jiangsu, China
Yuan Yu Intensive Care Unit of Cardiovascular Surgery Department, Qilu Hospital of Shandong University, Jinan, China
Hao Yuan Department of Critical Care Medicine, Sun Yat-Sen University First Affiliated Hospital, Guangzhou, China
Qian Zhai National Institute of Healthcare Data Science, Nanjing University, Nanjing, China
Hao Zhang Department of Intensive Care Unit, National Cancer Center/National Clinical Research Center, Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Lina Zhang Intensive Care Unit, XiangYa Hospital, Central South University, Changsha, China National Clinical Research Center for Geriatric Disorders, Xiang Ya Hospital, Central South University, Changsha, China Hunan Provincial Clinical Research Center for Critical Care Medicine, Xiang Ya Hospital, Central South University, Changsha, China
Meng Zhang Department of Critical Care Medicine, Chongqing General Hospital, Chongqing, China
Zhongheng Zhang Department of Emergency Medicine, Key Laboratory of Precision Medicine in Diagnosis and Monitoring Research of Zhejiang Province, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, China
Chunguang Zhao Intensive Care Unit, XiangYa Hospital, Central South University, Changsha, China National Clinical Research Center for Geriatric Disorders, Xiang Ya Hospital, Central South University, Changsha, China Hunan Provincial Clinical Research Center for Critical Care Medicine, Xiang Ya Hospital, Central South University, Changsha, China
Ruiqiang Zheng Department of Critical Care Medicine, Northern Jiangsu People’s Hospital; Clinical Medical College, Yangzhou University, Yangzhou, China
Lei Zhong Department of Intensive Care Unit, National Cancer Center/National Clinical Research Center, Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
Feihu Zhou Department of Critical Care Medicine, The First Medical Center, Chinese PLA General Hospital, Beijing, China
Weiguo Zhu Department of General Medicine, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China

Collapse

Ho M, Levy TJ, Koulas I, Founta K, Coppa K, Hirsch JS, Davidson KW, Spyropoulos AC, Zanos TP. Longitudinal dynamic clinical phenotypes of in-hospital COVID-19 patients across three dominant virus variants in New York. Int J Med Inform 2024;181:105286. [PMID: 37956643 PMCID: PMC10843635 DOI: 10.1016/j.ijmedinf.2023.105286] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 10/20/2023] [Accepted: 11/03/2023] [Indexed: 11/15/2023]

Abstract

BACKGROUND

COVID-19 is a challenging disease to characterize given its wide-ranging heterogeneous symptomatology. Several studies have attempted to extract clinical phenotypes but often relied on data from small patient cohorts, usually limited to only one viral variant and utilizing a static snapshot of patient data.

OBJECTIVE

This study aimed to identify clinical phenotypes of hospitalized COVID-19 patients and investigate their longitudinal dynamics throughout the pandemic, with the goal to relate these phenotypes to clinical outcomes and treatment strategies.

METHODS

We utilized routinely collected demographic and clinical data throughout the hospitalization of 38,077 patients admitted between 3/2020 to 5/2022, in 12 New York hospitals. Uniform Manifold Approximation and Projection and agglomerative hierarchical clustering were used to derive the clusters, followed by exploratory data analysis to compare the prevalence of comorbidities and treatments per cluster.

RESULTS

4 distinct clinical phenotypes remained robust in multi-site validation and were associated with different mortality rates. The temporal progression of these phenotypes throughout the COVID-19 pandemic demonstrated increased variability across the waves of the three dominant viral variants (alpha, delta, omicron). Longitudinal analysis evaluating changes in clinical phenotypes of each patient throughout the course of a 4-week hospital stay exemplified the dynamic nature of the disease progression. Factors such as sex, race/ethnicity and specific treatment modalities revealed significant and clinically relevant differences between the observed phenotypes.

CONCLUSIONS

Our proposed methodology has the potential of enabling clinicians and policy makers to draw evidence-based conclusions for guiding treatment modalities in a dynamic fashion.

Collapse

Affiliation(s)

Matthew Ho Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY 11549
Todd J Levy Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030
Ioannis Koulas Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030
Kyriaki Founta Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY 11549
Kevin Coppa Department of Clinical Digital Solutions, Northwell Health, New Hyde Park, NY 11042
Jamie S Hirsch Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY 11549; Department of Clinical Digital Solutions, Northwell Health, New Hyde Park, NY 11042
Karina W Davidson Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY 11549
Alex C Spyropoulos Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY 11549
Theodoros P Zanos Institute of Health Systems Science, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Institute of Bioelectronic Medicine, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY 11030; Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, Northwell Health, Hempstead, NY 11549.

Collapse

Papanastasiou G, Yang G, Fotiadis DI, Dikaios N, Wang C, Huda A, Sobolevsky L, Raasch J, Perez E, Sidhu G, Palumbo D. Large-scale deep learning analysis to identify adult patients at risk for combined and common variable immunodeficiencies. COMMUNICATIONS MEDICINE 2023;3:189. [PMID: 38123736 PMCID: PMC10733406 DOI: 10.1038/s43856-023-00412-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 11/21/2023] [Indexed: 12/23/2023] Open

Lhoste VPF, Zhou B, Mishra A, Bennett JE, Filippi S, Asaria P, Gregg EW, Danaei G, Ezzati M. Cardiometabolic and renal phenotypes and transitions in the United States population. NATURE CARDIOVASCULAR RESEARCH 2023;3:46-59. [PMID: 38314318 PMCID: PMC7615595 DOI: 10.1038/s44161-023-00391-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 11/13/2023] [Indexed: 02/06/2024]

Affiliation(s)

Victor P. F. Lhoste Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
Bin Zhou Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK Abdul Latif Jameel Institute for Disease and Emergency Analytics, Imperial College London, London, UK
Anu Mishra Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
James E. Bennett Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
Sarah Filippi Department of Mathematics, Imperial College London, London, UK
Perviz Asaria Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
Edward W. Gregg Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK Abdul Latif Jameel Institute for Disease and Emergency Analytics, Imperial College London, London, UK School of Population Health, Royal College of Surgeons in Ireland, Dublin, Ireland
Goodarz Danaei Department of Global Health and Population, Harvard T.H. Chan School of Public Health, Boston, MA, USA Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Majid Ezzati Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK Abdul Latif Jameel Institute for Disease and Emergency Analytics, Imperial College London, London, UK Regional Institute for Population Studies, University of Ghana, Accra, Ghana

Collapse

Lanotte F, O’Brien MK, Jayaraman A. AI in Rehabilitation Medicine: Opportunities and Challenges. Ann Rehabil Med 2023;47:444-458. [PMID: 38093518 PMCID: PMC10767220 DOI: 10.5535/arm.23131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 11/23/2023] [Indexed: 01/03/2024] Open

Sivarajkumar S, Huang Y, Wang Y. Fair patient model: Mitigating bias in the patient representation learned from the electronic health records. J Biomed Inform 2023;148:104544. [PMID: 37995843 PMCID: PMC10850918 DOI: 10.1016/j.jbi.2023.104544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 10/02/2023] [Accepted: 11/10/2023] [Indexed: 11/25/2023]

Keszthelyi D, Gaudet-Blavignac C, Bjelogrlic M, Lovis C. Patient Information Summarization in Clinical Settings: Scoping Review. JMIR Med Inform 2023;11:e44639. [PMID: 38015588 DOI: 10.2196/44639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 03/15/2023] [Accepted: 07/25/2023] [Indexed: 11/29/2023] Open

Abstract

BACKGROUND

Information overflow, a common problem in the present clinical environment, can be mitigated by summarizing clinical data. Although there are several solutions for clinical summarization, there is a lack of a complete overview of the research relevant to this field.

OBJECTIVE

This study aims to identify state-of-the-art solutions for clinical summarization, to analyze their capabilities, and to identify their properties.

METHODS

A scoping review of articles published between 2005 and 2022 was conducted. With a clinical focus, PubMed and Web of Science were queried to find an initial set of reports, later extended by articles found through a chain of citations. The included reports were analyzed to answer the questions of where, what, and how medical information is summarized; whether summarization conserves temporality, uncertainty, and medical pertinence; and how the propositions are evaluated and deployed. To answer how information is summarized, methods were compared through a new framework "collect-synthesize-communicate" referring to information gathering from data, its synthesis, and communication to the end user.

RESULTS

Overall, 128 articles were included, representing various medical fields. Exclusively structured data were used as input in 46.1% (59/128) of papers, text in 41.4% (53/128) of articles, and both in 10.2% (13/128) of papers. Using the proposed framework, 42.2% (54/128) of the records contributed to information collection, 27.3% (35/128) contributed to information synthesis, and 46.1% (59/128) presented solutions for summary communication. Numerous summarization approaches have been presented, including extractive (n=13) and abstractive summarization (n=19); topic modeling (n=5); summary specification (n=11); concept and relation extraction (n=30); visual design considerations (n=59); and complete pipelines (n=7) using information extraction, synthesis, and communication. Graphical displays (n=53), short texts (n=41), static reports (n=7), and problem-oriented views (n=7) were the most common types in terms of summary communication. Although temporality and uncertainty information were usually not conserved in most studies (74/128, 57.8% and 113/128, 88.3%, respectively), some studies presented solutions to treat this information. Overall, 115 (89.8%) articles showed results of an evaluation, and methods included evaluations with human participants (median 15, IQR 24 participants): measurements in experiments with human participants (n=31), real situations (n=8), and usability studies (n=28). Methods without human involvement included intrinsic evaluation (n=24), performance on a proxy (n=10), or domain-specific tasks (n=11). Overall, 11 (8.6%) reports described a system deployed in clinical settings.

CONCLUSIONS

The scientific literature contains many propositions for summarizing patient information but reports very few comparisons of these proposals. This work proposes to compare these algorithms through how they conserve essential aspects of clinical information and through the "collect-synthesize-communicate" framework. We found that current propositions usually address these 3 steps only partially. Moreover, they conserve and use temporality, uncertainty, and pertinent medical aspects to varying extents, and solutions are often preliminary.

Collapse

Misra S, Wagner R, Ozkan B, Schön M, Sevilla-Gonzalez M, Prystupa K, Wang CC, Kreienkamp RJ, Cromer SJ, Rooney MR, Duan D, Thuesen ACB, Wallace AS, Leong A, Deutsch AJ, Andersen MK, Billings LK, Eckel RH, Sheu WHH, Hansen T, Stefan N, Goodarzi MO, Ray D, Selvin E, Florez JC, Meigs JB, Udler MS. Precision subclassification of type 2 diabetes: a systematic review. COMMUNICATIONS MEDICINE 2023;3:138. [PMID: 37798471 PMCID: PMC10556101 DOI: 10.1038/s43856-023-00360-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 09/15/2023] [Indexed: 10/07/2023] Open

Affiliation(s)

Shivani Misra Department of Metabolism, Digestion and Reproduction, Imperial College London, London, UK. Department of Diabetes and Endocrinology, Imperial College Healthcare NHS Trust, London, UK.
Robert Wagner Department of Endocrinology and Diabetology, University Hospital Düsseldorf, Heinrich Heine University Düsseldorf, Moorenstr. 5, 40225, Düsseldorf, Germany Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Auf'm Hennekamp 65, 40225, Düsseldorf, Germany German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
Bige Ozkan Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA Ciccarone Center for the Prevention of Cardiovascular Disease, Johns Hopkins School of Medicine, Baltimore, MD, USA
Martin Schön Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Auf'm Hennekamp 65, 40225, Düsseldorf, Germany German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany Institute of Experimental Endocrinology, Biomedical Research Center, Slovak Academy of Sciences, Bratislava, Slovakia
Magdalena Sevilla-Gonzalez Clinical and Translational Epidemiology Unit, Massachusetts General Hospital, Boston, MA, USA Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA
Katsiaryna Prystupa Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich Heine University Düsseldorf, Auf'm Hennekamp 65, 40225, Düsseldorf, Germany German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany
Caroline C Wang Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Raymond J Kreienkamp Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA Department of Pediatrics, Division of Endocrinology, Boston Children's Hospital, Boston, MA, USA
Sara J Cromer Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Mary R Rooney Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Daisy Duan Division of Endocrinology, Diabetes and Metabolism, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Anne Cathrine Baun Thuesen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Amelia S Wallace Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Aaron Leong Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA Division of General Internal Medicine, Massachusetts General Hospital, 100 Cambridge St 16th Floor, Boston, MA, USA
Aaron J Deutsch Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Mette K Andersen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Liana K Billings Division of Endocrinology, Diabetes and Metabolism, NorthShore University Health System, Skokie, IL, USA Department of Medicine, Pritzker School of Medicine, University of Chicago, Chicago, IL, USA
Robert H Eckel Division of Endocrinology, Metabolism and Diabetes, University of Colorado School of Medicine, Aurora, CO, USA
Wayne Huey-Herng Sheu Institute of Molecular and Genomic Medicine, National Health Research Institute, Miaoli County, Taiwan, ROC Division of Endocrinology and Metabolism, Taichung Veterans General Hospital, Taichung, Taiwan, ROC Division of Endocrinology and Metabolism, Taipei Veterans General Hospital, Taipei, Taiwan, ROC
Torben Hansen Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
Norbert Stefan German Center for Diabetes Research (DZD), Ingolstädter Landstraße 1, 85764, Neuherberg, Germany University Hospital of Tübingen, Tübingen, Germany Institute of Diabetes Research and Metabolic Diseases (IDM), Helmholtz Center Munich, Neuherberg, Germany
Mark O Goodarzi Division of Endocrinology, Diabetes and Metabolism, Department of Medicine, Cedars-Sinai Medical Center, Los Angeles, CA, USA
Debashree Ray Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Elizabeth Selvin Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
Jose C Florez Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
James B Meigs Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Division of General Internal Medicine, Massachusetts General Hospital, 100 Cambridge St 16th Floor, Boston, MA, USA
Miriam S Udler Programs in Metabolism and Medical & Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA Department of Medicine, Harvard Medical School, Boston, MA, USA Diabetes Unit, Division of Endocrinology, Massachusetts General Hospital, Boston, MA, USA Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA

Collapse

Pellegrini C, Navab N, Kazi A. Unsupervised pre-training of graph transformers on patient population graphs. Med Image Anal 2023;89:102895. [PMID: 37473609 DOI: 10.1016/j.media.2023.102895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 07/06/2023] [Accepted: 07/07/2023] [Indexed: 07/22/2023]

Liu P, Wang Z, Liu N, Peres MA. A scoping review of the clinical application of machine learning in data-driven population segmentation analysis. J Am Med Inform Assoc 2023;30:1573-1582. [PMID: 37369006 PMCID: PMC10436153 DOI: 10.1093/jamia/ocad111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 06/08/2023] [Accepted: 06/16/2023] [Indexed: 06/29/2023] Open

Xu J, Yin R, Huang Y, Gao H, Wu Y, Guo J, Smith GE, DeKosky ST, Wang F, Guo Y, Bian J. Identification of Outcome-Oriented Progression Subtypes from Mild Cognitive Impairment to Alzheimer's Disease Using Electronic Health Records. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.27.23293270. [PMID: 37577594 PMCID: PMC10418300 DOI: 10.1101/2023.07.27.23293270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]

van der Haar D, Moustafa A, Warren SL, Alashwal H, van Zyl T. An Alzheimer's disease category progression sub-grouping analysis using manifold learning on ADNI. Sci Rep 2023;13:10483. [PMID: 37380746 DOI: 10.1038/s41598-023-37569-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 06/23/2023] [Indexed: 06/30/2023] Open

Boussina A, Wardi G, Shashikumar SP, Malhotra A, Zheng K, Nemati S. Representation Learning and Spectral Clustering for the Development and External Validation of Dynamic Sepsis Phenotypes: Observational Cohort Study. J Med Internet Res 2023;25:e45614. [PMID: 37351927 PMCID: PMC10337434 DOI: 10.2196/45614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 02/28/2023] [Accepted: 05/22/2023] [Indexed: 06/24/2023] Open

Abstract

BACKGROUND

Recent attempts at clinical phenotyping for sepsis have shown promise in identifying groups of patients with distinct treatment responses. Nonetheless, the replicability and actionability of these phenotypes remain an issue because the patient trajectory is a function of both the patient's physiological state and the interventions they receive.

OBJECTIVE

We aimed to develop a novel approach for deriving clinical phenotypes using unsupervised learning and transition modeling.

METHODS

Forty commonly used clinical variables from the electronic health record were used as inputs to a feed-forward neural network trained to predict the onset of sepsis. Using spectral clustering on the representations from this network, we derived and validated consistent phenotypes across a diverse cohort of patients with sepsis. We modeled phenotype dynamics as a Markov decision process with transitions as a function of the patient's current state and the interventions they received.

RESULTS

Four consistent and distinct phenotypes were derived from over 11,500 adult patients who were admitted from the University of California, San Diego emergency department (ED) with sepsis between January 1, 2016, and January 31, 2020. Over 2000 adult patients admitted from the University of California, Irvine ED with sepsis between November 4, 2017, and August 4, 2022, were involved in the external validation. We demonstrate that sepsis phenotypes are not static and evolve in response to physiological factors and based on interventions. We show that roughly 45% of patients change phenotype membership within the first 6 hours of ED arrival. We observed consistent trends in patient dynamics as a function of interventions including early administration of antibiotics.

CONCLUSIONS

We derived and describe 4 sepsis phenotypes present within 6 hours of triage in the ED. We observe that the administration of a 30 mL/kg fluid bolus may be associated with worse outcomes in certain phenotypes, whereas prompt antimicrobial therapy is associated with improved outcomes.

Collapse

Wang M, Sushil M, Miao BY, Butte AJ. Bottom-up and top-down paradigms of artificial intelligence research approaches to healthcare data science using growing real-world big data. J Am Med Inform Assoc 2023;30:1323-1332. [PMID: 37187158 PMCID: PMC10280344 DOI: 10.1093/jamia/ocad085] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 04/03/2023] [Accepted: 05/04/2023] [Indexed: 05/17/2023] Open

Xu R, Ali MK, Ho JC, Yang C. Hypergraph Transformers for EHR-based Clinical Predictions. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2023;2023:582-591. [PMID: 37350881 PMCID: PMC10283128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/24/2023]

Penrod N, Okeh C, Velez Edwards DR, Barnhart K, Senapati S, Verma SS. Leveraging electronic health record data for endometriosis research. Front Digit Health 2023;5:1150687. [PMID: 37342866 PMCID: PMC10278662 DOI: 10.3389/fdgth.2023.1150687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 05/10/2023] [Indexed: 06/23/2023] Open

Soman K, Nelson CA, Cerono G, Goldman SM, Baranzini SE, Brown EG. Early detection of Parkinson's disease through enriching the electronic health record using a biomedical knowledge graph. Front Med (Lausanne) 2023;10:1081087. [PMID: 37250641 PMCID: PMC10217780 DOI: 10.3389/fmed.2023.1081087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Accepted: 04/18/2023] [Indexed: 05/31/2023] Open

Tariq A, Tang S, Sakhi H, Celi LA, Newsome JM, Rubin DL, Trivedi H, Gichoya JW, Banerjee I. Fusion of imaging and non-imaging data for disease trajectory prediction for coronavirus disease 2019 patients. J Med Imaging (Bellingham) 2023;10:034004. [PMID: 37388280 PMCID: PMC10306115 DOI: 10.1117/1.jmi.10.3.034004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 06/07/2023] [Accepted: 06/13/2023] [Indexed: 07/01/2023] Open

Abstract

Purpose

Our study investigates whether graph-based fusion of imaging data with non-imaging electronic health records (EHR) data can improve the prediction of the disease trajectories for patients with coronavirus disease 2019 (COVID-19) beyond the prediction performance of only imaging or non-imaging EHR data.

Approach

We present a fusion framework for fine-grained clinical outcome prediction [discharge, intensive care unit (ICU) admission, or death] that fuses imaging and non-imaging information using a similarity-based graph structure. Node features are represented by image embedding, and edges are encoded with clinical or demographic similarity.

Results

Experiments on data collected from the Emory Healthcare Network indicate that our fusion modeling scheme performs consistently better than predictive models developed using only imaging or non-imaging features, with area under the receiver operating characteristics curve of 0.76, 0.90, and 0.75 for discharge from hospital, mortality, and ICU admission, respectively. External validation was performed on data collected from the Mayo Clinic. Our scheme highlights known biases in the model prediction, such as bias against patients with alcohol abuse history and bias based on insurance status.

Conclusions

Our study signifies the importance of the fusion of multiple data modalities for the accurate prediction of clinical trajectories. The proposed graph structure can model relationships between patients based on non-imaging EHR data, and graph convolutional networks can fuse this relationship information with imaging data to effectively predict future disease trajectory more effectively than models employing only imaging or non-imaging data. Our graph-based fusion modeling frameworks can be easily extended to other prediction tasks to efficiently combine imaging data with non-imaging clinical data.

Collapse

Misra S, Wagner R, Ozkan B, Schön M, Sevilla-Gonzalez M, Prystupa K, Wang CC, Kreienkamp RJ, Cromer SJ, Rooney MR, Duan D, Thuesen ACB, Wallace AS, Leong A, Deutsch AJ, Andersen MK, Billings LK, Eckel RH, Sheu WHH, Hansen T, Stefan N, Goodarzi MO, Ray D, Selvin E, Florez JC, Meigs JB, Udler MS. Systematic review of precision subclassification of type 2 diabetes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.04.19.23288577. [PMID: 37131632 PMCID: PMC10153304 DOI: 10.1101/2023.04.19.23288577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]

Alfalahi H, Dias SB, Khandoker AH, Chaudhuri KR, Hadjileontiadis LJ. A scoping review of neurodegenerative manifestations in explainable digital phenotyping. NPJ Parkinsons Dis 2023;9:49. [PMID: 36997573 PMCID: PMC10063633 DOI: 10.1038/s41531-023-00494-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 03/16/2023] [Indexed: 04/03/2023] Open

Chen A, Lu R, Han R, Huang R, Qin G, Wen J, Li Q, Zhang Z, Jiang W. Building Practical Risk Prediction Models for Nasopharyngeal Carcinoma Screening with Patient Graph Analysis and Machine Learning. Cancer Epidemiol Biomarkers Prev 2023;32:274-280. [PMID: 36480263 DOI: 10.1158/1055-9965.epi-22-0792] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Revised: 09/07/2022] [Accepted: 12/06/2022] [Indexed: 12/13/2022] Open

Li F, Wu P, Ong HH, Peterson JF, Wei WQ, Zhao J. Evaluating and mitigating bias in machine learning models for cardiovascular disease prediction. J Biomed Inform 2023;138:104294. [PMID: 36706849 PMCID: PMC11104322 DOI: 10.1016/j.jbi.2023.104294] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 01/16/2023] [Accepted: 01/21/2023] [Indexed: 01/26/2023]

Abstract

OBJECTIVE

The study aims to investigate whether machine learning-based predictive models for cardiovascular disease (CVD) risk assessment show equivalent performance across demographic groups (such as race and gender) and if bias mitigation methods can reduce any bias present in the models. This is important as systematic bias may be introduced when collecting and preprocessing health data, which could affect the performance of the models on certain demographic sub-cohorts. The study is to investigate this using electronic health records data and various machine learning models.

METHODS

The study used large de-identified Electronic Health Records data from Vanderbilt University Medical Center. Machine learning (ML) algorithms including logistic regression, random forest, gradient-boosting trees, and long short-term memory were applied to build multiple predictive models. Model bias and fairness were evaluated using equal opportunity difference (EOD, 0 indicates fairness) and disparate impact (DI, 1 indicates fairness). In our study, we also evaluated the fairness of a non-ML baseline model, the American Heart Association (AHA) Pooled Cohort Risk Equations (PCEs). Moreover, we compared the performance of three different de-biasing methods: removing protected attributes (e.g., race and gender), resampling the imbalanced training dataset by sample size, and resampling by the proportion of people with CVD outcomes.

RESULTS

The study cohort included 109,490 individuals (mean [SD] age 47.4 [14.7] years; 64.5% female; 86.3% White; 13.7% Black). The experimental results suggested that most ML models had smaller EOD and DI than PCEs. For ML models, the mean EOD ranged from -0.001 to 0.018 and the mean DI ranged from 1.037 to 1.094 across race groups. There was a larger EOD and DI across gender groups, with EOD ranging from 0.131 to 0.136 and DI ranging from 1.535 to 1.587. For debiasing methods, removing protected attributes didn't significantly reduced the bias for most ML models. Resampling by sample size also didn't consistently decrease bias. Resampling by case proportion reduced the EOD and DI for gender groups but slightly reduced accuracy in many cases.

CONCLUSIONS

Among the VUMC cohort, both PCEs and ML models were biased against women, suggesting the need to investigate and correct gender disparities in CVD risk prediction. Resampling by proportion reduced the bias for gender groups but not for race groups.

Collapse

Explaining predictive factors in patient pathways using autoencoders. PLoS One 2022;17:e0277135. [DOI: 10.1371/journal.pone.0277135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 10/20/2022] [Indexed: 11/12/2022] Open

Carruthers R, Straw I, Ruffle JK, Herron D, Nelson A, Bzdok D, Fernandez-Reyes D, Rees G, Nachev P. Representational ethical model calibration. NPJ Digit Med 2022;5:170. [PMID: 36333390 PMCID: PMC9636204 DOI: 10.1038/s41746-022-00716-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 10/19/2022] [Indexed: 11/06/2022] Open

Chen A, Chen DO. Simulation of a machine learning enabled learning health system for risk prediction using synthetic patient data. Sci Rep 2022;12:17917. [PMID: 36289292 PMCID: PMC9606301 DOI: 10.1038/s41598-022-23011-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 10/21/2022] [Indexed: 01/20/2023] Open

Zou Y, Pesaranghader A, Song Z, Verma A, Buckeridge DL, Li Y. Modeling electronic health record data using an end-to-end knowledge-graph-informed topic model. Sci Rep 2022;12:17868. [PMID: 36284225 PMCID: PMC9596500 DOI: 10.1038/s41598-022-22956-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 10/21/2022] [Indexed: 01/20/2023] Open

Dileep G, Gianchandani Gyani SG. Artificial Intelligence in Breast Cancer Screening and Diagnosis. Cureus 2022;14:e30318. [PMID: 36381716 PMCID: PMC9650950 DOI: 10.7759/cureus.30318] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 10/15/2022] [Indexed: 11/05/2022] Open

Zucco AG, Agius R, Svanberg R, Moestrup KS, Marandi RZ, MacPherson CR, Lundgren J, Ostrowski SR, Niemann CU. Personalized survival probabilities for SARS-CoV-2 positive patients by explainable machine learning. Sci Rep 2022;12:13879. [PMID: 35974050 PMCID: PMC9380679 DOI: 10.1038/s41598-022-17953-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 08/03/2022] [Indexed: 01/08/2023] Open

Preparing for the next pandemic via transfer learning from existing diseases with hierarchical multi-modal BERT: a study on COVID-19 outcome prediction. Sci Rep 2022;12:10748. [PMID: 35750878 PMCID: PMC9232529 DOI: 10.1038/s41598-022-13072-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 05/20/2022] [Indexed: 11/14/2022] Open

Abstract

Developing prediction models for emerging infectious diseases from relatively small numbers of cases is a critical need for improving pandemic preparedness. Using COVID-19 as an exemplar, we propose a transfer learning methodology for developing predictive models from multi-modal electronic healthcare records by leveraging information from more prevalent diseases with shared clinical characteristics. Our novel hierarchical, multi-modal model (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\textsc {TransMED}}$$\end{document}TRANSMED) integrates baseline risk factors from the natural language processing of clinical notes at admission, time-series measurements of biomarkers obtained from laboratory tests, and discrete diagnostic, procedure and drug codes. We demonstrate the alignment of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\textsc {TransMED}}$$\end{document}TRANSMED’s predictions with well-established clinical knowledge about COVID-19 through univariate and multivariate risk factor driven sub-cohort analysis. \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\textsc {TransMED}}$$\end{document}TRANSMED’s superior performance over state-of-the-art methods shows that leveraging patient data across modalities and transferring prior knowledge from similar disorders is critical for accurate prediction of patient outcomes, and this approach may serve as an important tool in the early response to future pandemics.

Collapse

Maurits MP, Korsunsky I, Raychaudhuri S, Murphy SN, Smoller JW, Weiss ST, Petukhova LM, Weng C, Wei WQ, Huizinga TWJ, Reinders MJT, Karlson EW, van den Akker EB, Knevel R. A framework for employing longitudinally collected multicenter electronic health records to stratify heterogeneous patient populations on disease history. J Am Med Inform Assoc 2022;29:761-769. [PMID: 35139533 PMCID: PMC9122640 DOI: 10.1093/jamia/ocac008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 11/24/2021] [Accepted: 01/27/2022] [Indexed: 11/23/2022] Open

Abstract

OBJECTIVE

To facilitate patient disease subset and risk factor identification by constructing a pipeline which is generalizable, provides easily interpretable results, and allows replication by overcoming electronic health records (EHRs) batch effects.

MATERIAL AND METHODS

We used 1872 billing codes in EHRs of 102 880 patients from 12 healthcare systems. Using tools borrowed from single-cell omics, we mitigated center-specific batch effects and performed clustering to identify patients with highly similar medical history patterns across the various centers. Our visualization method (PheSpec) depicts the phenotypic profile of clusters, applies a novel filtering of noninformative codes (Ranked Scope Pervasion), and indicates the most distinguishing features.

RESULTS

We observed 114 clinically meaningful profiles, for example, linking prostate hyperplasia with cancer and diabetes with cardiovascular problems and grouping pediatric developmental disorders. Our framework identified disease subsets, exemplified by 6 "other headache" clusters, where phenotypic profiles suggested different underlying mechanisms: migraine, convulsion, injury, eye problems, joint pain, and pituitary gland disorders. Phenotypic patterns replicated well, with high correlations of ≥0.75 to an average of 6 (2-8) of the 12 different cohorts, demonstrating the consistency with which our method discovers disease history profiles.

DISCUSSION

Costly clinical research ventures should be based on solid hypotheses. We repurpose methods from single-cell omics to build these hypotheses from observational EHR data, distilling useful information from complex data.

CONCLUSION

We establish a generalizable pipeline for the identification and replication of clinically meaningful (sub)phenotypes from widely available high-dimensional billing codes. This approach overcomes datatype problems and produces comprehensive visualizations of validation-ready phenotypes.

Collapse

Affiliation(s)

Marc P Maurits Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands Leiden Computational Biology Center, Leiden University Medical Center, Leiden, The Netherlands
Ilya Korsunsky Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Soumya Raychaudhuri Center for Data Sciences, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Shawn N Murphy Research Information Science and Computing, Mass General Brigham, Boston, MA, USA
Jordan W Smoller Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
Scott T Weiss Channing Division of Network Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Lynn M Petukhova Lynn M. Petukhova, Department of Dermatology at NewYork-Presbyterian/Columbia University Medical Center (CUMC)
Chunhua Weng Chunhua Weng, Biomedical Informatics - Columbia University
Wei-Qi Wei Wei-Qi Wei, Biomedical Informatics in the School of Medicine at Vanderbilt University Wei
Thomas W J Huizinga Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands
Marcel J T Reinders Leiden Computational Biology Center, Leiden University Medical Center, Leiden, The Netherlands The Delft Bioinformatics Lab, Delft University of Technology, Delft, The Netherlands
Elizabeth W Karlson Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
Erik B van den Akker Leiden Computational Biology Center, Leiden University Medical Center, Leiden, The Netherlands Section of Molecular Epidemiology, Leiden University Medical Center, Leiden, The Netherlands
Rachel Knevel Department of Rheumatology, Leiden University Medical Center, Leiden, The Netherlands Division of Rheumatology, Inflammation and Immunity, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA

Collapse

Zeng X, Linwood SL, Liu C. Pretrained transformer framework on pediatric claims data for population specific tasks. Sci Rep 2022;12:3651. [PMID: 35256645 PMCID: PMC8901645 DOI: 10.1038/s41598-022-07545-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Accepted: 01/28/2022] [Indexed: 11/09/2022] Open

Predictive structured-unstructured interactions in EHR models: A case study of suicide prediction. NPJ Digit Med 2022;5:15. [PMID: 35087182 PMCID: PMC8795240 DOI: 10.1038/s41746-022-00558-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 12/13/2021] [Indexed: 11/20/2022] Open

Xie F, Yuan H, Ning Y, Ong MEH, Feng M, Hsu W, Chakraborty B, Liu N. Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies. J Biomed Inform 2021;126:103980. [PMID: 34974189 DOI: 10.1016/j.jbi.2021.103980] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 11/07/2021] [Accepted: 12/20/2021] [Indexed: 12/21/2022]

Alexander N, Alexander DC, Barkhof F, Denaxas S. Identifying and evaluating clinical subtypes of Alzheimer's disease in care electronic health records using unsupervised machine learning. BMC Med Inform Decis Mak 2021;21:343. [PMID: 34879829 PMCID: PMC8653614 DOI: 10.1186/s12911-021-01693-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 11/15/2021] [Indexed: 02/02/2023] Open

Abstract

BACKGROUND

Alzheimer's disease (AD) is a highly heterogeneous disease with diverse trajectories and outcomes observed in clinical populations. Understanding this heterogeneity can enable better treatment, prognosis and disease management. Studies to date have mainly used imaging or cognition data and have been limited in terms of data breadth and sample size. Here we examine the clinical heterogeneity of Alzheimer's disease patients using electronic health records (EHR) to identify and characterise disease subgroups using multiple clustering methods, identifying clusters which are clinically actionable.

METHODS

We identified AD patients in primary care EHR from the Clinical Practice Research Datalink (CPRD) using a previously validated rule-based phenotyping algorithm. We extracted and included a range of comorbidities, symptoms and demographic features as patient features. We evaluated four different clustering methods (k-means, kernel k-means, affinity propagation and latent class analysis) to cluster Alzheimer's disease patients. We compared clusters on clinically relevant outcomes and evaluated each method using measures of cluster structure, stability, efficiency of outcome prediction and replicability in external data sets.

RESULTS

We identified 7,913 AD patients, with a mean age of 82 and 66.2% female. We included 21 features in our analysis. We observed 5, 2, 5 and 6 clusters in k-means, kernel k-means, affinity propagation and latent class analysis respectively. K-means was found to produce the most consistent results based on four evaluative measures. We discovered a consistent cluster found in three of the four methods composed of predominantly female, younger disease onset (43% between ages 42-73) diagnosed with depression and anxiety, with a quicker rate of progression compared to the average across other clusters.

CONCLUSION

Each clustering approach produced substantially different clusters and K-Means performed the best out of the four methods based on the four evaluative criteria. However, the consistent appearance of one particular cluster across three of the four methods potentially suggests the presence of a distinct disease subtype that merits further exploration. Our study underlines the variability of the results obtained from different clustering approaches and the importance of systematically evaluating different approaches for identifying disease subtypes in complex EHR.

Collapse

Chen J, Sun L, Yu K, Batmanghelich K. Extracting Disease-Relevant Features with Adversarial Regularization. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2021;2021:3464-3471. [PMID: 35198261 PMCID: PMC8863436 DOI: 10.1109/bibm52615.2021.9669878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Searle T, Ibrahim Z, Teo J, Dobson R. Estimating redundancy in clinical text. J Biomed Inform 2021;124:103938. [PMID: 34695581 DOI: 10.1016/j.jbi.2021.103938] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Revised: 08/19/2021] [Accepted: 10/17/2021] [Indexed: 12/15/2022]

Lee S, Kang S, Eun Y, Won HH, Kim H, Cha HS, Koh EM, Lee J. A cluster analysis of patients with axial spondyloarthritis using tumour necrosis factor alpha inhibitors based on clinical characteristics. Arthritis Res Ther 2021;23:284. [PMID: 34782006 PMCID: PMC8591959 DOI: 10.1186/s13075-021-02647-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Accepted: 10/12/2021] [Indexed: 11/24/2022] Open

Wanyan T, Honarvar H, Jaladanki SK, Zang C, Naik N, Somani S, De Freitas JK, Paranjpe I, Vaid A, Zhang J, Miotto R, Wang Z, Nadkarni GN, Zitnik M, Azad A, Wang F, Ding Y, Glicksberg BS. Contrastive Learning Improves Critical Event Prediction in COVID-19 Patients. PATTERNS 2021;2:100389. [PMID: 34723227 PMCID: PMC8542449 DOI: 10.1016/j.patter.2021.100389] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 09/12/2021] [Accepted: 10/21/2021] [Indexed: 12/30/2022]

Affiliation(s)

Tingyi Wanyan Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, USA
Hossein Honarvar Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Suraj K Jaladanki Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Chengxi Zang Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Nidhi Naik Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Sulaiman Somani Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Jessica K De Freitas Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Ishan Paranjpe Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Akhil Vaid Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Jing Zhang Renmin University of China, Beijing, China
Riccardo Miotto Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Zhangyang Wang Department of Electrical and Computer Engineering, University of Texas at Austin, Austin, TX, USA
Girish N Nadkarni Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Division of Nephrology, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY
Marinka Zitnik Department of Biomedical Informatics, Harvard University, USA
Ariful Azad School of Informatics, Computing, and Engineering, Indiana University, Bloomington, IN, USA
Fei Wang Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Ying Ding Dell Medical School, University of Texas at Austin, Austin, TX, USA.,School of Informatics, University of Texas at Austin, Austin, TX, USA
Benjamin S Glicksberg Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA

Collapse

Peng J, Jury EC, Dönnes P, Ciurtin C. Machine Learning Techniques for Personalised Medicine Approaches in Immune-Mediated Chronic Inflammatory Diseases: Applications and Challenges. Front Pharmacol 2021;12:720694. [PMID: 34658859 PMCID: PMC8514674 DOI: 10.3389/fphar.2021.720694] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 09/14/2021] [Indexed: 12/12/2022] Open

Ziletti A, Berns C, Treichel O, Weber T, Liang J, Kammerath S, Schwaerzler M, Virayah J, Ruau D, Ma X, Mattern A. Discovering Key Topics From Short, Real-World Medical Inquiries via Natural Language Processing. FRONTIERS IN COMPUTER SCIENCE 2021. [DOI: 10.3389/fcomp.2021.672867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open