1
|
Paolo D, Greco C, Cortellini A, Ramella S, Soda P, Bria A, Sicilia R. Hierarchical embedding attention for overall survival prediction in lung cancer from unstructured EHRs. BMC Med Inform Decis Mak 2025; 25:169. [PMID: 40251623 PMCID: PMC12007135 DOI: 10.1186/s12911-025-02998-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2024] [Accepted: 04/07/2025] [Indexed: 04/20/2025] Open
Abstract
The automated processing of Electronic Health Records (EHRs) poses a significant challenge due to their unstructured nature, rich in valuable, yet disorganized information. Natural Language Processing (NLP), particularly Named Entity Recognition (NER), has been instrumental in extracting structured information from EHR data. However, existing literature primarly focuses on extracting handcrafted clinical features through NLP and NER methods without delving into their learned representations. In this work, we explore the untapped potential of these representations by considering their contextual richness and entity-specific information. Our proposed methodology extracts representations generated by a transformer-based NER model on EHRs data, combines them using a hierarchical attention mechanism, and employs the obtained enriched representation as input for a clinical prediction model. Specifically, this study addresses Overall Survival (OS) in Non-Small Cell Lung Cancer (NSCLC) using unstructured EHRs data collected from an Italian clinical centre encompassing 838 records from 231 lung cancer patients. Whilst our study is applied on EHRs written in Italian, it serves as use case to prove the effectiveness of extracting and employing high level textual representations that capture relevant information as named entities. Our methodology is interpretable because the hierarchical attention mechanism highlights the information in EHRs that the model considers the most crucial during the decision-making process. We validated this interpretability by measuring the agreement of domain experts on the importance assigned by the hierarchical attention mechanism to EHRs information through a questionnaire. Results demonstrate the effectiveness of our method, showcasing statistically significant improvements over traditional manually extracted clinical features.
Collapse
Affiliation(s)
- Domenico Paolo
- Unit of Computer Systems & Bioinformatics, Department of Engineering, University Campus Bio-Medico di Roma, Roma, Italy
| | - Carlo Greco
- Research Unit of Radiation Oncology, Department of Medicine and Surgery, University Campus Bio-Medico di Roma, Roma, Italy
- Operative Research Unit of Radiation Oncology, Fondazione Policlinico Universitario Campus Bio-Medico, Roma, Italy
| | - Alessio Cortellini
- Operative Research Unit of Medical Oncology, Fondazione Policlinico Universitario Campus Bio-Medico, Roma, Italy
| | - Sara Ramella
- Research Unit of Radiation Oncology, Department of Medicine and Surgery, University Campus Bio-Medico di Roma, Roma, Italy
- Operative Research Unit of Radiation Oncology, Fondazione Policlinico Universitario Campus Bio-Medico, Roma, Italy
| | - Paolo Soda
- Unit of Computer Systems & Bioinformatics, Department of Engineering, University Campus Bio-Medico di Roma, Roma, Italy.
- Department of Diagnostics and Intervention, Radiation Physics, Umeå University, Umeå, Sweden.
| | - Alessandro Bria
- Department of Electrical and Information Engineering, University of Cassino and Southern Latium, Cassino, Italy
| | - Rosa Sicilia
- Unit of Computer Systems & Bioinformatics, Department of Engineering, University Campus Bio-Medico di Roma, Roma, Italy
| |
Collapse
|
2
|
Francesconi A, di Biase L, Cappetta D, Rebecchi F, Soda P, Sicilia R, Guarrasi V. Class balancing diversity multimodal ensemble for Alzheimer's disease diagnosis and early detection. Comput Med Imaging Graph 2025; 123:102529. [PMID: 40147216 DOI: 10.1016/j.compmedimag.2025.102529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Revised: 03/07/2025] [Accepted: 03/13/2025] [Indexed: 03/29/2025]
Abstract
Alzheimer's disease (AD) poses significant global health challenges due to its increasing prevalence and associated societal costs. Early detection and diagnosis of AD are critical for delaying progression and improving patient outcomes. Traditional diagnostic methods and single-modality data often fall short in identifying early-stage AD and distinguishing it from Mild Cognitive Impairment (MCI). This study addresses these challenges by introducing a novel approach: multImodal enseMble via class BALancing diversity for iMbalancEd Data (IMBALMED). IMBALMED integrates multimodal data from the Alzheimer's Disease Neuroimaging Initiative database, including clinical assessments, neuroimaging phenotypes, biospecimen, and subject characteristics data. It employs a new ensemble of model classifiers, designed specifically for this framework, which combines eight distinct families of learning paradigms trained with diverse class balancing techniques to overcome class imbalance and enhance model accuracy. We evaluate IMBALMED on two diagnostic tasks (binary and ternary classification) and four binary early detection tasks (at 12, 24, 36, and 48 months), comparing its performance with state-of-the-art algorithms and an unbalanced dataset method. To further validate the proposed model and ensure genuine generalization to real-world scenarios, we conducted an external validation experiment using data from the most recent phase of the ADNI dataset. IMBALMED demonstrates superior diagnostic accuracy and predictive performance in both binary and ternary classification tasks, significantly improving early detection of MCI at a 48-month time point and showing excellent generalizability in the 12-month task during external validation. The method shows improved classification performance and robustness, offering a promising solution for early detection and management of AD.
Collapse
Affiliation(s)
- Arianna Francesconi
- Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
| | - Lazzaro di Biase
- Operative Research Unit of Neurology, Fondazione Policlinico Universitario Campus Bio-Medico, Rome, Italy.
| | - Donato Cappetta
- Eustema S.p.A., Research and Development Centre, Naples, Italy.
| | - Fabio Rebecchi
- Eustema S.p.A., Research and Development Centre, Naples, Italy.
| | - Paolo Soda
- Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy; Department of Diagnostic and Intervention, Radiation Physics, Biomedical Engineering, Umeå University, Umeå, Sweden.
| | - Rosa Sicilia
- Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
| | - Valerio Guarrasi
- Unit of Computer Systems and Bioinformatics, Department of Engineering, Università Campus Bio-Medico di Roma, Rome, Italy.
| |
Collapse
|
3
|
Di Teodoro G, Siciliano F, Guarrasi V, Vandamme AM, Ghisetti V, Sönnerborg A, Zazzi M, Silvestri F, Palagi L. A graph neural network-based model with out-of-distribution robustness for enhancing antiretroviral therapy outcome prediction for HIV-1. Comput Med Imaging Graph 2025; 120:102484. [PMID: 39808870 DOI: 10.1016/j.compmedimag.2024.102484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2024] [Revised: 11/16/2024] [Accepted: 12/23/2024] [Indexed: 01/16/2025]
Abstract
Predicting the outcome of antiretroviral therapies (ART) for HIV-1 is a pressing clinical challenge, especially when the ART includes drugs with limited effectiveness data. This scarcity of data can arise either due to the introduction of a new drug to the market or due to limited use in clinical settings, resulting in clinical dataset with highly unbalanced therapy representation. To tackle this issue, we introduce a novel joint fusion model, which combines features from a Fully Connected (FC) Neural Network and a Graph Neural Network (GNN) in a multi-modality fashion. Our model uses both tabular data about genetic sequences and a knowledge base derived from Stanford drug-resistance mutation tables, which serve as benchmark references for deducing in-vivo treatment efficacy based on the viral genetic sequence. By leveraging this knowledge base structured as a graph, the GNN component enables our model to adapt to imbalanced data distributions and account for Out-of-Distribution (OoD) drugs. We evaluated these models' robustness against OoD drugs in the test set. Our comprehensive analysis demonstrates that the proposed model consistently outperforms the FC model. These results underscore the advantage of integrating Stanford scores in the model, thereby enhancing its generalizability and robustness, but also extending its utility in contributing in more informed clinical decisions with limited data availability. The source code is available at https://github.com/federicosiciliano/graph-ood-hiv.
Collapse
Affiliation(s)
- Giulia Di Teodoro
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy; EuResist Network, 00152, Rome, Italy.
| | - Federico Siciliano
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy.
| | - Valerio Guarrasi
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, 00128, Rome, Italy.
| | - Anne-Mieke Vandamme
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Clinical and Epidemiological Virology, Leuven, Belgium; Center for Global Health and Tropical Medicine, Instituto de Higiene e Medicina Tropical, Universidade Nova de Lisboa, 1349-008, Lisbon, Portugal.
| | - Valeria Ghisetti
- Molecular Biology and Microbiology Unit, Amedeo di Savoia Hospital, ASL Città di Torino, 10128, Turin, Italy.
| | - Anders Sönnerborg
- Karolinska Institutet, Division of Infectious Diseases, Department of Medicine Huddinge, 14152, Stockholm, Sweden; Karolinska University Hospital, Department of Infectious Diseases, 14186, Stockholm, Sweden.
| | - Maurizio Zazzi
- Department of Medical Biotechnologies, University of Siena, 53100, Siena, Italy.
| | - Fabrizio Silvestri
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy.
| | - Laura Palagi
- Sapienza University of Rome, Department of Computer Control and Management Engineering Antonio Ruberti, 00185, Rome, Italy.
| |
Collapse
|
4
|
Mogensen K, Guarrasi V, Larsson J, Hansson W, Wåhlin A, Koskinen LO, Malm J, Eklund A, Soda P, Qvarlander S. An optimized ensemble search approach for classification of higher-level gait disorder using brain magnetic resonance images. Comput Biol Med 2025; 184:109457. [PMID: 39615237 DOI: 10.1016/j.compbiomed.2024.109457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 11/18/2024] [Accepted: 11/19/2024] [Indexed: 12/22/2024]
Abstract
Higher-Level Gait Disorder (HLGD) is a type of gait disorder estimated to affect up to 6% of the older population. By definition, its symptoms originate from the higher-level nervous system, yet its association with brain morphology remains unclear. This study hypothesizes that there are patterns in brain morphology linked to HLGD. For the first time in the literature, this work investigates whether deep learning, in the form of convolutional neural networks, can capture patterns in magnetic resonance images to identify individuals affected by HLGD. To handle this new classification task, we propose setting up an ensemble of models. This leverages the benefits of combining classifiers instead of determining which network is the most suitable, developing a new architecture, or customizing an existing one. We introduce a computationally cost-effective search algorithm to find the optimal ensemble by leveraging a cost function of both traditional performance scores and the diversity among the models. Using a unique dataset from a large population-based cohort (VESPR), the ensemble identified by our algorithm demonstrated superior performance compared to single networks, other ensemble fusion techniques, and the best linear radiological measure. This emphasizes the importance of implementing diversity into the cost function. Furthermore, the results indicate significant morphological differences in brain structure between HLGD-affected individuals and controls, motivating research about which areas the networks base their classifications on, to get a better understanding of the pathophysiology of HLGD.
Collapse
Affiliation(s)
- Klara Mogensen
- Department of Diagnostics and Intervention, Biomedical Engineering and Radiation Physics, Umeå University, 90187, Sweden.
| | - Valerio Guarrasi
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, 00128, Italy
| | - Jenny Larsson
- Department of Clinical Sciences, Neurosciences, Umeå University, 90187, Sweden
| | - William Hansson
- Department of Clinical Sciences, Neurosciences, Umeå University, 90187, Sweden
| | - Anders Wåhlin
- Department of Diagnostics and Intervention, Biomedical Engineering and Radiation Physics, Umeå University, 90187, Sweden; Umeå Center for Functional Brain Imaging, Umeå University, 90187, Sweden; Department of Applied Physics and Electronics, Umeå University, 90187, Sweden
| | - Lars-Owe Koskinen
- Department of Clinical Sciences, Neurosciences, Umeå University, 90187, Sweden
| | - Jan Malm
- Department of Clinical Sciences, Neurosciences, Umeå University, 90187, Sweden
| | - Anders Eklund
- Department of Diagnostics and Intervention, Biomedical Engineering and Radiation Physics, Umeå University, 90187, Sweden; Umeå Center for Functional Brain Imaging, Umeå University, 90187, Sweden
| | - Paolo Soda
- Department of Diagnostics and Intervention, Biomedical Engineering and Radiation Physics, Umeå University, 90187, Sweden; Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, 00128, Italy
| | - Sara Qvarlander
- Department of Diagnostics and Intervention, Biomedical Engineering and Radiation Physics, Umeå University, 90187, Sweden
| |
Collapse
|
5
|
Cordelli E, Soda P, Citter S, Schiavon E, Salvatore C, Fazzini D, Clementi G, Cellina M, Cozzi A, Bortolotto C, Preda L, Francini L, Tortora M, Castiglioni I, Papa S, Sona D, Alì M. Machine learning predicts pulmonary Long Covid sequelae using clinical data. BMC Med Inform Decis Mak 2024; 24:359. [PMID: 39604988 PMCID: PMC11600907 DOI: 10.1186/s12911-024-02745-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Accepted: 10/25/2024] [Indexed: 11/29/2024] Open
Abstract
Long COVID is a multi-systemic disease characterized by the persistence or occurrence of many symptoms that in many cases affect the pulmonary system. These, in turn, may deteriorate the patient's quality of life making it easier to develop severe complications. Being able to predict this syndrome is therefore important as this enables early treatment. In this work, we investigated three machine learning approaches that use clinical data collected at the time of hospitalization to this goal. The first works with all the descriptors feeding a traditional shallow learner, the second exploits the benefits of an ensemble of classifiers, and the third is driven by the intrinsic multimodality of the data so that different models learn complementary information. The experiments on a new cohort of data from 152 patients show that it is possible to predict pulmonary Long Covid sequelae with an accuracy of up to 94 % . As a further contribution, this work also publicly discloses the related data repository to foster research in this field.
Collapse
Affiliation(s)
- Ermanno Cordelli
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Via Alvaro del Portillo 21, Rome, 00128, Italy
| | - Paolo Soda
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Via Alvaro del Portillo 21, Rome, 00128, Italy.
- Department of Diagnostics and Intervention, Radiation Physics, Biomedical Engineering, Umeå University, Universitetstorget 4, 901 87, Umeå, Sweden.
| | - Sara Citter
- Fondazione Bruno Kessler, Via Sommarive, 18, Trento, 38123, Italy
- Department of Physics, University of Trento, Via Sommarive, 14, Trento, 38123, Italy
| | - Elia Schiavon
- DeepTrace Technologies S.R.L., Via Conservatorio 17, Milan, 20122, MI, Italy
| | - Christian Salvatore
- DeepTrace Technologies S.R.L., Via Conservatorio 17, Milan, 20122, MI, Italy
- Department of Science, Technology and Society, University School for Advanced Studies IUSS Pavia, 27100, Pavia, Italy
| | - Deborah Fazzini
- Department of Diagnostic Imaging and Stereotactic Radiosurgey, Centro Diagnostico Italiano S.p.A., Via S. Saint Bon 20, Milan, 20147, Italy
| | - Greta Clementi
- Department of Diagnostic Imaging and Stereotactic Radiosurgey, Centro Diagnostico Italiano S.p.A., Via S. Saint Bon 20, Milan, 20147, Italy
| | - Michaela Cellina
- Radiology Department, ASST Fatebenefratelli Sacco, Piazza Principessa Clotilde 3, Milan, 20121, Italy
| | - Andrea Cozzi
- Imaging Institute of Southern Switzerland (IIMSI), Ente Ospedaliero Cantonale (EOC), Lugano, Switzerland
| | - Chandra Bortolotto
- Radiology Unit, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Corso Str. Nuova, 65, Pavia, 27100, Italy
- Radiology Institute, Fondazione IRCCS Policlinico San Matteo, Viale Golgi 19, Pavia, 27100, Italy
| | - Lorenzo Preda
- Radiology Unit, Department of Clinical, Surgical, Diagnostic, and Pediatric Sciences, University of Pavia, Corso Str. Nuova, 65, Pavia, 27100, Italy
- Radiology Institute, Fondazione IRCCS Policlinico San Matteo, Viale Golgi 19, Pavia, 27100, Italy
| | - Luisa Francini
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Via Alvaro del Portillo 21, Rome, 00128, Italy
| | - Matteo Tortora
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Via Alvaro del Portillo 21, Rome, 00128, Italy
- Department of Naval, Electrical, Electronics and Telecommunications Engineering, University of Genova, Via all'Opera Pia 11a, Genoa, 16145, Italy
| | - Isabella Castiglioni
- Department of Physics G. Occhialini, University of Milan-Bicocca, 20133, Milan, Italy
| | - Sergio Papa
- Department of Diagnostic Imaging and Stereotactic Radiosurgey, Centro Diagnostico Italiano S.p.A., Via S. Saint Bon 20, Milan, 20147, Italy
| | - Diego Sona
- Fondazione Bruno Kessler, Via Sommarive, 18, Trento, 38123, Italy
| | - Marco Alì
- Department of Diagnostic Imaging and Stereotactic Radiosurgey, Centro Diagnostico Italiano S.p.A., Via S. Saint Bon 20, Milan, 20147, Italy
- Bracco Imaging S.p.A., Via Caduti di Marcinelle 13, Milan, 20134, Italy
| |
Collapse
|
6
|
Teoh JR, Dong J, Zuo X, Lai KW, Hasikin K, Wu X. Advancing healthcare through multimodal data fusion: a comprehensive review of techniques and applications. PeerJ Comput Sci 2024; 10:e2298. [PMID: 39650483 PMCID: PMC11623190 DOI: 10.7717/peerj-cs.2298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Accepted: 08/09/2024] [Indexed: 12/11/2024]
Abstract
With the increasing availability of diverse healthcare data sources, such as medical images and electronic health records, there is a growing need to effectively integrate and fuse this multimodal data for comprehensive analysis and decision-making. However, despite its potential, multimodal data fusion in healthcare remains limited. This review paper provides an overview of existing literature on multimodal data fusion in healthcare, covering 69 relevant works published between 2018 and 2024. It focuses on methodologies that integrate different data types to enhance medical analysis, including techniques for integrating medical images with structured and unstructured data, combining multiple image modalities, and other features. Additionally, the paper reviews various approaches to multimodal data fusion, such as early, intermediate, and late fusion methods, and examines the challenges and limitations associated with these techniques. The potential benefits and applications of multimodal data fusion in various diseases are highlighted, illustrating specific strategies employed in healthcare artificial intelligence (AI) model development. This research synthesizes existing information to facilitate progress in using multimodal data for improved medical diagnosis and treatment planning.
Collapse
Affiliation(s)
- Jing Ru Teoh
- Department of Biomedical Engineering, University of Malaya, Kuala Lumpur, Malaysia
| | - Jian Dong
- China Electronics Standardization Institute, Beijing, China
| | - Xiaowei Zuo
- Department of Psychiatry, The Affiliated Xuzhou Oriental Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
| | - Khin Wee Lai
- Department of Biomedical Engineering, University of Malaya, Kuala Lumpur, Malaysia
| | - Khairunnisa Hasikin
- Department of Biomedical Engineering, University of Malaya, Kuala Lumpur, Malaysia
- Faculty of Engineering, Centre of Intelligent Systems for Emerging Technology (CISET), Kuala Lumpur, Malaysia
| | - Xiang Wu
- Department of Biomedical Engineering, University of Malaya, Kuala Lumpur, Malaysia
- Institute of Medical Information Security, Xuzhou, Jiangsu, China
| |
Collapse
|
7
|
Liu L, Cai S, Wu X, Zhu H, Wang Y. Effects of Ward Noise Reduction Administration on Mental Health and Lung Function of Patients with Lung Cancer. Noise Health 2024; 26:235-241. [PMID: 38904829 PMCID: PMC11530107 DOI: 10.4103/nah.nah_98_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 02/28/2024] [Accepted: 02/28/2024] [Indexed: 06/22/2024] Open
Abstract
OBJECTIVE This study aimed to analyze the effects of ward noise reduction administration on the lung function and mental health of patients with lung cancer. METHODS A total of 195 patients who underwent lung cancer surgery in PLA Northern Theater Command Air Force Hospital from November 2020 to November 2022 were selected to be divided into a control group (routine nursing) and an observation group (routine nursing and ward noise reduction administration) in accordance with the medical record system. The general demographic data, noise level, lung function (forced expiratory volume in 1 s (FEV1), forced vital capacity (FVC) and FEV1/FVC)), and complications of patients in the two groups were collected. Propensity score matching (PSM) was used to balance the baseline data of the two groups, and t-test and chi-square test were used to analyze the data. RESULTS After PSM was conducted, 50 patients were enrolled in each group. No statistical difference was found in the baseline data, preadministration noise levels, and FEV1, FVC, FEV1/FVC, state-anxiety inventory (S-AI), and trait anxiety inventory scale (T-AI) scores between the two groups (P > 0.05). After ward noise reduction was administered, the noise level in the observation group was lower than that in the control group (P < 0.05). The FEV1, FVC, and FEV1/FVC scores of the observation group were higher than those of the control group but were not statistically significant (P > 0.05). The S-AI and T-AI scores of the observation group were lower than those of the control group (P < 0.05). No differences were found in the complications between the two groups (P > 0.05). CONCLUSION Administering ward noise reduction in patients with lung cancer can alleviate their negative emotions, thus worthy of clinical adoption.
Collapse
Affiliation(s)
- Lina Liu
- Department of Radiation Oncology, Air Force Hospital of the PLA Northern Theater Command, Shenyang, 110042, China
| | - Shuo Cai
- Department of Nursing, PLA Northern Theater Command Air Force Hospital, Shenyang, 110042, China
| | - Xiaoyu Wu
- Department of Respiratory and Critical Care Medicine, PLA Northern Theater Command Air Force Hospital, Shenyang, 110042, China
| | - Huixin Zhu
- Department of Nursing, PLA Northern Theater Command Air Force Hospital, Shenyang, 110042, China
| | - Yu Wang
- Department of Pharmacy, PLA Northern Theater Command Air Force Hospital, Shenyang, 110042, China
| |
Collapse
|
8
|
Guarrasi V, Soda P. Multi-objective optimization determines when, which and how to fuse deep networks: An application to predict COVID-19 outcomes. Comput Biol Med 2023; 154:106625. [PMID: 36738713 PMCID: PMC9892294 DOI: 10.1016/j.compbiomed.2023.106625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/18/2023] [Accepted: 01/28/2023] [Indexed: 02/05/2023]
Abstract
The COVID-19 pandemic has caused millions of cases and deaths and the AI-related scientific community, after being involved with detecting COVID-19 signs in medical images, has been now directing the efforts towards the development of methods that can predict the progression of the disease. This task is multimodal by its very nature and, recently, baseline results achieved on the publicly available AIforCOVID dataset have shown that chest X-ray scans and clinical information are useful to identify patients at risk of severe outcomes. While deep learning has shown superior performance in several medical fields, in most of the cases it considers unimodal data only. In this respect, when, which and how to fuse the different modalities is an open challenge in multimodal deep learning. To cope with these three questions here we present a novel approach optimizing the setup of a multimodal end-to-end model. It exploits Pareto multi-objective optimization working with a performance metric and the diversity score of multiple candidate unimodal neural networks to be fused. We test our method on the AIforCOVID dataset, attaining state-of-the-art results, not only outperforming the baseline performance but also being robust to external validation. Moreover, exploiting XAI algorithms we figure out a hierarchy among the modalities and we extract the features' intra-modality importance, enriching the trust on the predictions made by the model.
Collapse
Affiliation(s)
- Valerio Guarrasi
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Italy; Department of Computer, Control, and Management Engineering, Sapienza University of Rome, Italy.
| | - Paolo Soda
- Unit of Computer Systems and Bioinformatics, Department of Engineering, University Campus Bio-Medico of Rome, Italy; Department of Radiation Sciences, Radiation Physics, Biomedical Engineering, Umeå, University, Umeå, Sweden.
| |
Collapse
|