1
|
Gliozzo J, Soto-Gomez M, Guarino V, Bonometti A, Cabri A, Cavalleri E, Reese J, Robinson PN, Mesiti M, Valentini G, Casiraghi E. Intrinsic-dimension analysis for guiding dimensionality reduction and data fusion in multi-omics data processing. Artif Intell Med 2025; 160:103049. [PMID: 39673960 DOI: 10.1016/j.artmed.2024.103049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 12/03/2024] [Accepted: 12/04/2024] [Indexed: 12/16/2024]
Abstract
Multi-omics data have revolutionized biomedical research by providing a comprehensive understanding of biological systems and the molecular mechanisms of disease development. However, analyzing multi-omics data is challenging due to high dimensionality and limited sample sizes, necessitating proper data-reduction pipelines to ensure reliable analyses. Additionally, its multimodal nature requires effective data-integration pipelines. While several dimensionality reduction and data fusion algorithms have been proposed, crucial aspects are often overlooked. Specifically, the choice of projection space dimension is typically heuristic and uniformly applied across all omics, neglecting the unique high dimension small sample size challenges faced by individual omics. This paper introduces a novel multi-modal dimensionality reduction pipeline tailored to individual views. By leveraging intrinsic dimensionality estimators, we assess the curse-of-dimensionality impact on each view and propose a two-step reduction strategy for significantly affected views, combining feature selection with feature extraction. Compared to traditional uniform reduction pipelines in a crucial and supervised multi-omics analysis setting, our approach shows significant improvement. Additionally, we explore three effective unsupervised multi-omics data fusion methods rooted in the main data fusion strategies to gain insights into their performance under crucial, yet overlooked, settings.
Collapse
Affiliation(s)
- Jessica Gliozzo
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy; European Commission, Joint Research Centre (JRC), Ispra, Italy
| | - Mauricio Soto-Gomez
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy
| | - Valentina Guarino
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy
| | - Arturo Bonometti
- Department of Biomedical Sciences, Humanitas University, Milan, Italy; Department of Pathology, IRCCS Humanitas Clinical and Research Hospital, Milan, Italy
| | - Alberto Cabri
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy
| | - Emanuele Cavalleri
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy
| | - Justin Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Marco Mesiti
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Giorgio Valentini
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy; CINI, Infolife National Laboratory, Roma, Italy
| | - Elena Casiraghi
- AnacletoLab, Computer Science Department, Università degli Studi di Milano, Milan, Italy; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA; CINI, Infolife National Laboratory, Roma, Italy; Department of Computer Science, Aalto University, Espoo, Finland.
| |
Collapse
|
2
|
Kamalakannan N, Macharla SR, Kanimozhi M, Sudhakar MS. Exponential Pixelating Integral transform with dual fractal features for enhanced chest X-ray abnormality detection. Comput Biol Med 2024; 182:109093. [PMID: 39232407 DOI: 10.1016/j.compbiomed.2024.109093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Revised: 08/25/2024] [Accepted: 08/29/2024] [Indexed: 09/06/2024]
Abstract
The heightened prevalence of respiratory disorders, particularly exacerbated by a significant upswing in fatalities due to the novel coronavirus, underscores the critical need for early detection and timely intervention. This imperative is paramount, possessing the potential to profoundly impact and safeguard numerous lives. Medically, chest radiography stands out as an essential and economically viable medical imaging approach for diagnosing and assessing the severity of diverse Respiratory Disorders. However, their detection in Chest X-Rays is a cumbersome task even for well-trained radiologists owing to low contrast issues, overlapping of the tissue structures, subjective variability, and the presence of noise. To address these issues, a novel analytical model termed Exponential Pixelating Integral is introduced for the automatic detection of infections in Chest X-Rays in this work. Initially, the presented Exponential Pixelating Integral enhances the pixel intensities to overcome the low-contrast issues that are then polar-transformed followed by their representation using the locally invariant Mandelbrot and Julia fractal geometries for effective distinction of structural features. The collated features labeled Exponential Pixelating Integral with dually characterized fractal features are then classified by the non-parametric multivariate adaptive regression splines to establish an ensemble model between each pair of classes for effective diagnosis of diverse diseases. Rigorous analysis of the proposed classification framework on large medical benchmarked datasets showcases its superiority over its peers by registering a higher classification accuracy and F1 scores ranging from 98.46 to 99.45 % and 96.53-98.10 % respectively, making it a precise and interpretable automated system for diagnosing respiratory disorders.
Collapse
Affiliation(s)
| | | | - M Kanimozhi
- School of Electrical & Electronics, Sathyabama Institute of Science and Technology, Chennai, Tamilnadu, India
| | - M S Sudhakar
- School of Electronics Engineering, Vellore Institute of Technology, Vellore, Tamilnadu, India.
| |
Collapse
|
3
|
Chan LE, Casiraghi E, Reese J, Harmon QE, Schaper K, Hegde H, Valentini G, Schmitt C, Motsinger-Reif A, Hall JE, Mungall CJ, Robinson PN, Haendel MA. Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests. Int J Med Inform 2024; 187:105461. [PMID: 38643701 PMCID: PMC11188727 DOI: 10.1016/j.ijmedinf.2024.105461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 04/09/2024] [Accepted: 04/16/2024] [Indexed: 04/23/2024]
Abstract
OBJECTIVE Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (for example, endometriosis, ovarian cyst, and uterine fibroids). MATERIALS AND METHODS We harmonized survey data from the Personalized Environment and Genes Study (PEGS) on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. RESULTS Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant or suggestive predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. DISCUSSION Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal but can support hypothesis generation. CONCLUSION This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.
Collapse
Affiliation(s)
- Lauren E Chan
- Oregon State University, College of Public Health and Human Sciences, Corvallis, OR, USA.
| | - Elena Casiraghi
- AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Italy; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA; European Laboratory for Learning and Intelligent Systems, ELLIS
| | - Justin Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Quaker E Harmon
- National Institute of Environmental Health Sciences, Epidemiology Branch, Durham, NC, USA
| | - Kevin Schaper
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Harshad Hegde
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Giorgio Valentini
- AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Italy; European Laboratory for Learning and Intelligent Systems, ELLIS
| | - Charles Schmitt
- National Institute of Environmental Health Sciences, Office of Data Science, Durham, NC, USA
| | - Alison Motsinger-Reif
- National Institute of Environmental Health Sciences, Biostatistics & Computational Biology Branch, Durham, NC, USA
| | - Janet E Hall
- National Institute of Environmental Health Sciences, Clinical Research Branch, Durham, NC, USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Peter N Robinson
- European Laboratory for Learning and Intelligent Systems, ELLIS; The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Melissa A Haendel
- University of North Carolina, Dept. of Genetics, Chapel Hill, NC, USA
| |
Collapse
|
4
|
Khalili H, Wimmer MA. Towards Improved XAI-Based Epidemiological Research into the Next Potential Pandemic. Life (Basel) 2024; 14:783. [PMID: 39063538 PMCID: PMC11278356 DOI: 10.3390/life14070783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2024] [Revised: 06/16/2024] [Accepted: 06/19/2024] [Indexed: 07/28/2024] Open
Abstract
By applying AI techniques to a variety of pandemic-relevant data, artificial intelligence (AI) has substantially supported the control of the spread of the SARS-CoV-2 virus. Along with this, epidemiological machine learning studies of SARS-CoV-2 have been frequently published. While these models can be perceived as precise and policy-relevant to guide governments towards optimal containment policies, their black box nature can hamper building trust and relying confidently on the prescriptions proposed. This paper focuses on interpretable AI-based epidemiological models in the context of the recent SARS-CoV-2 pandemic. We systematically review existing studies, which jointly incorporate AI, SARS-CoV-2 epidemiology, and explainable AI approaches (XAI). First, we propose a conceptual framework by synthesizing the main methodological features of the existing AI pipelines of SARS-CoV-2. Upon the proposed conceptual framework and by analyzing the selected epidemiological studies, we reflect on current research gaps in epidemiological AI toolboxes and how to fill these gaps to generate enhanced policy support in the next potential pandemic.
Collapse
Affiliation(s)
- Hamed Khalili
- Research Group E-Government, Faculty of Computer Science, University of Koblenz, D-56070 Koblenz, Germany;
| | | |
Collapse
|
5
|
Tong L, Shi W, Isgut M, Zhong Y, Lais P, Gloster L, Sun J, Swain A, Giuste F, Wang MD. Integrating Multi-Omics Data With EHR for Precision Medicine Using Advanced Artificial Intelligence. IEEE Rev Biomed Eng 2024; 17:80-97. [PMID: 37824325 DOI: 10.1109/rbme.2023.3324264] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
With the recent advancement of novel biomedical technologies such as high-throughput sequencing and wearable devices, multi-modal biomedical data ranging from multi-omics molecular data to real-time continuous bio-signals are generated at an unprecedented speed and scale every day. For the first time, these multi-modal biomedical data are able to make precision medicine close to a reality. However, due to data volume and the complexity, making good use of these multi-modal biomedical data requires major effort. Researchers and clinicians are actively developing artificial intelligence (AI) approaches for data-driven knowledge discovery and causal inference using a variety of biomedical data modalities. These AI-based approaches have demonstrated promising results in various biomedical and healthcare applications. In this review paper, we summarize the state-of-the-art AI models for integrating multi-omics data and electronic health records (EHRs) for precision medicine. We discuss the challenges and opportunities in integrating multi-omics data with EHRs and future directions. We hope this review can inspire future research and developing in integrating multi-omics data with EHRs for precision medicine.
Collapse
|
6
|
Ferri P, Romero-Garcia N, Badenes R, Lora-Pablos D, Morales TG, Gómez de la Cámara A, García-Gómez JM, Sáez C. Extremely missing numerical data in Electronic Health Records for machine learning can be managed through simple imputation methods considering informative missingness: A comparative of solutions in a COVID-19 mortality case study. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2023; 242:107803. [PMID: 37703700 DOI: 10.1016/j.cmpb.2023.107803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 08/28/2023] [Accepted: 09/05/2023] [Indexed: 09/15/2023]
Abstract
BACKGROUND AND OBJECTIVE Reusing Electronic Health Records (EHRs) for Machine Learning (ML) leads on many occasions to extremely incomplete and sparse tabular datasets, which can hinder the model development processes and limit their performance and generalization. In this study, we aimed to characterize the most effective data imputation techniques and ML models for dealing with highly missing numerical data in EHRs, in the case where only a very limited number of data are complete, as opposed to the usual case of having a reduced number of missing values. METHODS We used a case study including full blood count laboratory data, demographic and survival data in the context of COVID-19 hospital admissions and evaluated 30 processing pipelines combining imputation methods with ML classifiers. The imputation methods included missing mask, translation and encoding, mean imputation, k-nearest neighbors' imputation, Bayesian ridge regression imputation and generative adversarial imputation networks. The classifiers included k-nearest neighbors, logistic regression, random forest, gradient boosting and deep multilayer perceptron. RESULTS Our results suggest that in the presence of highly missing data, combining translation and encoding imputation-which considers informative missingness-with tree ensemble classifiers-random forest and gradient boosting-is a sensible choice when aiming to maximize performance, in terms of area under curve. CONCLUSIONS Based on our findings, we recommend the consideration of this imputer-classifier configuration when constructing models in the presence of extremely incomplete numerical data in EHR.
Collapse
Affiliation(s)
- Pablo Ferri
- Biomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones, Universitat Politècnica de València, Camino de Vera s/n, Valencia 46022, Spain.
| | | | - Rafael Badenes
- Departament de Cirugia, Universitat de València, Spain; Instituto INCLIVA, Hospital Clínico Universitario de Valencia, Spain; Department Anesthesiology, Surgical-Trauma Intensive Care and Pain Clinic, Hospital Clínic Universitari, Valencia, Spain
| | - David Lora-Pablos
- Instituto de Investigación imas12, Hospital 12 de Octubre, Madrid, Spain; Facultad de Estudios Estadísticos, Universidad Complutense de Madrid, Spain
| | | | | | - Juan M García-Gómez
- Biomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones, Universitat Politècnica de València, Camino de Vera s/n, Valencia 46022, Spain
| | - Carlos Sáez
- Biomedical Data Science Lab, Instituto Universitario de Tecnologías de la Información y Comunicaciones, Universitat Politècnica de València, Camino de Vera s/n, Valencia 46022, Spain
| |
Collapse
|
7
|
Champendal M, Müller H, Prior JO, Dos Reis CS. A scoping review of interpretability and explainability concerning artificial intelligence methods in medical imaging. Eur J Radiol 2023; 169:111159. [PMID: 37976760 DOI: 10.1016/j.ejrad.2023.111159] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/26/2023] [Accepted: 10/19/2023] [Indexed: 11/19/2023]
Abstract
PURPOSE To review eXplainable Artificial Intelligence/(XAI) methods available for medical imaging/(MI). METHOD A scoping review was conducted following the Joanna Briggs Institute's methodology. The search was performed on Pubmed, Embase, Cinhal, Web of Science, BioRxiv, MedRxiv, and Google Scholar. Studies published in French and English after 2017 were included. Keyword combinations and descriptors related to explainability, and MI modalities were employed. Two independent reviewers screened abstracts, titles and full text, resolving differences through discussion. RESULTS 228 studies met the criteria. XAI publications are increasing, targeting MRI (n = 73), radiography (n = 47), CT (n = 46). Lung (n = 82) and brain (n = 74) pathologies, Covid-19 (n = 48), Alzheimer's disease (n = 25), brain tumors (n = 15) are the main pathologies explained. Explanations are presented visually (n = 186), numerically (n = 67), rule-based (n = 11), textually (n = 11), and example-based (n = 6). Commonly explained tasks include classification (n = 89), prediction (n = 47), diagnosis (n = 39), detection (n = 29), segmentation (n = 13), and image quality improvement (n = 6). The most frequently provided explanations were local (78.1 %), 5.7 % were global, and 16.2 % combined both local and global approaches. Post-hoc approaches were predominantly employed. The used terminology varied, sometimes indistinctively using explainable (n = 207), interpretable (n = 187), understandable (n = 112), transparent (n = 61), reliable (n = 31), and intelligible (n = 3). CONCLUSION The number of XAI publications in medical imaging is increasing, primarily focusing on applying XAI techniques to MRI, CT, and radiography for classifying and predicting lung and brain pathologies. Visual and numerical output formats are predominantly used. Terminology standardisation remains a challenge, as terms like "explainable" and "interpretable" are sometimes being used indistinctively. Future XAI development should consider user needs and perspectives.
Collapse
Affiliation(s)
- Mélanie Champendal
- School of Health Sciences HESAV, HES-SO, University of Applied Sciences Western Switzerland, Lausanne, CH, Switzerland; Faculty of Biology and Medicine, University of Lausanne, Lausanne, CH, Switzerland.
| | - Henning Müller
- Informatics Institute, University of Applied Sciences Western Switzerland (HES-SO Valais) Sierre, CH, Switzerland; Medical faculty, University of Geneva, CH, Switzerland.
| | - John O Prior
- Faculty of Biology and Medicine, University of Lausanne, Lausanne, CH, Switzerland; Nuclear Medicine and Molecular Imaging Department, Lausanne University Hospital (CHUV), Lausanne, CH, Switzerland.
| | - Cláudia Sá Dos Reis
- School of Health Sciences HESAV, HES-SO, University of Applied Sciences Western Switzerland, Lausanne, CH, Switzerland.
| |
Collapse
|
8
|
Hong D, Chang H, He X, Zhan Y, Tong R, Wu X, Li G. Construction of an Early Alert System for Intradialytic Hypotension before Initiating Hemodialysis Based on Machine Learning. KIDNEY DISEASES (BASEL, SWITZERLAND) 2023; 9:433-442. [PMID: 37901708 PMCID: PMC10601920 DOI: 10.1159/000531619] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Accepted: 06/05/2023] [Indexed: 10/31/2023]
Abstract
Introduction Intradialytic hypotension (IDH) is prevalent and associated with high hospitalization and mortality rates. The purpose of this study was to explore the risk factors for IDH and use artificial intelligence to establish an early alert system before hemodialysis sessions to identify patients at high risk of IDH. Materials and Methods We obtained data on 314,534 hemodialysis sessions conducted at Sichuan Provincial People's Hospital from the renal disease treatment information system. IDH was defined as a systolic blood pressure drop ≥20 mm Hg, a mean arterial pressure drop ≥10 mm Hg during dialysis, or the occurrence of clinical hypotensive events requiring nursing intervention. After pre-processing, the data were randomly divided into training (80%) and testing (20%) sets. Four interpolation methods, three feature selection methods, and 18 machine learning algorithms were used to construct predictive models. The area under the receiver operating characteristic curve (AUC) was the main indicator for evaluating the performance of the models, while Shapley Additive ExPlanation was used to explain the contribution of each variable to the best predictive model. Results A total of 3,906 patients and 314,534 dialysis sessions were included, of which 142,237 cases showed IDH (incidence rate, 45.2%). Nineteen parameters were identified through artificial intelligence feature screening. They included age, pre-dialysis weight, dry weight, pre-dialysis blood pressure, heart rate, prescribed ultrafiltration, blood cell counts (neutrophil, lymphocyte, monocyte, eosinophil, lymphocyte, and platelet counts), hematocrit, serum calcium, creatinine, urea, glucose, and uric acid. Random forest, gradient boosting, and logistic regression were the three best models, and the AUCs were 0.812 (95% confidence interval [CI], 0.811-0.813), 0.748 (95% CI, 0.747-0.749), and 0.743 (95% CI, 0.742-0.744), respectively. Conclusion Our dialysis software-based artificial intelligence alert system can be used to predict IDH occurrence, enabling the initiation of relevant interventions.
Collapse
Affiliation(s)
- Daqing Hong
- Department of Nephrology, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Huan Chang
- Department of Pharmacy, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Xin He
- Department of Nephrology, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
- Department of Nephrology, Affiliated Hospital of Southwest Medical University, Luzhou, China
| | - Ya Zhan
- Department of Nephrology, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
- Department of Nephrology, Affiliated Hospital of Southwest Medical University, Luzhou, China
| | - Rongsheng Tong
- Department of Pharmacy, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Xingwei Wu
- Department of Pharmacy, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| | - Guisen Li
- Department of Nephrology, Sichuan Provincial People’s Hospital, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
9
|
McFadden BR, Reynolds M, Inglis TJJ. Developing machine learning systems worthy of trust for infection science: a requirement for future implementation into clinical practice. Front Digit Health 2023; 5:1260602. [PMID: 37829595 PMCID: PMC10565494 DOI: 10.3389/fdgth.2023.1260602] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 09/15/2023] [Indexed: 10/14/2023] Open
Abstract
Infection science is a discipline of healthcare which includes clinical microbiology, public health microbiology, mechanisms of microbial disease, and antimicrobial countermeasures. The importance of infection science has become more apparent in recent years during the SARS-CoV-2 (COVID-19) pandemic and subsequent highlighting of critical operational domains within infection science including the hospital, clinical laboratory, and public health environments to prevent, manage, and treat infectious diseases. However, as the global community transitions beyond the pandemic, the importance of infection science remains, with emerging infectious diseases, bloodstream infections, sepsis, and antimicrobial resistance becoming increasingly significant contributions to the burden of global disease. Machine learning (ML) is frequently applied in healthcare and medical domains, with growing interest in the application of ML techniques to problems in infection science. This has the potential to address several key aspects including improving patient outcomes, optimising workflows in the clinical laboratory, and supporting the management of public health. However, despite promising results, the implementation of ML into clinical practice and workflows is limited. Enabling the migration of ML models from the research to real world environment requires the development of trustworthy ML systems that support the requirements of users, stakeholders, and regulatory agencies. This paper will provide readers with a brief introduction to infection science, outline the principles of trustworthy ML systems, provide examples of the application of these principles in infection science, and propose future directions for moving towards the development of trustworthy ML systems in infection science.
Collapse
Affiliation(s)
- Benjamin R. McFadden
- School of Physics, Mathematics and Computing, University of Western Australia, Perth, WA, Australia
| | - Mark Reynolds
- School of Physics, Mathematics and Computing, University of Western Australia, Perth, WA, Australia
| | - Timothy J. J. Inglis
- Western Australian Country Health Service, Perth, WA, Australia
- School of Medicine, University of Western Australia, Perth, WA, Australia
- Department of Microbiology, Pathwest Laboratory Medicine, Perth, WA, Australia
| |
Collapse
|
10
|
Chan LE, Casiraghi E, Putman T, Reese J, Harmon QE, Schaper K, Hedge H, Valentini G, Schmitt C, Motsinger-Reif A, Hall JE, Mungall CJ, Robinson PN, Haendel MA. Predicting nutrition and environmental factors associated with female reproductive disorders using a knowledge graph and random forests. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.14.23292679. [PMID: 37502882 PMCID: PMC10371183 DOI: 10.1101/2023.07.14.23292679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Objective Female reproductive disorders (FRDs) are common health conditions that may present with significant symptoms. Diet and environment are potential areas for FRD interventions. We utilized a knowledge graph (KG) method to predict factors associated with common FRDs (e.g., endometriosis, ovarian cyst, and uterine fibroids). Materials and Methods We harmonized survey data from the Personalized Environment and Genes Study on internal and external environmental exposures and health conditions with biomedical ontology content. We merged the harmonized data and ontologies with supplemental nutrient and agricultural chemical data to create a KG. We analyzed the KG by embedding edges and applying a random forest for edge prediction to identify variables potentially associated with FRDs. We also conducted logistic regression analysis for comparison. Results Across 9765 PEGS respondents, the KG analysis resulted in 8535 significant predicted links between FRDs and chemicals, phenotypes, and diseases. Amongst these links, 32 were exact matches when compared with the logistic regression results, including comorbidities, medications, foods, and occupational exposures. Discussion Mechanistic underpinnings of predicted links documented in the literature may support some of our findings. Our KG methods are useful for predicting possible associations in large, survey-based datasets with added information on directionality and magnitude of effect from logistic regression. These results should not be construed as causal, but can support hypothesis generation. Conclusion This investigation enabled the generation of hypotheses on a variety of potential links between FRDs and exposures. Future investigations should prospectively evaluate the variables hypothesized to impact FRDs.
Collapse
Affiliation(s)
- Lauren E Chan
- Oregon State University, College of Public Health and Human Sciences, Corvallis, OR, USA
| | - Elena Casiraghi
- AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Italy
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Tim Putman
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Justin Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Quaker E Harmon
- National Institute of Environmental Health Sciences, Epidemiology Branch, Durham, NC, USA
| | - Kevin Schaper
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Harshad Hedge
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Giorgio Valentini
- AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Italy
| | - Charles Schmitt
- National Institute of Environmental Health Sciences, Office of Data Science, Durham, NC, USA
| | - Alison Motsinger-Reif
- National Institute of Environmental Health Sciences, Biostatistics & Computational Biology Branch, Durham, NC, USA
| | - Janet E Hall
- National Institute of Environmental Health Sciences, Clinical Research Branch, Durham, NC, USA
| | - Christopher J Mungall
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | | |
Collapse
|
11
|
Abbasi Habashi S, Koyuncu M, Alizadehsani R. A Survey of COVID-19 Diagnosis Using Routine Blood Tests with the Aid of Artificial Intelligence Techniques. Diagnostics (Basel) 2023; 13:1749. [PMID: 37238232 PMCID: PMC10217633 DOI: 10.3390/diagnostics13101749] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/19/2023] [Accepted: 04/29/2023] [Indexed: 05/28/2023] Open
Abstract
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), causing a disease called COVID-19, is a class of acute respiratory syndrome that has considerably affected the global economy and healthcare system. This virus is diagnosed using a traditional technique known as the Reverse Transcription Polymerase Chain Reaction (RT-PCR) test. However, RT-PCR customarily outputs a lot of false-negative and incorrect results. Current works indicate that COVID-19 can also be diagnosed using imaging resolutions, including CT scans, X-rays, and blood tests. Nevertheless, X-rays and CT scans cannot always be used for patient screening because of high costs, radiation doses, and an insufficient number of devices. Therefore, there is a requirement for a less expensive and faster diagnostic model to recognize the positive and negative cases of COVID-19. Blood tests are easily performed and cost less than RT-PCR and imaging tests. Since biochemical parameters in routine blood tests vary during the COVID-19 infection, they may supply physicians with exact information about the diagnosis of COVID-19. This study reviewed some newly emerging artificial intelligence (AI)-based methods to diagnose COVID-19 using routine blood tests. We gathered information about research resources and inspected 92 articles that were carefully chosen from a variety of publishers, such as IEEE, Springer, Elsevier, and MDPI. Then, these 92 studies are classified into two tables which contain articles that use machine Learning and deep Learning models to diagnose COVID-19 while using routine blood test datasets. In these studies, for diagnosing COVID-19, Random Forest and logistic regression are the most widely used machine learning methods and the most widely used performance metrics are accuracy, sensitivity, specificity, and AUC. Finally, we conclude by discussing and analyzing these studies which use machine learning and deep learning models and routine blood test datasets for COVID-19 detection. This survey can be the starting point for a novice-/beginner-level researcher to perform on COVID-19 classification.
Collapse
Affiliation(s)
| | - Murat Koyuncu
- Department of Information Systems Engineering, Atilim University, 06830 Ankara, Turkey;
| | - Roohallah Alizadehsani
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Waurn Ponds, Geelong, VIC 3216, Australia
| |
Collapse
|
12
|
Ortiz-Barrios M, Arias-Fonseca S, Ishizaka A, Barbati M, Avendaño-Collante B, Navarro-Jiménez E. Artificial intelligence and discrete-event simulation for capacity management of intensive care units during the Covid-19 pandemic: A case study. JOURNAL OF BUSINESS RESEARCH 2023; 160:113806. [PMID: 36895308 PMCID: PMC9981538 DOI: 10.1016/j.jbusres.2023.113806] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 01/18/2023] [Accepted: 02/23/2023] [Indexed: 06/18/2023]
Abstract
The Covid-19 pandemic has pushed the Intensive Care Units (ICUs) into significant operational disruptions. The rapid evolution of this disease, the bed capacity constraints, the wide variety of patient profiles, and the imbalances within health supply chains still represent a challenge for policymakers. This paper aims to use Artificial Intelligence (AI) and Discrete-Event Simulation (DES) to support ICU bed capacity management during Covid-19. The proposed approach was validated in a Spanish hospital chain where we initially identified the predictors of ICU admission in Covid-19 patients. Second, we applied Random Forest (RF) to predict ICU admission likelihood using patient data collected in the Emergency Department (ED). Finally, we included the RF outcomes in a DES model to assist decision-makers in evaluating new ICU bed configurations responding to the patient transfer expected from downstream services. The results evidenced that the median bed waiting time declined between 32.42 and 48.03 min after intervention.
Collapse
Affiliation(s)
- Miguel Ortiz-Barrios
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 080002, Colombia
| | - Sebastián Arias-Fonseca
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 080002, Colombia
| | - Alessio Ishizaka
- NEOMA Business School, 1 rue du Maréchal Juin, Mont-Saint-Aignan 76130, France
| | - Maria Barbati
- Department of Economics, University Ca' Foscari, Cannaregio 873, Fondamenta San Giobbe, 30121 Venice, Italy
| | | | | |
Collapse
|
13
|
Casiraghi E, Wong R, Hall M, Coleman B, Notaro M, Evans MD, Tronieri JS, Blau H, Laraway B, Callahan TJ, Chan LE, Bramante CT, Buse JB, Moffitt RA, Stürmer T, Johnson SG, Raymond Shao Y, Reese J, Robinson PN, Paccanaro A, Valentini G, Huling JD, Wilkins KJ. A method for comparing multiple imputation techniques: A case study on the U.S. national COVID cohort collaborative. J Biomed Inform 2023; 139:104295. [PMID: 36716983 PMCID: PMC10683778 DOI: 10.1016/j.jbi.2023.104295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Revised: 01/16/2023] [Accepted: 01/21/2023] [Indexed: 02/01/2023]
Abstract
Healthcare datasets obtained from Electronic Health Records have proven to be extremely useful for assessing associations between patients' predictors and outcomes of interest. However, these datasets often suffer from missing values in a high proportion of cases, whose removal may introduce severe bias. Several multiple imputation algorithms have been proposed to attempt to recover the missing information under an assumed missingness mechanism. Each algorithm presents strengths and weaknesses, and there is currently no consensus on which multiple imputation algorithm works best in a given scenario. Furthermore, the selection of each algorithm's parameters and data-related modeling choices are also both crucial and challenging. In this paper we propose a novel framework to numerically evaluate strategies for handling missing data in the context of statistical analysis, with a particular focus on multiple imputation techniques. We demonstrate the feasibility of our approach on a large cohort of type-2 diabetes patients provided by the National COVID Cohort Collaborative (N3C) Enclave, where we explored the influence of various patient characteristics on outcomes related to COVID-19. Our analysis included classic multiple imputation techniques as well as simple complete-case Inverse Probability Weighted models. Extensive experiments show that our approach can effectively highlight the most promising and performant missing-data handling strategy for our case study. Moreover, our methodology allowed a better understanding of the behavior of the different models and of how it changed as we modified their parameters. Our method is general and can be applied to different research fields and on datasets containing heterogeneous types.
Collapse
Affiliation(s)
- Elena Casiraghi
- AnacletoLab, Department of Computer Science "Giovanni degli Antoni", Università degli Studi di Milano, Milan, Italy; CINI, Infolife National Laboratory, Roma, Italy; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Rachel Wong
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Margaret Hall
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Ben Coleman
- The Jackson Laboratory for Genomic Medicine, Farmington, USA; Institute for Systems Genomics, University of Connecticut, Farmington, CT, USA
| | - Marco Notaro
- AnacletoLab, Department of Computer Science "Giovanni degli Antoni", Università degli Studi di Milano, Milan, Italy; CINI, Infolife National Laboratory, Roma, Italy
| | - Michael D Evans
- Biostatistical Design and Analysis Center, Clinical and Translational Science Institute, University of Minnesota, Minneapolis, MN, USA
| | - Jena S Tronieri
- Department of Psychiatry, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, Farmington, USA
| | - Bryan Laraway
- University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
| | | | - Lauren E Chan
- College of Public Health and Human Sciences, Oregon State University, Corvallis, USA
| | - Carolyn T Bramante
- Division of General Internal Medicine, University of Minnesota, Minneapolis, MN, USA
| | - John B Buse
- NC Translational and Clinical Sciences Institute, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA; Division of Endocrinology, Department of Medicine, University of North Carolina School of Medicine, USA
| | - Richard A Moffitt
- Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA
| | - Til Stürmer
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Steven G Johnson
- Institute for Health Informatics, University of Minnesota, Minneapolis, MN, USA
| | - Yu Raymond Shao
- Harvard-MIT Division of Health Sciences and Technology (HST), 260 Longwood Ave, Boston, USA; Department of Radiation Oncology, UT Southwestern Medical Center, Dallas, USA
| | - Justin Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, USA; Institute for Systems Genomics, University of Connecticut, Farmington, CT, USA
| | - Alberto Paccanaro
- School of Applied Mathematics (EMAp), Fundação Getúlio Vargas, Rio de Janeiro, Brazil; Department of Computer Science, Royal Holloway, University of London, Egham, UK
| | - Giorgio Valentini
- AnacletoLab, Department of Computer Science "Giovanni degli Antoni", Università degli Studi di Milano, Milan, Italy; CINI, Infolife National Laboratory, Roma, Italy
| | - Jared D Huling
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, MN, USA
| | - Kenneth J Wilkins
- Biostatistics Program, Office of the Director, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
14
|
Zheng HL, An SY, Qiao BJ, Guan P, Huang DS, Wu W. A data-driven interpretable ensemble framework based on tree models for forecasting the occurrence of COVID-19 in the USA. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:13648-13659. [PMID: 36131178 PMCID: PMC9492466 DOI: 10.1007/s11356-022-23132-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 09/16/2022] [Indexed: 06/15/2023]
Abstract
This prevalence of coronavirus disease 2019 (COVID-19) has become one of the most serious public health crises. Tree-based machine learning methods, with the advantages of high efficiency, and strong interpretability, have been widely used in predicting diseases. A data-driven interpretable ensemble framework based on tree models was designed to forecast daily new cases of COVID-19 in the USA and to determine the important factors related to COVID-19. Based on a hyperparametric optimization technique, we developed three machine learning algorithms based on decision trees, including random forest (RF), eXtreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), and three linear ensemble models were used to integrate these outcomes for better prediction accuracy. Finally, the SHapley Additive explanation (SHAP) value was used to obtain the feature importance ranking. Our outcomes demonstrated that, among the three basic machine learners, the prediction accuracy was the following in descending order: LightGBM, XGBoost, and RF. The optimized LAD ensemble was the most precise prediction model that reduced the prediction error of the best base learner (LightGBM) by approximately 3.111%, while vaccination, wearing masks, less mobility, and government interventions had positive effects on the control and prevention of COVID-19.
Collapse
Affiliation(s)
- Hu-Li Zheng
- Department of Epidemiology, School of Public Health, China Medical University, No. 77 Puhe Road, Shenyang, Liaoning Province China
| | - Shu-Yi An
- Liaoning Provincial Center for Disease Control and Prevention, Shenyang, Liaoning China
| | - Bao-Jun Qiao
- Liaoning Provincial Center for Disease Control and Prevention, Shenyang, Liaoning China
| | - Peng Guan
- Department of Epidemiology, School of Public Health, China Medical University, No. 77 Puhe Road, Shenyang, Liaoning Province China
| | - De-Sheng Huang
- Department of Mathematics, School of Intelligent Medicine, China Medical University, Shenyang, Liaoning China
| | - Wei Wu
- Department of Epidemiology, School of Public Health, China Medical University, No. 77 Puhe Road, Shenyang, Liaoning Province China
| |
Collapse
|
15
|
Cappelletti L, Petrini A, Gliozzo J, Casiraghi E, Schubach M, Kircher M, Valentini G. Boosting tissue-specific prediction of active cis-regulatory regions through deep learning and Bayesian optimization techniques. BMC Bioinformatics 2022; 23:154. [PMID: 36510125 PMCID: PMC9743524 DOI: 10.1186/s12859-022-04582-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 01/20/2022] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Cis-regulatory regions (CRRs) are non-coding regions of the DNA that fine control the spatio-temporal pattern of transcription; they are involved in a wide range of pivotal processes such as the development of specific cell-lines/tissues and the dynamic cell response to physiological stimuli. Recent studies showed that genetic variants occurring in CRRs are strongly correlated with pathogenicity or deleteriousness. Considering the central role of CRRs in the regulation of physiological and pathological conditions, the correct identification of CRRs and of their tissue-specific activity status through Machine Learning methods plays a major role in dissecting the impact of genetic variants on human diseases. Unfortunately, the problem is still open, though some promising results have been already reported by (deep) machine-learning based methods that predict active promoters and enhancers in specific tissues or cell lines by encoding epigenetic or spectral features directly extracted from DNA sequences. RESULTS We present the experiments we performed to compare two Deep Neural Networks, a Feed-Forward Neural Network model working on epigenomic features, and a Convolutional Neural Network model working only on genomic sequence, targeted to the identification of enhancer- and promoter-activity in specific cell lines. While performing experiments to understand how the experimental setup influences the prediction performance of the methods, we particularly focused on (1) automatic model selection performed by Bayesian optimization and (2) exploring different data rebalancing setups for reducing negative unbalancing effects. CONCLUSIONS Results show that (1) automatic model selection by Bayesian optimization improves the quality of the learner; (2) data rebalancing considerably impacts the prediction performance of the models; test set rebalancing may provide over-optimistic results, and should therefore be cautiously applied; (3) despite working on sequence data, convolutional models obtain performance close to those of feed forward models working on epigenomic information, which suggests that also sequence data carries informative content for CRR-activity prediction. We therefore suggest combining both models/data types in future works.
Collapse
Affiliation(s)
- Luca Cappelletti
- grid.4708.b0000 0004 1757 2822AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy
| | - Alessandro Petrini
- grid.4708.b0000 0004 1757 2822AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy
| | - Jessica Gliozzo
- grid.4708.b0000 0004 1757 2822AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy
| | - Elena Casiraghi
- grid.4708.b0000 0004 1757 2822AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy
| | - Max Schubach
- grid.6363.00000 0001 2218 4662Berlin Institute of Health at Charité, Universitätsmedizin Berlin, Berlin, Germany
| | - Martin Kircher
- grid.6363.00000 0001 2218 4662Berlin Institute of Health at Charité, Universitätsmedizin Berlin, Berlin, Germany
| | - Giorgio Valentini
- grid.4708.b0000 0004 1757 2822AnacletoLab, Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy ,European Laboratory for Learning and Intelligent Systems (ELLIS), Berlin, Germany ,CINI National Laboratory of Artificial Intelligence and Intelligent Systems (AIIS), Rome, Italy ,grid.4708.b0000 0004 1757 2822Data Science Research Center, Università degli Studi di Milano, Milan, Italy
| |
Collapse
|
16
|
Chung J, Kim D, Choi J, Yune S, Song KD, Kim S, Chua M, Succi MD, Conklin J, Longo MGF, Ackman JB, Petranovic M, Lev MH, Do S. Prediction of oxygen requirement in patients with COVID-19 using a pre-trained chest radiograph xAI model: efficient development of auditable risk prediction models via a fine-tuning approach. Sci Rep 2022; 12:21164. [PMID: 36476724 PMCID: PMC9729627 DOI: 10.1038/s41598-022-24721-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 11/18/2022] [Indexed: 12/12/2022] Open
Abstract
Risk prediction requires comprehensive integration of clinical information and concurrent radiological findings. We present an upgraded chest radiograph (CXR) explainable artificial intelligence (xAI) model, which was trained on 241,723 well-annotated CXRs obtained prior to the onset of the COVID-19 pandemic. Mean area under the receiver operating characteristic curve (AUROC) for detection of 20 radiographic features was 0.955 (95% CI 0.938-0.955) on PA view and 0.909 (95% CI 0.890-0.925) on AP view. Coexistent and correlated radiographic findings are displayed in an interpretation table, and calibrated classifier confidence is displayed on an AI scoreboard. Retrieval of similar feature patches and comparable CXRs from a Model-Derived Atlas provides justification for model predictions. To demonstrate the feasibility of a fine-tuning approach for efficient and scalable development of xAI risk prediction models, we applied our CXR xAI model, in combination with clinical information, to predict oxygen requirement in COVID-19 patients. Prediction accuracy for high flow oxygen (HFO) and mechanical ventilation (MV) was 0.953 and 0.934 at 24 h and 0.932 and 0.836 at 72 h from the time of emergency department (ED) admission, respectively. Our CXR xAI model is auditable and captures key pathophysiological manifestations of cardiorespiratory diseases and cardiothoracic comorbidities. This model can be efficiently and broadly applied via a fine-tuning approach to provide fully automated risk and outcome predictions in various clinical scenarios in real-world practice.
Collapse
Affiliation(s)
- Joowon Chung
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Doyun Kim
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Jongmun Choi
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Sehyo Yune
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Kyoung Doo Song
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
- Department of Radiology, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-Ro, Gangnam-Gu, Seoul, 06351, Republic of Korea
| | - Seonkyoung Kim
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Michelle Chua
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Marc D Succi
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - John Conklin
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Maria G Figueiro Longo
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Jeanne B Ackman
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Milena Petranovic
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Michael H Lev
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA
| | - Synho Do
- Department of Radiology, Massachusetts General Brigham and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
17
|
Wang DD, Li YF, Zhang C, He SM, Chen X. Predicting the effect of sirolimus on disease activity in patients with systemic lupus erythematosus using machine learning. J Clin Pharm Ther 2022; 47:1845-1850. [PMID: 36131617 DOI: 10.1111/jcpt.13778] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Revised: 08/03/2022] [Accepted: 09/04/2022] [Indexed: 11/30/2022]
Abstract
WHAT IS KNOWN AND OBJECTIVES The present study aimed to predict the effect of sirolimus on disease activity in patients with systemic lupus erythematosus (SLE) using machine learning and to recommend appropriate sirolimus dosage regimen for patients with SLE. METHODS The Emax model was selected for machine learning, where the evaluation indicator was the change rate of systemic lupus erythematosus disease activity index from baseline value. RESULTS A total 103 patients with SLE were included for modelling, where the Emax , ET50 were -53.9%, 1.53 months in the final model respectively, and the evaluation of the final model was good. Further simulation found that the follow-up time to achieve 25%, 50%, 75% and 80% (plateau) Emax of sirolimus effecting on disease activity in patients with SLE were 0.51, 1.53, 4.59 and 6.12 months, respectively. In addition, the sirolimus dosage was flexible and adjusted according to drug concentration, where the intersection of sirolimus concentration range included in this study was about 8-10 ng/ml. WHAT IS NEW AND CONCLUSIONS This study was the first time to predict the effect of sirolimus on disease activity in patients with SLE and in order to achieve better therapeutic effect maintaining a concentration of 8-10 ng/ml sirolimus for at least 6.12 months was necessary.
Collapse
Affiliation(s)
- Dong-Dong Wang
- Jiangsu Key Laboratory of New Drug Research and Clinical Pharmacy, School of Pharmacy, Xuzhou Medical University, Xuzhou, Jiangsu, China
| | - Ya-Feng Li
- Department of Pharmacy, Feng Xian People's Hospital, Xuzhou, Jiangsu, China
| | - Cun Zhang
- Department of Pharmacy, Xuzhou Oriental Hospital Affiliated to Xuzhou Medical University, Xuzhou, Jiangsu, China
| | - Su-Mei He
- Department of Pharmacy, Suzhou Science & Technology Town Hospital, Gusu School, Nanjing Medical University, Suzhou, Jiangsu, China
| | - Xiao Chen
- School of Nursing, Xuzhou Medical University, Xuzhou, Jiangsu, China
| |
Collapse
|
18
|
Delgado J, de Manuel A, Parra I, Moyano C, Rueda J, Guersenzvaig A, Ausin T, Cruz M, Casacuberta D, Puyol A. Bias in algorithms of AI systems developed for COVID-19: A scoping review. JOURNAL OF BIOETHICAL INQUIRY 2022; 19:407-419. [PMID: 35857214 PMCID: PMC9463236 DOI: 10.1007/s11673-022-10200-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 05/06/2022] [Indexed: 06/15/2023]
Abstract
To analyze which ethically relevant biases have been identified by academic literature in artificial intelligence (AI) algorithms developed either for patient risk prediction and triage, or for contact tracing to deal with the COVID-19 pandemic. Additionally, to specifically investigate whether the role of social determinants of health (SDOH) have been considered in these AI developments or not. We conducted a scoping review of the literature, which covered publications from March 2020 to April 2021. Studies mentioning biases on AI algorithms developed for contact tracing and medical triage or risk prediction regarding COVID-19 were included. From 1054 identified articles, 20 studies were finally included. We propose a typology of biases identified in the literature based on bias, limitations and other ethical issues in both areas of analysis. Results on health disparities and SDOH were classified into five categories: racial disparities, biased data, socio-economic disparities, unequal accessibility and workforce, and information communication. SDOH needs to be considered in the clinical context, where they still seem underestimated. Epidemiological conditions depend on geographic location, so the use of local data in studies to develop international solutions may increase some biases. Gender bias was not specifically addressed in the articles included. The main biases are related to data collection and management. Ethical problems related to privacy, consent, and lack of regulation have been identified in contact tracing while some bias-related health inequalities have been highlighted. There is a need for further research focusing on SDOH and these specific AI apps.
Collapse
Affiliation(s)
- Janet Delgado
- Department of Philosophy 1, Faculty of Philosophy, University of Granada, Granada, Spain
| | - Alicia de Manuel
- Department of Philosophy, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Iris Parra
- Department of Philosophy, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Cristian Moyano
- Department of Philosophy, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Jon Rueda
- FiloLab Scientific Unit of Excellence of the University of Granada, Granada, Spain
| | | | - Txetxu Ausin
- Institute for Philosophy of the Spanish National Research Council (CSIC), Madrid, Spain
| | - Maite Cruz
- Andalusian School of Public Health (EASP), Granada, Spain
| | - David Casacuberta
- Department of Philosophy, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Angel Puyol
- Department of Philosophy, Universitat Autònoma de Barcelona, Barcelona, Spain
| |
Collapse
|
19
|
Teng Q, Liu Z, Song Y, Han K, Lu Y. A survey on the interpretability of deep learning in medical diagnosis. MULTIMEDIA SYSTEMS 2022; 28:2335-2355. [PMID: 35789785 PMCID: PMC9243744 DOI: 10.1007/s00530-022-00960-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 05/29/2022] [Indexed: 06/15/2023]
Abstract
Deep learning has demonstrated remarkable performance in the medical domain, with accuracy that rivals or even exceeds that of human experts. However, it has a significant problem that these models are "black-box" structures, which means they are opaque, non-intuitive, and difficult for people to understand. This creates a barrier to the application of deep learning models in clinical practice due to lack of interpretability, trust, and transparency. To overcome this problem, several studies on interpretability have been proposed. Therefore, in this paper, we comprehensively review the interpretability of deep learning in medical diagnosis based on the current literature, including some common interpretability methods used in the medical domain, various applications with interpretability for disease diagnosis, prevalent evaluation metrics, and several disease datasets. In addition, the challenges of interpretability and future research directions are also discussed here. To the best of our knowledge, this is the first time that various applications of interpretability methods for disease diagnosis have been summarized.
Collapse
Affiliation(s)
- Qiaoying Teng
- School of Computer Science, Jilin Normal University, Siping, 136000 China
| | - Zhe Liu
- School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang, 212013 China
| | - Yuqing Song
- School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang, 212013 China
| | - Kai Han
- School of Computer Science and Communication Engineering, Jiangsu University, Zhenjiang, 212013 China
| | - Yang Lu
- School of Computer Science, Jilin Normal University, Siping, 136000 China
| |
Collapse
|
20
|
Gliozzo J, Mesiti M, Notaro M, Petrini A, Patak A, Puertas-Gallardo A, Paccanaro A, Valentini G, Casiraghi E. Heterogeneous data integration methods for patient similarity networks. Brief Bioinform 2022; 23:6604996. [PMID: 35679533 PMCID: PMC9294435 DOI: 10.1093/bib/bbac207] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Revised: 04/14/2022] [Accepted: 05/04/2022] [Indexed: 12/29/2022] Open
Abstract
Patient similarity networks (PSNs), where patients are represented as nodes and their similarities as weighted edges, are being increasingly used in clinical research. These networks provide an insightful summary of the relationships among patients and can be exploited by inductive or transductive learning algorithms for the prediction of patient outcome, phenotype and disease risk. PSNs can also be easily visualized, thus offering a natural way to inspect complex heterogeneous patient data and providing some level of explainability of the predictions obtained by machine learning algorithms. The advent of high-throughput technologies, enabling us to acquire high-dimensional views of the same patients (e.g. omics data, laboratory data, imaging data), calls for the development of data fusion techniques for PSNs in order to leverage this rich heterogeneous information. In this article, we review existing methods for integrating multiple biomedical data views to construct PSNs, together with the different patient similarity measures that have been proposed. We also review methods that have appeared in the machine learning literature but have not yet been applied to PSNs, thus providing a resource to navigate the vast machine learning literature existing on this topic. In particular, we focus on methods that could be used to integrate very heterogeneous datasets, including multi-omics data as well as data derived from clinical information and medical imaging.
Collapse
Affiliation(s)
- Jessica Gliozzo
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,European Commission, Joint Research Centre (JRC), Ispra (VA), Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Marco Mesiti
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Marco Notaro
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Alessandro Petrini
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| | - Alex Patak
- European Commission, Joint Research Centre (JRC), Ispra (VA), Italy
| | | | - Alberto Paccanaro
- Department of Computer Science, Royal Holloway, University of London, Egham, TW20 0EX UK.,School of Applied Mathematics (EMAp), Fundação Getúlio Vargas, Rio de Janeiro Brazil
| | - Giorgio Valentini
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy.,DSRC UNIMI, Data Science Research Center, Milano, 20135, Italy.,ELLIS, European Laboratory for Learning and Intelligent Systems, Berlin, Germany
| | - Elena Casiraghi
- AnacletoLab - Computer Science Department, Universitá degli Studi di Milano, Via Celoria 18, 20135, Milan, Italy.,CINI, Infolife National Laboratory, Roma, Italy
| |
Collapse
|
21
|
Comito C, Pizzuti C. Artificial intelligence for forecasting and diagnosing COVID-19 pandemic: A focused review. Artif Intell Med 2022; 128:102286. [PMID: 35534142 PMCID: PMC8958821 DOI: 10.1016/j.artmed.2022.102286] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 03/16/2022] [Accepted: 03/21/2022] [Indexed: 02/05/2023]
Abstract
The outbreak of novel corona virus 2019 (COVID-19) has been treated as a public health crisis of global concern by the World Health Organization (WHO). COVID-19 pandemic hugely affected countries worldwide raising the need to exploit novel, alternative and emerging technologies to respond to the emergency created by the weak health-care systems. In this context, Artificial Intelligence (AI) techniques can give a valid support to public health authorities, complementing traditional approaches with advanced tools. This study provides a comprehensive review of methods, algorithms, applications, and emerging AI technologies that can be utilized for forecasting and diagnosing COVID-19. The main objectives of this review are summarized as follows. (i) Understanding the importance of AI approaches such as machine learning and deep learning for COVID-19 pandemic; (ii) discussing the efficiency and impact of these methods for COVID-19 forecasting and diagnosing; (iii) providing an extensive background description of AI techniques to help non-expert to better catch the underlying concepts; (iv) for each work surveyed, give a detailed analysis of the rationale behind the approach, highlighting the method used, the type and size of data analyzed, the validation method, the target application and the results achieved; (v) focusing on some future challenges in COVID-19 forecasting and diagnosing.
Collapse
Affiliation(s)
- Carmela Comito
- National Research Council of Italy (CNR), Institute for High Performance Computing and Networking (ICAR), Rende, Italy.
| | - Clara Pizzuti
- National Research Council of Italy (CNR), Institute for High Performance Computing and Networking (ICAR), Rende, Italy.
| |
Collapse
|
22
|
Subash Chandra Bose S, Vinoth Kumar A, Premkumar A, Deepika M, Gokilavani M. Biserial targeted feature projection based radial kernel regressive deep belief neural learning for covid-19 prediction. Soft comput 2022; 27:1651-1662. [PMID: 35378723 PMCID: PMC8968782 DOI: 10.1007/s00500-022-06943-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/21/2022] [Indexed: 01/31/2023]
Abstract
Coronavirus disease 2019 (COVID-19) is a highly infectious viral disease caused by the novel SARS-CoV-2 virus. Different prediction techniques have been developed to predict the coronavirus disease's existence in patients. However, the accurate prediction was not improved and time consumption was not minimized. In order to address these existing problems, a novel technique called Biserial Targeted Feature Projection-based Radial Kernel Regressive Deep Belief Neural Learning (BTFP-RKRDBNL) is introduced to perform accurate disease prediction with lesser time consumption. The BTFP-RKRDBNL techniques perform disease prediction with the help of different layers such as two visible layers namely input and layer and two hidden layers. Initially, the features and data are collected from the dataset and transmitted to the input layer. The Point Biserial Correlative Target feature projection is used to select relevant features and other irrelevant features are removed with minimizing the disease prediction time. Then the relevant features are sent to the hidden layer 2. Next, Radial Kernel Regression is applied to analyze the training features and testing disease features to identify the disease with higher accuracy and a lesser false positive rate. Experimental analysis is planned to measure the prediction accuracy, sensitivity, and specificity, and prediction time for different numbers of patients. The result illustrates that the method increases the prediction accuracy, sensitivity, and specificity by 10, 6, and 21% and reduces the prediction time by 10% as compared to state-of-the-art works.
Collapse
Affiliation(s)
- S. Subash Chandra Bose
- Department of Information Technology, Guru Nanak College, Velachery, Chennai, Tamil Nadu India
| | - A. Vinoth Kumar
- Department of Electronics and Communication Engineering, Dr MGR Educational and Research Institute, Chennai, Tamil Nadu India
| | - Anitha Premkumar
- Department of Computer Science, Presidency University, Bangalore, 560064 India
| | - M. Deepika
- Computer Science and Engineering, ASIET, Kalady, Kerala India
| | - M. Gokilavani
- Computer Science and Engineering, KL University, Guntur, Andra Pradesh India
| |
Collapse
|
23
|
Hu J, Han Z, Heidari AA, Shou Y, Ye H, Wang L, Huang X, Chen H, Chen Y, Wu P. Detection of COVID-19 severity using blood gas analysis parameters and Harris hawks optimized extreme learning machine. Comput Biol Med 2022; 142:105166. [PMID: 35077935 PMCID: PMC8701842 DOI: 10.1016/j.compbiomed.2021.105166] [Citation(s) in RCA: 45] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 12/20/2021] [Accepted: 12/20/2021] [Indexed: 01/08/2023]
Abstract
Coronavirus disease-2019 (COVID-19) has made the world more cautious about widespread viruses, and a tragic pandemic that was caused by a novel coronavirus has harmed human beings in recent years. The new coronavirus pneumonia outbreak is spreading rapidly worldwide. We collect arterial blood samples from 51 patients with a COVID-19 diagnosis. Blood gas analysis is performed using a Siemens RAPID Point 500 blood gas analyzer. To accurately determine the factors that play a decisive role in the early recognition and discrimination of COVID-19 severity, a prediction framework that is based on an improved binary Harris hawk optimization (HHO) algorithm in combination with a kernel extreme learning machine is proposed in this paper. This method uses specular reflection learning to improve the original HHO algorithm and is referred to as HHOSRL. The experimental results show that the selected indicators, such as age, partial pressure of oxygen, oxygen saturation, sodium ion concentration, and lactic acid, are essential for the early accurate assessment of COVID-19 severity by the proposed feature selection method. The simulation results show that the established methodlogy can achieve promising performance. We believe that our proposed model provides an effective strategy for accurate early assessment of COVID-19 and distinguishing disease severity. The codes of HHO will be updated in https://aliasgharheidari.com/HHO.html.
Collapse
Affiliation(s)
- Jiao Hu
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Zhengyuan Han
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| | - Ali Asghar Heidari
- School of Surveying and Geospatial Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | - Yeqi Shou
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| | - Hua Ye
- Department of Pulmonary and Critical Care Medicine, Affiliated Yueqing Hospital, Wenzhou Medical University, Yueqing, 325600, China.
| | - Liangxing Wang
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| | - Xiaoying Huang
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| | - Huiling Chen
- Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China.
| | - Yanfan Chen
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| | - Peiliang Wu
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, 325000, PR China.
| |
Collapse
|
24
|
Kianfar N, Mesgari MS, Mollalo A, Kaveh M. Spatio-temporal modeling of COVID-19 prevalence and mortality using artificial neural network algorithms. Spat Spatiotemporal Epidemiol 2022; 40:100471. [PMID: 35120681 PMCID: PMC8580864 DOI: 10.1016/j.sste.2021.100471] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 10/03/2021] [Accepted: 11/04/2021] [Indexed: 01/09/2023]
Abstract
The outbreak of coronavirus disease (COVID-19) has become one of the most challenging global concerns in recent years. Due to inadequate worldwide studies on spatio-temporal modeling of COVID-19, this research aims to examine the relative significance of potential explanatory variables (n = 75) concerning COVID-19 prevalence and mortality using multilayer perceptron artificial neural network topology. We utilized ten variable importance analysis methods to identify the relative importance of the explanatory variables. The main findings indicated that several variables were persistently among the most influential variables in all periods. Regarding COVID-19 prevalence, unemployment and population density were among the most influential variables with the highest importance scores. While for COVID-19 mortality, health-related variables such as diabetes prevalence and number of hospital beds were among the most significant variables. The obtained findings from this study might provide general insights for public health policymakers to monitor the spread of disease and support decision-making.
Collapse
Affiliation(s)
- Nima Kianfar
- Faculty of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran 19967-15433, Iran.
| | - Mohammad Saadi Mesgari
- Faculty of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran 19967-15433, Iran
| | - Abolfazl Mollalo
- Department of Public Health and Prevention Science, School of Health Sciences, Baldwin Wallace University, Berea, OH 44017, USA
| | - Mehrdad Kaveh
- Faculty of Geodesy and Geomatics, K. N. Toosi University of Technology, Tehran 19967-15433, Iran
| |
Collapse
|
25
|
de Moura LV, Mattjie C, Dartora CM, Barros RC, Marques da Silva AM. Explainable Machine Learning for COVID-19 Pneumonia Classification With Texture-Based Features Extraction in Chest Radiography. Front Digit Health 2022; 3:662343. [PMID: 35112097 PMCID: PMC8801500 DOI: 10.3389/fdgth.2021.662343] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 11/29/2021] [Indexed: 12/18/2022] Open
Abstract
Both reverse transcription-PCR (RT-PCR) and chest X-rays are used for the diagnosis of the coronavirus disease-2019 (COVID-19). However, COVID-19 pneumonia does not have a defined set of radiological findings. Our work aims to investigate radiomic features and classification models to differentiate chest X-ray images of COVID-19-based pneumonia and other types of lung patterns. The goal is to provide grounds for understanding the distinctive COVID-19 radiographic texture features using supervised ensemble machine learning methods based on trees through the interpretable Shapley Additive Explanations (SHAP) approach. We use 2,611 COVID-19 chest X-ray images and 2,611 non-COVID-19 chest X-rays. After segmenting the lung in three zones and laterally, a histogram normalization is applied, and radiomic features are extracted. SHAP recursive feature elimination with cross-validation is used to select features. Hyperparameter optimization of XGBoost and Random Forest ensemble tree models is applied using random search. The best classification model was XGBoost, with an accuracy of 0.82 and a sensitivity of 0.82. The explainable model showed the importance of the middle left and superior right lung zones in classifying COVID-19 pneumonia from other lung patterns.
Collapse
Affiliation(s)
- Luís Vinícius de Moura
- Medical Image Computing Laboratory, School of Technology, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| | - Christian Mattjie
- Medical Image Computing Laboratory, School of Technology, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
- Graduate Program in Biomedical Gerontology, School of Medicine, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| | - Caroline Machado Dartora
- Medical Image Computing Laboratory, School of Technology, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
- Graduate Program in Biomedical Gerontology, School of Medicine, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| | - Rodrigo C. Barros
- Machine Learning Theory and Applications Lab, School of Technology, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| | - Ana Maria Marques da Silva
- Medical Image Computing Laboratory, School of Technology, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
- Graduate Program in Biomedical Gerontology, School of Medicine, Pontifical Catholic University of Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| |
Collapse
|
26
|
A pilot study for investigating the feasibility of supervised machine learning approaches for the classification of pedestrians struck by vehicles. J Forensic Leg Med 2021; 84:102256. [PMID: 34678617 DOI: 10.1016/j.jflm.2021.102256] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 09/17/2021] [Accepted: 09/27/2021] [Indexed: 12/23/2022]
Abstract
This research focuses on the application of Artificial Intelligence (AI) methodologies to the problem of classifying vehicles involved in lethal pedestrian collisions. Specifically, the vehicle type is predicted on the basis of traumatic injury suffered by casualties, exploiting machine learning algorithms. In the present study, AI-assisted diagnosis was shown to have correct prediction about 70% of the time. In pedestrians struck by trucks, more severe injuries were appreciated in the facial skeleton, lungs, major airways, liver, and spleen as well as in the sternum/clavicle/rib complex, whereas the lower extremities were more affected by fractures in pedestrians struck by cars. Although the distinction of the striking vehicle should develop beyond autopsy evidence alone, the presented approach which is novel in the realm of forensic science, is shown to be effective in building automated decision support systems. Outcomes from this system can provide valuable information after the execution of autoptic examinations supporting the forensic investigation. Preliminary results from the application of machine learning algorithms with real-world datasets seem to highlight the efficacy of the proposed approach, which could be used for further studies concerning this topic.
Collapse
|
27
|
Ortíz-Barrios MA, Coba-Blanco DM, Alfaro-Saíz JJ, Stand-González D. Process Improvement Approaches for Increasing the Response of Emergency Departments against the COVID-19 Pandemic: A Systematic Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:8814. [PMID: 34444561 PMCID: PMC8392152 DOI: 10.3390/ijerph18168814] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 08/15/2021] [Accepted: 08/17/2021] [Indexed: 12/23/2022]
Abstract
The COVID-19 pandemic has strongly affected the dynamics of Emergency Departments (EDs) worldwide and has accentuated the need for tackling different operational inefficiencies that decrease the quality of care provided to infected patients. The EDs continue to struggle against this outbreak by implementing strategies maximizing their performance within an uncertain healthcare environment. The efforts, however, have remained insufficient in view of the growing number of admissions and increased severity of the coronavirus disease. Therefore, the primary aim of this paper is to review the literature on process improvement interventions focused on increasing the ED response to the current COVID-19 outbreak to delineate future research lines based on the gaps detected in the practical scenario. Therefore, we applied the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines to perform a review containing the research papers published between December 2019 and April 2021 using ISI Web of Science, Scopus, PubMed, IEEE, Google Scholar, and Science Direct databases. The articles were further classified taking into account the research domain, primary aim, journal, and publication year. A total of 65 papers disseminated in 51 journals were concluded to satisfy the inclusion criteria. Our review found that most applications have been directed towards predicting the health outcomes in COVID-19 patients through machine learning and data analytics techniques. In the overarching pandemic, healthcare decision makers are strongly recommended to integrate artificial intelligence techniques with approaches from the operations research (OR) and quality management domains to upgrade the ED performance under social-economic restrictions.
Collapse
Affiliation(s)
- Miguel Angel Ortíz-Barrios
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 081001, Colombia; (D.M.C.-B.); (D.S.-G.)
| | - Dayana Milena Coba-Blanco
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 081001, Colombia; (D.M.C.-B.); (D.S.-G.)
| | - Juan-José Alfaro-Saíz
- Research Centre on Production Management and Engineering, Universitat Politècnica de València, 46022 Valencia, Spain;
| | - Daniela Stand-González
- Department of Productivity and Innovation, Universidad de la Costa CUC, Barranquilla 081001, Colombia; (D.M.C.-B.); (D.S.-G.)
| |
Collapse
|
28
|
AlJame M, Imtiaz A, Ahmad I, Mohammed A. Deep forest model for diagnosing COVID-19 from routine blood tests. Sci Rep 2021; 11:16682. [PMID: 34404838 PMCID: PMC8371014 DOI: 10.1038/s41598-021-95957-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 08/03/2021] [Indexed: 02/07/2023] Open
Abstract
The Coronavirus Disease 2019 (COVID-19) global pandemic has threatened the lives of people worldwide and posed considerable challenges. Early and accurate screening of infected people is vital for combating the disease. To help with the limited quantity of swab tests, we propose a machine learning prediction model to accurately diagnose COVID-19 from clinical and/or routine laboratory data. The model exploits a new ensemble-based method called the deep forest (DF), where multiple classifiers in multiple layers are used to encourage diversity and improve performance. The cascade level employs the layer-by-layer processing and is constructed from three different classifiers: extra trees, XGBoost, and LightGBM. The prediction model was trained and evaluated on two publicly available datasets. Experimental results show that the proposed DF model has an accuracy of 99.5%, sensitivity of 95.28%, and specificity of 99.96%. These performance metrics are comparable to other well-established machine learning techniques, and hence DF model can serve as a fast screening tool for COVID-19 patients at places where testing is scarce.
Collapse
Affiliation(s)
- Maryam AlJame
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait.
| | | | - Imtiaz Ahmad
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait
| | - Ameer Mohammed
- Department of Computer Engineering, Kuwait University, Kuwait City, Kuwait
| |
Collapse
|
29
|
Peyvandi A, Majidi B, Peyvandi S, Patra J. Computer-Aided-Diagnosis as a Service on Decentralized Medical Cloud for Efficient and Rapid Emergency Response Intelligence. NEW GENERATION COMPUTING 2021; 39:677-700. [PMID: 34219860 PMCID: PMC8236221 DOI: 10.1007/s00354-021-00131-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 06/19/2021] [Indexed: 05/05/2023]
Abstract
The COVID-19 pandemic resulted in a significant increase in the workload for the emergency systems and healthcare providers all around the world. The emergency systems are dealing with large number of patients in various stages of deteriorating conditions which require significant medical expertise for accurate and rapid diagnosis and treatment. This issue will become more prominent in places with lack of medical experts and state-of-the-art clinical equipment, especially in developing countries. The machine intelligence aided medical diagnosis systems can provide rapid, dependable, autonomous, and low-cost solutions for medical diagnosis in emergency conditions. In this paper, a privacy-preserving computer-aided diagnosis (CAD) framework, called Decentralized deep Emergency response Intelligence (D-EI), which provides secure machine learning based medical diagnosis on the cloud is proposed. The proposed framework provides a blockchain based decentralized machine learning solution to aid the health providers with medical diagnosis in emergency conditions. The D-EI uses blockchain smart contracts to train the CAD machine learning models using all the data on the medical cloud while preserving the privacy of patients' records. Using the proposed framework, the data of each patient helps to increase the overall accuracy of the CAD model by balancing the diagnosis datasets with minority classes and special cases. As a case study, the D-EI is demonstrated as a solution for COVID-19 diagnosis. The D-EI framework can help in pandemic management by providing rapid and accurate diagnosis in overwhelming medical workload conditions.
Collapse
Affiliation(s)
| | - Babak Majidi
- Department of Computer Engineering, Khatam University, Tehran, Iran
- Emergency and Rapid Response Simulation (ADERSIM) Artificial Intelligence Group, Faculty of Liberal Arts and Professional Studies, York University, Toronto, Canada
| | - Soodeh Peyvandi
- Process Management and Business Intelligence, University of Applied Sciences Upper Austria, Steyr, Austria
| | - Jagdish Patra
- Faculty of Science, Engineering and Technology, Swinburne University of Technology, Melbourne, Australia
| |
Collapse
|
30
|
Janko V, Slapničar G, Dovgan E, Reščič N, Kolenik T, Gjoreski M, Smerkol M, Gams M, Luštrek M. Machine Learning for Analyzing Non-Countermeasure Factors Affecting Early Spread of COVID-19. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:6750. [PMID: 34201618 PMCID: PMC8268491 DOI: 10.3390/ijerph18136750] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 06/20/2021] [Accepted: 06/21/2021] [Indexed: 12/24/2022]
Abstract
The COVID-19 pandemic affected the whole world, but not all countries were impacted equally. This opens the question of what factors can explain the initial faster spread in some countries compared to others. Many such factors are overshadowed by the effect of the countermeasures, so we studied the early phases of the infection when countermeasures had not yet taken place. We collected the most diverse dataset of potentially relevant factors and infection metrics to date for this task. Using it, we show the importance of different factors and factor categories as determined by both statistical methods and machine learning (ML) feature selection (FS) approaches. Factors related to culture (e.g., individualism, openness), development, and travel proved the most important. A more thorough factor analysis was then made using a novel rule discovery algorithm. We also show how interconnected these factors are and caution against relying on ML analysis in isolation. Importantly, we explore potential pitfalls found in the methodology of similar work and demonstrate their impact on COVID-19 data analysis. Our best models using the decision tree classifier can predict the infection class with roughly 80% accuracy.
Collapse
Affiliation(s)
- Vito Janko
- Jožef Stefan Institute, 1000 Ljubljana, Slovenia; (G.S.); (E.D.); (N.R.); (T.K.); (M.G.); (M.S.); (M.G.); (M.L.)
| | | | | | | | | | | | | | | | | |
Collapse
|
31
|
Evaluating Service-Oriented and Microservice Architecture Patterns to Deploy eHealth Applications in Cloud Computing Environment. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11104350] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
This article proposes a new framework for a Cloud-based eHealth platform concept focused on Cloud computing environments, since current and emerging approaches using digital clinical history increasingly demonstrate their potential in maintaining the quality of the benefits in medical care services, especially in computer-assisted clinical diagnosis within the field of infectious diseases and due to the worsening of chronic pathologies. Our objective is to evaluate and contrast the performance of the architectural patterns most commonly used for developing eHealth applications (i.e., service-oriented architecture (SOA) and microservices architecture (MSA)), using as reference the quantitative values obtained from the various performance tests and their ability to adapt to the required software attribute (i.e., versatile high-performance). Therefore, it was necessary to modify our platform to fit two architectural variants. As a follow-up to this activity, corresponding tests were performed that showed that the MSA variant functions better in terms of performance and response time compared to the SOA variant; however, it consumed significantly more bandwidth than SOA, and scalability in SOA is generally not possible or requires significant effort to be achieved. We conclude that the implementation of SOA and MSA depends on the nature and needs of organizations (e.g., performance or interoperability).
Collapse
|
32
|
Adamidi ES, Mitsis K, Nikita KS. Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review. Comput Struct Biotechnol J 2021; 19:2833-2850. [PMID: 34025952 PMCID: PMC8123783 DOI: 10.1016/j.csbj.2021.05.010] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 05/01/2021] [Accepted: 05/02/2021] [Indexed: 12/23/2022] Open
Abstract
The worldwide health crisis caused by the SARS-Cov-2 virus has resulted in>3 million deaths so far. Improving early screening, diagnosis and prognosis of the disease are critical steps in assisting healthcare professionals to save lives during this pandemic. Since WHO declared the COVID-19 outbreak as a pandemic, several studies have been conducted using Artificial Intelligence techniques to optimize these steps on clinical settings in terms of quality, accuracy and most importantly time. The objective of this study is to conduct a systematic literature review on published and preprint reports of Artificial Intelligence models developed and validated for screening, diagnosis and prognosis of the coronavirus disease 2019. We included 101 studies, published from January 1st, 2020 to December 30th, 2020, that developed AI prediction models which can be applied in the clinical setting. We identified in total 14 models for screening, 38 diagnostic models for detecting COVID-19 and 50 prognostic models for predicting ICU need, ventilator need, mortality risk, severity assessment or hospital length stay. Moreover, 43 studies were based on medical imaging and 58 studies on the use of clinical parameters, laboratory results or demographic features. Several heterogeneous predictors derived from multimodal data were identified. Analysis of these multimodal data, captured from various sources, in terms of prominence for each category of the included studies, was performed. Finally, Risk of Bias (RoB) analysis was also conducted to examine the applicability of the included studies in the clinical setting and assist healthcare providers, guideline developers, and policymakers.
Collapse
Key Words
- ABG, Arterial Blood Gas
- ADA, Adenosine Deaminase
- AI, Artificial Intelligence
- ANN, Artificial Neural Networks
- APTT, Activated Partial Thromboplastin Time
- ARMED, Attribute Reduction with Multi-objective Decomposition Ensemble optimizer
- AUC, Area Under the Curve
- Acc, Accuracy
- Adaboost, Adaptive Boosting
- Apol AI, Apolipoprotein AI
- Apol B, Apolipoprotein B
- Artificial intelligence
- BNB, Bernoulli Naïve Bayes
- BUN, Blood Urea Nitrogen
- CI, Confidence Interval
- CK-MB, Creatine Kinase isoenzyme
- CNN, Convolutional Neural Networks
- COVID-19
- CPP, COVID-19 Positive Patients
- CRP, C-Reactive Protein
- CRT, Classification and Regression Decision Tree
- CoxPH, Cox Proportional Hazards
- DCNN, Deep Convolutional Neural Networks
- DL, Deep Learning
- DLC, Density Lipoprotein Cholesterol
- DNN, Deep Neural Networks
- DT, Decision Tree
- Diagnosis
- ED, Emergency Department
- ESR, Erythrocyte Sedimentation Rate
- ET, Extra Trees
- FCV, Fold Cross Validation
- FL, Federated Learning
- FiO2, Fraction of Inspiration O2
- GBDT, Gradient Boost Decision Tree
- GBM light, Gradient Boosting Machine light
- GDCNN, Genetic Deep Learning Convolutional Neural Network
- GFR, Glomerular Filtration Rate
- GFS, Gradient boosted feature selection
- GGT, Glutamyl Transpeptidase
- GNB, Gaussian Naïve Bayes
- HDLC, High Density Lipoprotein Cholesterol
- INR, International Normalized Ratio
- Inception Resnet, Inception Residual Neural Network
- L1LR, L1 Regularized Logistic Regression
- LASSO, Least Absolute Shrinkage and Selection Operator
- LDA, Linear Discriminant Analysis
- LDH, Lactate Dehydrogenase
- LDLC, Low Density Lipoprotein Cholesterol
- LR, Logistic Regression
- LSTM, Long-Short Term Memory
- MCHC, Mean Corpuscular Hemoglobin Concentration
- MCV, Mean corpuscular volume
- ML, Machine Learning
- MLP, MultiLayer Perceptron
- MPV, Mean Platelet Volume
- MRMR, Maximum Relevance Minimum Redundancy
- Multimodal data
- NB, Naïve Bayes
- NLP, Natural Language Processing
- NPV, Negative Predictive Values
- Nadam optimizer, Nesterov Accelerated Adaptive Moment optimizer
- OB, Occult Blood test
- PCT, Thrombocytocrit
- PPV, Positive Predictive Values
- PWD, Platelet Distribution Width
- PaO2, Arterial Oxygen Tension
- Paco2, Arterial Carbondioxide Tension
- Prognosis
- RBC, Red Blood Cell
- RBF, Radial Basis Function
- RBP, Retinol Binding Protein
- RDW, Red blood cell Distribution Width
- RF, Random Forest
- RFE, Recursive Feature Elimination
- RSV, Respiratory Syncytial Virus
- SEN, Sensitivity
- SG, Specific Gravity
- SMOTE, Synthetic Minority Oversampling Technique
- SPE, Specificity
- SRLSR, Sparse Rescaled Linear Square Regression
- SVM, Support Vector Machine
- SaO2, Arterial Oxygen saturation
- Screening
- TBA, Total Bile Acid
- TTS, Training Test Split
- WBC, White Blood Cell count
- XGB, eXtreme Gradient Boost
- k-NN, K-Nearest Neighbor
Collapse
Affiliation(s)
- Eleni S. Adamidi
- Biomedical Simulations and Imaging Lab, School of Electrical and Computer Engineering, National Technical University of Athens, Greece
| | - Konstantinos Mitsis
- Biomedical Simulations and Imaging Lab, School of Electrical and Computer Engineering, National Technical University of Athens, Greece
| | - Konstantina S. Nikita
- Biomedical Simulations and Imaging Lab, School of Electrical and Computer Engineering, National Technical University of Athens, Greece
| |
Collapse
|
33
|
Chatzitofis A, Cancian P, Gkitsas V, Carlucci A, Stalidis P, Albanis G, Karakottas A, Semertzidis T, Daras P, Giannitto C, Casiraghi E, Sposta FM, Vatteroni G, Ammirabile A, Lofino L, Ragucci P, Laino ME, Voza A, Desai A, Cecconi M, Balzarini L, Chiti A, Zarpalas D, Savevski V. Volume-of-Interest Aware Deep Neural Networks for Rapid Chest CT-Based COVID-19 Patient Risk Assessment. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:2842. [PMID: 33799509 PMCID: PMC7998401 DOI: 10.3390/ijerph18062842] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 02/19/2021] [Accepted: 03/03/2021] [Indexed: 02/06/2023]
Abstract
Since December 2019, the world has been devastated by the Coronavirus Disease 2019 (COVID-19) pandemic. Emergency Departments have been experiencing situations of urgency where clinical experts, without long experience and mature means in the fight against COVID-19, have to rapidly decide the most proper patient treatment. In this context, we introduce an artificially intelligent tool for effective and efficient Computed Tomography (CT)-based risk assessment to improve treatment and patient care. In this paper, we introduce a data-driven approach built on top of volume-of-interest aware deep neural networks for automatic COVID-19 patient risk assessment (discharged, hospitalized, intensive care unit) based on lung infection quantization through segmentation and, subsequently, CT classification. We tackle the high and varying dimensionality of the CT input by detecting and analyzing only a sub-volume of the CT, the Volume-of-Interest (VoI). Differently from recent strategies that consider infected CT slices without requiring any spatial coherency between them, or use the whole lung volume by applying abrupt and lossy volume down-sampling, we assess only the "most infected volume" composed of slices at its original spatial resolution. To achieve the above, we create, present and publish a new labeled and annotated CT dataset with 626 CT samples from COVID-19 patients. The comparison against such strategies proves the effectiveness of our VoI-based approach. We achieve remarkable performance on patient risk assessment evaluated on balanced data by reaching 88.88%, 89.77%, 94.73% and 88.88% accuracy, sensitivity, specificity and F1-score, respectively.
Collapse
Affiliation(s)
- Anargyros Chatzitofis
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Pierandrea Cancian
- Humanitas AI Center, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (P.C.); (A.C.); (M.E.L.); (V.S.)
| | - Vasileios Gkitsas
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Alessandro Carlucci
- Humanitas AI Center, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (P.C.); (A.C.); (M.E.L.); (V.S.)
| | - Panagiotis Stalidis
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Georgios Albanis
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Antonis Karakottas
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Theodoros Semertzidis
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Petros Daras
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Caterina Giannitto
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
| | - Elena Casiraghi
- Dipartimento di Informatica/Computer Science Department “Giovanni degli Antoni”, Università degli Studi di Milano, Via Celoria 18, 20133 Milan, Italy;
| | - Federica Mrakic Sposta
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
| | - Giulia Vatteroni
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Italy; (A.D.); (M.C.); (A.C.)
| | - Angela Ammirabile
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Italy; (A.D.); (M.C.); (A.C.)
| | - Ludovica Lofino
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Italy; (A.D.); (M.C.); (A.C.)
| | - Pasquala Ragucci
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
| | - Maria Elena Laino
- Humanitas AI Center, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (P.C.); (A.C.); (M.E.L.); (V.S.)
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
| | - Antonio Voza
- Emergency Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy;
| | - Antonio Desai
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Italy; (A.D.); (M.C.); (A.C.)
- Emergency Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy;
| | - Maurizio Cecconi
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Italy; (A.D.); (M.C.); (A.C.)
- Intensive Care Unit, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy
| | - Luca Balzarini
- Radiology Department, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (C.G.); (F.M.S.); (G.V.); (A.A.); (L.L.); (P.R.); (L.B.)
| | - Arturo Chiti
- Department of Biomedical Sciences, Humanitas University, Via Rita Levi Montalcini 4, 20090 Pieve Emanuele, Italy; (A.D.); (M.C.); (A.C.)
- Humanitas Clinical and Research Center—IRCCS, Via Manzoni 56, 20089 Rozzano, Italy
| | - Dimitrios Zarpalas
- Centre for Research and Technology Hellas, Information Technologies Institute, 6th km Charilaou—Thermi, P.O. Box 60361, 57001 Thessaloniki, Greece; (A.C.); (V.G.); (P.S.); (G.A.); (A.K.); (T.S.); (P.D.)
| | - Victor Savevski
- Humanitas AI Center, Humanitas Research Hospital, Via Alessandro Manzoni 56, 20089 Rozzano, Italy; (P.C.); (A.C.); (M.E.L.); (V.S.)
| |
Collapse
|
34
|
Esposito A, Casiraghi E, Chiaraviglio F, Scarabelli A, Stellato E, Plensich G, Lastella G, Di Meglio L, Fusco S, Avola E, Jachetti A, Giannitto C, Malchiodi D, Frasca M, Beheshti A, Robinson PN, Valentini G, Forzenigo L, Carrafiello G. Artificial Intelligence in Predicting Clinical Outcome in COVID-19 Patients from Clinical, Biochemical and a Qualitative Chest X-Ray Scoring System. REPORTS IN MEDICAL IMAGING 2021. [DOI: 10.2147/rmi.s292314] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
|
35
|
Kumar Singh V, Abdel-Nasser M, Pandey N, Puig D. LungINFseg: Segmenting COVID-19 Infected Regions in Lung CT Images Based on a Receptive-Field-Aware Deep Learning Framework. Diagnostics (Basel) 2021; 11:158. [PMID: 33498999 PMCID: PMC7910858 DOI: 10.3390/diagnostics11020158] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Revised: 01/07/2021] [Accepted: 01/14/2021] [Indexed: 12/13/2022] Open
Abstract
COVID-19 is a fast-growing disease all over the world, but facilities in the hospitals are restricted. Due to unavailability of an appropriate vaccine or medicine, early identification of patients suspected to have COVID-19 plays an important role in limiting the extent of disease. Lung computed tomography (CT) imaging is an alternative to the RT-PCR test for diagnosing COVID-19. Manual segmentation of lung CT images is time consuming and has several challenges, such as the high disparities in texture, size, and location of infections. Patchy ground-glass and consolidations, along with pathological changes, limit the accuracy of the existing deep learning-based CT slices segmentation methods. To cope with these issues, in this paper we propose a fully automated and efficient deep learning-based method, called LungINFseg, to segment the COVID-19 infections in lung CT images. Specifically, we propose the receptive-field-aware (RFA) module that can enlarge the receptive field of the segmentation models and increase the learning ability of the model without information loss. RFA includes convolution layers to extract COVID-19 features, dilated convolution consolidated with learnable parallel-group convolution to enlarge the receptive field, frequency domain features obtained by discrete wavelet transform, which also enlarges the receptive field, and an attention mechanism to promote COVID-19-related features. Large receptive fields could help deep learning models to learn contextual information and COVID-19 infection-related features that yield accurate segmentation results. In our experiments, we used a total of 1800+ annotated CT slices to build and test LungINFseg. We also compared LungINFseg with 13 state-of-the-art deep learning-based segmentation methods to demonstrate its effectiveness. LungINFseg achieved a dice score of 80.34% and an intersection-over-union (IoU) score of 68.77%-higher than the ones of the other 13 segmentation methods. Specifically, the dice and IoU scores of LungINFseg were 10% better than those of the popular biomedical segmentation method U-Net.
Collapse
Affiliation(s)
- Vivek Kumar Singh
- Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, 43007 Tarragona, Spain; (V.K.S.); (D.P.)
| | - Mohamed Abdel-Nasser
- Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, 43007 Tarragona, Spain; (V.K.S.); (D.P.)
- Department of Electrical Engineering, Aswan University, 81542 Aswan, Egypt
| | - Nidhi Pandey
- Department of Medicine and Health Sciences, Universitat Rovira i Virgili, 43201 Reus, Spain;
| | - Domenec Puig
- Department of Computer Engineering and Mathematics, Universitat Rovira i Virgili, 43007 Tarragona, Spain; (V.K.S.); (D.P.)
| |
Collapse
|