1
|
Si Z, Li J, Leary E. Machine learning in public health informatics: Evidence that complex sampling structures may not be needed for prediction models with imbalanced outcomes. Ann Epidemiol 2025; 102:75-80. [PMID: 39798678 DOI: 10.1016/j.annepidem.2024.12.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2024] [Revised: 12/16/2024] [Accepted: 12/30/2024] [Indexed: 01/15/2025]
Abstract
PURPOSE The objective of this study is to investigate the predictive ability of machine learning models for imbalanced outcomes from national survey data without the use of sampling weights. METHODS We evaluated the predictive performance of machine learning models on imbalanced outcomes from the US National Health and Nutrition Examination Survey (USNHANES) without using sampling weights. Four machine learning models (support vector machine, random forest, least absolute shrinkage and selection operator regression, and deep neural network) were compared with a logistic model that incorporates the survey's complex sampling design. Three resampling methods (oversampling, undersampling, and combined) were used to address class imbalance during the model training process. RESULTS For all models, the balanced accuracy was similar (ranging from 0.72 to 0.76) and the specificity was smaller than sensitivity except for random forest. The support vector machine and neural networks performed best with sensitivity (ranging from 0.79 to 0.83), while the random forest had the largest specificity (ranging from 0.86 to 0.96), with one exception. PR-AUC scores and Brier scores were low ranging from 0.2529 to 0.3313 (lower scores worse) and 0.1005-0.3245 (lower scores better), respectively CONCLUSIONS: The machine learning models had overall similar predictive capacity to the recommended methods which integrate the complex sampling design for the prediction of osteoarthritis occurrence with USNHANES.
Collapse
Affiliation(s)
- Zhengye Si
- Department of Orthopaedic Surgery, Thompson Laboratory for Regenerative Orthopaedics, School of Medicine, University of Missouri, 1100 Virginia Avenue, Columbia , MO 65211, USA
| | - Jinpu Li
- Department of Orthopaedic Surgery, Thompson Laboratory for Regenerative Orthopaedics, School of Medicine, University of Missouri, 1100 Virginia Avenue, Columbia , MO 65211, USA
| | - Emily Leary
- Department of Orthopaedic Surgery, Thompson Laboratory for Regenerative Orthopaedics, School of Medicine, University of Missouri, 1100 Virginia Avenue, Columbia , MO 65211, USA.
| |
Collapse
|
2
|
van Kleef LA, Francque SM, Prieto-Ortiz JE, Sonneveld MJ, Sanchez-Luque CB, Prieto-Ortiz RG, Kwanten WJ, Vonghia L, Verrijken A, De Block C, Gadi Z, Janssen HLA, de Knegt RJ, Brouwer WP. Metabolic Dysfunction-Associated Fibrosis 5 (MAF-5) Score Predicts Liver Fibrosis Risk and Outcome in the General Population With Metabolic Dysfunction. Gastroenterology 2024; 167:357-367.e9. [PMID: 38513745 DOI: 10.1053/j.gastro.2024.03.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 03/10/2024] [Accepted: 03/13/2024] [Indexed: 03/23/2024]
Abstract
BACKGROUND & AIMS There is an unmet need for noninvasive tests to improve case-finding and aid primary care professionals in referring patients at high risk of liver disease. METHODS A metabolic dysfunction-associated fibrosis (MAF-5) score was developed and externally validated in a total of 21,797 individuals with metabolic dysfunction in population-based (National Health and Nutrition Examination Survey 2017-2020, National Health and Nutrition Examination Survey III, and Rotterdam Study) and hospital-based (from Antwerp and Bogota) cohorts. Fibrosis was defined as liver stiffness ≥8.0 kPa. Diagnostic accuracy was compared with FIB-4, nonalcoholic fatty liver disease fibrosis score (NFS), LiverRisk score and steatosis-associated fibrosis estimator (SAFE). MAF-5 was externally validated with liver stiffness measurement ≥8.0 kPa, with shear-wave elastography ≥7.5 kPa, and biopsy-proven steatotic liver disease according to Metavir and Nonalcoholic Steatohepatitis Clinical Research Network scores, and was tested for prognostic performance (all-cause mortality). RESULTS The MAF-5 score comprised waist circumference, body mass index (calculated as kg / m2), diabetes, aspartate aminotransferase, and platelets. With this score, 60.9% was predicted at low, 14.1% at intermediate, and 24.9% at high risk of fibrosis. The observed prevalence was 3.3%, 7.9%, and 28.1%, respectively. The area under the receiver operator curve of MAF-5 (0.81) was significantly higher than FIB-4 (0.61), and outperformed the FIB-4 among young people (negative predictive value [NPV], 99%; area under the curve [AUC], 0.86 vs NPV, 94%; AUC, 0.51) and older adults (NPV, 94%; AUC, 0.75 vs NPV, 88%; AUC, 0.55). MAF-5 showed excellent performance to detect liver stiffness measurement ≥12 kPa (AUC, 0.86 training; AUC, 0.85 validation) and good performance in detecting liver stiffness and biopsy-proven liver fibrosis among the external validation cohorts. MAF-5 score >1 was associated with increased risk of all-cause mortality in (un)adjusted models (adjusted hazard ratio, 1.59; 95% CI, 1.47-1.73). CONCLUSIONS The MAF-5 score is a validated, age-independent, inexpensive referral tool to identify individuals at high risk of liver fibrosis and all-cause mortality in primary care populations, using simple variables.
Collapse
Affiliation(s)
- Laurens A van Kleef
- Department of Gastroenterology and Hepatology, Erasmus Medical Center, University Medical Center, Rotterdam, The Netherlands
| | - Sven M Francque
- Department of Gastroenterology and Hepatology, Antwerp University Hospital, Antwerp, Belgium; Laboratory of Experimental Medicine and Paediatrics, University of Antwerp, Antwerp, Belgium
| | | | - Milan J Sonneveld
- Department of Gastroenterology and Hepatology, Erasmus Medical Center, University Medical Center, Rotterdam, The Netherlands
| | | | | | - Wilhelmus J Kwanten
- Department of Gastroenterology and Hepatology, Antwerp University Hospital, Antwerp, Belgium; Laboratory of Experimental Medicine and Paediatrics, University of Antwerp, Antwerp, Belgium
| | - Luisa Vonghia
- Department of Gastroenterology and Hepatology, Antwerp University Hospital, Antwerp, Belgium; Laboratory of Experimental Medicine and Paediatrics, University of Antwerp, Antwerp, Belgium
| | - An Verrijken
- Laboratory of Experimental Medicine and Paediatrics, University of Antwerp, Antwerp, Belgium; Department of Endocrinology, Diabetology and Metabolism, Antwerp University Hospital, Antwerp, Belgium
| | - Christophe De Block
- Laboratory of Experimental Medicine and Paediatrics, University of Antwerp, Antwerp, Belgium; Department of Endocrinology, Diabetology and Metabolism, Antwerp University Hospital, Antwerp, Belgium
| | - Zouhir Gadi
- Department of Gastroenterology and Hepatology, Antwerp University Hospital, Antwerp, Belgium; Laboratory of Experimental Medicine and Paediatrics, University of Antwerp, Antwerp, Belgium
| | - Harry L A Janssen
- Department of Gastroenterology and Hepatology, Erasmus Medical Center, University Medical Center, Rotterdam, The Netherlands; Toronto Centre for Liver Disease, Toronto General Hospital, University Health Network, Canada
| | - Robert J de Knegt
- Department of Gastroenterology and Hepatology, Erasmus Medical Center, University Medical Center, Rotterdam, The Netherlands
| | - Willem Pieter Brouwer
- Department of Gastroenterology and Hepatology, Erasmus Medical Center, University Medical Center, Rotterdam, The Netherlands.
| |
Collapse
|
3
|
Hassoun S, Bruckmann C, Ciardullo S, Perseghin G, Marra F, Curto A, Arena U, Broccolo F, Di Gaudio F. NAIF: A novel artificial intelligence-based tool for accurate diagnosis of stage F3/F4 liver fibrosis in the general adult population, validated with three external datasets. Int J Med Inform 2024; 185:105373. [PMID: 38395017 DOI: 10.1016/j.ijmedinf.2024.105373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 02/05/2024] [Accepted: 02/07/2024] [Indexed: 02/25/2024]
Abstract
OBJECTIVE The purpose of this study was to determine the effectiveness of a new AI-based tool called NAIF (NAFLD-AI-Fibrosis) in identifying individuals from the general population with advanced liver fibrosis (stage F3/F4). We compared NAIF's performance to two existing risk score calculators, aspartate aminotransferase-to-platelet ratio index (APRI) and fibrosis-4 (Fib4). METHODS To set up the algorithm for diagnosing severe liver fibrosis (defined as Fibroscan® values E ≥ 9.7 KPa), we used 19 blood biochemistry parameters and two demographic parameters in a group of 5,962 individuals from the NHANES population (2017-2020 pre-pandemic, public database). We then assessed the algorithm's performance by comparing its accuracy, precision, sensitivity, specificity, and F1 score values to those of APRI and Fib4 scoring systems. RESULTS In a kept-out sub dataset of the NHANES population, NAIF achieved a predictive precision of 72 %, a sensitivity of 61 %, and a specificity of 77 % in correctly identifying adults (aged 18-79 years) with severe liver fibrosis. Additionally, NAIF performed well when tested with two external datasets of Italian patients with a Fibroscan® score E ≥ 9.7 kPa, and with an external dataset of patients with diagnosis of severe liver fibrosis through biopsy. CONCLUSIONS The results of our study suggest that NAIF, using routinely available parameters, outperforms in sensitivity existing scoring methods (Fib4 and APRI) in diagnosing severe liver fibrosis, even when tested with external validation datasets. NAIF uses routinely available parameters, making it a promising tool for identifying individuals with advanced liver fibrosis from the general population. Word count abstract: 236.
Collapse
Affiliation(s)
- Samir Hassoun
- Unità Operativa Centro Controllo Qualità e Rischio Chimico (CQRC), Azienda Ospedaliera Villa Sofia Cervello, viale Strasburgo 233, 90146 Palermo, Italy.
| | - Chiara Bruckmann
- Unità Operativa Centro Controllo Qualità e Rischio Chimico (CQRC), Azienda Ospedaliera Villa Sofia Cervello, viale Strasburgo 233, 90146 Palermo, Italy.
| | - Stefano Ciardullo
- Department of Medicine and Surgery, University of Milano-Bicocca, via Modigliani 10, 20900 Monza, Italy; Department of Medicine and Rehabilitation, Policlinico di Monza, Monza, via Modigliani 10, 20900 Monza, Italy
| | - Gianluca Perseghin
- Department of Medicine and Surgery, University of Milano-Bicocca, via Modigliani 10, 20900 Monza, Italy; Department of Medicine and Rehabilitation, Policlinico di Monza, Monza, via Modigliani 10, 20900 Monza, Italy
| | - Fabio Marra
- Dipartimento di Medicina Sperimentale e Clinica, University of Florence, Largo Giovanni Alessandro Brambilla, 3, 50134 Firenze Italy
| | - Armando Curto
- Dipartimento di Medicina Sperimentale e Clinica, University of Florence, Largo Giovanni Alessandro Brambilla, 3, 50134 Firenze Italy
| | - Umberto Arena
- Dipartimento di Medicina Sperimentale e Clinica, University of Florence, Largo Giovanni Alessandro Brambilla, 3, 50134 Firenze Italy
| | - Francesco Broccolo
- Department of Experimental Medicine, University of Salento, 73100 Lecce, Italy.
| | - Francesca Di Gaudio
- Unità Operativa Centro Controllo Qualità e Rischio Chimico (CQRC), Azienda Ospedaliera Villa Sofia Cervello, viale Strasburgo 233, 90146 Palermo, Italy; PROMISE-Promotion of Health, Maternal-Childhood, Internal and Specialized Medicine of Excellence G. D'Alessandro, Piazza delle Cliniche, 2, 90127 Palermo, Italy
| |
Collapse
|
4
|
Bottrighi A, Pennisi M. Exploring the State of Machine Learning and Deep Learning in Medicine: A Survey of the Italian Research Community. INFORMATION 2023; 14:513. [DOI: 10.3390/info14090513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2025] Open
Abstract
Artificial intelligence (AI) is becoming increasingly important, especially in the medical field. While AI has been used in medicine for some time, its growth in the last decade is remarkable. Specifically, machine learning (ML) and deep learning (DL) techniques in medicine have been increasingly adopted due to the growing abundance of health-related data, the improved suitability of such techniques for managing large datasets, and more computational power. ML and DL methodologies are fostering the development of new “intelligent” tools and expert systems to process data, to automatize human–machine interactions, and to deliver advanced predictive systems that are changing every aspect of the scientific research, industry, and society. The Italian scientific community was instrumental in advancing this research area. This article aims to conduct a comprehensive investigation of the ML and DL methodologies and applications used in medicine by the Italian research community in the last five years. To this end, we selected all the papers published in the last five years with at least one of the authors affiliated to an Italian institution that in the title, in the abstract, or in the keywords present the terms “machine learning” or “deep learning” and reference a medical area. We focused our research on journal papers under the hypothesis that Italian researchers prefer to present novel but well-established research in scientific journals. We then analyzed the selected papers considering different dimensions, including the medical topic, the type of data, the pre-processing methods, the learning methods, and the evaluation methods. As a final outcome, a comprehensive overview of the Italian research landscape is given, highlighting how the community has increasingly worked on a very heterogeneous range of medical problems.
Collapse
Affiliation(s)
- Alessio Bottrighi
- Dipartimento di Scienze e Innovazione Tecnologica (DiSIT), Computer Science Institute, Università del Piemonte Orientale, 15121 Alessandria, Italy
- Laboratorio Integrato di Intelligenza Artificiale e Informatica in Medicina, Azienda Ospedaliera SS. Antonio e Biagio e Cesare Arrigo, Alessandria—e DiSIT—Università del Piemonte Orientale, 15121 Alessandria, Italy
| | - Marzio Pennisi
- Dipartimento di Scienze e Innovazione Tecnologica (DiSIT), Computer Science Institute, Università del Piemonte Orientale, 15121 Alessandria, Italy
- Laboratorio Integrato di Intelligenza Artificiale e Informatica in Medicina, Azienda Ospedaliera SS. Antonio e Biagio e Cesare Arrigo, Alessandria—e DiSIT—Università del Piemonte Orientale, 15121 Alessandria, Italy
| |
Collapse
|
5
|
Weng S, Hu D, Chen J, Yang Y, Peng D. Prediction of Fatty Liver Disease in a Chinese Population Using Machine-Learning Algorithms. Diagnostics (Basel) 2023; 13:diagnostics13061168. [PMID: 36980476 PMCID: PMC10047083 DOI: 10.3390/diagnostics13061168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 03/13/2023] [Accepted: 03/16/2023] [Indexed: 03/30/2023] Open
Abstract
BACKGROUND Fatty liver disease (FLD) is an important risk factor for liver cancer and cardiovascular disease and can lead to significant social and economic burden. However, there is currently no nationwide epidemiological survey for FLD in China, making early FLD screening crucial for the Chinese population. Unfortunately, liver biopsy and abdominal ultrasound, the preferred methods for FLD diagnosis, are not practical for primary medical institutions. Therefore, the aim of this study was to develop machine learning (ML) models for screening individuals at high risk of FLD, and to provide a new perspective on early FLD diagnosis. METHODS This study included a total of 30,574 individuals between the ages of 18 and 70 who completed abdominal ultrasound and the related clinical examinations. Among them, 3474 individuals were diagnosed with FLD by abdominal ultrasound. We used 11 indicators to build eight classification models to predict FLD. The model prediction ability was evaluated by the area under the curve, sensitivity, specificity, positive predictive value, negative predictive value, and kappa value. Feature importance analysis was assessed by Shapley value or root mean square error loss after permutations. RESULTS Among the eight ML models, the prediction accuracy of the extreme gradient boosting (XGBoost) model was highest at 89.77%. By feature importance analysis, we found that the body mass index, triglyceride, and alanine aminotransferase play important roles in FLD prediction. CONCLUSION XGBoost improves the efficiency and cost of large-scale FLD screening.
Collapse
Affiliation(s)
- Shuwei Weng
- Department of Cardiovascular Medicine, The Second Xiangya Hospital, Central South University, Changsha 410011, China
- Research Institute of Blood Lipid and Atherosclerosis, Central South University, Changsha 410011, China
| | - Die Hu
- Department of Cardiovascular Medicine, The Second Xiangya Hospital, Central South University, Changsha 410011, China
- Research Institute of Blood Lipid and Atherosclerosis, Central South University, Changsha 410011, China
| | - Jin Chen
- Department of Cardiovascular Medicine, The Second Xiangya Hospital, Central South University, Changsha 410011, China
- Research Institute of Blood Lipid and Atherosclerosis, Central South University, Changsha 410011, China
| | - Yanyi Yang
- Health Management Center, The Second Xiangya Hospital, Central South University, Changsha 410011, China
| | - Daoquan Peng
- Department of Cardiovascular Medicine, The Second Xiangya Hospital, Central South University, Changsha 410011, China
- Research Institute of Blood Lipid and Atherosclerosis, Central South University, Changsha 410011, China
| |
Collapse
|