Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: van der Ploeg T, Austin PC, Steyerberg EW. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Med Res Methodol 2014;14:137. [PMID: 25532820 PMCID: PMC4289553 DOI: 10.1186/1471-2288-14-137] [Citation(s) in RCA: 327] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 12/19/2014] [Indexed: 12/27/2022] Open

For:	van der Ploeg T, Austin PC, Steyerberg EW. Modern modelling techniques are data hungry: a simulation study for predicting dichotomous endpoints. BMC Med Res Methodol 2014;14:137. [PMID: 25532820 PMCID: PMC4289553 DOI: 10.1186/1471-2288-14-137] [Citation(s) in RCA: 327] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 12/19/2014] [Indexed: 12/27/2022] Open

Number

Cited by Other Article(s)

Li YX, Liu YC, Wang M, Huang YL. Prediction of gestational diabetes mellitus at the first trimester: machine-learning algorithms. Arch Gynecol Obstet 2024;309:2557-2566. [PMID: 37477677 DOI: 10.1007/s00404-023-07131-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 06/27/2023] [Indexed: 07/22/2023]

Sokou R, Parastatidou S, Konstantinidi A, Tsantes AG, Iacovidou N, Piovani D, Bonovas S, Tsantes AE. Bleeding Scoring Systems in Neonates: A Systematic Review. Semin Thromb Hemost 2024;50:620-637. [PMID: 38016650 DOI: 10.1055/s-0043-1777070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]

Asiimwe IG, S'fiso Ndzamba B, Mouksassi S, Pillai GC, Lombard A, Lang J. Machine-Learning Assisted Screening of Correlated Covariates: Application to Clinical Data of Desipramine. AAPS J 2024;26:63. [PMID: 38816519 DOI: 10.1208/s12248-024-00934-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 05/14/2024] [Indexed: 06/01/2024] Open

Abstract

Stepwise covariate modeling (SCM) has a high computational burden and can select the wrong covariates. Machine learning (ML) has been proposed as a screening tool to improve the efficiency of covariate selection, but little is known about how to apply ML on actual clinical data. First, we simulated datasets based on clinical data to compare the performance of various ML and traditional pharmacometrics (PMX) techniques with and without accounting for highly-correlated covariates. This simulation step identified the ML algorithm and the number of top covariates to select when using the actual clinical data. A previously developed desipramine population-pharmacokinetic model was used to simulate virtual subjects. Fifteen covariates were considered with four having an effect included. Based on the F1 score (an accuracy measure), ridge regression was the most accurate ML technique on 200 simulated datasets (F1 score = 0.475 ± 0.231), a performance which almost doubled when highly-correlated covariates were accounted for (F1 score = 0.860 ± 0.158). These performances were better than forwards selection with SCM (F1 score = 0.251 ± 0.274 and 0.499 ± 0.381 without/with correlations respectively). In terms of computational cost, ridge regression (0.42 ± 0.07 seconds/simulated dataset, 1 thread) was ~20,000 times faster than SCM (2.30 ± 2.29 hours, 15 threads). On the clinical dataset, prescreening with the selected ML algorithm reduced SCM runtime by 42.86% (from 1.75 to 1.00 days) and produced the same final model as SCM only. In conclusion, we have demonstrated that accounting for highly-correlated covariates improves ML prescreening accuracy. The choice of ML method and the proportion of important covariates (unknown a priori) can be guided by simulations.

Collapse

Taha-Mehlitz S, Wentzler L, Angehrn F, Hendie A, Ochs V, Wolleb J, Staartjes VE, Enodien B, Baltuonis M, Vorburger S, Frey DM, Rosenberg R, von Flüe M, Müller-Stich B, Cattin PC, Taha A, Steinemann D. Machine learning-based preoperative analytics for the prediction of anastomotic leakage in colorectal surgery: a swiss pilot study. Surg Endosc 2024:10.1007/s00464-024-10926-4. [PMID: 38777894 DOI: 10.1007/s00464-024-10926-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Accepted: 05/05/2024] [Indexed: 05/25/2024]

Abstract

BACKGROUND

Anastomotic leakage (AL), a severe complication following colorectal surgery, arises from defects at the anastomosis site. This study evaluates the feasibility of predicting AL using machine learning (ML) algorithms based on preoperative data.

METHODS

We retrospectively analyzed data including 21 predictors from patients undergoing colorectal surgery with bowel anastomosis at four Swiss hospitals. Several ML algorithms were applied for binary classification into AL or non-AL groups, utilizing a five-fold cross-validation strategy with a 90% training and 10% validation split. Additionally, a holdout test set from an external hospital was employed to assess the models' robustness in external validation.

RESULTS

Among 1244 patients, 112 (9.0%) suffered from AL. The Random Forest model showed an AUC-ROC of 0.78 (SD: ± 0.01) on the internal test set, which significantly decreased to 0.60 (SD: ± 0.05) on the external holdout test set comprising 198 patients, including 7 (3.5%) with AL. Conversely, the Logistic Regression model demonstrated more consistent AUC-ROC values of 0.69 (SD: ± 0.01) on the internal set and 0.61 (SD: ± 0.05) on the external set. Accuracy measures for Random Forest were 0.82 (SD: ± 0.04) internally and 0.87 (SD: ± 0.08) externally, while Logistic Regression achieved accuracies of 0.81 (SD: ± 0.10) and 0.88 (SD: ± 0.15). F1 Scores for Random Forest moved from 0.58 (SD: ± 0.03) internally to 0.51 (SD: ± 0.03) externally, with Logistic Regression maintaining more stable scores of 0.53 (SD: ± 0.04) and 0.51 (SD: ± 0.02).

CONCLUSION

In this pilot study, we evaluated ML-based prediction models for AL post-colorectal surgery and identified ten patient-related risk factors associated with AL. Highlighting the need for multicenter data, external validation, and larger sample sizes, our findings emphasize the potential of ML in enhancing surgical outcomes and inform future development of a web-based application for broader clinical use.

Collapse

Affiliation(s)

Stephanie Taha-Mehlitz Clarunis, University Center for Gastrointestinal and Liver Diseases, St. Clara Hospital and University Hospital Basel, 4002, Basel, Switzerland
Larissa Wentzler Medical Faculty, University Basel, 4056, Basel, Switzerland Center for Gastrointestinal and Liver Diseases, Cantonal Hospital Basel-Landschaft, 4410, Liestal, Switzerland
Fiorenzo Angehrn Clarunis, University Center for Gastrointestinal and Liver Diseases, St. Clara Hospital and University Hospital Basel, 4002, Basel, Switzerland
Ahmad Hendie Department of Computer Engineering, McGill University, Montreal, H3A 0E9, Canada
Vincent Ochs Department of Biomedical Engineering, Faculty of Medicine, University of Basel, Hegenheimermattweg 167C Allschwil, 4123, Basel, Switzerland
Julia Wolleb Department of Biomedical Engineering, Faculty of Medicine, University of Basel, Hegenheimermattweg 167C Allschwil, 4123, Basel, Switzerland
Victor E Staartjes Machine Intelligence in Clinical Neuroscience (MICN) Laboratory, Department of Neurosurgery, University Hospital Zurich, 8091, Zurich, Switzerland
Bassey Enodien Department of Surgery, GZO-Hospital, 8620, Wetzikon, Switzerland
Martinas Baltuonis Department of Surgery, Emmental Teaching Hospital, 3400, Burgdorf, Switzerland
Stephan Vorburger Department of Surgery, Emmental Teaching Hospital, 3400, Burgdorf, Switzerland
Daniel M Frey Department of Surgery, GZO-Hospital, 8620, Wetzikon, Switzerland
Robert Rosenberg Center for Gastrointestinal and Liver Diseases, Cantonal Hospital Basel-Landschaft, 4410, Liestal, Switzerland
Markus von Flüe Hirslanden Klinik St. Anna, 6006, Lucerne, Switzerland
Beat Müller-Stich Clarunis, University Center for Gastrointestinal and Liver Diseases, St. Clara Hospital and University Hospital Basel, 4002, Basel, Switzerland
Philippe C Cattin Department of Biomedical Engineering, Faculty of Medicine, University of Basel, Hegenheimermattweg 167C Allschwil, 4123, Basel, Switzerland
Anas Taha Center for Gastrointestinal and Liver Diseases, Cantonal Hospital Basel-Landschaft, 4410, Liestal, Switzerland. Department of Biomedical Engineering, Faculty of Medicine, University of Basel, Hegenheimermattweg 167C Allschwil, 4123, Basel, Switzerland. Department of Surgery, Brody School of Medicine, East Carolina University, Greenville, NC, USA.
Daniel Steinemann Clarunis, University Center for Gastrointestinal and Liver Diseases, St. Clara Hospital and University Hospital Basel, 4002, Basel, Switzerland

Collapse

Mickley JP, Grove AF, Rouzrokh P, Yang L, Larson AN, Sanchez-Sotello J, Maradit Kremers H, Wyles CC. A Stepwise Approach to Analyzing Musculoskeletal Imaging Data With Artificial Intelligence. Arthritis Care Res (Hoboken) 2024;76:590-599. [PMID: 37849415 DOI: 10.1002/acr.25260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 08/27/2023] [Accepted: 10/13/2023] [Indexed: 10/19/2023]

Böck A, Urner K, Eckert JK, Salvermoser M, Laubhahn K, Kunze S, Kumbrink J, Hoeppner MP, Kalkbrenner K, Kreimeier S, Beyer K, Hamelmann E, Kabesch M, Depner M, Hansen G, Riedler J, Roponen M, Schmausser-Hechfellner E, Barnig C, Divaret-Chauveau A, Karvonen AM, Pekkanen J, Frei R, Roduit C, Lauener R, Schaub B. An integrated molecular risk score early in life for subsequent childhood asthma risk. Clin Exp Allergy 2024;54:314-328. [PMID: 38556721 DOI: 10.1111/cea.14475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 03/05/2024] [Accepted: 03/07/2024] [Indexed: 04/02/2024]

Abstract

BACKGROUND

Numerous children present with early wheeze symptoms, yet solely a subgroup develops childhood asthma. Early identification of children at risk is key for clinical monitoring, timely patient-tailored treatment, and preventing chronic, severe sequelae. For early prediction of childhood asthma, we aimed to define an integrated risk score combining established risk factors with genome-wide molecular markers at birth, complemented by subsequent clinical symptoms/diagnoses (wheezing, atopic dermatitis, food allergy).

METHODS

Three longitudinal birth cohorts (PAULINA/PAULCHEN, n = 190 + 93 = 283, PASTURE, n = 1133) were used to predict childhood asthma (age 5-11) including epidemiological characteristics and molecular markers: genotype, DNA methylation and mRNA expression (RNASeq/NanoString). Apparent (ap) and optimism-corrected (oc) performance (AUC/R2) was assessed leveraging evidence from independent studies (Naïve-Bayes approach) combined with high-dimensional logistic regression models (LASSO).

RESULTS

Asthma prediction with epidemiological characteristics at birth (maternal asthma, sex, farm environment) yielded an ocAUC = 0.65. Inclusion of molecular markers as predictors resulted in an improvement in apparent prediction performance, however, for optimism-corrected performance only a moderate increase was observed (upto ocAUC = 0.68). The greatest discriminate power was reached by adding the first symptoms/diagnosis (up to ocAUC = 0.76; increase of 0.08, p = .002). Longitudinal analysis of selected mRNA expression in PASTURE (cord blood, 1, 4.5, 6 years) showed that expression at age six had the strongest association with asthma and correlation of genes getting larger over time (r = .59, p < .001, 4.5-6 years).

CONCLUSION

Applying epidemiological predictors alone showed moderate predictive abilities. Molecular markers from birth modestly improved prediction. Allergic symptoms/diagnoses enhanced the power of prediction, which is important for clinical practice and for the design of future studies with molecular markers.

Collapse

Affiliation(s)

Andreas Böck Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany
Kathrin Urner Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany
Jana Kristin Eckert Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany
Michael Salvermoser Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany
Kristina Laubhahn Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Comprehensive Pneumology Center - Munich (CPC-M), German Center for Lung Research (DZL), Munich, Germany
Sonja Kunze Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany Institute of Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Jörg Kumbrink Institute of Pathology, Medical Faculty, LMU Munich, Munich, Germany
Marc P Hoeppner Institute of Clinical Molecular Biology, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
Kathrin Kalkbrenner Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany
Simone Kreimeier Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany Department of Health Economics and Health Care Management, School of Public Health, Bielefeld University, Bielefeld, Germany
Kirsten Beyer Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany Department of Pediatric Respiratory Medicine, Immunology and Critical Care Medicine, Charité Universitätsmedizin Berlin, Berlin, Germany
Eckard Hamelmann Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany Department for Pediatrics, Children's Center Bethel, University Hospital OWL, Bielefeld University, Bielefeld, Germany
Michael Kabesch Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany University Children's Hospital Regensburg (KUNO), St. Hedwig's Hospital of the Order of St. John and the University of Regensburg, Regensburg, Germany
Martin Depner Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany Institute of Asthma and Allergy Prevention, Helmholtz Zentrum München, German Research Centre for Environmental Health, Neuherberg, Germany
Gesine Hansen Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany Department of Pediatric Pneumology, Allergology and Neonatology, Hannover Medical School, Hannover, Germany Biomedical Research in Endstage and Obstructive Lung Disease Biomedical Research in Endstage and Obstructive Lung Disease (BREATH), Member of the German Center for Lung Research (DZL), Hannover, Germany Excellence Cluster Resolving Infection Susceptibility RESIST (EXC 2155), Deutsche Forschungsgemeinschaft, Hannover Medical School, Hannover, Germany
Josef Riedler Children's Hospital Schwarzach, Schwarzach, Austria
Marjut Roponen Department of Environmental and Biological Sciences, University of Eastern Finland, Kuopio, Finland
Elisabeth Schmausser-Hechfellner Institute of Asthma and Allergy Prevention, Helmholtz Zentrum München, German Research Centre for Environmental Health, Neuherberg, Germany
Cindy Barnig Department of Respiratory Disease, University Hospital, Besanҫon, France INSERM, EFS BFC, LabEx LipSTIC, UMR1098, Interactions Hôte-Greffon-Tumeur/Ingénierie Cellulaire et Génique, Univ. Bourgogne Franche-Comté, Besançon, France
Amandine Divaret-Chauveau Pediatric Allergy Department, Children's Hospital, University Hospital of Nancy, Vandoeuvre les Nancy, France EA3450 Development, Adaptation and Handicap (devah), Pediatric Allergy Department, University of Lorraine, Nancy, France UMR/CNRS 6249 Chrono-environment, University of Franche Comté, Besançon, France
Anne M Karvonen Department of Health Security, Finnish Institute for Health and Welfare, Kuopio, Finland
Juha Pekkanen Department of Health Security, Finnish Institute for Health and Welfare, Kuopio, Finland Department of Public Health, University of Helsinki, Helsinki, Finland
Remo Frei Christine Kühne Center for Allergy Research and Education (CK-CARE), Davos, Switzerland Division of Respiratory Medicine and Allergology, Department of Paediatrics, Inselspital, University of Bern, Bern, Switzerland
Caroline Roduit Christine Kühne Center for Allergy Research and Education (CK-CARE), Davos, Switzerland Division of Respiratory Medicine and Allergology, Department of Paediatrics, Inselspital, University of Bern, Bern, Switzerland Children's Hospital of Eastern Switzerland, St. Gallen, Switzerland Children's Hospital, University of Zürich, Zürich, Switzerland
Roger Lauener Christine Kühne Center for Allergy Research and Education (CK-CARE), Davos, Switzerland Children's Hospital of Eastern Switzerland, St. Gallen, Switzerland
Bianca Schaub Pediatric Allergology, Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany Member of the CHildhood Allergy and Tolerance Consortium (CHAMP), LMU Munich, Munich, Germany Comprehensive Pneumology Center - Munich (CPC-M), German Center for Lung Research (DZL), Munich, Germany

Collapse

Wang S, Meng F, Chen P, Lv Y, Wu M, Tang H, Bao H, Wu X, Shao Y, Wang J, Dai J, Xu L, Wang X, Yin R. Cell-free DNA assay for malignancy classification of high-risk lung nodules. J Thorac Cardiovasc Surg 2024:S0022-5223(24)00370-2. [PMID: 38670484 DOI: 10.1016/j.jtcvs.2024.04.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Revised: 03/18/2024] [Accepted: 04/15/2024] [Indexed: 04/28/2024]

Affiliation(s)

Siwei Wang Department of Thoracic Surgery, Jiangsu Key Laboratory of Molecular and Translational Cancer Research, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China; Clinical Research Institute of Traditional Chinese Medicine, Jiangsu Province Hospital of Chinese Medicine, The Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, Jiangsu, China
Fanchen Meng Department of Thoracic Surgery, Jiangsu Key Laboratory of Molecular and Translational Cancer Research, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China
Peng Chen Department of Thoracic Surgery, Jiangsu Key Laboratory of Molecular and Translational Cancer Research, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China
Yang Lv Department of Information Center, The Affiliated Jiangning Hospital of Nanjing Medical University, Nanjing, Jiangsu, China
Min Wu Geneseeq Research Institute, Nanjing Geneseeq Technology Inc, Nanjing, Jiangsu, China
Haimeng Tang Geneseeq Research Institute, Nanjing Geneseeq Technology Inc, Nanjing, Jiangsu, China
Hua Bao Geneseeq Research Institute, Nanjing Geneseeq Technology Inc, Nanjing, Jiangsu, China
Xue Wu Geneseeq Research Institute, Nanjing Geneseeq Technology Inc, Nanjing, Jiangsu, China
Yang Shao Geneseeq Research Institute, Nanjing Geneseeq Technology Inc, Nanjing, Jiangsu, China; School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China
Jie Wang Department of Science and Technology, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China; Biobank of Lung Cancer, Jiangsu Biobank of Clinical Resources, Nanjing, Jiangsu, China
Juncheng Dai School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China; Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu, China
Lin Xu Department of Thoracic Surgery, Jiangsu Key Laboratory of Molecular and Translational Cancer Research, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China; Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu, China
Xiaoxiao Wang Clinical Research Institute of Traditional Chinese Medicine, Jiangsu Province Hospital of Chinese Medicine, The Affiliated Hospital of Nanjing University of Chinese Medicine, Nanjing, Jiangsu, China.
Rong Yin Department of Thoracic Surgery, Jiangsu Key Laboratory of Molecular and Translational Cancer Research, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China; Department of Science and Technology, Nanjing Medical University Affiliated Cancer Hospital & Jiangsu Cancer Hospital & Jiangsu Institute of Cancer Research, Nanjing, Jiangsu, China; Biobank of Lung Cancer, Jiangsu Biobank of Clinical Resources, Nanjing, Jiangsu, China; Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, Nanjing, Jiangsu, China.

Collapse

Kwong JCC, Wu J, Malik S, Khondker A, Gupta N, Bodnariuc N, Narayana K, Malik M, van der Kwast TH, Johnson AEW, Zlotta AR, Kulkarni GS. Predicting non-muscle invasive bladder cancer outcomes using artificial intelligence: a systematic review using APPRAISE-AI. NPJ Digit Med 2024;7:98. [PMID: 38637674 PMCID: PMC11026453 DOI: 10.1038/s41746-024-01088-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/29/2024] [Indexed: 04/20/2024] Open

Affiliation(s)

Jethro C C Kwong Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada Temerty Centre for AI Research and Education in Medicine, University of Toronto, Toronto, ON, Canada
Jeremy Wu Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Shamir Malik Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Adree Khondker Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada
Naveen Gupta Georgetown University School of Medicine, Georgetown University, Washington, DC, USA Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
Nicole Bodnariuc Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Krishnateja Narayana Western University, London, ON, Canada
Mikail Malik Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
Theodorus H van der Kwast Laboratory Medicine Program, University Health Network, Princess Margaret Cancer Centre, University of Toronto, Toronto, ON, Canada
Alistair E W Johnson Temerty Centre for AI Research and Education in Medicine, University of Toronto, Toronto, ON, Canada Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
Alexandre R Zlotta Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada Division of Urology, Department of Surgery, Mount Sinai Hospital, Sinai Health System, Toronto, ON, Canada Division of Urology, Department of Surgery, Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
Girish S Kulkarni Division of Urology, Department of Surgery, University of Toronto, Toronto, ON, Canada. Temerty Centre for AI Research and Education in Medicine, University of Toronto, Toronto, ON, Canada. Division of Urology, Department of Surgery, Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada.

Collapse

Norris ML, Obeid N, El-Emam K. Examining the role of artificial intelligence to advance knowledge and address barriers to research in eating disorders. Int J Eat Disord 2024. [PMID: 38597344 DOI: 10.1002/eat.24215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 03/22/2024] [Accepted: 03/22/2024] [Indexed: 04/11/2024]

Čihák M, Horáková H, Vyhnálek M, Veverová K, Matušková V, Laczó J, Hort J, Nikolai T. Evaluation of Differential Diagnostics Potential of Uniform Data Set 2 Neuropsychology Battery Using Alzheimer's Disease Biomarkers. Arch Clin Neuropsychol 2024:acae028. [PMID: 38582748 DOI: 10.1093/arclin/acae028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 02/20/2024] [Accepted: 03/12/2024] [Indexed: 04/08/2024] Open

Liao C, Wang L, Quon G. Microbiome-based classification models for fresh produce safety and quality evaluation. Microbiol Spectr 2024;12:e0344823. [PMID: 38445872 PMCID: PMC10986475 DOI: 10.1128/spectrum.03448-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/17/2024] [Indexed: 03/07/2024] Open

Abstract

Small sample sizes and loss of sequencing reads during the microbiome data preprocessing can limit the statistical power of differentiating fresh produce phenotypes and prevent the detection of important bacterial species associated with produce contamination or quality reduction. Here, we explored a machine learning-based k-mer hash analysis strategy to identify DNA signatures predictive of produce safety (PS) and produce quality (PQ) and compared it against the amplicon sequence variant (ASV) strategy that uses a typical denoising step and ASV-based taxonomy strategy. Random forest-based classifiers for PS and PQ using 7-mer hash data sets had significantly higher classification accuracy than those using the ASV data sets. We also demonstrated that the proposed combination of integrating multiple data sets and leveraging a 7-mer hash strategy leads to better classification performance for PS and PQ compared to the ASV method but presents lower PS classification accuracy compared to the feature-selected ASV-based taxonomy strategy. Due to the current limitation of generating taxonomy using the 7-mer hash strategy, the ASV-based taxonomy strategy with remarkably less computing time and memory usage is more efficient for PS and PQ classification and applicable for important taxa identification. Results generated from this study lay the foundation for future studies that wish and need to incorporate and/or compare different microbiome sequencing data sets for the application of machine learning in the area of microbial safety and quality of food.

IMPORTANCE

Identification of generalizable indicators for produce safety (PS) and produce quality (PQ) improves the detection of produce contamination and quality decline. However, effective sequencing read loss during microbiome data preprocessing and the limited sample size of individual studies restrain statistical power to identify important features contributing to differentiating PS and PQ phenotypes. We applied machine learning-based models using individual and integrated k-mer hash and amplicon sequence variant (ASV) data sets for PS and PQ classification and evaluated their classification performance and found that random forest (RF)-based models using integrated 7-mer hash data sets achieved significantly higher PS and PQ classification accuracy. Due to the limitation of taxonomic analysis for the 7-mer hash, we also developed RF-based models using feature-selected ASV-based taxonomic data sets, which performed better PS classification than those using the integrated 7-mer hash data set. The RF feature selection method identified 480 PS indicators and 263 PQ indicators with a positive contribution to the PS and PQ classification.

Collapse

Chen X, Zeng Q, Tao L, Yuan J, Hang J, Lu G, Shao J, Li Y, Yu H. Machine Learning-Based Clinical Prediction Models for Acute Ischemic Stroke Based on Serum Xanthine Oxidase Levels. World Neurosurg 2024;184:e695-e707. [PMID: 38340801 DOI: 10.1016/j.wneu.2024.02.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Accepted: 02/04/2024] [Indexed: 02/12/2024]

Abstract

OBJECTIVE

Early prediction of the onset, progression and prognosis of acute ischemic stroke (AIS) is helpful for treatment decision-making and proactive management. Although several biomarkers have been found to predict the progression and prognosis of AIS, these biomarkers have not been widely used in routine clinical practice. Xanthine oxidase (XO) is a form of xanthine oxidoreductase (XOR), which is widespread in various organs of the human body and plays an important role in redox reactions and ischemia‒reperfusion injury. Our previous studies have shown that serum XO levels on admission have certain clinical predictive value for AIS. The purpose of this study was to utilize serum XO levels and clinical data to establish machine learning models for predicting the onset, progression, and prognosis of AIS.

METHODS

We enrolled 328 consecutive patients with AIS and 107 healthy controls from October 2020 to September 2021. Serum XO levels and stroke-related clinical data were collected. We established 5 machine learning models-the logistic regression (LR), support vector machine (SVM), decision tree, random forest, and K-nearest neighbor (KNN) models-to predict the onset, progression, and prognosis of AIS. The area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, specificity, negative predictive value, and positive predictive value were used to evaluate the predictive performance of each model.

RESULTS

Among the 5 machine learning models predicting AIS onset, the AUROC values of 4 prediction models were over 0.7, while that of the KNN model was lower (AUROC = 0.6708, 95% CI 0.576-0.765). The LR model showed the best AUROC value (AUROC = 0.9586, 95% CI 0.927-0.991). Although the 5 machine learning models showed relatively poor predictive value for the progression of AIS (all AUROCs <0.7), the LR model still showed the highest AUROC value (AUROC = 0.6543, 95% CI 0.453-0.856). We compared the value of 5 machine learning models in predicting the prognosis of AIS, and the LR model showed the best predictive value (AUROC = 0.8124, 95% CI 0.715-0.910).

CONCLUSIONS

The tested machine learning models based on serum levels of XO could predict the onset and prognosis of AIS. Among the 5 machine learning models, we found that the LR model showed the best predictive performance. Machine learning algorithms improve accuracy in the early diagnosis of AIS and can be used to make treatment decisions.

Collapse

Affiliation(s)

Xin Chen Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neuro Intensive Care Unit, Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neurology, Northern Jiangsu People's Hospital, Yangzhou, China
Qingping Zeng School of Nursing, Medical College of Yangzhou University, Yangzhou University, Yangzhou, China
Luhang Tao Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neuro Intensive Care Unit, Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neurology, Northern Jiangsu People's Hospital, Yangzhou, China
Jing Yuan Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Echocardiography, Northern Jiangsu People's Hospital, Yangzhou, China
Jing Hang Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neuro Intensive Care Unit, Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neurology, Northern Jiangsu People's Hospital, Yangzhou, China
Guangyu Lu School of Public Health, Medical College of Yangzhou University, Yangzhou University, Yangzhou, China
Jun Shao Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Cardiac Intensive Care Unit, Northern Jiangsu People's Hospital, Yangzhou, China
Yuping Li Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neuro Intensive Care Unit, Clinical Medical College of Yangzhou University, Yangzhou, China; Institute of Neurosurgery, Northern Jiangsu People's Hospital, Yangzhou, China
Hailong Yu Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neuro Intensive Care Unit, Clinical Medical College of Yangzhou University, Yangzhou, China; Department of Neurology, Northern Jiangsu People's Hospital, Yangzhou, China.

Collapse

Yoon J, Kim JH, Chung Y, Park J, Leigh JH, Kim SS. Change in employment status and its causal effect on suicidal ideation and depressive symptoms: A marginal structural model with machine learning algorithms. Scand J Work Environ Health 2024;50:218-227. [PMID: 38466615 PMCID: PMC11106614 DOI: 10.5271/sjweh.4150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Indexed: 03/13/2024] Open

El Emam K, Mosquera L, Fang X, El-Hussuna A. An evaluation of the replicability of analyses using synthetic health data. Sci Rep 2024;14:6978. [PMID: 38521806 PMCID: PMC10960851 DOI: 10.1038/s41598-024-57207-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 03/15/2024] [Indexed: 03/25/2024] Open

Abstract

Synthetic data generation is being increasingly used as a privacy preserving approach for sharing health data. In addition to protecting privacy, it is important to ensure that generated data has high utility. A common way to assess utility is the ability of synthetic data to replicate results from the real data. Replicability has been defined using two criteria: (a) replicate the results of the analyses on real data, and (b) ensure valid population inferences from the synthetic data. A simulation study using three heterogeneous real-world datasets evaluated the replicability of logistic regression workloads. Eight replicability metrics were evaluated: decision agreement, estimate agreement, standardized difference, confidence interval overlap, bias, confidence interval coverage, statistical power, and precision (empirical SE). The analysis of synthetic data used a multiple imputation approach whereby up to 20 datasets were generated and the fitted logistic regression models were combined using combining rules for fully synthetic datasets. The effects of synthetic data amplification were evaluated, and two types of generative models were used: sequential synthesis using boosted decision trees and a generative adversarial network (GAN). Privacy risk was evaluated using a membership disclosure metric. For sequential synthesis, adjusted model parameters after combining at least ten synthetic datasets gave high decision and estimate agreement, low standardized difference, as well as high confidence interval overlap, low bias, the confidence interval had nominal coverage, and power close to the nominal level. Amplification had only a marginal benefit. Confidence interval coverage from a single synthetic dataset without applying combining rules were erroneous, and statistical power, as expected, was artificially inflated when amplification was used. Sequential synthesis performed considerably better than the GAN across multiple datasets. Membership disclosure risk was low for all datasets and models. For replicable results, the statistical analysis of fully synthetic data should be based on at least ten generated datasets of the same size as the original whose analyses results are combined. Analysis results from synthetic data without applying combining rules can be misleading. Replicability results are dependent on the type of generative model used, with our study suggesting that sequential synthesis has good replicability characteristics for common health research workloads.

Collapse

Plurphanswat N, Selya A, Rodu B. Questionable Effects of Electronic Cigarette Use on Cardiovascular Diseases From the National Health Interview Survey (NHIS, 2014-2021). Cureus 2024;16:e57119. [PMID: 38681373 PMCID: PMC11055619 DOI: 10.7759/cureus.57119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/27/2024] [Indexed: 05/01/2024] Open

Ren X, Mi Z, Georgopoulos PG. Socioexposomics of COVID-19 across New Jersey: a comparison of geostatistical and machine learning approaches. JOURNAL OF EXPOSURE SCIENCE & ENVIRONMENTAL EPIDEMIOLOGY 2024;34:197-207. [PMID: 36725924 PMCID: PMC9889956 DOI: 10.1038/s41370-023-00518-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 12/29/2022] [Accepted: 01/06/2023] [Indexed: 06/18/2023]

Abstract

BACKGROUND

Disparities in adverse COVID-19 health outcomes have been associated with multiple social and environmental stressors. However, research is needed to evaluate the consistency and efficiency of methods for studying these associations at local scales.

OBJECTIVE

To assess socioexposomic associations with COVID-19 outcomes across New Jersey and evaluate consistency of findings from multiple modeling approaches.

METHODS

We retrieved data for COVID-19 cases and deaths for the 565 municipalities of New Jersey up to the end of the first phase of the pandemic, and calculated mortality rates with and without long-term-care (LTC) facility deaths. We considered 84 spatially heterogeneous environmental, demographic and socioeconomic factors from publicly available databases, including air pollution, proximity to industrial sites/facilities, transportation-related noise, occupation and commuting, neighborhood and housing characteristics, age structure, racial/ethnic composition, poverty, etc. Six geostatistical models (Poisson/Negative-Binomial regression, Poison/Negative-Binomial mixed effect model, Poisson/Negative-Binomial Bersag-York-Mollie spatial model) and two Machine Learning (ML) methods (Random Forest, Extreme Gradient Boosting) were implemented to assess association patterns. The Shapley effects plot was established for explainable ML and change of support validation was introduced to compare performances of different approaches.

RESULTS

We found robust positive associations of COVID-19 mortality with historic exposures to NO2, population density, percentage of minority and below high school education, and other social and environmental factors. Exclusion of LTC deaths does not significantly affect correlations for most factors but findings can be substantially influenced by model structures and assumptions. The best performing geostatistical models involved flexible structures representing data variations. ML methods captured association patterns consistent with the best performing geostatistical models, and furthermore detected consistent nonlinear associations not captured by geostatistical models.

SIGNIFICANCE

The findings of this work improve the understanding of how social and environmental disparities impacted COVID-19 outcomes across New Jersey.

Collapse

Aghakhani A, Yousefi M, Yekaninejad MS. Machine Learning Models for Predicting Sudden Sensorineural Hearing Loss Outcome: A Systematic Review. Ann Otol Rhinol Laryngol 2024;133:268-276. [PMID: 37864312 DOI: 10.1177/00034894231206902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2023]

Ratano D, Zhang B, Dianti J, Georgopoulos D, Brochard LJ, Chan TCY, Goligher EC. Lung- and diaphragm-protective strategies in acute respiratory failure: an in silico trial. Intensive Care Med Exp 2024;12:20. [PMID: 38416269 PMCID: PMC10902250 DOI: 10.1186/s40635-024-00606-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 02/21/2024] [Indexed: 02/29/2024] Open

Abstract

BACKGROUND

Lung- and diaphragm-protective (LDP) ventilation may prevent diaphragm atrophy and patient self-inflicted lung injury in acute respiratory failure, but feasibility is uncertain. The objectives of this study were to estimate the proportion of patients achieving LDP targets in different modes of ventilation, and to identify predictors of need for extracorporeal carbon dioxide removal (ECCO2R) to achieve LDP targets.

METHODS

An in silico clinical trial was conducted using a previously published mathematical model of patient-ventilator interaction in a simulated patient population (n = 5000) with clinically relevant physiological characteristics. Ventilation and sedation were titrated according to a pre-defined algorithm in pressure support ventilation (PSV) and proportional assist ventilation (PAV+) modes, with or without adjunctive ECCO2R, and using ECCO2R alone (without ventilation or sedation). Random forest modelling was employed to identify patient-level factors associated with achieving targets.

RESULTS

After titration, the proportion of patients achieving targets was lower in PAV+ vs. PSV (37% vs. 43%, odds ratio 0.78, 95% CI 0.73-0.85). Adjunctive ECCO2R substantially increased the probability of achieving targets in both PSV and PAV+ (85% vs. 84%). ECCO2R alone without ventilation or sedation achieved LDP targets in 9%. The main determinants of success without ECCO2R were lung compliance, ventilatory ratio, and strong ion difference. In silico trial results corresponded closely with the results obtained in a clinical trial of the LDP titration algorithm (n = 30).

CONCLUSIONS

In this in silico trial, many patients required ECCO2R in combination with mechanical ventilation and sedation to achieve LDP targets. ECCO2R increased the probability of achieving LDP targets in patients with intermediate degrees of derangement in elastance and ventilatory ratio.

Collapse

Pareek A, Ro DH, Karlsson J, Martin RK. Machine learning/artificial intelligence in sports medicine: state of the art and future directions. J ISAKOS 2024:S2059-7754(24)00013-0. [PMID: 38336099 DOI: 10.1016/j.jisako.2024.01.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 12/30/2023] [Accepted: 01/25/2024] [Indexed: 02/12/2024]

Ferrante FJ, Migeot J, Birba A, Amoruso L, Pérez G, Hesse E, Tagliazucchi E, Estienne C, Serrano C, Slachevsky A, Matallana D, Reyes P, Ibáñez A, Fittipaldi S, Campo CG, García AM. Multivariate word properties in fluency tasks reveal markers of Alzheimer's dementia. Alzheimers Dement 2024;20:925-940. [PMID: 37823470 PMCID: PMC10916979 DOI: 10.1002/alz.13472] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 08/15/2023] [Accepted: 08/20/2023] [Indexed: 10/13/2023]

Affiliation(s)

Franco J. Ferrante Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)Ciudad Autónoma de Buenos AiresArgentina Facultad de IngenieríaUniversidad de Buenos Aires (FIUBA)CABAArgentina
Joaquín Migeot Latin American Brain Health (BrainLat) InstituteUniversidad Adolfo IbáñezPeñalolénRegión MetropolitanaChile Center for Social and Cognitive Neuroscience (CSCN)School of PsychologyUniversidad Adolfo IbáñezLas CondesChile
Agustina Birba Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Instituto Universitario de NeurocienciaUniversidad de La LagunaLa LagunaTenerifeEspaña Cognitive Department of PsychologyUniversidad de La LagunaLa LagunaTenerifeEspaña
Lucía Amoruso Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Basque Center on Cognition Brain and Language (BCBL)San SebastiánGipuzkoaEspaña IkerbasqueBasque Foundation for ScienceBilbaoSpain
Gonzalo Pérez Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)Ciudad Autónoma de Buenos AiresArgentina Facultad de IngenieríaUniversidad de Buenos Aires (FIUBA)CABAArgentina
Eugenia Hesse Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Departamento de Matemática y CienciasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina
Enzo Tagliazucchi Latin American Brain Health (BrainLat) InstituteUniversidad Adolfo IbáñezPeñalolénRegión MetropolitanaChile Departamento de FísicaUniversidad de Buenos Aires and Instituto de Física de Buenos Aires (IFIBA‐CONICET)CABAArgentina
Claudio Estienne Instituto de Ingeniería BiomédicaUniversidad de Buenos AiresBuenos AiresArgentina
Cecilia Serrano Unidad de Neurología CognitivaHospital César MilsteinCABAArgentina
Andrea Slachevsky Neuropsychology and Clinical Neuroscience Laboratory (LANNEC)Physiopathology Department ‐ ICBMNeurocience and East Neuroscience DepartmentsFaculty of MedicineUniversity of ChileProvidenciaSantiagoChile Geroscience Center for Brain Health and Metabolism (GERO)Faculty of MedicineUniversity of ChileProvidenciaSantiagoChile Memory and Neuropsychiatric Clinic (CMYN) Neurology DepartmentHospital del Salvador and Faculty of MedicineUniversity of ChileProvidenciaSantiagoChile Servicio de NeurologíaDepartamento de MedicinaClínica Alemana‐Universidad del DesarrolloLas CondesRegión MetropolitanaChile
Diana Matallana Instituto de EnvejecimientoDepartment of PsychiatrySchool of MedicinePontifical Xaverian UniversityBogotáColombia Department of Mental HealthHospital Universitario Santa Fe de BogotáBogotáColombia
Pablo Reyes Centro de Memoria y CogniciónIntellectus‐Hospital Universitario San IgnacioBogotáColombia Pontificia Universidad JaverianaDepartments of PhysiologyPsychiatry and Aging InstituteBogotáColombia
Agustín Ibáñez Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)Ciudad Autónoma de Buenos AiresArgentina Latin American Brain Health (BrainLat) InstituteUniversidad Adolfo IbáñezPeñalolénRegión MetropolitanaChile Global Brain Health Institute, University of California San Francisco, San Francisco, California, USATrinity College DublinDublinIreland
Sol Fittipaldi Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Latin American Brain Health (BrainLat) InstituteUniversidad Adolfo IbáñezPeñalolénRegión MetropolitanaChile Global Brain Health Institute, University of California San Francisco, San Francisco, California, USATrinity College DublinDublinIreland
Cecilia Gonzalez Campo Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)Ciudad Autónoma de Buenos AiresArgentina
Adolfo M. García Centro de Neurociencias CognitivasUniversidad de San AndrésVictoriaProvincia de Buenos AiresArgentina Latin American Brain Health (BrainLat) InstituteUniversidad Adolfo IbáñezPeñalolénRegión MetropolitanaChile Global Brain Health Institute, University of California San Francisco, San Francisco, California, USATrinity College DublinDublinIreland Departamento de Lingüística y LiteraturaFacultad de HumanidadesUniversidad de Santiago de ChileEstación CentralSantiagoChile

Collapse

Billichová M, Coan LJ, Czanner S, Kováčová M, Sharifian F, Czanner G. Comparing the performance of statistical, machine learning, and deep learning algorithms to predict time-to-event: A simulation study for conversion to mild cognitive impairment. PLoS One 2024;19:e0297190. [PMID: 38252622 PMCID: PMC10802955 DOI: 10.1371/journal.pone.0297190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 01/01/2024] [Indexed: 01/24/2024] Open

Abstract

Mild Cognitive Impairment (MCI) is a condition characterized by a decline in cognitive abilities, specifically in memory, language, and attention, that is beyond what is expected due to normal aging. Detection of MCI is crucial for providing appropriate interventions and slowing down the progression of dementia. There are several automated predictive algorithms for prediction using time-to-event data, but it is not clear which is best to predict the time to conversion to MCI. There is also confusion if algorithms with fewer training weights are less accurate. We compared three algorithms, from smaller to large numbers of training weights: a statistical predictive model (Cox proportional hazards model, CoxPH), a machine learning model (Random Survival Forest, RSF), and a deep learning model (DeepSurv). To compare the algorithms under different scenarios, we created a simulated dataset based on the Alzheimer NACC dataset. We found that the CoxPH model was among the best-performing models, in all simulated scenarios. In a larger sample size (n = 6,000), the deep learning algorithm (DeepSurv) exhibited comparable accuracy (73.1%) to the CoxPH model (73%). In the past, ignoring heterogeneity in the CoxPH model led to the conclusion that deep learning methods are superior. We found that when using the CoxPH model with heterogeneity, its accuracy is comparable to that of DeepSurv and RSF. Furthermore, when unobserved heterogeneity is present, such as missing features in the training, all three models showed a similar drop in accuracy. This simulation study suggests that in some applications an algorithm with a smaller number of training weights is not disadvantaged in terms of accuracy. Since algorithms with fewer weights are inherently easier to explain, this study can help artificial intelligence research develop a principled approach to comparing statistical, machine learning, and deep learning algorithms for time-to-event predictions.

Collapse

de Hond A, van Buchem M, Fanconi C, Roy M, Blayney D, Kant I, Steyerberg E, Hernandez-Boussard T. Predicting Depression Risk in Patients With Cancer Using Multimodal Data: Algorithm Development Study. JMIR Med Inform 2024;12:e51925. [PMID: 38236635 PMCID: PMC10835583 DOI: 10.2196/51925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 11/11/2023] [Accepted: 12/08/2023] [Indexed: 01/19/2024] Open

Abstract

BACKGROUND

Patients with cancer starting systemic treatment programs, such as chemotherapy, often develop depression. A prediction model may assist physicians and health care workers in the early identification of these vulnerable patients.

OBJECTIVE

This study aimed to develop a prediction model for depression risk within the first month of cancer treatment.

METHODS

We included 16,159 patients diagnosed with cancer starting chemo- or radiotherapy treatment between 2008 and 2021. Machine learning models (eg, least absolute shrinkage and selection operator [LASSO] logistic regression) and natural language processing models (Bidirectional Encoder Representations from Transformers [BERT]) were used to develop multimodal prediction models using both electronic health record data and unstructured text (patient emails and clinician notes). Model performance was assessed in an independent test set (n=5387, 33%) using area under the receiver operating characteristic curve (AUROC), calibration curves, and decision curve analysis to assess initial clinical impact use.

RESULTS

Among 16,159 patients, 437 (2.7%) received a depression diagnosis within the first month of treatment. The LASSO logistic regression models based on the structured data (AUROC 0.74, 95% CI 0.71-0.78) and structured data with email classification scores (AUROC 0.74, 95% CI 0.71-0.78) had the best discriminative performance. The BERT models based on clinician notes and structured data with email classification scores had AUROCs around 0.71. The logistic regression model based on email classification scores alone performed poorly (AUROC 0.54, 95% CI 0.52-0.56), and the model based solely on clinician notes had the worst performance (AUROC 0.50, 95% CI 0.49-0.52). Calibration was good for the logistic regression models, whereas the BERT models produced overly extreme risk estimates even after recalibration. There was a small range of decision thresholds for which the best-performing model showed promising clinical effectiveness use. The risks were underestimated for female and Black patients.

CONCLUSIONS

The results demonstrated the potential and limitations of machine learning and multimodal models for predicting depression risk in patients with cancer. Future research is needed to further validate these models, refine the outcome label and predictors related to mental health, and address biases across subgroups.

Collapse

Affiliation(s)

Anne de Hond Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, Netherlands Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, Netherlands Department of Medicine (Biomedical Informatics), Stanford Medicine, Stanford University, Stanford, CA, United States
Marieke van Buchem Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, Netherlands Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, Netherlands Department of Medicine (Biomedical Informatics), Stanford Medicine, Stanford University, Stanford, CA, United States
Claudio Fanconi Department of Medicine (Biomedical Informatics), Stanford Medicine, Stanford University, Stanford, CA, United States Department of Electrical Engineering and Information Technology, ETH Zürich, Zürich, Switzerland
Mohana Roy Department of Medical Oncology, Stanford Medicine, Stanford University, Stanford, CA, United States
Douglas Blayney Department of Medical Oncology, Stanford Medicine, Stanford University, Stanford, CA, United States
Ilse Kant Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, Netherlands Department of Digital Health, University Medical Centre Utrecht, Utrecht, Netherlands
Ewout Steyerberg Clinical AI Implementation and Research Lab, Leiden University Medical Centre, Leiden, Netherlands Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, Netherlands
Tina Hernandez-Boussard Department of Medicine (Biomedical Informatics), Stanford Medicine, Stanford University, Stanford, CA, United States Department of Biomedical Data Science, Stanford University, Stanford, CA, United States Department of Epidemiology & Population Health (by courtesy), Stanford University, Stanford, CA, United States

Collapse

Koru G, Zhang Y, Felix H. Identifying the process and agency characteristics associated with poor utilization outcomes in home healthcare. Home Health Care Serv Q 2024:1-15. [PMID: 38230702 DOI: 10.1080/01621424.2024.2305933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2024]

Huang HY, Lin SP, Wang HY, Liou JY, Chang WK, Ting CK. Logistic Regression Is Non-Inferior to the Response Surface Model in Patient Response Prediction of Video-Assisted Thoracoscopic Surgery. Pharmaceuticals (Basel) 2024;17:95. [PMID: 38256927 PMCID: PMC10819298 DOI: 10.3390/ph17010095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 12/24/2023] [Accepted: 01/09/2024] [Indexed: 01/24/2024] Open

Abstract

Response surface models (RSMs) are a new trend in modern anesthesia. RSMs have demonstrated significant applicability in the field of anesthesia. However, the comparative analysis between RSMs and logistic regression (LR) in different surgeries remains relatively limited in the current literature. We hypothesized that using a total intravenous anesthesia (TIVA) technique with the response surface model (RSM) and logistic regression (LR) would predict the emergence from anesthesia in patients undergoing video-assisted thoracotomy surgery (VATS). This study aimed to prove that LR, like the RSM, can be used to improve patient safety and achieve enhanced recovery after surgery (ERAS). This was a prospective, observational study with data reanalysis. Twenty-nine patients (American Society of Anesthesiologists (ASA) class II and III) who underwent VATS for elective pulmonary or mediastinal surgery under TIVA were enrolled. We monitored the emergence from anesthesia, and the precise time point of regained response (RR) was noted. The influence of varying concentrations was examined and incorporated into both the RSM and LR. The receiver operating characteristic (ROC) curve area for Greco and LR models was 0.979 (confidence interval: 0.987 to 0.990) and 0.989 (confidence interval: 0.989 to 0.990), respectively. The two models had no significant differences in predicting the probability of regaining response. In conclusion, the LR model was effective and can be applied to patients undergoing VATS or other procedures of similar modalities. Furthermore, the RSM is significantly more sophisticated and has an accuracy similar to that of the LR model; however, the LR model is more accessible. Therefore, the LR model is a simpler tool for predicting arousal in patients undergoing VATS under TIVA with Remifentanil and Propofol.

Collapse

Li J, Hao Y, Liu Y, Wu L, Liang H, Ni L, Wang F, Wang S, Duan Y, Xu Q, Xiao J, Yang D, Gao G, Ding Y, Gao C, Xiao J, Zhao H. Supervised machine learning algorithms to predict the duration and risk of long-term hospitalization in HIV-infected individuals: a retrospective study. Front Public Health 2024;11:1282324. [PMID: 38249414 PMCID: PMC10796994 DOI: 10.3389/fpubh.2023.1282324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 12/13/2023] [Indexed: 01/23/2024] Open

Affiliation(s)

Jialu Li Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Yiwei Hao Division of Medical Record and Statistics, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Ying Liu Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Liang Wu Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Hongyuan Liang Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Liang Ni Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Fang Wang Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Sa Wang Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Yujiao Duan Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Qiuhua Xu Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Jinjing Xiao Department of Clinical Medicine, Zhengzhou University, Zhengzhou, China
Di Yang Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Guiju Gao Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Yi Ding Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Chengyu Gao Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Jiang Xiao Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China
Hongxin Zhao Clinical and Research Center of AIDS, Beijing Ditan Hospital, Capital Medical University, Beijing, China

Collapse

Grabman JH, Dodson CS. Unskilled, underperforming, or unaware? Testing three accounts of individual differences in metacognitive monitoring. Cognition 2024;242:105659. [PMID: 37939445 DOI: 10.1016/j.cognition.2023.105659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 10/26/2023] [Accepted: 10/31/2023] [Indexed: 11/10/2023]

Arina P, Kaczorek MR, Hofmaenner DA, Pisciotta W, Refinetti P, Singer M, Mazomenos EB, Whittle J. Prediction of Complications and Prognostication in Perioperative Medicine: A Systematic Review and PROBAST Assessment of Machine Learning Tools. Anesthesiology 2024;140:85-101. [PMID: 37944114 DOI: 10.1097/aln.0000000000004764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]

Abstract

BACKGROUND

The utilization of artificial intelligence and machine learning as diagnostic and predictive tools in perioperative medicine holds great promise. Indeed, many studies have been performed in recent years to explore the potential. The purpose of this systematic review is to assess the current state of machine learning in perioperative medicine, its utility in prediction of complications and prognostication, and limitations related to bias and validation.

METHODS

A multidisciplinary team of clinicians and engineers conducted a systematic review using the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) protocol. Multiple databases were searched, including Scopus, Cumulative Index to Nursing and Allied Health Literature (CINAHL), the Cochrane Library, PubMed, Medline, Embase, and Web of Science. The systematic review focused on study design, type of machine learning model used, validation techniques applied, and reported model performance on prediction of complications and prognostication. This review further classified outcomes and machine learning applications using an ad hoc classification system. The Prediction model Risk Of Bias Assessment Tool (PROBAST) was used to assess risk of bias and applicability of the studies.

RESULTS

A total of 103 studies were identified. The models reported in the literature were primarily based on single-center validations (75%), with only 13% being externally validated across multiple centers. Most of the mortality models demonstrated a limited ability to discriminate and classify effectively. The PROBAST assessment indicated a high risk of systematic errors in predicted outcomes and artificial intelligence or machine learning applications.

CONCLUSIONS

The findings indicate that the development of this field is still in its early stages. This systematic review indicates that application of machine learning in perioperative medicine is still at an early stage. While many studies suggest potential utility, several key challenges must be first overcome before their introduction into clinical practice.

EDITOR’S PERSPECTIVE

Collapse

Shojaee-Mend H, Velayati F, Tayefi B, Babaee E. Prediction of Diabetes Using Data Mining and Machine Learning Algorithms: A Cross-Sectional Study. Healthc Inform Res 2024;30:73-82. [PMID: 38359851 PMCID: PMC10879823 DOI: 10.4258/hir.2024.30.1.73] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 01/24/2024] [Accepted: 01/24/2024] [Indexed: 02/17/2024] Open

Gerry S, Collins GS. A prediction model for venous thromboembolism in the intensive care unit: flawed methods may lead to inaccurate predictions. Crit Care 2023;27:495. [PMID: 38111055 PMCID: PMC10726531 DOI: 10.1186/s13054-023-04778-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 12/12/2023] [Indexed: 12/20/2023] Open

Hsiung SY, Deng SX, Li J, Huang SY, Liaw CK, Huang SY, Wang CC, Hsieh YSY. Machine learning-based monosaccharide profiling for tissue-specific classification of Wolfiporia extensa samples. Carbohydr Polym 2023;322:121338. [PMID: 37839831 DOI: 10.1016/j.carbpol.2023.121338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Revised: 08/09/2023] [Accepted: 08/26/2023] [Indexed: 10/17/2023]

Nickson D, Singmann H, Meyer C, Toro C, Walasek L. Replicability and reproducibility of predictive models for diagnosis of depression among young adults using Electronic Health Records. Diagn Progn Res 2023;7:25. [PMID: 38049919 PMCID: PMC10696659 DOI: 10.1186/s41512-023-00160-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 10/10/2023] [Indexed: 12/06/2023] Open

Abstract

BACKGROUND

Recent advances in machine learning combined with the growing availability of digitized health records offer new opportunities for improving early diagnosis of depression. An emerging body of research shows that Electronic Health Records can be used to accurately predict cases of depression on the basis of individual's primary care records. The successes of these studies are undeniable, but there is a growing concern that their results may not be replicable, which could cast doubt on their clinical usefulness.

METHODS

To address this issue in the present paper, we set out to reproduce and replicate the work by Nichols et al. (2018), who trained predictive models of depression among young adults using Electronic Healthcare Records. Our contribution consists of three parts. First, we attempt to replicate the methodology used by the original authors, acquiring a more up-to-date set of primary health care records to the same specification and reproducing their data processing and analysis. Second, we test models presented in the original paper on our own data, thus providing out-of-sample prediction of the predictive models. Third, we extend past work by considering several novel machine-learning approaches in an attempt to improve the predictive accuracy achieved in the original work.

RESULTS

In summary, our results demonstrate that the work of Nichols et al. is largely reproducible and replicable. This was the case both for the replication of the original model and the out-of-sample replication applying NRCBM coefficients to our new EHRs data. Although alternative predictive models did not improve model performance over standard logistic regression, our results indicate that stepwise variable selection is not stable even in the case of large data sets.

CONCLUSION

We discuss the challenges associated with the research on mental health and Electronic Health Records, including the need to produce interpretable and robust models. We demonstrated some potential issues associated with the reliance on EHRs, including changes in the regulations and guidelines (such as the QOF guidelines in the UK) and reliance on visits to GP as a predictor of specific disorders.

Collapse

Park YW, Han K, Kim SH. Response to "Medical Statistics Unlock the Gateway to Further Research: Using Deep Learning to Predict CDKN2A/B Homozygous Deletion in Isocitrate Dehydrogenase-Mutant Astrocytoma". Korean J Radiol 2023;24:1306-1308. [PMID: 38016689 DOI: 10.3348/kjr.2023.0940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2023] [Accepted: 09/25/2023] [Indexed: 11/30/2023] Open

Ledger A, Ceusters J, Valentin L, Testa A, Van Holsbeke C, Franchi D, Bourne T, Froyman W, Timmerman D, Van Calster B. Multiclass risk models for ovarian malignancy: an illustration of prediction uncertainty due to the choice of algorithm. BMC Med Res Methodol 2023;23:276. [PMID: 38001421 PMCID: PMC10668424 DOI: 10.1186/s12874-023-02103-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 11/14/2023] [Indexed: 11/26/2023] Open

Abstract

BACKGROUND

Assessing malignancy risk is important to choose appropriate management of ovarian tumors. We compared six algorithms to estimate the probabilities that an ovarian tumor is benign, borderline malignant, stage I primary invasive, stage II-IV primary invasive, or secondary metastatic.

METHODS

This retrospective cohort study used 5909 patients recruited from 1999 to 2012 for model development, and 3199 patients recruited from 2012 to 2015 for model validation. Patients were recruited at oncology referral or general centers and underwent an ultrasound examination and surgery ≤ 120 days later. We developed models using standard multinomial logistic regression (MLR), Ridge MLR, random forest (RF), XGBoost, neural networks (NN), and support vector machines (SVM). We used nine clinical and ultrasound predictors but developed models with or without CA125.

RESULTS

Most tumors were benign (3980 in development and 1688 in validation data), secondary metastatic tumors were least common (246 and 172). The c-statistic (AUROC) to discriminate benign from any type of malignant tumor ranged from 0.89 to 0.92 for models with CA125, from 0.89 to 0.91 for models without. The multiclass c-statistic ranged from 0.41 (SVM) to 0.55 (XGBoost) for models with CA125, and from 0.42 (SVM) to 0.51 (standard MLR) for models without. Multiclass calibration was best for RF and XGBoost. Estimated probabilities for a benign tumor in the same patient often differed by more than 0.2 (20% points) depending on the model. Net Benefit for diagnosing malignancy was similar for algorithms at the commonly used 10% risk threshold, but was slightly higher for RF at higher thresholds. Comparing models, between 3% (XGBoost vs. NN, with CA125) and 30% (NN vs. SVM, without CA125) of patients fell on opposite sides of the 10% threshold.

CONCLUSION

Although several models had similarly good performance, individual probability estimates varied substantially.

Collapse

Affiliation(s)

Ashleigh Ledger Department of Development and Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000, Belgium
Jolien Ceusters Department of Development and Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000, Belgium Department of Oncology, Leuven Cancer Institute, Laboratory of Tumor Immunology and Immunotherapy, KU Leuven, Leuven, Belgium
Lil Valentin Department of Obstetrics and Gynecology, Skåne University Hospital, Malmö, Sweden Department of Clinical Sciences Malmö, Lund University, Malmö, Sweden
Antonia Testa Department of Woman, Child and Public Health, Fondazione Policlinico Universitario A. Gemelli IRCCS, Rome, Italy Dipartimento Universitario Scienze della Vita e Sanità Pubblica, Università Cattolica del Sacro Cuore, Rome, Italy
Caroline Van Holsbeke Department of Obstetrics and Gynecology, Ziekenhuis Oost-Limburg, Genk, Belgium
Dorella Franchi Preventive Gynecology Unit, Division of Gynecology, European Institute of Oncology IRCCS, Milan, Italy
Tom Bourne Department of Development and Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000, Belgium Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium Queen Charlotte's and Chelsea Hospital, Imperial College, London, UK
Wouter Froyman Department of Development and Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000, Belgium Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
Dirk Timmerman Department of Development and Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000, Belgium Department of Obstetrics and Gynecology, University Hospitals Leuven, Leuven, Belgium
Ben Van Calster Department of Development and Regeneration, KU Leuven, Herestraat 49 box 805, Leuven, 3000, Belgium. Department of Biomedical Data Sciences, Leiden University Medical Centre (LUMC), Leiden, Netherlands. Leuven Unit for Health Technology Assessment Research (LUHTAR), KU Leuven, Leuven, Belgium.

Collapse

Moon SJ, Lee S, Hwang J, Lee J, Kang S, Cha HS. Performances of machine learning algorithms in discriminating sacroiliitis features on MRI: a systematic review. RMD Open 2023;9:e003783. [PMID: 37996126 PMCID: PMC10668284 DOI: 10.1136/rmdopen-2023-003783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 11/08/2023] [Indexed: 11/25/2023] Open

Buick JE, Austin PC, Cheskes S, Ko DT, Atzema CL. Prediction models in prehospital and emergency medicine research: How to derive and internally validate a clinical prediction model. Acad Emerg Med 2023;30:1150-1160. [PMID: 37266925 DOI: 10.1111/acem.14756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 05/24/2023] [Accepted: 05/29/2023] [Indexed: 06/03/2023]

Zhang YF, Shen YJ, Huang Q, Wu CP, Zhou L, Ren HL. Predicting survival of advanced laryngeal squamous cell carcinoma: comparison of machine learning models and Cox regression models. Sci Rep 2023;13:18498. [PMID: 37898687 PMCID: PMC10613248 DOI: 10.1038/s41598-023-45831-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Accepted: 10/24/2023] [Indexed: 10/30/2023] Open

Mohsen F, Al-Absi HRH, Yousri NA, El Hajj N, Shah Z. A scoping review of artificial intelligence-based methods for diabetes risk prediction. NPJ Digit Med 2023;6:197. [PMID: 37880301 PMCID: PMC10600138 DOI: 10.1038/s41746-023-00933-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Accepted: 09/25/2023] [Indexed: 10/27/2023] Open

Macken AA, Macken LC, Oosterhoff JHF, Boileau P, Athwal GS, Doornberg JN, Lafosse L, Lafosse T, van den Bekerom MPJ, Buijze GA. Developing a machine learning algorithm to predict the probability of aseptic loosening of the glenoid component after anatomical total shoulder arthroplasty: protocol for a retrospective, multicentre study. BMJ Open 2023;13:e074700. [PMID: 37852772 PMCID: PMC10603397 DOI: 10.1136/bmjopen-2023-074700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 09/25/2023] [Indexed: 10/20/2023] Open

Abstract

INTRODUCTION

Despite technological advancements in recent years, glenoid component loosening remains a common complication after anatomical total shoulder arthroplasty (ATSA) and is one of the main causes of revision surgery. Increasing emphasis is placed on the prevention of glenoid component failure. Previous studies have successfully predicted range of motion, patient-reported outcomes and short-term complications after ATSA using machine learning methods, but an accurate predictive model for (glenoid component) revision is currently lacking. This study aims to use a large international database to accurately predict aseptic loosening of the glenoid component after ATSA using machine learning algorithms.

METHODS AND ANALYSIS

For this multicentre, retrospective study, individual patient data will be compiled from previously published studies reporting revision of ATSA. A systematic literature search will be performed in Medline (PubMed) identifying all studies reporting outcomes of ATSA. Authors will be contacted and invited to participate in the Machine Learning Consortium by sharing their anonymised databases. All databases reporting revisions after ATSA will be included, and individual patients with a follow-up less than 2 years or a fracture as the indication for ATSA will be excluded. First, features (predictive variables) will be identified using a random forest feature selection. The resulting features from the compiled database will be used to train various machine learning algorithms (stochastic gradient boosting, random forest, support vector machine, neural network and elastic-net penalised logistic regression). The developed and validated algorithms will be evaluated across discrimination (c-statistic), calibration, the Brier score and the decision curve analysis. The best-performing algorithm will be used to create an open-access online prediction tool.

ETHICS AND DISSEMINATION

Data will be collected adhering to the WHO regulation on data sharing. An Institutional Review Board review is not applicable. The study results will be published in a peer-reviewed journal.

Collapse

McLean KA, Goel T, Lawday S, Riad A, Simoes J, Knight SR, Ghosh D, Glasbey JC, Bhangu A, Harrison EM. Prognostic models for surgical-site infection in gastrointestinal surgery: systematic review. Br J Surg 2023;110:1441-1450. [PMID: 37433918 PMCID: PMC10564404 DOI: 10.1093/bjs/znad187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/11/2023] [Accepted: 05/20/2023] [Indexed: 07/13/2023]

Abstract

BACKGROUND

Identification of patients at high risk of surgical-site infection may allow clinicians to target interventions and monitoring to minimize associated morbidity. The aim of this systematic review was to identify and evaluate prognostic tools for the prediction of surgical-site infection in gastrointestinal surgery.

METHODS

This systematic review sought to identify original studies describing the development and validation of prognostic models for 30-day SSI after gastrointestinal surgery (PROSPERO: CRD42022311019). MEDLINE, Embase, Global Health, and IEEE Xplore were searched from 1 January 2000 to 24 February 2022. Studies were excluded if prognostic models included postoperative parameters or were procedure specific. A narrative synthesis was performed, with sample-size sufficiency, discriminative ability (area under the receiver operating characteristic curve), and prognostic accuracy compared.

RESULTS

Of 2249 records reviewed, 23 eligible prognostic models were identified. A total of 13 (57 per cent) reported no internal validation and only 4 (17 per cent) had undergone external validation. Most identified operative contamination (57 per cent, 13 of 23) and duration (52 per cent, 12 of 23) as important predictors; however, there remained substantial heterogeneity in other predictors identified (range 2-28). All models demonstrated a high risk of bias due to the analytic approach, with overall low applicability to an undifferentiated gastrointestinal surgical population. Model discrimination was reported in most studies (83 per cent, 19 of 23); however, calibration (22 per cent, 5 of 23) and prognostic accuracy (17 per cent, 4 of 23) were infrequently assessed. Of externally validated models (of which there were four), none displayed 'good' discrimination (area under the receiver operating characteristic curve greater than or equal to 0.7).

CONCLUSION

The risk of surgical-site infection after gastrointestinal surgery is insufficiently described by existing risk-prediction tools, which are not suitable for routine use. Novel risk-stratification tools are required to target perioperative interventions and mitigate modifiable risk factors.

Collapse

Yang H, Wu X, Liu W, Yuan Y, Zeng H, Li J, Ye B, Wang L, Luo S, Li Z, Liu H. A quantitative analysis framework of placenta accreta spectrum: placenta subtype, intraoperative bleeding, and hysterectomy risk evaluation based on magnetic resonance imaging-anatomical-clinical features. Quant Imaging Med Surg 2023;13:7105-7116. [PMID: 37869322 PMCID: PMC10585581 DOI: 10.21037/qims-23-142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Accepted: 08/07/2023] [Indexed: 10/24/2023]

Abstract

Background

Placenta accreta spectrum (PAS) is a significant contributor to maternal morbidity and mortality. Our objective was to develop a quantitative analysis framework utilizing magnetic resonance imaging (MRI)-anatomical-clinical features to predict 3 clinically significant parameters in patients with PAS: placenta subtype (invasive vs. non-invasive placenta), intraoperative bleeding (≥1,500 vs. <1,500 mL), and hysterectomy risk (hysterectomy vs. non-hysterectomy).

Methods

A total of 125 pregnant women with PAS from 2 medical centers were enrolled into an internal training set and an external testing set. Some 21 MRI-anatomical-clinical features were integrated as input into the framework. The proposed quantitative analytic framework contains mainly 3 classifiers built by extreme gradient boosting (XGBoost) and their testing in external datasets. We also further compared the accuracy of placenta subtype prediction between the proposed model and 4 radiologists. A quantitative model interpretation method called SHapley Additive exPlanations (SHAP) was conducted to explore the contribution of each feature.

Results

The placenta subtype (invasive vs. non-invasive), intraoperative bleeding (≥1,500 vs. <1,500 mL), and hysterectomy risk (hysterectomy vs. non-hysterectomy) demonstrated impressive area under the receiver operating characteristic curve (AUROC) values of 0.93, 0.88, and 0.90, respectively, in the internal validation set. Even in the external testing set, these metrics maintained their strength, achieving AUROC values of 0.91, 0.82, and 0.82, respectively. Comparing our proposed framework to the 4 radiologists, our model exhibited superior accuracy, specificity, and sensitivity in predicting placental subtypes within the external testing cohort. The features associated with intraplacental dark T2 bands played a crucial role in the decision-making process of all 3 prediction models.

Conclusions

The quantitative analysis framework can provide a robust method for classification of placenta subtype (invasive vs. non-invasive placenta), intraoperative bleeding (≥1,500 vs. <1,500 mL), and hysterectomy risk (hysterectomy vs. non-hysterectomy) based on MRI-anatomical-clinical features in PAS.

Collapse

Altuhaifa FA, Win KT, Su G. Predicting lung cancer survival based on clinical data using machine learning: A review. Comput Biol Med 2023;165:107338. [PMID: 37625260 DOI: 10.1016/j.compbiomed.2023.107338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 07/31/2023] [Accepted: 08/07/2023] [Indexed: 08/27/2023]

Patel AA, Schwab JH, Amanatullah DF, Divi SN. AOA Critical Issues Symposium: Shaping the Impact of Artificial Intelligence within Orthopaedic Surgery. J Bone Joint Surg Am 2023;105:1475-1479. [PMID: 37172106 DOI: 10.2106/jbjs.22.01330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]

Abstract

ABSTRACT

Artificial intelligence (AI) is a broad term that is widely used but inconsistently understood. It refers to the ability of any machine to exhibit human-like intelligence by making decisions, solving problems, or learning from experience. With its ability to rapidly process large amounts of information, AI has already transformed many industries such as entertainment, transportation, and communications through consumer-facing products and business-to-business applications. Given its potential, AI is also anticipated to impact the practice of medicine and the delivery of health care. Interest in AI-based techniques has grown rapidly within the orthopaedic community, resulting in an increasing number of publications on this topic. Topics of interest have ranged from the use of AI for imaging interpretation to AI-based techniques for predicting postoperative outcomes.The highly technical and data-driven nature of orthopaedic surgery creates the potential for AI, and its subdisciplines machine learning (ML) and deep learning (DL), to fundamentally transform our understanding of musculoskeletal care. However, AI-based techniques are not well known to most orthopaedic surgeons, nor are they taught with the same level of insight and critical thinking as traditional statistical methodology. With a clear understanding of the science behind AI-based techniques, orthopaedic surgeons will be able to identify the potential pitfalls of the application of AI to musculoskeletal health. Additionally, with increased understanding of AI, surgeons and their patients may have more trust in the results of AI-based analytics, thereby expanding the potential use of AI in clinical care and amplifying the impact it could have in improving quality and value. The purpose of this American Orthopaedic Association (AOA) symposium was to facilitate understanding and development of AI and AI-based techniques within orthopaedic surgery by defining common terminology related to AI, demonstrating the existing clinical utility of AI, and presenting future applications of AI in surgical care.

Collapse

Reeve K, On BI, Havla J, Burns J, Gosteli-Peter MA, Alabsawi A, Alayash Z, Götschi A, Seibold H, Mansmann U, Held U. Prognostic models for predicting clinical disease progression, worsening and activity in people with multiple sclerosis. Cochrane Database Syst Rev 2023;9:CD013606. [PMID: 37681561 PMCID: PMC10486189 DOI: 10.1002/14651858.cd013606.pub2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/09/2023]

Abstract

BACKGROUND

Multiple sclerosis (MS) is a chronic inflammatory disease of the central nervous system that affects millions of people worldwide. The disease course varies greatly across individuals and many disease-modifying treatments with different safety and efficacy profiles have been developed recently. Prognostic models evaluated and shown to be valid in different settings have the potential to support people with MS and their physicians during the decision-making process for treatment or disease/life management, allow stratified and more precise interpretation of interventional trials, and provide insights into disease mechanisms. Many researchers have turned to prognostic models to help predict clinical outcomes in people with MS; however, to our knowledge, no widely accepted prognostic model for MS is being used in clinical practice yet.

OBJECTIVES

To identify and summarise multivariable prognostic models, and their validation studies for quantifying the risk of clinical disease progression, worsening, and activity in adults with MS.

SEARCH METHODS

We searched MEDLINE, Embase, and the Cochrane Database of Systematic Reviews from January 1996 until July 2021. We also screened the reference lists of included studies and relevant reviews, and references citing the included studies.

SELECTION CRITERIA

We included all statistically developed multivariable prognostic models aiming to predict clinical disease progression, worsening, and activity, as measured by disability, relapse, conversion to definite MS, conversion to progressive MS, or a composite of these in adult individuals with MS. We also included any studies evaluating the performance of (i.e. validating) these models. There were no restrictions based on language, data source, timing of prognostication, or timing of outcome.

DATA COLLECTION AND ANALYSIS

Pairs of review authors independently screened titles/abstracts and full texts, extracted data using a piloted form based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS), assessed risk of bias using the Prediction Model Risk Of Bias Assessment Tool (PROBAST), and assessed reporting deficiencies based on the checklist items in Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD). The characteristics of the included models and their validations are described narratively. We planned to meta-analyse the discrimination and calibration of models with at least three external validations outside the model development study but no model met this criterion. We summarised between-study heterogeneity narratively but again could not perform the planned meta-regression.

MAIN RESULTS

We included 57 studies, from which we identified 75 model developments, 15 external validations corresponding to only 12 (16%) of the models, and six author-reported validations. Only two models were externally validated multiple times. None of the identified external validations were performed by researchers independent of those that developed the model. The outcome was related to disease progression in 39 (41%), relapses in 8 (8%), conversion to definite MS in 17 (18%), and conversion to progressive MS in 27 (28%) of the 96 models or validations. The disease and treatment-related characteristics of included participants, and definitions of considered predictors and outcome, were highly heterogeneous amongst the studies. Based on the publication year, we observed an increase in the percent of participants on treatment, diversification of the diagnostic criteria used, an increase in consideration of biomarkers or treatment as predictors, and increased use of machine learning methods over time. Usability and reproducibility All identified models contained at least one predictor requiring the skills of a medical specialist for measurement or assessment. Most of the models (44; 59%) contained predictors that require specialist equipment likely to be absent from primary care or standard hospital settings. Over half (52%) of the developed models were not accompanied by model coefficients, tools, or instructions, which hinders their application, independent validation or reproduction. The data used in model developments were made publicly available or reported to be available on request only in a few studies (two and six, respectively). Risk of bias We rated all but one of the model developments or validations as having high overall risk of bias. The main reason for this was the statistical methods used for the development or evaluation of prognostic models; we rated all but two of the included model developments or validations as having high risk of bias in the analysis domain. None of the model developments that were externally validated or these models' external validations had low risk of bias. There were concerns related to applicability of the models to our research question in over one-third (38%) of the models or their validations. Reporting deficiencies Reporting was poor overall and there was no observable increase in the quality of reporting over time. The items that were unclearly reported or not reported at all for most of the included models or validations were related to sample size justification, blinding of outcome assessors, details of the full model or how to obtain predictions from it, amount of missing data, and treatments received by the participants. Reporting of preferred model performance measures of discrimination and calibration was suboptimal.

AUTHORS' CONCLUSIONS

The current evidence is not sufficient for recommending the use of any of the published prognostic prediction models for people with MS in clinical routine today due to lack of independent external validations. The MS prognostic research community should adhere to the current reporting and methodological guidelines and conduct many more state-of-the-art external validation studies for the existing or newly developed models.

Collapse

Deng Y, Ma Y, Fu J, Wang X, Yu C, Lv J, Man S, Wang B, Li L. Combinatorial Use of Machine Learning and Logistic Regression for Predicting Carotid Plaque Risk Among 5.4 Million Adults With Fatty Liver Disease Receiving Health Check-Ups: Population-Based Cross-Sectional Study. JMIR Public Health Surveill 2023;9:e47095. [PMID: 37676713 PMCID: PMC10514774 DOI: 10.2196/47095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 04/28/2023] [Accepted: 07/25/2023] [Indexed: 09/08/2023] Open

Abstract

BACKGROUND

Carotid plaque can progress into stroke, myocardial infarction, etc, which are major global causes of death. Evidence shows a significant increase in carotid plaque incidence among patients with fatty liver disease. However, unlike the high detection rate of fatty liver disease, screening for carotid plaque in the asymptomatic population is not yet prevalent due to cost-effectiveness reasons, resulting in a large number of patients with undetected carotid plaques, especially among those with fatty liver disease.

OBJECTIVE

This study aimed to combine the advantages of machine learning (ML) and logistic regression to develop a straightforward prediction model among the population with fatty liver disease to identify individuals at risk of carotid plaque.

METHODS

Our study included 5,420,640 participants with fatty liver from Meinian Health Care Center. We used random forest, elastic net (EN), and extreme gradient boosting ML algorithms to select important features from potential predictors. Features acknowledged by all 3 models were enrolled in logistic regression analysis to develop a carotid plaque prediction model. Model performance was evaluated based on the area under the receiver operating characteristic curve, calibration curve, Brier score, and decision curve analysis both in a randomly split internal validation data set, and an external validation data set comprising 32,682 participants from MJ Health Check-up Center. Risk cutoff points for carotid plaque were determined based on the Youden index, predicted probability distribution, and prevalence rate of the internal validation data set to classify participants into high-, intermediate-, and low-risk groups. This risk classification was further validated in the external validation data set.

RESULTS

Among the participants, 26.23% (1,421,970/5,420,640) were diagnosed with carotid plaque in the development data set, and 21.64% (7074/32,682) were diagnosed in the external validation data set. A total of 6 features, including age, systolic blood pressure, low-density lipoprotein cholesterol (LDL-C), total cholesterol, fasting blood glucose, and hepatic steatosis index (HSI) were collectively selected by all 3 ML models out of 27 predictors. After eliminating the issue of collinearity between features, the logistic regression model established with the 5 independent predictors reached an area under the curve of 0.831 in the internal validation data set and 0.801 in the external validation data set, and showed good calibration capability graphically. Its predictive performance was comprehensively competitive compared with the single use of either logistic regression or ML algorithms. Optimal predicted probability cutoff points of 25% and 65% were determined for classifying individuals into low-, intermediate-, and high-risk categories for carotid plaque.

CONCLUSIONS

The combination of ML and logistic regression yielded a practical carotid plaque prediction model, and was of great public health implications in the early identification and risk assessment of carotid plaque among individuals with fatty liver.

Collapse

Affiliation(s)

Yuhan Deng Chongqing Research Institute of Big Data, Peking University, Chongqing, China Meinian Institute of Health, Beijing, China
Yuan Ma School of Population Medicine and Public Health, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
Jingzhu Fu Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
Xiaona Wang MJ Health Screening Center, Beijing, China
Canqing Yu Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
Jun Lv Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
Sailimai Man Meinian Institute of Health, Beijing, China Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
Bo Wang Meinian Institute of Health, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
Liming Li Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China Peking University Health Science Center Meinian Public Health Institute, Beijing, China Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China

Collapse

Kwong JCC, Khondker A, Lajkosz K, McDermott MBA, Frigola XB, McCradden MD, Mamdani M, Kulkarni GS, Johnson AEW. APPRAISE-AI Tool for Quantitative Evaluation of AI Studies for Clinical Decision Support. JAMA Netw Open 2023;6:e2335377. [PMID: 37747733 PMCID: PMC10520738 DOI: 10.1001/jamanetworkopen.2023.35377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 08/14/2023] [Indexed: 09/26/2023] Open

Abstract

Importance

Artificial intelligence (AI) has gained considerable attention in health care, yet concerns have been raised around appropriate methods and fairness. Current AI reporting guidelines do not provide a means of quantifying overall quality of AI research, limiting their ability to compare models addressing the same clinical question.

Objective

To develop a tool (APPRAISE-AI) to evaluate the methodological and reporting quality of AI prediction models for clinical decision support.

Design, Setting, and Participants

This quality improvement study evaluated AI studies in the model development, silent, and clinical trial phases using the APPRAISE-AI tool, a quantitative method for evaluating quality of AI studies across 6 domains: clinical relevance, data quality, methodological conduct, robustness of results, reporting quality, and reproducibility. These domains included 24 items with a maximum overall score of 100 points. Points were assigned to each item, with higher points indicating stronger methodological or reporting quality. The tool was applied to a systematic review on machine learning to estimate sepsis that included articles published until September 13, 2019. Data analysis was performed from September to December 2022.

Main Outcomes and Measures

The primary outcomes were interrater and intrarater reliability and the correlation between APPRAISE-AI scores and expert scores, 3-year citation rate, number of Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2) low risk-of-bias domains, and overall adherence to the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) statement.

Results

A total of 28 studies were included. Overall APPRAISE-AI scores ranged from 33 (low quality) to 67 (high quality). Most studies were moderate quality. The 5 lowest scoring items included source of data, sample size calculation, bias assessment, error analysis, and transparency. Overall APPRAISE-AI scores were associated with expert scores (Spearman ρ, 0.82; 95% CI, 0.64-0.91; P < .001), 3-year citation rate (Spearman ρ, 0.69; 95% CI, 0.43-0.85; P < .001), number of QUADAS-2 low risk-of-bias domains (Spearman ρ, 0.56; 95% CI, 0.24-0.77; P = .002), and adherence to the TRIPOD statement (Spearman ρ, 0.87; 95% CI, 0.73-0.94; P < .001). Intraclass correlation coefficient ranges for interrater and intrarater reliability were 0.74 to 1.00 for individual items, 0.81 to 0.99 for individual domains, and 0.91 to 0.98 for overall scores.

Conclusions and Relevance

In this quality improvement study, APPRAISE-AI demonstrated strong interrater and intrarater reliability and correlated well with several study quality measures. This tool may provide a quantitative approach for investigators, reviewers, editors, and funding organizations to compare the research quality across AI studies for clinical decision support.

Collapse

Affiliation(s)

Jethro C. C. Kwong Division of Urology, Department of Surgery, University of Toronto, Toronto, Ontario, Canada Temerty Centre for AI Research and Education in Medicine, University of Toronto, Toronto, Ontario, Canada
Adree Khondker Division of Urology, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
Katherine Lajkosz Division of Urology, Department of Surgery, University of Toronto, Toronto, Ontario, Canada Department of Biostatistics, University Health Network, University of Toronto, Toronto, Ontario, Canada
Matthew B. A. McDermott Department of Biomedical Informatics, Massachusetts Institute of Technology, Cambridge
Xavier Borrat Frigola Laboratory for Computational Physiology, Harvard–Massachusetts Institute of Technology Division of Health Sciences and Technology, Cambridge Anesthesiology and Critical Care Department, Hospital Clinic de Barcelona, Barcelona, Spain
Melissa D. McCradden Department of Bioethics, The Hospital for Sick Children, Toronto, Ontario, Canada Genetics & Genome Biology Research Program, Peter Gilgan Centre for Research and Learning, Toronto, Ontario, Canada Division of Clinical and Public Health, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada
Muhammad Mamdani Temerty Centre for AI Research and Education in Medicine, University of Toronto, Toronto, Ontario, Canada Data Science and Advanced Analytics, Unity Health Toronto, Toronto, Ontario, Canada
Girish S. Kulkarni Division of Urology, Department of Surgery, University of Toronto, Toronto, Ontario, Canada Princess Margaret Cancer Centre, University Health Network, University of Toronto, Toronto, Ontario, Canada
Alistair E. W. Johnson Temerty Centre for AI Research and Education in Medicine, University of Toronto, Toronto, Ontario, Canada Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada Child Health Evaluative Sciences, The Hospital for Sick Children, University of Toronto, Toronto, Ontario, Canada

Collapse

Wu X, Jin N, Gao H, Yan M, Chen Q, Sun T, Hao C, Zhao Y, Han X, Pan Y, Huang X, Li W, Wang K, Yin Y. Effectiveness and Safety of Palbociclib Plus Endocrine Therapy in Patients with Advanced Breast Cancer: A Multi-Center Study in China. Cancers (Basel) 2023;15:4360. [PMID: 37686645 PMCID: PMC10487219 DOI: 10.3390/cancers15174360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 08/23/2023] [Accepted: 08/29/2023] [Indexed: 09/10/2023] Open

Affiliation(s)

Xinyu Wu Department of Medicine, Nanjing Medical University, Nanjing 210029, China; (X.W.); (N.J.); (X.H.); (W.L.) Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China
Nan Jin Department of Medicine, Nanjing Medical University, Nanjing 210029, China; (X.W.); (N.J.); (X.H.); (W.L.) Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China
Hongfei Gao Department of Breast Cancer, Cancer Center, Guangdong Provincial People’s Hospital, Southern Medical University, Guangzhou 510515, China;
Min Yan Department of Breast Disease, Henan Breast Cancer Center, The Affiliated Cancer Hospital of Zhengzhou University, Zhengzhou 450008, China;
Qianjun Chen Department of Breast Disease, Guangdong Provincial Hospital of Chinese Medicine, Guangzhou 510120, China;
Tao Sun Department of Medical Oncology, Liaoning Cancer Hospital & Institute, Shenyang 110042, China;
Chunfang Hao Department of Breast Oncology, Tianjin Medical University Cancer Institute and Hospital, Tianjin 300060, China;
Yanxia Zhao Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430074, China;
Xinhua Han Division of Life Science and Medicine, Department of Oncology, The First Affiliated Hospital of University of Science and Technology of China (USTC), University of Science and Technology of China, Hefei 230026, China; (X.H.); (Y.P.)
Yueyin Pan Division of Life Science and Medicine, Department of Oncology, The First Affiliated Hospital of University of Science and Technology of China (USTC), University of Science and Technology of China, Hefei 230026, China; (X.H.); (Y.P.)
Xiang Huang Department of Medicine, Nanjing Medical University, Nanjing 210029, China; (X.W.); (N.J.); (X.H.); (W.L.) Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China
Wei Li Department of Medicine, Nanjing Medical University, Nanjing 210029, China; (X.W.); (N.J.); (X.H.); (W.L.) Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China
Kun Wang Department of Breast Cancer, Cancer Center, Guangdong Provincial People’s Hospital, Southern Medical University, Guangzhou 510515, China;
Yongmei Yin Department of Medicine, Nanjing Medical University, Nanjing 210029, China; (X.W.); (N.J.); (X.H.); (W.L.) Department of Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing 210029, China

Collapse

Solomon A, Cipăian CR, Negrea MO, Boicean A, Mihaila R, Beca C, Popa ML, Grama SM, Teodoru M, Neamtu B. Hepatic Involvement across the Metabolic Syndrome Spectrum: Non-Invasive Assessment and Risk Prediction Using Machine Learning. J Clin Med 2023;12:5657. [PMID: 37685725 PMCID: PMC10488813 DOI: 10.3390/jcm12175657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 08/23/2023] [Accepted: 08/28/2023] [Indexed: 09/10/2023] Open

Abstract

Metabolic-dysfunction-associated steatotic liver disease (MASLD) and metabolic syndrome (MetS) are inextricably linked conditions, both of which are experiencing an upward trend in prevalence, thereby exerting a substantial clinical and economic burden. The presence of MetS should prompt the search for metabolic-associated liver disease. Liver fibrosis is the main predictor of liver-related morbidity and mortality. Non-invasive tests (NIT) such as the Fibrosis-4 index (FIB4), aspartate aminotransferase-to-platelet ratio index (APRI), aspartate aminotransferase-to-alanine aminotransferase ratio (AAR), hepatic steatosis index (HIS), transient elastography (TE), and combined scores (AGILE3+, AGILE4) facilitate the detection of liver fibrosis or steatosis. Our study enrolled 217 patients with suspected MASLD, 109 of whom were diagnosed with MetS. We implemented clinical and biological evaluations complemented by transient elastography (TE) to discern the most robust predictors for liver disease manifestation patterns. Patients with MetS had significantly higher values of FIB4, APRI, HSI, liver stiffness, and steatosis parameters measured by TE, as well as AGILE3+ and AGILE4 scores. Machine-learning algorithms enhanced our evaluation. A two-step cluster algorithm yielded three clusters with reliable model quality. Cluster 1 contained patients without significant fibrosis or steatosis, while clusters 2 and 3 showed a higher prevalence of significant liver fibrosis or at least moderate steatosis as measured by TE. A decision tree algorithm identified age, BMI, liver enzyme levels, and metabolic syndrome characteristics as significant factors in predicting cluster membership with an overall accuracy of 89.4%. Combining NITs improves the accuracy of detecting patterns of liver involvement in patients with suspected MASLD.

Collapse

Affiliation(s)

Adelaida Solomon Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Călin Remus Cipăian Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Mihai Octavian Negrea Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Adrian Boicean Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Romeo Mihaila Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Corina Beca County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Mirela Livia Popa Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Sebastian Mihai Grama Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.)
Minodora Teodoru Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) County Clinical Emergency Hospital of Sibiu, 2–4 Corneliu Coposu Str., 550245 Sibiu, Romania;
Bogdan Neamtu Faculty of Medicine, “Lucian Blaga” University, 550024 Sibiu, Romania; (A.S.); (A.B.); (R.M.); (M.L.P.); (S.M.G.); (M.T.); (B.N.) Department of Clinical Research, Pediatric Clinical Hospital Sibiu, 550166 Sibiu, Romania

Collapse

Pei B, Jin C, Cao S, Ji N, Xia M, Jiang H. Geometric morphometrics and machine learning from three-dimensional facial scans for difficult mask ventilation prediction. Front Med (Lausanne) 2023;10:1203023. [PMID: 37636580 PMCID: PMC10447910 DOI: 10.3389/fmed.2023.1203023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Accepted: 07/31/2023] [Indexed: 08/29/2023] Open

Abstract

Background

Unanticipated difficult mask ventilation (DMV) is a potentially life-threatening event in anesthesia. Nevertheless, predicting DMV currently remains a challenge. This study aimed to verify whether three dimensional (3D) facial scans could predict DMV in patients scheduled for general anesthesia.

Methods

The 3D facial scans were taken on 669 adult patients scheduled for elective surgery under general anesthesia. Clinical variables currently used as predictors of DMV were also collected. The DMV was defined as the inability to provide adequate and stable ventilation. Spatially dense landmarks were digitized on 3D scans to describe sufficient details for facial features and then processed by 3D geometric morphometrics. Ten different machine learning (ML) algorithms, varying from simple to more advanced, were introduced. The performance of ML models for DMV prediction was compared with that of the DIFFMASK score. The area under the receiver operating characteristic curves (AUC) with its 95% confidence interval (95% CI) as well as the specificity and sensitivity were used to evaluate the predictive value of the model.

Results

The incidence of DMV was 35/669 (5.23%). The logistic regression (LR) model performed best among the 10 ML models. The AUC of the LR model was 0.825 (95% CI, 0.765-0.885). The sensitivity and specificity of the model were 0.829 (95% CI, 0.629-0.914) and 0.733 (95% CI, 0.532-0.819), respectively. The LR model demonstrated better predictive performance than the DIFFMASK score, which obtained an AUC of 0.785 (95% CI, 0.710-0.860) and a sensitivity of 0.686 (95% CI, 0.578-0.847). Notably, we identified a significant morphological difference in the mandibular region between the DMV group and the easy mask ventilation group.

Conclusion

Our study indicated a distinct morphological difference in the mandibular region between the DMV group and the easy mask ventilation group. 3D geometric morphometrics with ML could be a rapid, efficient, and non-invasive tool for DMV prediction to improve anesthesia safety.

Collapse

Wang Y, Zhang H, Liu L, Li Z, Zhou Y, Wei J, Xu Y, Zhou Y, Tang Y. Cognitive function and cardiovascular health in the elderly: network analysis based on hypertension, diabetes, cerebrovascular disease, and coronary heart disease. Front Aging Neurosci 2023;15:1229559. [PMID: 37600511 PMCID: PMC10436622 DOI: 10.3389/fnagi.2023.1229559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 07/21/2023] [Indexed: 08/22/2023] Open

Abstract

Introduction

Cognitive decline in the elderly population is a growing concern, and vascular factors, such as hypertension, diabetes, cerebrovascular disease, and coronary heart disease, have been associated with cognitive impairments. This study aims to provide deeper insights into the structure of cognitive function networks under these different vascular factors and explore their potential associations with specific cognitive domains.

Methods

Cognitive function was assessed using a modified Chinese version of the mini-mental state examination (MMSE) scale, and intensity centrality and side weights were estimated by network modeling. The network structure of cognitive function was compared across subgroups by including vascular factors as subgroup variables while controlling for comorbidities and confounders.

Results

The results revealed that cerebrovascular disease and coronary heart disease had a more significant impact on cognitive function. Cerebrovascular disease was associated with weaker centrality in memory and spatial orientation, and a sparser cognitive network structure. Coronary heart disease was associated with weaker centrality in memory, repetition, executive function, recall, attention, and calculation, as well as a sparser cognitive network structure. The NCT analyses further highlighted significant differences between the cerebrovascular disease and coronary heart disease groups compared to controls in terms of overall network structure and connection strength.

Conclusion

Our findings suggest that specific cognitive domains may be more vulnerable to impairments in patients with cerebrovascular disease and coronary heart disease. These insights could be used to improve the accuracy and sensitivity of cognitive screening in these patient populations, inform personalized cognitive intervention strategies, and provide a better understanding of the potential mechanisms underlying cognitive decline in patients with vascular diseases.

Collapse

Ishøi L, Thorborg K, Kallemose T, Kemp JL, Reiman MP, Nielsen MF, Hölmich P. Stratified care in hip arthroscopy: can we predict successful and unsuccessful outcomes? Development and external temporal validation of multivariable prediction models. Br J Sports Med 2023;57:1025-1034. [PMID: 37001982 DOI: 10.1136/bjsports-2022-105534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/24/2023] [Indexed: 04/03/2023]

Abstract

OBJECTIVE

Although hip arthroscopy is a widely adopted treatment option for hip-related pain, it is unknown whether preoperative clinical information can be used to assist surgical decision-making to avoid offering surgery to patients with limited potential for a successful outcome. We aimed to develop and validate clinical prediction models to identify patients more likely to have an unsuccessful or successful outcome 1 year post hip arthroscopy based on the patient acceptable symptom state.

METHODS

Patient records were extracted from the Danish Hip Arthroscopy Registry (DHAR). A priori, 26 common clinical variables from DHAR were selected as prognostic factors, including demographics, radiographic parameters of hip morphology and self-reported measures. We used 1082 hip arthroscopy patients (surgery performed 25 April 2012 to 4 October 2017) to develop the clinical prediction models based on logistic regression analyses. The development models were internally validated using bootstrapping and shrinkage before temporal external validation was performed using 464 hip arthroscopy patients (surgery performed 5 October 2017 to 13 May 2019).

RESULTS

The prediction model for unsuccessful outcomes showed best and acceptable predictive performance on the external validation dataset for all multiple imputations (Nagelkerke R2 range: 0.25-0.26) and calibration (intercept range: -0.10 to -0.11; slope range: 1.06-1.09), and acceptable discrimination (area under the curve range: 0.76-0.77). The prediction model for successful outcomes did not calibrate well, while also showing poor discrimination.

CONCLUSION

Common clinical variables including demographics, radiographic parameters of hip morphology and self-reported measures were able to predict the probability of having an unsuccessful outcome 1 year after hip arthroscopy, while the model for successful outcome showed unacceptable accuracy. The externally validated prediction model can be used to support clinical evaluation and shared decision making by informing the orthopaedic surgeon and patient about the risk of an unsuccessful outcome, and thus when surgery may not be appropriate.

Collapse