1
|
Tsegaye B, Snell KIE, Archer L, Kirtley S, Riley RD, Sperrin M, Van Calster B, Collins GS, Dhiman P. Larger sample sizes are needed when developing a clinical prediction model using machine learning in oncology: methodological systematic review. J Clin Epidemiol 2025; 180:111675. [PMID: 39814217 DOI: 10.1016/j.jclinepi.2025.111675] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 12/17/2024] [Accepted: 01/07/2025] [Indexed: 01/18/2025]
Abstract
BACKGROUND AND OBJECTIVES Having a sufficient sample size is crucial when developing a clinical prediction model. We reviewed details of sample size in studies developing prediction models for binary outcomes using machine learning (ML) methods within oncology and compared the sample size used to develop the models with the minimum required sample size needed when developing a regression-based model (Nmin). METHODS We searched the Medline (via OVID) database for studies developing a prediction model using ML methods published in December 2022. We reviewed how sample size was justified. We calculated Nmin, which is the Nmin, and compared this with the sample size that was used to develop the models. RESULTS Only one of 36 included studies justified their sample size. We were able to calculate Nmin for 17 (47%) studies. 5/17 studies met Nmin, allowing to precisely estimate the overall risk and minimize overfitting. There was a median deficit of 302 participants with the event (n = 17; range: -21,331 to 2298) when developing the ML models. An additional three out of the 17 studies met the required sample size to precisely estimate the overall risk only. CONCLUSION Studies developing a prediction model using ML in oncology seldom justified their sample size and sample sizes were often smaller than Nmin. As ML models almost certainly require a larger sample size than regression models, the deficit is likely larger. We recommend that researchers consider and report their sample size and at least meet the minimum sample size required when developing a regression-based model.
Collapse
Affiliation(s)
- Biruk Tsegaye
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK.
| | - Kym I E Snell
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK; Institute of Translational Medicine, National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Lucinda Archer
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK; Institute of Translational Medicine, National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Shona Kirtley
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK; Institute of Translational Medicine, National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Matthew Sperrin
- Division of Imaging, Informatics and Data Science, Manchester Academic Health Science Centre, University of Manchester, Manchester M13 9PL, UK
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium; Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, The Netherlands; Leuven Unit for Health Technology Assessment Research (LUHTAR), KU Leuven, Leuven, Belgium
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Paula Dhiman
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| |
Collapse
|
2
|
Moons KGM, Damen JAA, Kaul T, Hooft L, Andaur Navarro C, Dhiman P, Beam AL, Van Calster B, Celi LA, Denaxas S, Denniston AK, Ghassemi M, Heinze G, Kengne AP, Maier-Hein L, Liu X, Logullo P, McCradden MD, Liu N, Oakden-Rayner L, Singh K, Ting DS, Wynants L, Yang B, Reitsma JB, Riley RD, Collins GS, van Smeden M. PROBAST+AI: an updated quality, risk of bias, and applicability assessment tool for prediction models using regression or artificial intelligence methods. BMJ 2025; 388:e082505. [PMID: 40127903 PMCID: PMC11931409 DOI: 10.1136/bmj-2024-082505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/16/2025] [Indexed: 03/26/2025]
Affiliation(s)
- Karel G M Moons
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
| | - Johanna A A Damen
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Tabea Kaul
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
| | - Lotty Hooft
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Constanza Andaur Navarro
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
| | - Paula Dhiman
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Andrew L Beam
- Department of Epidemiology, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Leuven Unit for Health Technology Assessment Research (LUHTAR), KU Leuven, Leuven, Belgium
| | - Leo Anthony Celi
- Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Spiros Denaxas
- Institute of Health Informatics, University College London, London, UK
- British Heart Foundation Data Science Centre, Health Data Research Centre UK, London, United Kingdom
| | | | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Georg Heinze
- Institute of Clinical Biometrics, Centre for Medical Data Science, Medical University of Vienna, Vienna, Austria
| | | | - Lena Maier-Hein
- Division of Intelligent Medical Systems, German Cancer Research Centre (DKFZ), Heidelberg, Germany
- National Centre for Tumour Diseases (NCT) Heidelberg, Heidelberg, Germany
| | - Xiaoxuan Liu
- College of Medicine and Health, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- School of Health Sciences, College of Medicine and Health, University of Birmingham, Birmingham, UK
- NIHR Birmingham Biomedical Research Centre, Birmingham, UK
| | - Patricia Logullo
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Melissa D McCradden
- Department of Bioethics, The Hospital for Sick Children, Toronto, ON, Canada
| | - Nan Liu
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
| | - Lauren Oakden-Rayner
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia
| | - Karandeep Singh
- Department of Learning Health Sciences, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Daniel S Ting
- Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore, Singapore
- AI Office, Singapore Health Service, Duke-NUS Medical School, Singapore, Singapore
| | - Laure Wynants
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Bada Yang
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
- Cochrane Netherlands, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Johannes B Reitsma
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
| | - Richard D Riley
- School of Health Sciences, College of Medicine and Health, University of Birmingham, Birmingham, UK
- NIHR Birmingham Biomedical Research Centre, Birmingham, UK
| | - Gary S Collins
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Maarten van Smeden
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, 3508 GA Utrecht, Netherlands
| |
Collapse
|
3
|
Maru S, Kuwatsuru R, Matthias MD, Simpson RJ. Public Disclosure of Results From Artificial Intelligence/Machine Learning Research in Health Care: Comprehensive Analysis of ClinicalTrials.gov, PubMed, and Scopus Data (2010-2023). J Med Internet Res 2025; 27:e60148. [PMID: 40117574 PMCID: PMC11971578 DOI: 10.2196/60148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2024] [Revised: 01/13/2025] [Accepted: 01/18/2025] [Indexed: 03/23/2025] Open
Abstract
BACKGROUND Despite the rapid growth of research in artificial intelligence/machine learning (AI/ML), little is known about how often study results are disclosed years after study completion. OBJECTIVE We aimed to estimate the proportion of AI/ML research that reported results through ClinicalTrials.gov or peer-reviewed publications indexed in PubMed or Scopus. METHODS Using data from the Clinical Trials Transformation Initiative Aggregate Analysis of ClinicalTrials.gov, we identified studies initiated and completed between January 2010 and December 2023 that contained AI/ML-specific terms in the official title, brief summary, interventions, conditions, detailed descriptions, primary outcomes, or keywords. For 842 completed studies, we searched PubMed and Scopus for publications containing study identifiers and AI/ML-specific terms in relevant fields, such as the title, abstract, and keywords. We calculated disclosure rates within 3 years of study completion and median times to disclosure-from the "primary completion date" to the "results first posted date" on ClinicalTrials.gov or the earliest date of journal publication. RESULTS When restricted to studies completed before 2021, ensuring at least 3 years of follow-up in which to report results, 7.0% (22/316) disclosed results on ClinicalTrials.gov, 16.5% (52/316) in journal publications, and 20.6% (65/316) through either route within 3 years of completion. Higher disclosure rates were observed for trials: 11.0% (15/136) on ClinicalTrials.gov, 25.0% (34/136) in journal publications, and 30.1% (41/136) through either route. Randomized controlled trials had even higher disclosure rates: 12.2% (9/74) on ClinicalTrials.gov, 31.1% (23/74) in journal publications, and 36.5% (27/74) through either route. Nevertheless, most study findings (79.4%; 251/316) remained undisclosed 3 years after study completion. Trials using randomization (vs nonrandomized) or masking (vs open label) had higher disclosure rates and shorter times to disclosure. Most trials (85%; 305/357) had sample sizes of ≤1000, yet larger trials (n>1000) had higher publication rates (30.8%; 16/52) than smaller trials (n≤1000) (17.4%; 53/305). Hospitals (12.4%; 42/340), academia (15.1%; 39/259), and industry (13.7%; 20/146) published the most. High-income countries accounted for 82.4% (89/108) of all published studies. Of studies with disclosed results, the median times to report through ClinicalTrials.gov and in journal publications were 505 days (IQR 399-676) and 407 days (IQR 257-674), respectively. Open-label trials were common (60%; 214/357). Single-center designs were prevalent in both trials (83.3%; 290/348) and observational studies (82.3%; 377/458). CONCLUSIONS For nearly 80% of completed studies, findings remained undisclosed within the 3 years of follow-up, raising questions about the representativeness of publicly available evidence. While methodological rigor was generally associated with higher publication rates, the predominance of single-center designs and high-income countries may limit the generalizability of the results currently accessible.
Collapse
Affiliation(s)
- Shoko Maru
- Real‑World Evidence and Data Assessment (READS), Graduate School of Medicine, Juntendo University, Tokyo, Japan
| | - Ryohei Kuwatsuru
- Real‑World Evidence and Data Assessment (READS), Graduate School of Medicine, Juntendo University, Tokyo, Japan
- Department of Radiology, School of Medicine, Juntendo University, Tokyo, Japan
| | | | - Ross J Simpson
- Division of Cardiology, School of Medicine, University of North Carolina Chapel Hill, Chapel Hill, NC, United States
| |
Collapse
|
4
|
Jones C, Taylor M, Sperrin M, Grant SW. A systematic review of cardiac surgery clinical prediction models that include intra-operative variables. Perfusion 2025; 40:328-342. [PMID: 38649154 PMCID: PMC11849261 DOI: 10.1177/02676591241237758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
BACKGROUND Most cardiac surgery clinical prediction models (CPMs) are developed using pre-operative variables to predict post-operative outcomes. Some CPMs are developed with intra-operative variables, but none are widely used. The objective of this systematic review was to identify CPMs with intra-operative variables that predict short-term outcomes following adult cardiac surgery. METHODS Ovid MEDLINE and EMBASE databases were searched from inception to December 2022, for studies developing a CPM with at least one intra-operative variable. Data were extracted using a critical appraisal framework and bias assessment tool. Model performance was analysed using discrimination and calibration measures. RESULTS A total of 24 models were identified. Frequent predicted outcomes were acute kidney injury (9/24 studies) and peri-operative mortality (6/24 studies). Frequent pre-operative variables were age (18/24 studies) and creatinine/eGFR (18/24 studies). Common intra-operative variables were cardiopulmonary bypass time (16/24 studies) and transfusion (13/24 studies). Model discrimination was acceptable for all internally validated models (AUC 0.69-0.91). Calibration was poor (15/24 studies) or unreported (8/24 studies). Most CPMs were at a high or indeterminate risk of bias (23/24 models). The added value of intra-operative variables was assessed in six studies with statistically significantly improved discrimination demonstrated in two. CONCLUSION Weak reporting and methodological limitations may restrict wider applicability and adoption of existing CPMs that include intra-operative variables. There is some evidence that CPM discrimination is improved with the addition of intra-operative variables. Further work is required to understand the role of intra-operative CPMs in the management of cardiac surgery patients.
Collapse
Affiliation(s)
- Ceri Jones
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Department of Clinical Perfusion, University Hospital Southampton NHS Foundation Trust, Southampton General Hospital, Southampton, UK
| | - Marcus Taylor
- Department of Cardiothoracic Surgery, Manchester University Hospital Foundation Trust, Wythenshawe Hospital, , Manchester, UK
| | - Matthew Sperrin
- Division of Informatics, Imaging and Data Science, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
| | - Stuart W. Grant
- Division of Cardiovascular Sciences, ERC, Manchester University Hospitals Foundation Trust, University of Manchester, Manchester, UK
- South Tees Academic Cardiovascular Unit, South Tees Hospitals NHS Foundation Trust, Middlesbrough, UK
| |
Collapse
|
5
|
Critelli B, Hassan A, Lahooti I, Noh L, Park JS, Tong K, Lahooti A, Matzko N, Adams JN, Liss L, Quion J, Restrepo D, Nikahd M, Culp S, Lacy-Hulbert A, Speake C, Buxbaum J, Bischof J, Yazici C, Evans-Phillips A, Terp S, Weissman A, Conwell D, Hart P, Ramsey M, Krishna S, Han S, Park E, Shah R, Akshintala V, Windsor JA, Mull NK, Papachristou G, Celi LA, Lee P. A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality. PLoS Med 2025; 22:e1004432. [PMID: 39992936 PMCID: PMC11870378 DOI: 10.1371/journal.pmed.1004432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 02/28/2025] [Accepted: 01/07/2025] [Indexed: 02/26/2025] Open
Abstract
BACKGROUND An accurate prognostic tool is essential to aid clinical decision-making (e.g., patient triage) and to advance personalized medicine. However, such a prognostic tool is lacking for acute pancreatitis (AP). Increasingly machine learning (ML) techniques are being used to develop high-performing prognostic models in AP. However, methodologic and reporting quality has received little attention. High-quality reporting and study methodology are critical for model validity, reproducibility, and clinical implementation. In collaboration with content experts in ML methodology, we performed a systematic review critically appraising the quality of methodology and reporting of recently published ML AP prognostic models. METHODS/FINDINGS Using a validated search strategy, we identified ML AP studies from the databases MEDLINE and EMBASE published between January 2021 and December 2023. We also searched pre-print servers medRxiv, bioRxiv, and arXiv for pre-prints registered between January 2021 and December 2023. Eligibility criteria included all retrospective or prospective studies that developed or validated new or existing ML models in patients with AP that predicted an outcome following an episode of AP. Meta-analysis was considered if there was homogeneity in the study design and in the type of outcome predicted. For risk of bias (ROB) assessment, we used the Prediction Model Risk of Bias Assessment Tool. Quality of reporting was assessed using the Transparent Reporting of a Multivariable Prediction Model of Individual Prognosis or Diagnosis-Artificial Intelligence (TRIPOD+AI) statement that defines standards for 27 items that should be reported in publications using ML prognostic models. The search strategy identified 6,480 publications of which 30 met the eligibility criteria. Studies originated from China (22), the United States (4), and other (4). All 30 studies developed a new ML model and none sought to validate an existing ML model, producing a total of 39 new ML models. AP severity (23/39) or mortality (6/39) were the most common outcomes predicted. The mean area under the curve for all models and endpoints was 0.91 (SD 0.08). The ROB was high for at least one domain in all 39 models, particularly for the analysis domain (37/39 models). Steps were not taken to minimize over-optimistic model performance in 27/39 models. Due to heterogeneity in the study design and in how the outcomes were defined and determined, meta-analysis was not performed. Studies reported on only 15/27 items from TRIPOD+AI standards, with only 7/30 justifying sample size and 13/30 assessing data quality. Other reporting deficiencies included omissions regarding human-AI interaction (28/30), handling low-quality or incomplete data in practice (27/30), sharing analytical codes (25/30), study protocols (25/30), and reporting source data (19/30). CONCLUSIONS There are significant deficiencies in the methodology and reporting of recently published ML based prognostic models in AP patients. These undermine the validity, reproducibility, and implementation of these prognostic models despite their promise of superior predictive accuracy. REGISTRATION Research Registry (reviewregistry1727).
Collapse
Affiliation(s)
- Brian Critelli
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Amier Hassan
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Ila Lahooti
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Lydia Noh
- Northeast Ohio Medical School, Rootstown, Ohio, United States of America
| | - Jun Sung Park
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Kathleen Tong
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Ali Lahooti
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Nathan Matzko
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Jan Niklas Adams
- Department of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany
| | - Lukas Liss
- Department of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany
| | - Justin Quion
- Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - David Restrepo
- Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Melica Nikahd
- Department of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Stacey Culp
- Department of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Adam Lacy-Hulbert
- Department of Systems Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America
| | - Cate Speake
- Department of Interventional Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America
| | - James Buxbaum
- Department of Gastroenterology, University of Southern California, Los Angeles, California, United States of America
| | - Jason Bischof
- Department of Emergency Medicine, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Cemal Yazici
- Department of Gastroenterology, University of Illinois at Chicago, Chicago, Illinois, United States of America
| | - Anna Evans-Phillips
- Department of Gastroenterology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America
| | - Sophie Terp
- Department of Emergency Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Alexandra Weissman
- Department of Emergency Medicine, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America
| | - Darwin Conwell
- Department of Medicine, University of Kentucky, Lexington, Kentucky, United States of America
| | - Philip Hart
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Mitchell Ramsey
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Somashekar Krishna
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Samuel Han
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Erica Park
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Raj Shah
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Venkata Akshintala
- Department of Gastroenterology, Johns Hopkins Medical Center, Baltimore, Maryland, United States of America
| | - John A. Windsor
- Department of Surgical and Translational Research Centre, University of Auckland, Auckland, New Zealand
| | - Nikhil K. Mull
- Department of Hospital Medicine and Penn Medicine Center for Evidence-based Practice, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Georgios Papachristou
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Leo Anthony Celi
- Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Critical Care, Beth Israel Medical Center, Boston, Massachusetts, United States of America
| | - Peter Lee
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| |
Collapse
|
6
|
Chung Z, Mbuagbaw L, Mokete L, Bhandari M, Thabane L. Quality and completeness of, and spin in reporting of, pilot and feasibility studies in hip and knee arthroplasty: a protocol for a methodological survey. BMJ Open 2024; 14:e085441. [PMID: 39384225 PMCID: PMC11474718 DOI: 10.1136/bmjopen-2024-085441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 09/24/2024] [Indexed: 10/11/2024] Open
Abstract
INTRODUCTION Pilot or feasibility trials examine the feasibility, viability and recruitment potential of larger, main trials. Specifically, a pilot trial can be instrumental in identifying methodological issues essential to the development of an effective research protocol. However, numerous studies published as pilot or feasibility studies have demonstrated notable inconsistencies in the nature of information reported, resulting in poor-quality and incomplete reporting. It is unclear whether such low quality or incompleteness of reporting is also prevalent in arthroplasty pilot trials. METHODS AND ANALYSIS This protocol outlines a methodological survey examining the completeness of reporting among hip and knee arthroplasty pilot trials in accordance with the Consolidated Standards of Reporting Trials (CONSORT) 2010 extension to pilot trials. Secondary objectives include: (1) determining the prevalence of 'spin' practices, defined as: (a) placing a focus on statistical significance rather than feasibility, (b) presenting results that show the trial to be non-feasible as feasible or (c) emphasising the effectiveness or potential intervention benefits rather than feasibility; (2) determining factors associated with incomplete reporting, and 'spin'. A search of PubMed will be conducted for pilot trials in hip or knee arthroplasty published between 01 January 2017 and 31 December 2023. Following screening, appropriate data will be extracted from eligible publications and reported as descriptive statistics, encompassing elements of the CONSORT checklist associated with completeness of reporting. Logistic regression analysis and Poisson regression will be used to analyse factors associated with completeness of reporting and spin. ETHICS AND DISSEMINATION This methodological review does not require formal ethical approval, as it will solely involve the use of published and publicly reported literature. The results of this study will be disseminated through submission to peer-reviewed journals and academic conference presentations. Study details will be sent to McMaster University's media coordinators to be shared through the institution's research-focused platforms.
Collapse
Affiliation(s)
- Zinnia Chung
- Michael G. DeGroote School of Medicine, McMaster University, Hamilton, Ontario, Canada
| | - Lawrence Mbuagbaw
- Department of Health Research Methods Evidence and Impact, McMaster University, Hamilton, Ontario, Canada
- Biostatistics Unit, Father Sean O'Sullivan Research Centre, St Joseph's Healthcare Hamilton, Hamilton, Ontario, Canada
| | - Lipalo Mokete
- Division of Orthopaedic Surgery, University of the Witwatersrand Johannesburg, Johannesburg, Gauteng, South Africa
| | - Mohit Bhandari
- Department of Surgery, McMaster University, Hamilton, Ontario, Canada
| | - Lehana Thabane
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Ontario, Canada
- Research Institute, St Joseph's Healthcare Hamilton, Hamilton, Ontario, Canada
| |
Collapse
|
7
|
Drogt J, Milota M, van den Brink A, Jongsma K. Ethical guidance for reporting and evaluating claims of AI outperforming human doctors. NPJ Digit Med 2024; 7:271. [PMID: 39358556 PMCID: PMC11447248 DOI: 10.1038/s41746-024-01255-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Accepted: 09/08/2024] [Indexed: 10/04/2024] Open
Affiliation(s)
| | - Megan Milota
- University Medical Center, Utrecht, The Netherlands
| | | | | |
Collapse
|
8
|
Shah A, Dhiman P. Artificial intelligence in peri-operative prediction model research: are we there yet? Anaesthesia 2024; 79:1017-1022. [PMID: 38747301 DOI: 10.1111/anae.16315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/26/2024] [Indexed: 09/16/2024]
Affiliation(s)
- Akshay Shah
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - Paula Dhiman
- Nuffield Department of Orthopaedics Rheumatology and Musculoskeletal Sciences, Centre for Statistics in Medicine, University of Oxford, Oxford, UK
| |
Collapse
|
9
|
Shrier I, Impellizzeri FM, Stovitz SD. Identifying and Minimizing Incentives for Competing Interests in Sports Medicine Publications. Sports Med 2024; 54:1991-2000. [PMID: 38714641 DOI: 10.1007/s40279-024-02037-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/17/2024] [Indexed: 05/10/2024]
Abstract
Academics in sports medicine as well as other medical fields are generally expected to publish research and opinions in peer-reviewed journals. The peer-review process is intended to protect against the publication of flawed research and unsubstantiated claims. However, both financial and non-financial competing interests may result in sub-optimal results by affecting investigators, editors, peer reviewers, academic institutions, and publishers. In this article, we focus on the non-financial competing interests created in our current academic system. Because these competing interests are embedded in our current scholastic framework, the potential biases are difficult to quantify. To minimize the effect of these competing interests, we review and highlight some underlying incentives for each stakeholder and some potential solutions to mitigate their effects.
Collapse
Affiliation(s)
- Ian Shrier
- Centre for Clinical Epidemiology, Lady Davis Institute, Jewish General Hospital, McGill University, 3755 Cote Sainte Catherine Road, Montreal, QC, H3T 1E2, Canada.
| | - Franco M Impellizzeri
- School of Sport, Exercise and Rehabilitation, Faculty of Health, University of Technology Sydney, Sydney, NSW, Australia
| | - Steven D Stovitz
- Department of Family Medicine and Community Health, University of Minnesota, Minnesota, USA
| |
Collapse
|
10
|
Clift AK, Mahon H, Khan G, Boardman-Pretty F, Worker A, Marchini E, Buendia O, Fish P, Khan MS. Identifying patients with undiagnosed small intestinal neuroendocrine tumours in primary care using statistical and machine learning: model development and validation study. Br J Cancer 2024; 131:305-311. [PMID: 38831012 PMCID: PMC11263687 DOI: 10.1038/s41416-024-02736-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 05/10/2024] [Accepted: 05/23/2024] [Indexed: 06/05/2024] Open
Abstract
BACKGROUND Neuroendocrine tumours (NETs) are increasing in incidence, often diagnosed at advanced stages, and individuals may experience years of diagnostic delay, particularly when arising from the small intestine (SI). Clinical prediction models could present novel opportunities for case finding in primary care. METHODS An open cohort of adults (18+ years) contributing data to the Optimum Patient Care Research Database between 1st Jan 2000 and 30th March 2023 was identified. This database collects de-identified data from general practices in the UK. Model development approaches comprised logistic regression, penalised regression, and XGBoost. Performance (discrimination and calibration) was assessed using internal-external cross-validation. Decision analysis curves compared clinical utility. RESULTS Of 11.7 million individuals, 382 had recorded SI NET diagnoses (0.003%). The XGBoost model had the highest AUC (0.869, 95% confidence interval [CI]: 0.841-0.898) but was mildly miscalibrated (slope 1.165, 95% CI: 1.088-1.243; calibration-in-the-large 0.010, 95% CI: -0.164 to 0.185). Clinical utility was similar across all models. DISCUSSION Multivariable prediction models may have clinical utility in identifying individuals with undiagnosed SI NETs using information in their primary care records. Further evaluation including external validation and health economics modelling may identify cost-effective strategies for case finding for this uncommon tumour.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Peter Fish
- Mendelian, The Trampery Old Street, London, UK
| | - Mohid S Khan
- South Wales Neuroendocrine Cancer Service, University Hospital of Wales, Cardiff and Vale University Health Board, Heath Park, Cardiff, UK
- Cardiff University, School of Medicine, University Hospital of Wales, Cardiff, UK
| |
Collapse
|
11
|
Lyman GH, Kuderer NM. Artificial Intelligence in Cancer Clinical Research: II. Development and Validation of Clinical Prediction Models. Cancer Invest 2024; 42:447-451. [PMID: 38775011 DOI: 10.1080/07357907.2024.2354991] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Affiliation(s)
- Gary H Lyman
- Editor-in-Chief, Cancer Investigation Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Nicole M Kuderer
- Deputy Editor, Cancer Investigation Advanced Cancer Research Group, Kirkland, WA, USA
| |
Collapse
|
12
|
Song J, Zhang J, Liu G, Guo Z, Liao H, Feng W, Lin W, Li L, Zhang Y, Yang Y, Liu B, Luo R, Chen H, Wang S, Liu JH. PET/CT deep learning prognosis for treatment decision support in esophageal squamous cell carcinoma. Insights Imaging 2024; 15:161. [PMID: 38913225 PMCID: PMC11196479 DOI: 10.1186/s13244-024-01737-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Accepted: 06/02/2024] [Indexed: 06/25/2024] Open
Abstract
OBJECTIVES The clinical decision-making regarding choosing surgery alone (SA) or surgery followed by postoperative adjuvant chemotherapy (SPOCT) in esophageal squamous cell carcinoma (ESCC) remains controversial. We aim to propose a pre-therapy PET/CT image-based deep learning approach to improve the survival benefit and clinical management of ESCC patients. METHODS This retrospective multicenter study included 837 ESCC patients from three institutions. Prognostic biomarkers integrating six networks were developed to build an ESCC prognosis (ESCCPro) model and predict the survival probability of ESCC patients treated with SA and SPOCT. Patients who did not undergo surgical resection were in a control group. Overall survival (OS) was the primary end-point event. The expected improvement in survival prognosis with the application of ESCCPro to assign treatment protocols was estimated by comparing the survival of patients in each subgroup. Seven clinicians with varying experience evaluated how ESCCPro performed in assisting clinical decision-making. RESULTS In this retrospective multicenter study, patients receiving SA had a median OS 9.2 months longer than controls. No significant differences in survival were found between SA patients with predicted poor outcomes and the controls (p > 0.05). It was estimated that if ESCCPro was used to determine SA and SPOCT eligibility, the median OS in the ESCCPro-recommended SA group and SPOCT group would have been 15.3 months and 24.9 months longer, respectively. In addition, ESCCPro also significantly improved prognosis accuracy, certainty, and the efficiency of clinical experts. CONCLUSION ESCCPro assistance improved the survival benefit of ESCC patients and the clinical decision-making among the two treatment approaches. CRITICAL RELEVANCE STATEMENT The ESCCPro model for treatment decision-making is promising to improve overall survival in ESCC patients undergoing surgical resection and patients undergoing surgery followed by postoperative adjuvant chemotherapy. KEY POINTS ESCC is associated with a poor prognosis and unclear ideal treatments. ESCCPro predicts the survival of patients with ESCC and the expected benefit from SA. ESCCPro improves clinicians' stratification of patients' prognoses.
Collapse
Affiliation(s)
- Jiangdian Song
- School of Health Management, China Medical University, Shenyang, China.
| | - Jie Zhang
- Department of Medical Imaging, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Zhuhai, China
| | - Guichao Liu
- The Fifth Affiliated Hospital of Sun Yat-Sen University, Guangdong, China
| | - Zhexu Guo
- Department of Surgical Oncology and General Surgery, The First Hospital of China Medical University, Key Laboratory of Precision Diagnosis and Treatment of Gastrointestinal Tumors China Medical University, Ministry of Education, Shenyang, China
- Department of VIP In-Patient Ward, The First Hospital of China Medical University, Shenyang, China
| | - Hongxian Liao
- Department of Medical Imaging, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Zhuhai, China
| | - Wenhui Feng
- Department of Medical Imaging, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Zhuhai, China
| | - Wenxiang Lin
- Department of Medical Imaging, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Zhuhai, China
| | - Lei Li
- Department of Medical Imaging, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Zhuhai, China
| | - Yi Zhang
- Department of Medical Imaging, Zhuhai People's Hospital (Zhuhai Clinical Medical College of Jinan University), Zhuhai, China
| | - Yuxiang Yang
- Department of Radiology, The Second People's Hospital of Xiangzhou, Zhuhai, China
| | - Bin Liu
- Department of Radiology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Ruibang Luo
- Department of Computer Science, The University of Hong Kong, Hong Kong, Hong Kong
| | - Hao Chen
- Department of Gastroenterology, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Southern Medical University, Guangzhou, China.
| | - Siyun Wang
- Department of Oncology, Cancer Center, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China.
| | - Jian-Hua Liu
- Department of Oncology, Cancer Center, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China.
| |
Collapse
|
13
|
Andaur Navarro CL, Damen JAA, Ghannad M, Dhiman P, van Smeden M, Reitsma JB, Collins GS, Riley RD, Moons KGM, Hooft L. SPIN-PM: a consensus framework to evaluate the presence of spin in studies on prediction models. J Clin Epidemiol 2024; 170:111364. [PMID: 38631529 DOI: 10.1016/j.jclinepi.2024.111364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 04/01/2024] [Accepted: 04/08/2024] [Indexed: 04/19/2024]
Abstract
OBJECTIVES To develop a framework to identify and evaluate spin practices and its facilitators in studies on clinical prediction model regardless of the modeling technique. STUDY DESIGN AND SETTING We followed a three-phase consensus process: (1) premeeting literature review to generate items to be included; (2) a series of structured meetings to provide comments discussed and exchanged viewpoints on items to be included with a panel of experienced researchers; and (3) postmeeting review on final list of items and examples to be included. Through this iterative consensus process, a framework was derived after all panel's researchers agreed. RESULTS This consensus process involved a panel of eight researchers and resulted in SPIN-Prediction Models which consists of two categories of spin (misleading interpretation and misleading transportability), and within these categories, two forms of spin (spin practices and facilitators of spin). We provide criteria and examples. CONCLUSION We proposed this guidance aiming to facilitate not only the accurate reporting but also an accurate interpretation and extrapolation of clinical prediction models which will likely improve the reporting quality of subsequent research, as well as reduce research waste.
Collapse
Affiliation(s)
- Constanza L Andaur Navarro
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands; Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
| | - Johanna A A Damen
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands; Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Mona Ghannad
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands; Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Paula Dhiman
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology & Musculoskeletal Sciences, University of Oxford, Oxford, UK; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| | - Maarten van Smeden
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Johannes B Reitsma
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Gary S Collins
- Centre for Statistics in Medicine, Nuffield Department of Orthopaedics, Rheumatology & Musculoskeletal Sciences, University of Oxford, Oxford, UK; NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| | - Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Karel G M Moons
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands; Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Lotty Hooft
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands; Cochrane Netherlands, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
14
|
Nordblom N, Büttner M, Schwendicke F. Artificial Intelligence in Orthodontics: Critical Review. J Dent Res 2024; 103:577-584. [PMID: 38682436 PMCID: PMC11118788 DOI: 10.1177/00220345241235606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024] Open
Abstract
With increasing digitalization in orthodontics, certain orthodontic manufacturing processes such as the fabrication of indirect bonding trays, aligner production, or wire bending can be automated. However, orthodontic treatment planning and evaluation remains a specialist's task and responsibility. As the prediction of growth in orthodontic patients and response to orthodontic treatment is inherently complex and individual, orthodontists make use of features gathered from longitudinal, multimodal, and standardized orthodontic data sets. Currently, these data sets are used by the orthodontist to make informed, rule-based treatment decisions. In research, artificial intelligence (AI) has been successfully applied to assist orthodontists with the extraction of relevant data from such data sets. Here, AI has been applied for the analysis of clinical imagery, such as automated landmark detection in lateral cephalograms but also for evaluation of intraoral scans or photographic data. Furthermore, AI is applied to help orthodontists with decision support for treatment decisions such as the need for orthognathic surgery or for orthodontic tooth extractions. One major challenge in current AI research in orthodontics is the limited generalizability, as most studies use unicentric data with high risks of bias. Moreover, comparing AI across different studies and tasks is virtually impossible as both outcomes and outcome metrics vary widely, and underlying data sets are not standardized. Notably, only few AI applications in orthodontics have reached full clinical maturity and regulatory approval, and researchers in the field are tasked with tackling real-world evaluation and implementation of AI into the orthodontic workflow.
Collapse
Affiliation(s)
- N.F. Nordblom
- Department of Oral Diagnostics, Digital Health and Health Services Research, Charité–Universitätsmedizin Berlin, Berlin, Germany
| | - M. Büttner
- Department of Oral Diagnostics, Digital Health and Health Services Research, Charité–Universitätsmedizin Berlin, Berlin, Germany
| | - F. Schwendicke
- Department of Conservative Dentistry and Periodontology, University Hospital, Ludwig-Maximilians University of Munich, Munich, Germany
| |
Collapse
|
15
|
Collins GS. Making the black box more transparent: improving the reporting of artificial intelligence studies in healthcare. BMJ 2024; 385:q832. [PMID: 38626954 DOI: 10.1136/bmj.q832] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]
Affiliation(s)
- Gary S Collins
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| |
Collapse
|
16
|
Collins GS, Moons KGM, Dhiman P, Riley RD, Beam AL, Van Calster B, Ghassemi M, Liu X, Reitsma JB, van Smeden M, Boulesteix AL, Camaradou JC, Celi LA, Denaxas S, Denniston AK, Glocker B, Golub RM, Harvey H, Heinze G, Hoffman MM, Kengne AP, Lam E, Lee N, Loder EW, Maier-Hein L, Mateen BA, McCradden MD, Oakden-Rayner L, Ordish J, Parnell R, Rose S, Singh K, Wynants L, Logullo P. TRIPOD+AI statement: updated guidance for reporting clinical prediction models that use regression or machine learning methods. BMJ 2024; 385:e078378. [PMID: 38626948 PMCID: PMC11019967 DOI: 10.1136/bmj-2023-078378] [Citation(s) in RCA: 285] [Impact Index Per Article: 285.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/17/2024] [Indexed: 04/19/2024]
Affiliation(s)
- Gary S Collins
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Karel G M Moons
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Paula Dhiman
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| | - Richard D Riley
- Institute of Applied Health Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
| | - Andrew L Beam
- Department of Epidemiology, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Ben Van Calster
- Department of Development and Regeneration, KU Leuven, Leuven, Belgium
- Department of Biomedical Data Science, Leiden University Medical Centre, Leiden, Netherlands
| | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Xiaoxuan Liu
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
- University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Johannes B Reitsma
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Maarten van Smeden
- Julius Centre for Health Sciences and Primary Care, University Medical Centre Utrecht, Utrecht University, Utrecht, Netherlands
| | - Anne-Laure Boulesteix
- Institute for Medical Information Processing, Biometry and Epidemiology, Faculty of Medicine, Ludwig-Maximilians-University of Munich and Munich Centre of Machine Learning, Germany
| | - Jennifer Catherine Camaradou
- Patient representative, Health Data Research UK patient and public involvement and engagement group
- Patient representative, University of East Anglia, Faculty of Health Sciences, Norwich Research Park, Norwich, UK
| | - Leo Anthony Celi
- Beth Israel Deaconess Medical Center, Boston, MA, USA
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biostatistics, Harvard T H Chan School of Public Health, Boston, MA, USA
| | - Spiros Denaxas
- Institute of Health Informatics, University College London, London, UK
- British Heart Foundation Data Science Centre, London, UK
| | - Alastair K Denniston
- National Institute for Health and Care Research (NIHR) Birmingham Biomedical Research Centre, Birmingham, UK
- Institute of Inflammation and Ageing, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Ben Glocker
- Department of Computing, Imperial College London, London, UK
| | - Robert M Golub
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | | | - Georg Heinze
- Section for Clinical Biometrics, Centre for Medical Data Science, Medical University of Vienna, Vienna, Austria
| | - Michael M Hoffman
- Princess Margaret Cancer Centre, University Health Network, Toronto, ON, Canada
- Department of Medical Biophysics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | | | - Emily Lam
- Patient representative, Health Data Research UK patient and public involvement and engagement group
| | - Naomi Lee
- National Institute for Health and Care Excellence, London, UK
| | - Elizabeth W Loder
- The BMJ, London, UK
- Department of Neurology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Lena Maier-Hein
- Department of Intelligent Medical Systems, German Cancer Research Centre, Heidelberg, Germany
| | - Bilal A Mateen
- Institute of Health Informatics, University College London, London, UK
- Wellcome Trust, London, UK
- Alan Turing Institute, London, UK
| | - Melissa D McCradden
- Department of Bioethics, Hospital for Sick Children Toronto, ON, Canada
- Genetics and Genome Biology, SickKids Research Institute, Toronto, ON, Canada
| | - Lauren Oakden-Rayner
- Australian Institute for Machine Learning, University of Adelaide, Adelaide, SA, Australia
| | - Johan Ordish
- Medicines and Healthcare products Regulatory Agency, London, UK
| | - Richard Parnell
- Patient representative, Health Data Research UK patient and public involvement and engagement group
| | - Sherri Rose
- Department of Health Policy and Center for Health Policy, Stanford University, Stanford, CA, USA
| | - Karandeep Singh
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Laure Wynants
- Department of Epidemiology, CAPHRI Care and Public Health Research Institute, Maastricht University, Maastricht, Netherlands
| | - Patricia Logullo
- Centre for Statistics in Medicine, UK EQUATOR Centre, Nuffield Department of Orthopaedics, Rheumatology, and Musculoskeletal Sciences, University of Oxford, Oxford OX3 7LD, UK
| |
Collapse
|
17
|
Hassan A, Critelli B, Lahooti I, Lahooti A, Matzko N, Adams JN, Liss L, Quion J, Restrepo D, Nikahd M, Culp S, Noh L, Tong K, Park JS, Akshintala V, Windsor JA, Mull NK, Papachristou GI, Celi LA, Lee PJ. Critical appraisal of machine learning prognostic models for acute pancreatitis: protocol for a systematic review. Diagn Progn Res 2024; 8:6. [PMID: 38561864 PMCID: PMC10986113 DOI: 10.1186/s41512-024-00169-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 02/15/2024] [Indexed: 04/04/2024] Open
Abstract
Acute pancreatitis (AP) is an acute inflammatory disorder that is common, costly, and is increasing in incidence worldwide with over 300,000 hospitalizations occurring yearly in the United States alone. As its course and outcomes vary widely, a critical knowledge gap in the field has been a lack of accurate prognostic tools to forecast AP patients' outcomes. Despite several published studies in the last three decades, the predictive performance of published prognostic models has been found to be suboptimal. Recently, non-regression machine learning models (ML) have garnered intense interest in medicine for their potential for better predictive performance. Each year, an increasing number of AP models are being published. However, their methodologic quality relating to transparent reporting and risk of bias in study design has never been systematically appraised. Therefore, through collaboration between a group of clinicians and data scientists with appropriate content expertise, we will perform a systematic review of papers published between January 2021 and December 2023 containing artificial intelligence prognostic models in AP. To systematically assess these studies, the authors will leverage the CHARMS checklist, PROBAST tool for risk of bias assessment, and the most current version of the TRIPOD-AI. (Research Registry ( http://www.reviewregistry1727 .).
Collapse
Affiliation(s)
- Amier Hassan
- Division of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, USA
| | - Brian Critelli
- Division of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, USA
| | - Ila Lahooti
- Division of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, OH, USA
| | - Ali Lahooti
- Division of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, USA
| | - Nate Matzko
- Division of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, USA
| | - Jan Niklas Adams
- Division of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany
| | - Lukas Liss
- Division of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany
| | - Justin Quion
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, USA
| | - David Restrepo
- Laboratory for Computational Physiology, Massachusetts Institute of Technology, Cambridge, USA
| | - Melica Nikahd
- Division of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, USA
| | - Stacey Culp
- Division of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, USA
| | - Lydia Noh
- Northeast Ohio Medical School, Rootstown, USA
| | - Kathleen Tong
- Division of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, OH, USA
| | - Jun Sung Park
- Division of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, OH, USA
| | - Venkata Akshintala
- Division of Gastroenterology, Johns Hopkins Medical Center, Baltimore, USA
| | - John A Windsor
- Department of Surgery, University of Auckland, Auckland, New Zealand
| | - Nikhil K Mull
- Division of Hospital Medicine and Penn Medicine Center for Evidence-based Practice, University of Pennsylvania, Philadelphia, USA
| | - Georgios I Papachristou
- Division of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, OH, USA
| | - Leo Anthony Celi
- Department of Surgery, University of Auckland, Auckland, New Zealand
- Division of Critical Care, Beth Israel Medical Center, Boston, USA
| | - Peter J Lee
- Division of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, OH, USA.
| |
Collapse
|
18
|
O'Connor S, Vercell A, Wong D, Yorke J, Fallatah FA, Cave L, Anny Chen LY. The application and use of artificial intelligence in cancer nursing: A systematic review. Eur J Oncol Nurs 2024; 68:102510. [PMID: 38310664 DOI: 10.1016/j.ejon.2024.102510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 01/07/2024] [Accepted: 01/10/2024] [Indexed: 02/06/2024]
Abstract
PURPOSE Artificial Intelligence is being applied in oncology to improve patient and service outcomes. Yet, there is a limited understanding of how these advanced computational techniques are employed in cancer nursing to inform clinical practice. This review aimed to identify and synthesise evidence on artificial intelligence in cancer nursing. METHODS CINAHL, MEDLINE, PsycINFO, and PubMed were searched using key terms between January 2010 and December 2022. Titles, abstracts, and then full texts were screened against eligibility criteria, resulting in twenty studies being included. Critical appraisal was undertaken, and relevant data extracted and analysed. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were followed. RESULTS Artificial intelligence was used in numerous areas including breast, colorectal, liver, and ovarian cancer care among others. Algorithms were trained and tested on primary and secondary datasets to build predictive models of health problems related to cancer. Studies reported this led to improvements in the accuracy of predicting health outcomes or identifying variables that improved outcome prediction. While nurses led most studies, few deployed an artificial intelligence based digital tool with cancer nurses in a real-world setting as studies largely focused on developing and validating predictive models. CONCLUSION Electronic cancer nursing datasets should be established to enable artificial intelligence techniques to be tested and if effective implemented in digital prediction and other AI-based tools. Cancer nurses need more education on machine learning and natural language processing, so they can lead and contribute to artificial intelligence developments in oncology.
Collapse
Affiliation(s)
- Siobhan O'Connor
- Florence Nightingale Faculty of Nursing, Midwifery and Palliative Care, King's College London, London, United Kingdom.
| | - Amy Vercell
- Florence Nightingale Faculty of Nursing, Midwifery and Palliative Care, King's College London, London, United Kingdom; The Christie NHS Foundation Trust, Wilmslow Rd, Manchester, M20 4BX, United Kingdom.
| | - David Wong
- Leeds Institute for Health Informatics, University of Leeds, Leeds, United Kingdom.
| | - Janelle Yorke
- Florence Nightingale Faculty of Nursing, Midwifery and Palliative Care, King's College London, London, United Kingdom; The Christie NHS Foundation Trust, Wilmslow Rd, Manchester, M20 4BX, United Kingdom.
| | - Fatmah Abdulsamad Fallatah
- Department of Nursing Affairs, King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia.
| | - Louise Cave
- NHS Transformation Directorate, NHS England, England, United Kingdom.
| | - Lu-Yen Anny Chen
- Institute of Clinical Nursing, College of Nursing, National Yang Ming Chiao Tung University, Taipei, Taiwan.
| |
Collapse
|
19
|
Jin Z, Ma F, Chen H, Guo S. Leveraging machine learning to distinguish between bacterial and viral induced pharyngitis using hematological markers: a retrospective cohort study. Sci Rep 2023; 13:22899. [PMID: 38129529 PMCID: PMC10739959 DOI: 10.1038/s41598-023-49925-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 12/13/2023] [Indexed: 12/23/2023] Open
Abstract
Accurate differentiation between bacterial and viral-induced pharyngitis is recognized as essential for personalized treatment and judicious antibiotic use. From a cohort of 693 patients with pharyngitis, data from 197 individuals clearly diagnosed with bacterial or viral infections were meticulously analyzed in this study. By integrating detailed hematological insights with several machine learning algorithms, including Random Forest, Neural Networks, Decision Trees, Support Vector Machine, Naive Bayes, and Lasso Regression, for potential biomarkers were identified, with an emphasis being placed on the diagnostic significance of the Monocyte-to-Lymphocyte Ratio. Distinct inflammatory signatures associated with bacterial infections were spotlighted in this study. An innovation introduced in this research was the adaptation of the high-accuracy Lasso Regression model for the TI-84 calculator, with an AUC (95% CI) of 0.94 (0.925-0.955) being achieved. Using this adaptation, pivotal laboratory parameters can be input on-the-spot and infection probabilities can be computed subsequently. This methodology embodies an improvement in diagnostics, facilitating more effective distinction between bacterial and viral infections while fostering judicious antibiotic use.
Collapse
Affiliation(s)
- Zhe Jin
- School of Medical Technology, Hebei Medical University, Shijiazhuang, 050017, People's Republic of China
| | - Fengmei Ma
- Department of Otorhinolaryngology, Hebei Provincial Hospital of Traditional Chinese Medicine, Shijiazhuang, 050011, People's Republic of China
| | - Haoyang Chen
- Medicine-Education Coordination and Medical Education Research Center, Hebei Medical University, Shijiazhuang, 050017, People's Republic of China
| | - Shufan Guo
- Department of Otorhinolaryngology, Hebei Provincial Hospital of Traditional Chinese Medicine, Shijiazhuang, 050011, People's Republic of China.
| |
Collapse
|
20
|
Truchot A, Raynaud M, Loupy A. The authors reply. Kidney Int 2023; 104:1036. [PMID: 37863625 DOI: 10.1016/j.kint.2023.07.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 07/26/2023] [Indexed: 10/22/2023]
Affiliation(s)
- Agathe Truchot
- Paris Institute for Transplantation and Organ Regeneration, Université Paris Cité, Institut National de la Santé de Et de la Recherche Médicale (INSERM), U-970, AP-HP, Paris, France
| | - Marc Raynaud
- Paris Institute for Transplantation and Organ Regeneration, Université Paris Cité, Institut National de la Santé de Et de la Recherche Médicale (INSERM), U-970, AP-HP, Paris, France
| | - Alexandre Loupy
- Paris Institute for Transplantation and Organ Regeneration, Université Paris Cité, Institut National de la Santé de Et de la Recherche Médicale (INSERM), U-970, AP-HP, Paris, France.
| |
Collapse
|