1
|
Migiddorj B, Batterham M, Win KT. Systematic literature review on the application of explainable artificial intelligence in palliative care studies. Int J Med Inform 2025; 200:105914. [PMID: 40250167 DOI: 10.1016/j.ijmedinf.2025.105914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2024] [Revised: 03/02/2025] [Accepted: 04/07/2025] [Indexed: 04/20/2025]
Abstract
BACKGROUND As machine learning models become increasingly prevalent in palliative care, explainability has become a critical factor in their successful deployment in this sensitive field, where decisions can profoundly impact patient health and quality of life. To address these concerns, Explainable AI (XAI) aims to make complex AI models more understandable and trustworthy. OBJECTIVE This study aims to assess the current state of machine learning models in palliative care, specifically focusing on their compliance with the principles of XAI. METHODS A comprehensive literature search in four databases was conducted to identify articles on machine learning in palliative care studies published until May 2024, followed by the Preferred Reporting Items for Systematic Reviews and Meta-Analysis guideline. The Checklist for Assessment of Medical Artificial Intelligence was used to evaluate the quality of the studies. RESULTS Mortality and survival prediction were the primary focus areas in 15 (54%) of the included 28 studies. Regarding data explainability, 20 studies (71%) documented their data preprocessing methods. However, a notable concern is that 45% of the studies did not address handling missing data. Across these studies, 74 machine learning algorithms were employed. Complex models, including Random Forest, Support Vector Machines, Gradient Boosting Machines, and Deep Neural Networks, were predominantly used (64%) due to their high predictive power, achieving AUC values between 0.82 and 0.96. Post-hoc explanation techniques were applied in only 11 studies, using seven different XAI techniques, focusing on global explanations to enhance understanding of model behavior. CONCLUSION Given the critical role of AI-driven decisions in patient care, adopting XAI techniques is essential for fostering trust and usability. Although progress has been made, significant gaps persist. A main challenge remains the trade-off between model performance and interpretability, as highly accurate models often lack the transparency required to build trust in clinical settings. Additionally, complex models frequently provide inadequate explanations for their outputs, lack consistent documentation, and have limited XAI applications, reducing the interpretability of machine learning studies for clinicians and decision-makers.
Collapse
Affiliation(s)
- Battushig Migiddorj
- Faculty of Engineering and Information Science, University of Wollongong, Australia.
| | - Marijka Batterham
- Faculty of Engineering and Information Science, University of Wollongong, Australia
| | - Khin Than Win
- Faculty of Engineering and Information Science, University of Wollongong, Australia
| |
Collapse
|
2
|
Cary MP, Bessias S, McCall J, Pencina MJ, Grady SD, Lytle K, Economou‐Zavlanos NJ. Empowering nurses to champion Health equity & BE FAIR: Bias elimination for fair and responsible AI in healthcare. J Nurs Scholarsh 2025; 57:130-139. [PMID: 39075715 PMCID: PMC11771545 DOI: 10.1111/jnu.13007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 06/20/2024] [Accepted: 07/09/2024] [Indexed: 07/31/2024]
Abstract
BACKGROUND The concept of health equity by design encompasses a multifaceted approach that integrates actions aimed at eliminating biased, unjust, and correctable differences among groups of people as a fundamental element in the design of algorithms. As algorithmic tools are increasingly integrated into clinical practice at multiple levels, nurses are uniquely positioned to address challenges posed by the historical marginalization of minority groups and its intersections with the use of "big data" in healthcare settings; however, a coherent framework is needed to ensure that nurses receive appropriate training in these domains and are equipped to act effectively. PURPOSE We introduce the Bias Elimination for Fair AI in Healthcare (BE FAIR) framework, a comprehensive strategic approach that incorporates principles of health equity by design, for nurses to employ when seeking to mitigate bias and prevent discriminatory practices arising from the use of clinical algorithms in healthcare. By using examples from a "real-world" AI governance framework, we aim to initiate a wider discourse on equipping nurses with the skills needed to champion the BE FAIR initiative. METHODS Drawing on principles recently articulated by the Office of the National Coordinator for Health Information Technology, we conducted a critical examination of the concept of health equity by design. We also reviewed recent literature describing the risks of artificial intelligence (AI) technologies in healthcare as well as their potential for advancing health equity. Building on this context, we describe the BE FAIR framework, which has the potential to enable nurses to take a leadership role within health systems by implementing a governance structure to oversee the fairness and quality of clinical algorithms. We then examine leading frameworks for promoting health equity to inform the operationalization of BE FAIR within a local AI governance framework. RESULTS The application of the BE FAIR framework within the context of a working governance system for clinical AI technologies demonstrates how nurses can leverage their expertise to support the development and deployment of clinical algorithms, mitigating risks such as bias and promoting ethical, high-quality care powered by big data and AI technologies. CONCLUSION AND RELEVANCE As health systems learn how well-intentioned clinical algorithms can potentially perpetuate health disparities, we have an opportunity and an obligation to do better. New efforts empowering nurses to advocate for BE FAIR, involving them in AI governance, data collection methods, and the evaluation of tools intended to reduce bias, mark important steps in achieving equitable healthcare for all.
Collapse
Affiliation(s)
- Michael P. Cary
- Duke University School of NursingDurhamNorth CarolinaUSA
- Duke University School of MedicineDurhamNorth CarolinaUSA
- Duke University Health SystemDurhamNorth CarolinaUSA
| | - Sophia Bessias
- Duke University School of MedicineDurhamNorth CarolinaUSA
- Duke University Health SystemDurhamNorth CarolinaUSA
| | | | - Michael J. Pencina
- Duke University School of MedicineDurhamNorth CarolinaUSA
- Duke University Health SystemDurhamNorth CarolinaUSA
| | | | - Kay Lytle
- Duke University School of NursingDurhamNorth CarolinaUSA
- Duke University Health SystemDurhamNorth CarolinaUSA
| | | |
Collapse
|
3
|
Gatineau G, Shevroja E, Vendrami C, Gonzalez-Rodriguez E, Leslie WD, Lamy O, Hans D. Development and reporting of artificial intelligence in osteoporosis management. J Bone Miner Res 2024; 39:1553-1573. [PMID: 39163489 PMCID: PMC11523092 DOI: 10.1093/jbmr/zjae131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 07/17/2024] [Accepted: 08/01/2024] [Indexed: 08/22/2024]
Abstract
An abundance of medical data and enhanced computational power have led to a surge in artificial intelligence (AI) applications. Published studies involving AI in bone and osteoporosis research have increased exponentially, raising the need for transparent model development and reporting strategies. This review offers a comprehensive overview and systematic quality assessment of AI articles in osteoporosis while highlighting recent advancements. A systematic search in the PubMed database, from December 17, 2020 to February 1, 2023 was conducted to identify AI articles that relate to osteoporosis. The quality assessment of the studies relied on the systematic evaluation of 12 quality items derived from the minimum information about clinical artificial intelligence modeling checklist. The systematic search yielded 97 articles that fell into 5 areas; bone properties assessment (11 articles), osteoporosis classification (26 articles), fracture detection/classification (25 articles), risk prediction (24 articles), and bone segmentation (11 articles). The average quality score for each study area was 8.9 (range: 7-11) for bone properties assessment, 7.8 (range: 5-11) for osteoporosis classification, 8.4 (range: 7-11) for fracture detection, 7.6 (range: 4-11) for risk prediction, and 9.0 (range: 6-11) for bone segmentation. A sixth area, AI-driven clinical decision support, identified the studies from the 5 preceding areas that aimed to improve clinician efficiency, diagnostic accuracy, and patient outcomes through AI-driven models and opportunistic screening by automating or assisting with specific clinical tasks in complex scenarios. The current work highlights disparities in study quality and a lack of standardized reporting practices. Despite these limitations, a wide range of models and examination strategies have shown promising outcomes to aid in the earlier diagnosis and improve clinical decision-making. Through careful consideration of sources of bias in model performance assessment, the field can build confidence in AI-based approaches, ultimately leading to improved clinical workflows and patient outcomes.
Collapse
Affiliation(s)
- Guillaume Gatineau
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - Enisa Shevroja
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - Colin Vendrami
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - Elena Gonzalez-Rodriguez
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| | - William D Leslie
- Department of Medicine, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | - Olivier Lamy
- Internal Medicine Unit, Internal Medicine Department, Lausanne University Hospital and University of Lausanne, 1005 Lausanne, Switzerland
| | - Didier Hans
- Interdisciplinary Center of Bone Diseases, Rheumatology Unit, Bone and Joint Department, Lausanne University Hospital and University of Lausanne, Av. Pierre-Decker 4, 1011 Lausanne, Switzerland
| |
Collapse
|
4
|
Lei M, Feng T, Chen M, Shen J, Liu J, Chang F, Chen J, Sun X, Mao Z, Li Y, Yin P, Tang P, Zhang L. Establishment and validation of an artificial intelligence web application for predicting postoperative in-hospital mortality in patients with hip fracture: a national cohort study of 52 707 cases. Int J Surg 2024; 110:4876-4892. [PMID: 38752505 PMCID: PMC11325965 DOI: 10.1097/js9.0000000000001599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 04/26/2024] [Indexed: 05/19/2024]
Abstract
BACKGROUND In-hospital mortality following hip fractures is a significant concern, and accurate prediction of this outcome is crucial for appropriate clinical management. Nonetheless, there is a lack of effective prediction tools in clinical practice. By utilizing artificial intelligence (AI) and machine learning techniques, this study aims to develop a predictive model that can assist clinicians in identifying geriatric hip fracture patients at a higher risk of in-hospital mortality. METHODS A total of 52 707 geriatric hip fracture patients treated with surgery from 90 hospitals were included in this study. The primary outcome was postoperative in-hospital mortality. The patients were randomly divided into two groups, with a ratio of 7:3. The majority of patients, assigned to the training cohort, were used to develop the AI models. The remaining patients, assigned to the validation cohort, were used to validate the models. Various machine learning algorithms, including logistic regression (LR), decision tree (DT), naïve bayesian (NB), neural network (NN), eXGBoosting machine (eXGBM), and random forest (RF), were employed for model development. A comprehensive scoring system, incorporating 10 evaluation metrics, was developed to assess the prediction performance, with higher scores indicating superior predictive capability. Based on the best machine learning-based model, an AI application was developed on the Internet. In addition, a comparative testing of prediction performance between doctors and the AI application. FINDINGS The eXGBM model exhibited the best prediction performance, with an area under the curve (AUC) of 0.908 (95% CI: 0.881-0.932), as well as the highest accuracy (0.820), precision (0.817), specificity (0.814), and F1 score (0.822), and the lowest Brier score (0.120) and log loss (0.374). Additionally, the model showed favorable calibration, with a slope of 0.999 and an intercept of 0.028. According to the scoring system incorporating 10 evaluation metrics, the eXGBM model achieved the highest score (56), followed by the RF model (48) and NN model (41). The LR, DT, and NB models had total scores of 27, 30, and 13, respectively. The AI application has been deployed online at https://in-hospitaldeathinhipfracture-l9vhqo3l55fy8dkdvuskvu.streamlit.app/ , based on the eXGBM model. The comparative testing revealed that the AI application's predictive capabilities significantly outperformed those of the doctors in terms of AUC values (0.908 vs. 0.682, P <0.001). CONCLUSIONS The eXGBM model demonstrates promising predictive performance in assessing the risk of postoperative in-hospital mortality among geriatric hip fracture patients. The developed AI model serves as a valuable tool to enhance clinical decision-making.
Collapse
Affiliation(s)
- Mingxing Lei
- Department of Orthopedics, National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, PLA General Hospital
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
- Department of Orthopedics, Hainan Hospital of Chinese PLA General Hospital, Hainan, People's Republic of China
| | - Taojin Feng
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Ming Chen
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Junmin Shen
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Jiang Liu
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Feifan Chang
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Junyu Chen
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Xinyu Sun
- Department of Orthopedics, Chinese PLA General Hospital
- Chinese PLA Medical School
| | - Zhi Mao
- Department of Emergency, The First Medical Center of PLA General Hospital, Beijing
| | - Yi Li
- Department of Orthopedics, National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, PLA General Hospital
- Department of Orthopedics, Chinese PLA General Hospital
| | - Pengbin Yin
- Department of Orthopedics, National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, PLA General Hospital
- Department of Orthopedics, Chinese PLA General Hospital
| | - Peifu Tang
- Department of Orthopedics, National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, PLA General Hospital
- Department of Orthopedics, Chinese PLA General Hospital
| | - Licheng Zhang
- Department of Orthopedics, National Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, PLA General Hospital
- Department of Orthopedics, Chinese PLA General Hospital
| |
Collapse
|
5
|
Salvador Comino MR, Youssef P, Heinzelmann A, Bernhardt F, Seifert C, Tewes M. Machine Learning-Based Prediction of 1-Year Survival Using Subjective and Objective Parameters in Patients With Cancer. JCO Clin Cancer Inform 2024; 8:e2400041. [PMID: 39197123 DOI: 10.1200/cci.24.00041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 06/25/2024] [Accepted: 07/15/2024] [Indexed: 08/30/2024] Open
Abstract
PURPOSE Palliative care is recommended for patients with cancer with a life expectancy of <12 months. Machine learning (ML) techniques can help in predicting survival outcomes among patients with cancer and may help distinguish who benefits the most from palliative care support. We aim to explore the importance of several objective and subjective self-reported variables. Subjective variables were collected through electronic psycho-oncologic and palliative care self-assessment screenings. We used these variables to predict 1-year mortality. MATERIALS AND METHODS Between April 1, 2020, and March 31, 2021, a total of 265 patients with advanced cancer completed a patient-reported outcome tool. We documented objective and subjective variables collected from electronic health records, self-reported subjective variables, and all clinical variables combined. We used logistic regression (LR), 20-fold cross-validation, decision trees, and random forests to predict 1-year mortality. We analyzed the receiver operating characteristic (ROC) curve-AUC, the precision-recall curve-AUC (PR-AUC)-and the feature importance of the ML models. RESULTS The performance of clinical nonpatient variables in predictions (LR reaches 0.81 [ROC-AUC] and 0.72 [F1 score]) are much more predictive than that of subjective patient-reported variables (LR reaches 0.55 [ROC-AUC] and 0.52 [F1 score]). CONCLUSION The results show that objective variables used in this study are much more predictive than subjective patient-reported variables, which measure subjective burden. These findings indicate that subjective burden cannot be reliably used to predict survival. Further research is needed to clarify the role of self-reported patient burden and mortality prediction using ML.
Collapse
Affiliation(s)
- Maria Rosa Salvador Comino
- Department of Palliative Medicine, West German Cancer Center, University Hospital Essen, University of Duisburg-Essen, Essen, Germany
| | - Paul Youssef
- Institute for Artificial Intelligence in Medicine (IKIM), University of Duisburg-Essen, Essen, Germany
- Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany
| | - Anna Heinzelmann
- Department of Palliative Medicine, West German Cancer Center, University Hospital Essen, University of Duisburg-Essen, Essen, Germany
| | - Florian Bernhardt
- Department of Palliative Care, West German Cancer Center, University Hospital Muenster, University of Muenster, Muenster, Germany
| | - Christin Seifert
- Department of Mathematics and Computer Science, University of Marburg, Marburg, Germany
| | - Mitra Tewes
- Department of Palliative Medicine, West German Cancer Center, University Hospital Essen, University of Duisburg-Essen, Essen, Germany
| |
Collapse
|
6
|
Mosfeldt M, Jørgensen HL, Lauritzen JB, Jansson KÅ. Development and Internal Validation of a Multivariable Prediction Model for Mortality After Hip Fracture with Machine Learning Techniques. Calcif Tissue Int 2024; 114:568-582. [PMID: 38625579 PMCID: PMC11090964 DOI: 10.1007/s00223-024-01208-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Accepted: 03/11/2024] [Indexed: 04/17/2024]
Abstract
In order to estimate the likelihood of 1, 3, 6 and 12 month mortality in patients with hip fractures, we applied a variety of machine learning methods using readily available, preoperative data. We used prospectively collected data from a single university hospital in Copenhagen, Denmark for consecutive patients with hip fractures, aged 60 years and older, treated between September 2008 to September 2010 (n = 1186). Preoperative biochemical and anamnestic data were used as predictors and outcome was survival at 1, 3, 6 and 12 months after the fracture. After feature selection for each timepoint a stratified split was done (70/30) before training and validating Random Forest models, extreme gradient boosting (XGB) and Generalized Linear Models. We evaluated and compared each model using receiver operator characteristic (ROC), calibration slope and intercept, Spiegelhalter's z- test and Decision Curve Analysis. Using combinations of between 10 and 13 anamnestic and biochemical parameters we were able to successfully estimate the likelihood of mortality with an area under the curve on ROC curves of 0.79, 0.80, 0.79 and 0.81 for 1, 3, 6 and 12 month, respectively. The XGB was the overall best calibrated and most promising model. The XGB model most successfully estimated the likelihood of mortality postoperatively. An easy-to-use model could be helpful in perioperative decisions concerning level of care, focused research and information to patients. External validation is necessary before widespread use and is currently underway, an online tool has been developed for educational/experimental purposes ( https://hipfx.shinyapps.io/hipfx/ ).
Collapse
Affiliation(s)
- Mathias Mosfeldt
- Department of Orthopaedics, Karolinska University Hospital, Stockholm, Sweden.
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden.
| | - Henrik Løvendahl Jørgensen
- Department of Clinical Biochemistry, Hvidovre Hospital, Hvidovre, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Jes Bruun Lauritzen
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
- Department of Orthopaedic Surgery, Bispebjerg Hospital, University of Copenhagen, Copenhagen, Denmark
| | - Karl-Åke Jansson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Orthopaedics, Södersjukhuset, Stockholm, Sweden
| |
Collapse
|
7
|
Cary MP, De Gagne JC, Kauschinger ED, Carter BM. Advancing Health Equity Through Artificial Intelligence: An Educational Framework for Preparing Nurses in Clinical Practice and Research. Creat Nurs 2024; 30:154-164. [PMID: 38689433 DOI: 10.1177/10784535241249193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]
Abstract
The integration of artificial intelligence (AI) into health care offers the potential to enhance patient care, improve diagnostic precision, and broaden access to health-care services. Nurses, positioned at the forefront of patient care, play a pivotal role in utilizing AI to foster a more efficient and equitable health-care system. However, to fulfil this role, nurses will require education that prepares them with the necessary skills and knowledge for the effective and ethical application of AI. This article proposes a framework for nurses which includes AI principles, skills, competencies, and curriculum development focused on the practical use of AI, with an emphasis on care that aims to achieve health equity. By adopting this educational framework, nurses will be prepared to make substantial contributions to reducing health disparities and fostering a health-care system that is more efficient and equitable.
Collapse
Affiliation(s)
- Michael P Cary
- Duke University School of Nursing, Durham, NC, USA
- Duke University School of Medicine, Durham, NC, USA
- Duke AI Health, Durham, NC, USA
- American Association of Colleges of Nursing, Durham, NC, USA
| | - Jennie C De Gagne
- Duke University School of Nursing, Durham, NC, USA
- Duke University School of Medicine, Durham, NC, USA
- Duke AI Health, Durham, NC, USA
- American Association of Colleges of Nursing, Durham, NC, USA
| | - Elaine D Kauschinger
- Duke University School of Nursing, Durham, NC, USA
- Duke University School of Medicine, Durham, NC, USA
- Duke AI Health, Durham, NC, USA
- American Association of Colleges of Nursing, Durham, NC, USA
| | - Brigit M Carter
- Duke University School of Nursing, Durham, NC, USA
- Duke University School of Medicine, Durham, NC, USA
- Duke AI Health, Durham, NC, USA
- American Association of Colleges of Nursing, Durham, NC, USA
| |
Collapse
|
8
|
Dijkstra H, van de Kuit A, de Groot T, Canta O, Groot OQ, Oosterhoff JH, Doornberg JN. Systematic review of machine-learning models in orthopaedic trauma. Bone Jt Open 2024; 5:9-19. [PMID: 38226447 PMCID: PMC10790183 DOI: 10.1302/2633-1462.51.bjo-2023-0095.r1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/17/2024] Open
Abstract
Aims Machine-learning (ML) prediction models in orthopaedic trauma hold great promise in assisting clinicians in various tasks, such as personalized risk stratification. However, an overview of current applications and critical appraisal to peer-reviewed guidelines is lacking. The objectives of this study are to 1) provide an overview of current ML prediction models in orthopaedic trauma; 2) evaluate the completeness of reporting following the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement; and 3) assess the risk of bias following the Prediction model Risk Of Bias Assessment Tool (PROBAST) tool. Methods A systematic search screening 3,252 studies identified 45 ML-based prediction models in orthopaedic trauma up to January 2023. The TRIPOD statement assessed transparent reporting and the PROBAST tool the risk of bias. Results A total of 40 studies reported on training and internal validation; four studies performed both development and external validation, and one study performed only external validation. The most commonly reported outcomes were mortality (33%, 15/45) and length of hospital stay (9%, 4/45), and the majority of prediction models were developed in the hip fracture population (60%, 27/45). The overall median completeness for the TRIPOD statement was 62% (interquartile range 30 to 81%). The overall risk of bias in the PROBAST tool was low in 24% (11/45), high in 69% (31/45), and unclear in 7% (3/45) of the studies. High risk of bias was mainly due to analysis domain concerns including small datasets with low number of outcomes, complete-case analysis in case of missing data, and no reporting of performance measures. Conclusion The results of this study showed that despite a myriad of potential clinically useful applications, a substantial part of ML studies in orthopaedic trauma lack transparent reporting, and are at high risk of bias. These problems must be resolved by following established guidelines to instil confidence in ML models among patients and clinicians. Otherwise, there will remain a sizeable gap between the development of ML prediction models and their clinical application in our day-to-day orthopaedic trauma practice.
Collapse
Affiliation(s)
- Hidde Dijkstra
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
- University Center for Geriatric Medicine, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Anouk van de Kuit
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
| | - Tom de Groot
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
- Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Olga Canta
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
| | - Olivier Q. Groot
- Department of Orthopaedic Surgery, University Medical Centre Utrecht, University of Utrecht, Utrecht, Netherlands
| | - Jacobien H. Oosterhoff
- Department of Engineering Systems & Services, Faculty Technology Policy and Management, Delft University of Technology, Delft, Netherlands
| | - Job N. Doornberg
- Department of Orthopaedic Surgery, University Medical Centre Groningen, Groningen, Netherlands
- Department of Orthopaedic Trauma Surgery, Flinders Medical Center, Flinders University, Adelaide, Australia
| | | |
Collapse
|
9
|
Turhan S, Canbek U, Dubektas-Canbek T, Dogu E. Predicting Prolonged Wound Drainage after Hemiarthroplasty for Hip Fractures: A Stacked Machine Learning Study. Clin Orthop Surg 2023; 15:894-901. [PMID: 38045590 PMCID: PMC10689231 DOI: 10.4055/cios22181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 09/24/2022] [Accepted: 09/27/2022] [Indexed: 12/05/2023] Open
Abstract
Background Prolonged wound drainage (PWD) is one of the most important reasons that increase the risk of early periprosthetic joint infection after arthroplasty. It is very important to evaluate the risk factors for PWD in the surgical field after arthroplasty surgery. This can be accomplished using machine learning or artificial intelligence methods. Our aim in this study was to compare machine learning methods in predicting possible PWD. Methods The study was carried out on clinical, laboratory, and radiological data of 313 patients who underwent hemiarthroplasty (HA) for proximal femur fractures. We preprocessed the dataset and trained and tested machine learning methods using cross validation. We compared various machine learning algorithms (linear discriminant analysis, decision tree, k-nearest neighbors, gradient boosting machine, and logistic regression [LR]) based on performance measures. We also combined the most successful algorithms with a metaclassifier. To help understand the relationship between risk factors, we provided a risk factor severity ranking. Results To estimate the risk of PWD, classification was performed with first-level classifiers and then integrated as a LR-based meta-learner stacking method. More performance improvements were achieved with the stacking method. Conclusions We found that the stacking method was superior to other methods in PWD classification. We determined that the volume of fluid collected from the drain, morbid obesity class, blood transfusion, and body mass index score were the four most important risk factors according to stacking.
Collapse
Affiliation(s)
- Sultan Turhan
- Department of Statistics, Mugla Sitki Kocman University, Mugla, Türkiye
| | - Umut Canbek
- Department of Orthopedics and Traumatology, Mugla Sitki Kocman University College of Medicine, Mugla, Türkiye
| | - Tugba Dubektas-Canbek
- Department of Internal Medicine, Mugla Sitki Kocman University College of Medicine, Mugla, Türkiye
| | - Eralp Dogu
- Department of Statistics, Mugla Sitki Kocman University, Mugla, Türkiye
| |
Collapse
|
10
|
Wang J, Chen H, Wang H, Liu W, Peng D, Zhao Q, Xiao M. A Risk Prediction Model for Physical Restraints Among Older Chinese Adults in Long-term Care Facilities: Machine Learning Study. J Med Internet Res 2023; 25:e43815. [PMID: 37023416 PMCID: PMC10131772 DOI: 10.2196/43815] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 01/07/2023] [Accepted: 03/12/2023] [Indexed: 04/08/2023] Open
Abstract
BACKGROUND Numerous studies have identified risk factors for physical restraint (PR) use in older adults in long-term care facilities. Nevertheless, there is a lack of predictive tools to identify high-risk individuals. OBJECTIVE We aimed to develop machine learning (ML)-based models to predict the risk of PR in older adults. METHODS This study conducted a cross-sectional secondary data analysis based on 1026 older adults from 6 long-term care facilities in Chongqing, China, from July 2019 to November 2019. The primary outcome was the use of PR (yes or no), identified by 2 collectors' direct observation. A total of 15 candidate predictors (older adults' demographic and clinical factors) that could be commonly and easily collected from clinical practice were used to build 9 independent ML models: Gaussian Naïve Bayesian (GNB), k-nearest neighbor (KNN), decision tree (DT), logistic regression (LR), support vector machine (SVM), random forest (RF), multilayer perceptron (MLP), extreme gradient boosting (XGBoost), and light gradient boosting machine (Lightgbm), as well as stacking ensemble ML. Performance was evaluated using accuracy, precision, recall, an F score, a comprehensive evaluation indicator (CEI) weighed by the above indicators, and the area under the receiver operating characteristic curve (AUC). A net benefit approach using the decision curve analysis (DCA) was performed to evaluate the clinical utility of the best model. Models were tested via 10-fold cross-validation. Feature importance was interpreted using Shapley Additive Explanations (SHAP). RESULTS A total of 1026 older adults (mean 83.5, SD 7.6 years; n=586, 57.1% male older adults) and 265 restrained older adults were included in the study. All ML models performed well, with an AUC above 0.905 and an F score above 0.900. The 2 best independent models are RF (AUC 0.938, 95% CI 0.914-0.947) and SVM (AUC 0.949, 95% CI 0.911-0.953). The DCA demonstrated that the RF model displayed better clinical utility than other models. The stacking model combined with SVM, RF, and MLP performed best with AUC (0.950) and CEI (0.943) values, as well as the DCA curve indicated the best clinical utility. The SHAP plots demonstrated that the significant contributors to model performance were related to cognitive impairment, care dependency, mobility decline, physical agitation, and an indwelling tube. CONCLUSIONS The RF and stacking models had high performance and clinical utility. ML prediction models for predicting the probability of PR in older adults could offer clinical screening and decision support, which could help medical staff in the early identification and PR management of older adults.
Collapse
Affiliation(s)
- Jun Wang
- Department of Nursing, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Hongmei Chen
- Department of Gynecology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Houwei Wang
- College of Mathematics and Physics, Chongqing University of Science and Technology, Chongqing, China
| | - Weichu Liu
- Department of Gynecology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Daomei Peng
- Aged Care Unit, The First Social Welfare Home of Chongqing, Chongqing, China
| | - Qinghua Zhao
- Department of Nursing, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Mingzhao Xiao
- Department of Urology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| |
Collapse
|
11
|
Dijkstra H, Oosterhoff JHF, van de Kuit A, IJpma FFA, Schwab JH, Poolman RW, Sprague S, Bzovsky S, Bhandari M, Swiontkowski M, Schemitsch EH, Doornberg JN, Hendrickx LAM. Development of machine-learning algorithms for 90-day and one-year mortality prediction in the elderly with femoral neck fractures based on the HEALTH and FAITH trials. Bone Jt Open 2023; 4:168-181. [PMID: 37051847 PMCID: PMC10032237 DOI: 10.1302/2633-1462.43.bjo-2022-0162.r1] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/14/2023] Open
Abstract
To develop prediction models using machine-learning (ML) algorithms for 90-day and one-year mortality prediction in femoral neck fracture (FNF) patients aged 50 years or older based on the Hip fracture Evaluation with Alternatives of Total Hip arthroplasty versus Hemiarthroplasty (HEALTH) and Fixation using Alternative Implants for the Treatment of Hip fractures (FAITH) trials. This study included 2,388 patients from the HEALTH and FAITH trials, with 90-day and one-year mortality proportions of 3.0% (71/2,388) and 6.4% (153/2,388), respectively. The mean age was 75.9 years (SD 10.8) and 65.9% of patients (1,574/2,388) were female. The algorithms included patient and injury characteristics. Six algorithms were developed, internally validated and evaluated across discrimination (c-statistic; discriminative ability between those with risk of mortality and those without), calibration (observed outcome compared to the predicted probability), and the Brier score (composite of discrimination and calibration). The developed algorithms distinguished between patients at high and low risk for 90-day and one-year mortality. The penalized logistic regression algorithm had the best performance metrics for both 90-day (c-statistic 0.80, calibration slope 0.95, calibration intercept -0.06, and Brier score 0.039) and one-year (c-statistic 0.76, calibration slope 0.86, calibration intercept -0.20, and Brier score 0.074) mortality prediction in the hold-out set. Using high-quality data, the ML-based prediction models accurately predicted 90-day and one-year mortality in patients aged 50 years or older with a FNF. The final models must be externally validated to assess generalizability to other populations, and prospectively evaluated in the process of shared decision-making.
Collapse
Affiliation(s)
- Hidde Dijkstra
- Department of Orthopaedic Surgery, University Medical Centre Groningen, University of Groningen, Groningen, The Netherlands
- Department of Trauma Surgery, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
- Department of Geriatric Medicine, University Medical Center of Groningen, University of Groningen, Groningen, The Netherlands
| | - Jacobien H. F. Oosterhoff
- Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
- Department of Orthopaedic Surgery, Amsterdam Movement Sciences, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
- Department of Engineering Systems and Services, Faculty Technology Policy Management, Delft University of Technology, Delt, Netherlands
| | - Anouk van de Kuit
- Department of Orthopaedic Surgery, University Medical Centre Groningen, University of Groningen, Groningen, The Netherlands
- Department of Trauma Surgery, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Frank F. A. IJpma
- Department of Trauma Surgery, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Joseph H. Schwab
- Department of Orthopaedic Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, Massachusetts, USA
| | - Rudolf W. Poolman
- Department of Orthopaedic Surgery, Leiden University Medical Center, Leiden, The Netherlands
- Department of Orthopaedic Surgery, Onze Lieve Vrouw Gasthuis, Amsterdam, The Netherlands
| | - Sheila Sprague
- Division of Orthopaedic Surgery, Department of Surgery, McMaster University, Hamilton, Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada
| | - Sofia Bzovsky
- Division of Orthopaedic Surgery, Department of Surgery, McMaster University, Hamilton, Canada
| | - Mohit Bhandari
- Division of Orthopaedic Surgery, Department of Surgery, McMaster University, Hamilton, Canada
- Department of Health Research Methods, Evidence, and Impact, McMaster University, Hamilton, Canada
| | - Marc Swiontkowski
- Department of Orthopaedic Surgery, University of Minnesota, Minneapolis, Minnesota, USA
| | | | - Job N. Doornberg
- Department of Orthopaedic Surgery, University Medical Centre Groningen, University of Groningen, Groningen, The Netherlands
| | - Laurent A. M. Hendrickx
- Department of Orthopaedic Surgery, Amsterdam Movement Sciences, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, The Netherlands
| | | | | | | |
Collapse
|
12
|
Applications of Machine Learning in Palliative Care: A Systematic Review. Cancers (Basel) 2023; 15:cancers15051596. [PMID: 36900387 PMCID: PMC10001037 DOI: 10.3390/cancers15051596] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Revised: 02/24/2023] [Accepted: 03/02/2023] [Indexed: 03/08/2023] Open
Abstract
Objective: To summarize the available literature on using machine learning (ML) for palliative care practice as well as research and to assess the adherence of the published studies to the most important ML best practices. Methods: The MEDLINE database was searched for the use of ML in palliative care practice or research, and the records were screened according to PRISMA guidelines. Results: In total, 22 publications using machine learning for mortality prediction (n = 15), data annotation (n = 5), predicting morbidity under palliative therapy (n = 1), and predicting response to palliative therapy (n = 1) were included. Publications used a variety of supervised or unsupervised models, but mostly tree-based classifiers and neural networks. Two publications had code uploaded to a public repository, and one publication uploaded the dataset. Conclusions: Machine learning in palliative care is mainly used to predict mortality. Similarly to other applications of ML, external test sets and prospective validations are the exception.
Collapse
|
13
|
Lex JR, Di Michele J, Koucheki R, Pincus D, Whyne C, Ravi B. Artificial Intelligence for Hip Fracture Detection and Outcome Prediction: A Systematic Review and Meta-analysis. JAMA Netw Open 2023; 6:e233391. [PMID: 36930153 PMCID: PMC10024206 DOI: 10.1001/jamanetworkopen.2023.3391] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open
Abstract
IMPORTANCE Artificial intelligence (AI) enables powerful models for establishment of clinical diagnostic and prognostic tools for hip fractures; however the performance and potential impact of these newly developed algorithms are currently unknown. OBJECTIVE To evaluate the performance of AI algorithms designed to diagnose hip fractures on radiographs and predict postoperative clinical outcomes following hip fracture surgery relative to current practices. DATA SOURCES A systematic review of the literature was performed using the MEDLINE, Embase, and Cochrane Library databases for all articles published from database inception to January 23, 2023. A manual reference search of included articles was also undertaken to identify any additional relevant articles. STUDY SELECTION Studies developing machine learning (ML) models for the diagnosis of hip fractures from hip or pelvic radiographs or to predict any postoperative patient outcome following hip fracture surgery were included. DATA EXTRACTION AND SYNTHESIS This study followed the Preferred Reporting Items for Systematic Reviews and Meta-analyses and was registered with PROSPERO. Eligible full-text articles were evaluated and relevant data extracted independently using a template data extraction form. For studies that predicted postoperative outcomes, the performance of traditional predictive statistical models, either multivariable logistic or linear regression, was recorded and compared with the performance of the best ML model on the same out-of-sample data set. MAIN OUTCOMES AND MEASURES Diagnostic accuracy of AI models was compared with the diagnostic accuracy of expert clinicians using odds ratios (ORs) with 95% CIs. Areas under the curve for postoperative outcome prediction between traditional statistical models (multivariable linear or logistic regression) and ML models were compared. RESULTS Of 39 studies that met all criteria and were included in this analysis, 18 (46.2%) used AI models to diagnose hip fractures on plain radiographs and 21 (53.8%) used AI models to predict patient outcomes following hip fracture surgery. A total of 39 598 plain radiographs and 714 939 hip fractures were used for training, validating, and testing ML models specific to diagnosis and postoperative outcome prediction, respectively. Mortality and length of hospital stay were the most predicted outcomes. On pooled data analysis, compared with clinicians, the OR for diagnostic error of ML models was 0.79 (95% CI, 0.48-1.31; P = .36; I2 = 60%) for hip fracture radiographs. For the ML models, the mean (SD) sensitivity was 89.3% (8.5%), specificity was 87.5% (9.9%), and F1 score was 0.90 (0.06). The mean area under the curve for mortality prediction was 0.84 with ML models compared with 0.79 for alternative controls (P = .09). CONCLUSIONS AND RELEVANCE The findings of this systematic review and meta-analysis suggest that the potential applications of AI to aid with diagnosis from hip radiographs are promising. The performance of AI in diagnosing hip fractures was comparable with that of expert radiologists and surgeons. However, current implementations of AI for outcome prediction do not seem to provide substantial benefit over traditional multivariable predictive statistics.
Collapse
Affiliation(s)
- Johnathan R. Lex
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Orthopaedics Biomechanics Laboratory, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Joseph Di Michele
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
| | - Robert Koucheki
- Institute of Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
- Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Daniel Pincus
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| | - Cari Whyne
- Orthopaedics Biomechanics Laboratory, Sunnybrook Research Institute, Toronto, Ontario, Canada
| | - Bheeshma Ravi
- Division of Orthopaedic Surgery, Department of Surgery, University of Toronto, Toronto, Ontario, Canada
- Division of Orthopaedic Surgery, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada
| |
Collapse
|
14
|
Bowers A, Drake C, Makarkin AE, Monzyk R, Maity B, Telle A. Predicting Patient Mortality for Earlier Palliative Care Identification in Medicare Advantage Plans: Features of a Machine Learning Model. JMIR AI 2023; 2:e42253. [PMID: 38875557 PMCID: PMC11041411 DOI: 10.2196/42253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/21/2022] [Accepted: 12/20/2022] [Indexed: 06/16/2024]
Abstract
BACKGROUND Machine learning (ML) can offer greater precision and sensitivity in predicting Medicare patient end of life and potential need for palliative services compared to provider recommendations alone. However, earlier ML research on older community dwelling Medicare beneficiaries has provided insufficient exploration of key model feature impacts and the role of the social determinants of health. OBJECTIVE This study describes the development of a binary classification ML model predicting 1-year mortality among Medicare Advantage plan members aged ≥65 years (N=318,774) and further examines the top features of the predictive model. METHODS A light gradient-boosted trees model configuration was selected based on 5-fold cross-validation. The model was trained with 80% of cases (n=255,020) using randomized feature generation periods, with 20% (n=63,754) reserved as a holdout for validation. The final algorithm used 907 feature inputs extracted primarily from claims and administrative data capturing patient diagnoses, service utilization, demographics, and census tract-based social determinants index measures. RESULTS The total sample had an actual mortality prevalence of 3.9% in the 2018 outcome period. The final model correctly predicted 44.2% of patient expirations among the top 1% of highest risk members (AUC=0.84; 95% CI 0.83-0.85) versus 24.0% predicted by the model iteration using only age, gender, and select high-risk utilization features (AUC=0.74; 95% CI 0.73-0.74). The most important algorithm features included patient demographics, diagnoses, pharmacy utilization, mean costs, and certain social determinants of health. CONCLUSIONS The final ML model better predicts Medicare Advantage member end of life using a variety of routinely collected data and supports earlier patient identification for palliative care.
Collapse
Affiliation(s)
- Anne Bowers
- Evernorth Health, Inc, St. Louis, MO, United States
| | | | | | | | | | - Andrew Telle
- Evernorth Health, Inc, St. Louis, MO, United States
| |
Collapse
|
15
|
Lee W, Schwartz N, Bansal A, Khor S, Hammarlund N, Basu A, Devine B. A Scoping Review of the Use of Machine Learning in Health Economics and Outcomes Research: Part 2-Data From Nonwearables. VALUE IN HEALTH : THE JOURNAL OF THE INTERNATIONAL SOCIETY FOR PHARMACOECONOMICS AND OUTCOMES RESEARCH 2022; 25:2053-2061. [PMID: 35989154 DOI: 10.1016/j.jval.2022.07.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 06/10/2022] [Accepted: 07/10/2022] [Indexed: 06/15/2023]
Abstract
OBJECTIVES Despite the increasing interest in applying machine learning (ML) methods in health economics and outcomes research (HEOR), stakeholders face uncertainties in when and how ML can be used. We reviewed the recent applications of ML in HEOR. METHODS We searched PubMed for studies published between January 2020 and March 2021 and randomly chose 20% of the identified studies for the sake of manageability. Studies that were in HEOR and applied an ML technique were included. Studies related to wearable devices were excluded. We abstracted information on the ML applications, data types, and ML methods and analyzed it using descriptive statistics. RESULTS We retrieved 805 articles, of which 161 (20%) were randomly chosen. Ninety-two of the random sample met the eligibility criteria. We found that ML was primarily used for predicting future events (86%) rather than current events (14%). The most common response variables were clinical events or disease incidence (42%) and treatment outcomes (22%). ML was less used to predict economic outcomes such as health resource utilization (16%) or costs (3%). Although electronic medical records (35%) were frequently used for model development, claims data were used less frequently (9%). Tree-based methods (eg, random forests and boosting) were the most commonly used ML methods (31%). CONCLUSIONS The use of ML techniques in HEOR is growing rapidly, but there remain opportunities to apply them to predict economic outcomes, especially using claims databases, which could inform the development of cost-effectiveness models.
Collapse
Affiliation(s)
- Woojung Lee
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, School of Pharmacy, University of Washington, Seattle, WA, USA.
| | - Naomi Schwartz
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, School of Pharmacy, University of Washington, Seattle, WA, USA
| | - Aasthaa Bansal
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, School of Pharmacy, University of Washington, Seattle, WA, USA
| | - Sara Khor
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, School of Pharmacy, University of Washington, Seattle, WA, USA
| | - Noah Hammarlund
- Department of Health Services Research, Management & Policy, University of Florida, Gainesville, FL, USA
| | - Anirban Basu
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, School of Pharmacy, University of Washington, Seattle, WA, USA
| | - Beth Devine
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, School of Pharmacy, University of Washington, Seattle, WA, USA
| |
Collapse
|
16
|
Yang R, Hui D, Li X, Wang K, Li C, Li Z. Prediction of single pulmonary nodule growth by CT radiomics and clinical features - a one-year follow-up study. Front Oncol 2022; 12:1034817. [PMID: 36387220 PMCID: PMC9650464 DOI: 10.3389/fonc.2022.1034817] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 10/05/2022] [Indexed: 09/07/2023] Open
Abstract
Background With the development of imaging technology, an increasing number of pulmonary nodules have been found. Some pulmonary nodules may gradually grow and develop into lung cancer, while others may remain stable for many years. Accurately predicting the growth of pulmonary nodules in advance is of great clinical significance for early treatment. The purpose of this study was to establish a predictive model using radiomics and to study its value in predicting the growth of pulmonary nodules. Materials and methods According to the inclusion and exclusion criteria, 228 pulmonary nodules in 228 subjects were included in the study. During the one-year follow-up, 69 nodules grew larger, and 159 nodules remained stable. All the nodules were randomly divided into the training group and validation group in a proportion of 7:3. For the training data set, the t test, Chi-square test and Fisher exact test were used to analyze the sex, age and nodule location of the growth group and stable group. Two radiologists independently delineated the ROIs of the nodules to extract the radiomics characteristics using Pyradiomics. After dimension reduction by the LASSO algorithm, logistic regression analysis was performed on age and ten selected radiological features, and a prediction model was established and tested in the validation group. SVM, RF, MLP and AdaBoost models were also established, and the prediction effect was evaluated by ROC analysis. Results There was a significant difference in age between the growth group and the stable group (P < 0.05), but there was no significant difference in sex or nodule location (P > 0.05). The interclass correlation coefficients between the two observers were > 0.75. After dimension reduction by the LASSO algorithm, ten radiomic features were selected, including two shape-based features, one gray-level-cooccurence-matrix (GLCM), one first-order feature, one gray-level-run-length-matrix (GLRLM), three gray-level-dependence-matrix (GLDM) and two gray-level-size-zone-matrix (GLSZM). The logistic regression model combining age and radiomics features achieved an AUC of 0.87 and an accuracy of 0.82 in the training group and an AUC of 0.82 and an accuracy of 0.84 in the verification group for the prediction of nodule growth. For nonlinear models, in the training group, the AUCs of the SVM, RF, MLP and boost models were 0.95, 1.0, 1.0 and 1.0, respectively. In the validation group, the AUCs of the SVM, RF, MLP and boost models were 0.81, 0.77, 0.81, and 0.71, respectively. Conclusions In this study, we established several machine learning models that can successfully predict the growth of pulmonary nodules within one year. The logistic regression model combining age and imaging parameters has the best accuracy and generalization. This model is very helpful for the early treatment of pulmonary nodules and has important clinical significance.
Collapse
Affiliation(s)
- Ran Yang
- Department of Radiology, Second People’s Hospital of JiuLongPo District, Chongqing, China
| | - Dongming Hui
- Department of Radiology, Second People’s Hospital of JiuLongPo District, Chongqing, China
| | - Xing Li
- Department of Radiology, Chongqing Western Hospital, Chongqing, China
| | - Kun Wang
- Department of Radiology, Chongqing Western Hospital, Chongqing, China
| | - Caiyong Li
- Department of Radiology, Chongqing Western Hospital, Chongqing, China
| | - Zhichao Li
- Department of Radiology, Second People’s Hospital of JiuLongPo District, Chongqing, China
| |
Collapse
|
17
|
Kitcharanant N, Chotiyarnwong P, Tanphiriyakun T, Vanitcharoenkul E, Mahaisavariya C, Boonyaprapa W, Unnanuntana A. Development and internal validation of a machine-learning-developed model for predicting 1-year mortality after fragility hip fracture. BMC Geriatr 2022; 22:451. [PMID: 35610589 PMCID: PMC9131628 DOI: 10.1186/s12877-022-03152-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Accepted: 05/19/2022] [Indexed: 11/17/2022] Open
Abstract
Background Fragility hip fracture increases morbidity and mortality in older adult patients, especially within the first year. Identification of patients at high risk of death facilitates modification of associated perioperative factors that can reduce mortality. Various machine learning algorithms have been developed and are widely used in healthcare research, particularly for mortality prediction. This study aimed to develop and internally validate 7 machine learning models to predict 1-year mortality after fragility hip fracture. Methods This retrospective study included patients with fragility hip fractures from a single center (Siriraj Hospital, Bangkok, Thailand) from July 2016 to October 2018. A total of 492 patients were enrolled. They were randomly categorized into a training group (344 cases, 70%) or a testing group (148 cases, 30%). Various machine learning techniques were used: the Gradient Boosting Classifier (GB), Random Forests Classifier (RF), Artificial Neural Network Classifier (ANN), Logistic Regression Classifier (LR), Naive Bayes Classifier (NB), Support Vector Machine Classifier (SVM), and K-Nearest Neighbors Classifier (KNN). All models were internally validated by evaluating their performance and the area under a receiver operating characteristic curve (AUC). Results For the testing dataset, the accuracies were GB model = 0.93, RF model = 0.95, ANN model = 0.94, LR model = 0.91, NB model = 0.89, SVM model = 0.90, and KNN model = 0.90. All models achieved high AUCs that ranged between 0.81 and 0.99. The RF model also provided a negative predictive value of 0.96, a positive predictive value of 0.93, a specificity of 0.99, and a sensitivity of 0.68. Conclusions Our machine learning approach facilitated the successful development of an accurate model to predict 1-year mortality after fragility hip fracture. Several machine learning algorithms (eg, Gradient Boosting and Random Forest) had the potential to provide high predictive performance based on the clinical parameters of each patient. The web application is available at www.hipprediction.com. External validation in a larger group of patients or in different hospital settings is warranted to evaluate the clinical utility of this tool. Trial registration Thai Clinical Trials Registry (22 February 2021; reg. no. TCTR20210222003). Supplementary Information The online version contains supplementary material available at 10.1186/s12877-022-03152-x.
Collapse
Affiliation(s)
- Nitchanant Kitcharanant
- Department of Orthopaedics, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand.,Department of Orthopaedic Surgery, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, 10700, Bangkok, Thailand
| | - Pojchong Chotiyarnwong
- Department of Orthopaedic Surgery, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, 10700, Bangkok, Thailand.
| | - Thiraphat Tanphiriyakun
- Department of Orthopaedics, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand.,Biomedical Informatics Center, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| | - Ekasame Vanitcharoenkul
- Department of Orthopaedic Surgery, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, 10700, Bangkok, Thailand
| | - Chantas Mahaisavariya
- Golden Jubilee Medical Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Wichian Boonyaprapa
- Siriraj Information Technology Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Aasis Unnanuntana
- Department of Orthopaedic Surgery, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, 10700, Bangkok, Thailand
| |
Collapse
|
18
|
Kistler CE, Sloane PD, Zimmerman S. New Findings on Palliative Care Issues Near the End-of-Life. J Am Med Dir Assoc 2020; 22:265-267. [PMID: 33378649 DOI: 10.1016/j.jamda.2020.12.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 12/21/2020] [Indexed: 10/22/2022]
Affiliation(s)
- Christine E Kistler
- Cecil G. Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, NC, USA; Department of Family Medicine, School of Medicine, University of North Carolina at Chapel Hill, NC, USA.
| | - Philip D Sloane
- Cecil G. Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, NC, USA; Department of Family Medicine, School of Medicine, University of North Carolina at Chapel Hill, NC, USA
| | - Sheryl Zimmerman
- Cecil G. Sheps Center for Health Services Research, University of North Carolina at Chapel Hill, NC, USA; Schools of Social Work and Public Health, University of North Carolina at Chapel Hill, NC, USA
| |
Collapse
|