1
|
Farmani V, Kniep H, Maros ME, Lyashevska O, Malone F, Fiehler J, Morris L. Estimating individualized effectiveness of receiving successful recanalization for ischemic stroke cases using machine learning techniques. J Stroke Cerebrovasc Dis 2025; 34:108324. [PMID: 40254242 DOI: 10.1016/j.jstrokecerebrovasdis.2025.108324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Revised: 04/07/2025] [Accepted: 04/18/2025] [Indexed: 04/22/2025] Open
Abstract
OBJECTIVES Directly measuring the causal effect of mechanical thrombectomy (MT) for each ischemic stroke patient remains challenging, as it is impossible to observe the outcomes for both with and without successful recanalization in the same individual. In this study, we aimed to use machine learning to identify characteristics influencing the likelihood of not benefiting from successful recanalization. MATERIALS & METHODS A total of 1718 non-reperfused patients (Thrombolysis in Cerebral Infarction [TICI] ≤ 2a) and 10339 reperfused patients (TICI ≥ 2b) were included in the study as nonreperfusion and reperfusion groups, respectively. The primary target variable was probability of poor functional outcome after three months, defined by the modified Rankin Scale score of 3 to 6. Two random forest (RF) models trained on pre-treatment covariates of nonreperfusion and reperfusion groups, were used to predict the probability of poor outcome under unsuccessful and successful recanalization scenarios, respectively. The individual effect of successful recanalization was defined as the difference in predicted probabilities returned by the two models. RESULTS Strong calibration was achieved by the RF models trained on nonreperfusion group (intercept:0.027, slope: 1.030) and reperfused group (intercept:0.010, slope: 1.017). The average risk reduction under successful recanalization scenario was 22.0 % (95 % CI [21.7 % - 22.3 %]) for the reperfused group and 19.8 % (95 % CI [19.1 % - 20.5 %]) for the nonreperfusion group. Key factors associated with not benefiting from successful recanalization included older age, higher pre-stroke mRS scores and higher National Institutes of Health Stroke Scale score at admission. CONCLUSIONS This study highlights the potential of predictive ML techniques to estimate the individual effect of successful recanalization on ischemic stroke patients undergoing MT.
Collapse
Affiliation(s)
- Vahid Farmani
- Galway Medical Technology Centre, Department of Mechanical and Industrial Engineering, Atlantic Technological University, Galway, Ireland.
| | - Helge Kniep
- Department of Diagnostic and Interventional Neuroradiology, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany.
| | - Mate E Maros
- neur, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany.
| | - Olga Lyashevska
- Marine and Freshwater Research Centre, Department of Natural Resources & the Environment, School of Science and Computing, Atlantic Technological University, Galway, Ireland; Netherlands eScience Center, Amsterdam, Netherlands.
| | - Fiona Malone
- Galway Medical Technology Centre, Department of Mechanical and Industrial Engineering, Atlantic Technological University, Galway, Ireland.
| | - Jens Fiehler
- Department of Diagnostic and Interventional Neuroradiology, University Medical Centre Hamburg-Eppendorf, Hamburg, Germany.
| | - Liam Morris
- Galway Medical Technology Centre, Department of Mechanical and Industrial Engineering, Atlantic Technological University, Galway, Ireland.
| |
Collapse
|
2
|
Saleem MA, Javeed A, Akarathanawat W, Chutinet A, Suwanwela NC, Kaewplung P, Chaitusaney S, Deelertpaiboon S, Srisiri W, Benjapolakul W. An intelligent learning system based on electronic health records for unbiased stroke prediction. Sci Rep 2024; 14:23052. [PMID: 39367027 PMCID: PMC11452373 DOI: 10.1038/s41598-024-73570-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 09/18/2024] [Indexed: 10/06/2024] Open
Abstract
Stroke has a negative impact on people's lives and is one of the leading causes of death and disability worldwide. Early detection of symptoms can significantly help predict stroke and promote a healthy lifestyle. Researchers have developed several methods to predict strokes using machine learning (ML) techniques. However, the proposed systems have suffered from the following two main problems. The first problem is that the machine learning models are biased due to the uneven distribution of classes in the dataset. Recent research has not adequately addressed this problem, and no preventive measures have been taken. Synthetic Minority Oversampling (SMOTE) has been used to remove bias and balance the training of the proposed ML model. The second problem is to solve the problem of lower classification accuracy of machine learning models. We proposed a learning system that combines an autoencoder with a linear discriminant analysis (LDA) model to increase the accuracy of the proposed ML model for stroke prediction. Relevant features are extracted from the feature space using the autoencoder, and the extracted subset is then fed into the LDA model for stroke classification. The hyperparameters of the LDA model are found using a grid search strategy. However, the conventional accuracy metric does not truly reflect the performance of ML models. Therefore, we employed several evaluation metrics to validate the efficiency of the proposed model. Consequently, we evaluated the proposed model's accuracy, sensitivity, specificity, area under the curve (AUC), and receiver operator characteristic (ROC). The experimental results show that the proposed model achieves a sensitivity and specificity of 98.51% and 97.56%, respectively, with an accuracy of 99.24% and a balanced accuracy of 98.00%.
Collapse
Affiliation(s)
- Muhammad Asim Saleem
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Ashir Javeed
- Aging Research Center, Karolinska Institutet, 171 65, Stockholm, Sweden
| | - Wasan Akarathanawat
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand
- Chulalongkorn Stroke Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
- Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
| | - Aurauma Chutinet
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand
- Chulalongkorn Stroke Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
- Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
| | - Nijasri Charnnarong Suwanwela
- Division of Neurology, Department of Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330, Thailand
- Chulalongkorn Stroke Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
- Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, 10330, Thailand
| | - Pasu Kaewplung
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand.
| | - Surachai Chaitusaney
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Sunchai Deelertpaiboon
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Wattanasak Srisiri
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand
| | - Watit Benjapolakul
- Center of Excellence in Artificial Intelligence, Machine Learning and Smart Grid Technology, Department of Electrical Engineering, Faculty of Engineering, Chulalongkorn University, Bangkok, 10330, Thailand.
| |
Collapse
|
3
|
Soltan A, Washington P. Challenges in Reducing Bias Using Post-Processing Fairness for Breast Cancer Stage Classification with Deep Learning. ALGORITHMS 2024; 17:141. [PMID: 38962581 PMCID: PMC11221567 DOI: 10.3390/a17040141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
Breast cancer is the most common cancer affecting women globally. Despite the significant impact of deep learning models on breast cancer diagnosis and treatment, achieving fairness or equitable outcomes across diverse populations remains a challenge when some demographic groups are underrepresented in the training data. We quantified the bias of models trained to predict breast cancer stage from a dataset consisting of 1000 biopsies from 842 patients provided by AIM-Ahead (Artificial Intelligence/Machine Learning Consortium to Advance Health Equity and Researcher Diversity). Notably, the majority of data (over 70%) were from White patients. We found that prior to post-processing adjustments, all deep learning models we trained consistently performed better for White patients than for non-White patients. After model calibration, we observed mixed results, with only some models demonstrating improved performance. This work provides a case study of bias in breast cancer medical imaging models and highlights the challenges in using post-processing to attempt to achieve fairness.
Collapse
Affiliation(s)
- Armin Soltan
- Hawaii Health Digital Lab, Information and Computer Science, University of Hawaii at Manoa, Honolulu, HI 96822, USA
| | - Peter Washington
- Hawaii Health Digital Lab, Information and Computer Science, University of Hawaii at Manoa, Honolulu, HI 96822, USA
| |
Collapse
|
4
|
Shaikh Y, Jeelani M, Gibbons MC, Livingston D, Williams DR, Wijesinghe S, Patterson J, Russell S. Centering and collaborating with community knowledge systems: piloting a novel participatory modeling approach. Int J Equity Health 2023; 22:45. [PMID: 36915080 PMCID: PMC10010640 DOI: 10.1186/s12939-023-01839-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Accepted: 01/21/2023] [Indexed: 03/16/2023] Open
Abstract
BACKGROUND Systems science approaches like simulation modeling can offer an opportunity for community voice to shape policies. In the episteme of many communities there are elders, leaders, and researchers who are seen as bearers of historic knowledge and can contextualize and interpret contemporary research using knowledge systems of the community. There is a need for a systematic methodology to collaborate with community Knowledge Bearers and Knowledge Interpreters. In this paper we report the results of piloting a systematic methodology for collaborating with a community Knowledge-Bearer and Knowledge-Interpreter to develop a conceptual model revealing the local-level influences and architecture of systems shaping community realities. The use case for this pilot is 'persistent poverty' in the United States, specifically within the inner-city African American community in Baltimore City. METHODS This pilot of a participatory modeling approach was conducted over a span of 7 sessions and included the following steps, each with an associated script: Step 1: Knowledge-Bearer and Knowledge-Interpreter recruitment Step 2: Relationship building Step 3: Session introduction, Vignette development & enrichment Step 4: Vignette analysis & constructing architecture of systems map Step 5: Augmenting architecture of systems map RESULTS: Each step of the participatory modeling approach resulted in artifacts that were valuable for both the communities and the research effort. Vignette construction resulted in narratives representing a spectrum of lived experiences, trajectories, and outcomes within a community. The collaborative analysis of vignettes yielded the Architecture of Systemic Factors map, that revealed how factors inter-relate to form a system in which lived experience of poverty occurs. A literature search provided an opportunity for the community to contextualize existing research about them using realities of lived experience. CONCLUSION This methodology showed that a community Knowledge Bearer can function as communicators and interpreters of their community's knowledge base, can develop coherent narratives of lived experiences within which research and knowledge is contextualized, and can collaboratively construct conceptual mappings necessary for simulation modeling. This participatory modeling approach showed that even if there already exists a vast body of research about a community, collaborating with community gives context to that research and brings together disparate findings within narratives of lived experience.
Collapse
Affiliation(s)
- Yahya Shaikh
- The MITRE Corp, 2275 Rolling Run Dr, Windsor Mill, Woodlawn, MD, 21244, USA.
| | - Muzamillah Jeelani
- International Islamic University of Malaysia, Jalan Gombak, 53100, Kuala Lumpur, Selangor, Malaysia
| | | | | | | | | | | | - Sybil Russell
- The MITRE Corp, 2275 Rolling Run Dr, Windsor Mill, Woodlawn, MD, 21244, USA
| |
Collapse
|
5
|
Celi LA, Cellini J, Charpignon ML, Dee EC, Dernoncourt F, Eber R, Mitchell WG, Moukheiber L, Schirmer J, Situ J, Paguio J, Park J, Wawira JG, Yao S. Sources of bias in artificial intelligence that perpetuate healthcare disparities-A global review. PLOS DIGITAL HEALTH 2022; 1:e0000022. [PMID: 36812532 PMCID: PMC9931338 DOI: 10.1371/journal.pdig.0000022] [Citation(s) in RCA: 163] [Impact Index Per Article: 54.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 02/07/2022] [Indexed: 01/17/2023]
Abstract
BACKGROUND While artificial intelligence (AI) offers possibilities of advanced clinical prediction and decision-making in healthcare, models trained on relatively homogeneous datasets, and populations poorly-representative of underlying diversity, limits generalisability and risks biased AI-based decisions. Here, we describe the landscape of AI in clinical medicine to delineate population and data-source disparities. METHODS We performed a scoping review of clinical papers published in PubMed in 2019 using AI techniques. We assessed differences in dataset country source, clinical specialty, and author nationality, sex, and expertise. A manually tagged subsample of PubMed articles was used to train a model, leveraging transfer-learning techniques (building upon an existing BioBERT model) to predict eligibility for inclusion (original, human, clinical AI literature). Of all eligible articles, database country source and clinical specialty were manually labelled. A BioBERT-based model predicted first/last author expertise. Author nationality was determined using corresponding affiliated institution information using Entrez Direct. And first/last author sex was evaluated using the Gendarize.io API. RESULTS Our search yielded 30,576 articles, of which 7,314 (23.9%) were eligible for further analysis. Most databases came from the US (40.8%) and China (13.7%). Radiology was the most represented clinical specialty (40.4%), followed by pathology (9.1%). Authors were primarily from either China (24.0%) or the US (18.4%). First and last authors were predominately data experts (i.e., statisticians) (59.6% and 53.9% respectively) rather than clinicians. And the majority of first/last authors were male (74.1%). INTERPRETATION U.S. and Chinese datasets and authors were disproportionately overrepresented in clinical AI, and almost all of the top 10 databases and author nationalities were from high income countries (HICs). AI techniques were most commonly employed for image-rich specialties, and authors were predominantly male, with non-clinical backgrounds. Development of technological infrastructure in data-poor regions, and diligence in external validation and model re-calibration prior to clinical implementation in the short-term, are crucial in ensuring clinical AI is meaningful for broader populations, and to avoid perpetuating global health inequity.
Collapse
Affiliation(s)
- Leo Anthony Celi
- Massachusetts Institute of Technology, Institute for Medical Engineering and Science, Cambridge, MA, United States of America
- Harvard TH Chan School of Public Health, Department of Biostatistics, Boston, MA, United States of America
- Beth Israel Deaconess Medical Center, Department of Medicine, Boston, MA, United States of America
| | - Jacqueline Cellini
- Harvard Medical School, Department of Library Services, Boston, MA, United States of America
| | - Marie-Laure Charpignon
- Massachusetts Institute of Technology, Institute for Data, Systems and Society, Cambridge, MA, United States of America
| | | | | | - Rene Eber
- Montpellier University, Montpellier Research in Management, Montpellier, France
| | | | - Lama Moukheiber
- Massachusetts General Hospital, Harvard Medical School, Boston, MA, United States of America
| | - Julian Schirmer
- Montpellier University, Montpellier Research in Management, Montpellier, France
| | - Julia Situ
- Massachusetts Institute of Technology, Department of Computer Science and Molecular Biology, Cambridge, MA, United States of America
| | - Joseph Paguio
- Einstein Medical Center Philadelphia, Department of Medicine, Philadelphia, PA, United States of America
| | - Joel Park
- BeiGene, Applied Innovation, Cambridge, MA, United States of America
| | - Judy Gichoya Wawira
- Emory University, Department of Radiology and Biomedical Informatics, Atlanta, GA, United States of America
| | - Seth Yao
- Einstein Medical Center Philadelphia, Department of Medicine, Philadelphia, PA, United States of America
| | | |
Collapse
|