1
|
Li Y, Huang H, Li Y, Ye Z, Li X, Liu K, Liu M, Liu L, Jiang J. Characterizing soil COPs eco-risk in China. JOURNAL OF HAZARDOUS MATERIALS 2025; 489:137588. [PMID: 39954439 DOI: 10.1016/j.jhazmat.2025.137588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2024] [Revised: 02/04/2025] [Accepted: 02/10/2025] [Indexed: 02/17/2025]
Abstract
Although soil combustible organic pollutants (COPs) pose a serious threat to human well-being, their spatial distribution patterns, responses to environmental constraints, and areas of risk throughout China are still unclear. This knowledge gap hinders the control of soil COPs, causing us to overlook their impact on climate change and the environment. In this study, a total of 420 soil samples, distributed in typical regions of China, were tested for COPs content, including black carbon (BC) and polycyclic aromatic hydrocarbons (PAHs). Interest points (POI) such as parking lots, gas stations, and car services have become the main factors that influence soil COPs enrichment, and can be considered new indicators in other organic pollution studies. By comparing various machine learning simulations and predictions, this study accurately predicted the content of soil COPs in China and pointed out that, as the "third pole of the world", the Qinghai Tibet Plateau will face an unprecedented crisis. We established a method for assessing the comprehensive risk of soil COPs and identified at-risk areas, which accounted for 38.9 % of China's total soil area. Our research findings emphasize the main driving factors for soil COPs and identify areas in China that require prioritized soil COPs control.
Collapse
Affiliation(s)
- Yan Li
- Collaborative Innovation Center of Sustainable Forestry, College of forestry, Nanjing Forestry University, Nanjing, Jiangsu, China; Key Laboratory of Geographic Information Science of the Ministry of Education, School of Geographic Sciences, East China Normal University, Shanghai, China
| | - Haoran Huang
- Collaborative Innovation Center of Sustainable Forestry, College of forestry, Nanjing Forestry University, Nanjing, Jiangsu, China
| | - Ye Li
- Key Laboratory of Geographic Information Science of the Ministry of Education, School of Geographic Sciences, East China Normal University, Shanghai, China.
| | - Zi Ye
- Collaborative Innovation Center of Sustainable Forestry, College of forestry, Nanjing Forestry University, Nanjing, Jiangsu, China
| | - Xiang Li
- School of Architectural Engineering, Jinling Institute of Technology, Nanjing, Jiangsu, China
| | - Ke Liu
- College of Resources and Environment, Henan University of Economics and Law, Zhengzhou, Henan, China
| | - Min Liu
- Key Laboratory of Geographic Information Science of the Ministry of Education, School of Geographic Sciences, East China Normal University, Shanghai, China.
| | - Lei Liu
- State Key Laboratory of Nutrient Use and Management, Beijing Key Laboratory of Farmland Soil Pollution Prevention and Remediation, College of Resources and Environmental Sciences, National Academy of Agriculture Green Development, China Agricultural University, Beijing 100193, China; College of Earth and Environmental Sciences, Lanzhou University, Lanzhou 730000, China
| | - Jiang Jiang
- Collaborative Innovation Center of Sustainable Forestry, College of forestry, Nanjing Forestry University, Nanjing, Jiangsu, China.
| |
Collapse
|
2
|
Joshanloo M. Identifying the key predictors of positive self-perceptions of aging using machine learning. Soc Sci Med 2025; 374:118060. [PMID: 40233632 DOI: 10.1016/j.socscimed.2025.118060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Revised: 02/17/2025] [Accepted: 04/08/2025] [Indexed: 04/17/2025]
Abstract
This study aimed to identify key predictors of self-perceptions of aging (SPA) among older adults by examining a comprehensive set of potential predictors across physical, psychological, social, and demographic domains. Data from over 4000 American adults (mean age ≈ 70) from the Health and Retirement Study were used. A machine learning approach using Random Forest regression was employed to assess the relative importance of 49 potential predictors of SPA. The results revealed that health status, age, and psychological resources emerged as the strongest predictors of SPA. The psychological resources included the positive triad of self-esteem, life satisfaction, and optimism, as well as sense of mastery. Emotional tendencies and experiences, financial satisfaction, personality traits, and social factors had substantially lower predictive power. This study provides a comprehensive understanding of the factors that predict SPA and their relative importance, offering insights for both theory and practice. The results highlight the potential for designing targeted, evidence-based interventions that enhance psychological resources, address health and functional well-being, provide tailored support across the lifespan, and incorporate lifestyle changes to foster positive aging perceptions.
Collapse
Affiliation(s)
- Mohsen Joshanloo
- Department of Psychology, Keimyung University, Daegu, South Korea.
| |
Collapse
|
3
|
Yang B, Park C, Lin S, Muralidharan V, Kado DM. Around the EQUATOR With Clin-STAR: AI-Based Randomized Controlled Trial Challenges and Opportunities in Aging Research. J Am Geriatr Soc 2025; 73:1365-1375. [PMID: 39907384 PMCID: PMC12100690 DOI: 10.1111/jgs.19362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 11/25/2024] [Accepted: 12/08/2024] [Indexed: 02/06/2025]
Abstract
The CONSORT 2010 statement is a guideline that provides an evidence-based checklist of minimum reporting standards for randomized trials. With the rapid growth of Artificial Intelligence (AI) based interventions in the past 10 years, the CONSORT-AI extension was created in 2020 to provide guidelines for AI-based randomized controlled trials (RCT). The Clin-STAR "Around the EQUATOR" series features existing reported standards while also highlighting the inherent complexities of research involving research of older participants. In this work, we propose that when designing AI-based RCTs involving older adults, researchers adopt a conceptual framework (CONSORT-AI-5Ms) designed around the 5Ms (Mind, Mobility, Medications, Matters most, and Multi-complexity) of Age-Friendly Healthcare Systems. Employing the 5Ms in this context, we provide a detailed rationale and include specific examples of challenges and potential solutions to maximize the impact and value of AI RCTs in an older adult population. By combining the original intent of CONSORT-AI with the 5Ms framework, CONSORT-AI-5Ms provides a patient-centered and equitable perspective to consider when designing AI-based RCTs to address the diverse needs and challenges associated with geriatric care.
Collapse
Affiliation(s)
- Betsy Yang
- Section of Geriatric Medicine, Division of Primary Care and Population Health, Department of MedicineStanford School of MedicinePalo AltoCaliforniaUSA
- Geriatric Research Education Research and Clinical Center (GRECC)Veterans Administration Healthcare SystemPalo AltoCaliforniaUSA
- Stanford Healthcare AI Applied Research Team (HEA3RT)Stanford School of MedicinePalo AltoCaliforniaUSA
| | - Caroline Park
- Section of Geriatric Medicine, Division of Primary Care and Population Health, Department of MedicineStanford School of MedicinePalo AltoCaliforniaUSA
- Geriatric Research Education Research and Clinical Center (GRECC)Veterans Administration Healthcare SystemPalo AltoCaliforniaUSA
- Department of Family MedicineUSC Keck School of MedicinePasadenaCaliforniaUSA
| | - Steven Lin
- Stanford Healthcare AI Applied Research Team (HEA3RT)Stanford School of MedicinePalo AltoCaliforniaUSA
- Division of Primary Care and Population Health, Department of MedicineStanford School of MedicinePalo AltoCaliforniaUSA
| | | | - Deborah M. Kado
- Section of Geriatric Medicine, Division of Primary Care and Population Health, Department of MedicineStanford School of MedicinePalo AltoCaliforniaUSA
- Geriatric Research Education Research and Clinical Center (GRECC)Veterans Administration Healthcare SystemPalo AltoCaliforniaUSA
| |
Collapse
|
4
|
Wang L, Guo X, Shi H, Ma Y, Bao H, Jiang L, Zhao L, Feng Z, Zhu T, Lu L. CRISP: A causal relationships-guided deep learning framework for advanced ICU mortality prediction. BMC Med Inform Decis Mak 2025; 25:165. [PMID: 40234903 PMCID: PMC12001402 DOI: 10.1186/s12911-025-02981-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2024] [Accepted: 03/19/2025] [Indexed: 04/17/2025] Open
Abstract
BACKGROUND Mortality prediction is critical in clinical care, particularly in intensive care units (ICUs), where early identification of high-risk patients can inform treatment decisions. While deep learning (DL) models have demonstrated significant potential in this task, most suffer from limited generalizability, which hinders their widespread clinical application. Additionally, the class imbalance in electronic health records (EHRs) complicates model training. This study aims to develop a causally-informed prediction model that incorporates underlying causal relationships to mitigate class imbalance, enabling more stable mortality predictions. METHODS This study introduces the CRISP model (Causal Relationship Informed Superior Prediction), which leverages native counterfactuals to augment the minority class and constructs patient representations by incorporating causal structures to enhance mortality prediction. Patient data were obtained from the public MIMIC-III and MIMIC-IV databases, as well as an additional dataset from the West China Hospital of Sichuan University (WCHSU). RESULTS A total of 69,190 ICU cases were included, with 30,844 cases from MIMIC-III, 27,362 cases from MIMIC-IV, and 10,984 cases from WCHSU. The CRISP model demonstrated stable performance in mortality prediction across the 3 datasets, achieving AUROC (0.9042-0.9480) and AUPRC (0.4771-0.7611). CRISP's data augmentation module showed predictive performance comparable to commonly used interpolation-based oversampling techniques. CONCLUSION CRISP achieves better generalizability across different patient groups, compared to various baseline algorithms, thereby enhancing the practical application of DL in clinical decision support. TRIAL REGISTRATION Trial registration information for the WCHSU data is available on the Chinese Clinical Trial Registry website ( http://www.chictr.org.cn ), with the registration number ChiCTR1900025160. The recruitment period for the data was from August 5, 2019, to August 31, 2021.
Collapse
Affiliation(s)
- Linna Wang
- College of Computer Science, Sichuan University, 24 South Section 1, 1st Ring Road, Chengdu, Sichuan, 610065, China
| | - Xinyu Guo
- Department of Health Policy and Management, West China School of Public Health and West China Fourth Hospital, Sichuan University, 37 Guoxue Alley, Chengdu, Sichuan, 610041, China
| | - Haoyue Shi
- College of Mechanical Engineering, Sichuan University, 24 South Section 1, 1st Ring Road, Chengdu, Sichuan, 610065, China
| | - Yuehang Ma
- College of Computer Science, Sichuan University, 24 South Section 1, 1st Ring Road, Chengdu, Sichuan, 610065, China
| | - Han Bao
- College of Computer Science, Sichuan University, 24 South Section 1, 1st Ring Road, Chengdu, Sichuan, 610065, China
| | - Lihua Jiang
- Department of Health Policy and Management, West China School of Public Health and West China Fourth Hospital, Sichuan University, 37 Guoxue Alley, Chengdu, Sichuan, 610041, China
| | - Li Zhao
- Department of Health Policy and Management, West China School of Public Health and West China Fourth Hospital, Sichuan University, 37 Guoxue Alley, Chengdu, Sichuan, 610041, China
| | - Ziliang Feng
- College of Computer Science, Sichuan University, 24 South Section 1, 1st Ring Road, Chengdu, Sichuan, 610065, China
| | - Tao Zhu
- Department of Anesthesiology, National Clinical Research Center for Geriatrics, West China Hospital, Sichuan University, 37 Guoxue Alley, Chengdu, Sichuan, 610041, China
| | - Li Lu
- College of Computer Science, Sichuan University, 24 South Section 1, 1st Ring Road, Chengdu, Sichuan, 610065, China.
| |
Collapse
|
5
|
Wang Z, Ge S, Liao T, Yuan M, Qian W, Chen Q, Liang W, Cheng X, Zhou Q, Ju Z, Zhu H, Xiong W. Integrative single-cell metabolomics and phenotypic profiling reveals metabolic heterogeneity of cellular oxidation and senescence. Nat Commun 2025; 16:2740. [PMID: 40113759 PMCID: PMC11926267 DOI: 10.1038/s41467-025-57992-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Accepted: 03/10/2025] [Indexed: 03/22/2025] Open
Abstract
Emerging evidence has unveiled heterogeneity in phenotypic and transcriptional alterations at the single-cell level during oxidative stress and senescence. Despite the pivotal roles of cellular metabolism, a comprehensive elucidation of metabolomic heterogeneity in cells and its connection with cellular oxidative and senescent status remains elusive. By integrating single-cell live imaging with mass spectrometry (SCLIMS), we establish a cross-modality technique capturing both metabolome and oxidative level in individual cells. The SCLIMS demonstrates substantial metabolomic heterogeneity among cells with diverse oxidative levels. Furthermore, the single-cell metabolome predicted heterogeneous states of cells. Remarkably, the pre-existing metabolomic heterogeneity determines the divergent cellular fate upon oxidative insult. Supplementation of key metabolites screened by SCLIMS resulted in a reduction in cellular oxidative levels and an extension of C. elegans lifespan. Altogether, SCLIMS represents a potent tool for integrative metabolomics and phenotypic profiling at the single-cell level, offering innovative approaches to investigate metabolic heterogeneity in cellular processes.
Collapse
Affiliation(s)
- Ziyi Wang
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Siyuan Ge
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Tiepeng Liao
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Man Yuan
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Wenwei Qian
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Qi Chen
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Wei Liang
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China
| | - Xiawei Cheng
- School of Pharmacy, Optogenetics & Synthetic Biology Interdisciplinary Research Center, East China University of Science and Technology, 200237, Shanghai, China
| | - Qinghua Zhou
- Key Laboratory of Regenerative Medicine of Ministry of Education, Institute of Aging and Regenerative Medicine, Department of Developmental & Regenerative Medicine, College of Life Science and Technology, Jinan University, 510632, Guangzhou, Guangdong, China
| | - Zhenyu Ju
- Key Laboratory of Regenerative Medicine of Ministry of Education, Institute of Aging and Regenerative Medicine, Department of Developmental & Regenerative Medicine, College of Life Science and Technology, Jinan University, 510632, Guangzhou, Guangdong, China
| | - Hongying Zhu
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China.
- Anhui Province Key Laboratory of Biomedical Imaging and Intelligent Processing, Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, 230088, Hefei, China.
- CAS Key Laboratory of Brain Function and Disease, 230026, Hefei, China.
- Anhui Province Key Laboratory of Biomedical Aging Research, 230026, Hefei, China.
| | - Wei Xiong
- Department of Neurology, The First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, 230001, Hefei, China.
- Anhui Province Key Laboratory of Biomedical Imaging and Intelligent Processing, Institute of Artificial Intelligence, Hefei Comprehensive National Science Center, 230088, Hefei, China.
- CAS Key Laboratory of Brain Function and Disease, 230026, Hefei, China.
- Anhui Province Key Laboratory of Biomedical Aging Research, 230026, Hefei, China.
| |
Collapse
|
6
|
Wang L, Chen B, Ouyang J, Mu Y, Zhen L, Yang L, Xu W, Tang L. Causal-inference machine learning reveals the drivers of China's 2022 ozone rebound. ENVIRONMENTAL SCIENCE AND ECOTECHNOLOGY 2025; 24:100524. [PMID: 39896320 PMCID: PMC11786889 DOI: 10.1016/j.ese.2025.100524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 01/04/2025] [Accepted: 01/05/2025] [Indexed: 02/04/2025]
Abstract
Ground-level ozone concentrations rebounded significantly across China in 2022, challenging air quality management and public health. Identifying the drivers of this rebound is crucial for designing effective mitigation strategies. Commonly used methods, such as chemical transport models and machine learning, provide valuable insights but face limitations-chemical transport models are computationally intensive, while machine learning often fails to address confounding factors or establish causality. Here we show that elevated temperatures and increased solar radiation, as primary meteorological drivers, collectively account for 57 % of the total ozone increase, based on an integrated analysis of ground-based monitoring data, satellite observations, and meteorological reanalysis information using explainable machine learning and causal inference techniques. Compared to the year 2021, 90 % of the stations reported an increase in the Formaldehyde to Nitrogen ratio, implying a growing sensitivity of ozone formation to nitrogen oxide levels. These findings highlight the significant causal role of meteorological changes in the ozone rebound, urging the adoption of targeted ozone mitigation strategies under climate warming, particularly through varied regional strategies that consider existing anthropogenic emission levels and the prospective increase in biogenic volatile organic compounds. This identification of causal relationships in air pollution dynamics can support data-driven and accurate decision-making.
Collapse
Affiliation(s)
- Lin Wang
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
| | - Baihua Chen
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
| | - Jingyi Ouyang
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yanshu Mu
- China School of Mathematics, Jilin University, Changchun, 130012, China
| | - Ling Zhen
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lin Yang
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
| | - Wei Xu
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
| | - Lina Tang
- Key Laboratory of Urban Environment and Health, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen, 361021, China
| |
Collapse
|
7
|
Luo Q, Zhang Q, Liu H, Chen X, Yang S, Xu Q. Time-dependent interpretable survival prediction model for second primary NSCLC patients. Int J Med Inform 2025; 195:105771. [PMID: 39721115 DOI: 10.1016/j.ijmedinf.2024.105771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 11/23/2024] [Accepted: 12/19/2024] [Indexed: 12/28/2024]
Abstract
OBJECTIVE Accurate predictive models for second primary non-small cell lung cancer (SP-NSCLC) are limited. This study aimed to develop and validate overall survival (OS) prediction models for SP-NSCLC patients using time-dependent interpretable survival machine learning algorithms. METHODS This study utilized data from the Surveillance, Epidemiology, and End Results (SEER) database, encompassing 8 and 12 registries, to extract data on patients aged 20-89 diagnosed with SP-NSCLC between 1988 and 2020. The dataset was divided into development, external temporal and spatial validation cohorts. Predictors included demographic, clinical, pathological and initial primary cancer-related features. Multiple survival machine learning algorithms were developed and validated, assessing model performance using C-index, time-dependent area under the receiver operating characteristic curve (time-AUC), and time-dependent Brier Score. The time-dependent interpretability analysis was employed to explore the time-dependent feature importance of key predictors. RESULTS The Blackboost model demonstrated excellent performance (C-index: 0.7517, and time-AUC: 0.8438), and good calibration (time-Brier Score of 0.0754). External validations and subgroup analyses demonstrated the robustness, generalizability, and fairness. Utilizing the optimal cutoff threshold, high-risk groups could be effectively identified. Surgery was the most critical predictor across the entire survival period. Combined stage (distant) and chemotherapy were the second most important predictors within 0 to 5 years, while age replaced from 5 to 20 years. Additionally, we developed an online visualization tool. CONCLUSIONS The Blackboost survival model achieved accurate, fair, and robust survival prediction for SP-NSCLC patients. Surgery, combined stage (distant), chemotherapy, and age contributed differently across various survival periods. The online visualization tool facilitated personalized survival predictions.
Collapse
Affiliation(s)
- Qiong Luo
- Department of Oncology Medicine, Fujian Medical University Union Hospital, Fuzhou, 350001, PR China
| | - Qianyuan Zhang
- Department of General Medicine, Fujian Medical University Union Hospital, Fuzhou, 350001, PR China
| | - Haiyu Liu
- Department of Pulmonary and Critical Care Medicine, Fujian Medical University Union Hospital, Fuzhou 350001, PR China
| | - Xiangqi Chen
- Department of Pulmonary and Critical Care Medicine, Fujian Medical University Union Hospital, Fuzhou 350001, PR China.
| | - Sheng Yang
- Department of Oncology Medicine, Fujian Medical University Union Hospital, Fuzhou, 350001, PR China.
| | - Qian Xu
- Department of Oncology Medicine, Fujian Medical University Union Hospital, Fuzhou, 350001, PR China.
| |
Collapse
|
8
|
Moges WK, Tegegne AS, Mitku AA, Tesfahun E, Hailemeskel S. Causal machine learning models for predicting low birth weight in midwife-led continuity care intervention in North Shoa Zone, Ethiopia. BMC Med Inform Decis Mak 2025; 25:64. [PMID: 39920662 PMCID: PMC11806756 DOI: 10.1186/s12911-025-02917-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Accepted: 02/03/2025] [Indexed: 02/09/2025] Open
Abstract
BACKGROUND Low birth weight (LBW) is a critical global health issue that affects infants disproportionately, particularly in developing countries. This study adopted causal machine learning (CML) algorithms for predicting LBW in newborns, drawing from midwife-led continuity care (MLCC). METHODS A quasi-experimental study was carried out in the North Shoa Zone of Ethiopia from August 2019 to September 2020. A total of 1166 women were allocated into two groups. The first group, the MLCC group, received all their antenatal, labor, birth, and immediate post-natal care from a single midwife. The second group received care from various staff members at different times throughout their pregnancy and childbirth. In this study, CML was implemented to predict LBW. Data preprocessing, including data cleaning, was conducted. CML was then employed to identify the most suitable classifier for predicting LBW. Gradient boosting algorithms were used to estimate the causal effect of MLCC on LBW. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance. Moreover, meta-learner algorithms were utilized to estimate the individual treatment effect (ITE), the average treatment effect (ATE), and performance. RESULTS The study results revealed that Causal K-Nearest Neighbors (CKNN) was the most effective classifier based on accuracy and estimated LBW using a 94.52% accuracy, 90.25% precision, 92.57% recall, and an F1 score of 88.2%. Meconium aspiration, perinatal mortality, pregnancy-induced hypertension, vacuum babies in need of resuscitation, and previous surgeries on their reproductive organs were identified as the top five features affecting LBW. The estimated impact of MLCC versus other professional groups on LBW was analyzed using gradient boosting algorithms and was found to be 0.237. The estimated ATE for the S-learner was 0.284, which is lower than the true ATE of 0.216. Additionally, the estimated ITE for both the T-learner and X-learner was less than -0.5, indicating that mothers would not choose to participate in the MLCC program. CONCLUSIONS Based on these findings, the CKNN classifier demonstrated a higher accuracy and effectiveness. The S-learner and R-learner models, utilizing the XGBoost Regressor and BaseSRegressor, provided accurate estimations of ITE for assessing the impact of the MLCC program. Promoting the MLCC program could help stabilize LBW outcomes.
Collapse
Affiliation(s)
- Wudneh Ketema Moges
- Department of Statistics, College of Science, Bahir Dar University, P.O.Box 79, Bahir Dar, Ethiopia.
- Department of Statistics, College of Science, Debre Berhan University, P.O.Box 445, Debre Berhan, Ethiopia.
- Department of Data Science, College of Computing, Debre Berhan University, P.O.Box 445, Debre Berhan, Ethiopia.
| | - Awoke Seyoum Tegegne
- Department of Statistics, College of Science, Bahir Dar University, P.O.Box 79, Bahir Dar, Ethiopia
| | - Aweke A Mitku
- Department of Statistics, College of Science, Bahir Dar University, P.O.Box 79, Bahir Dar, Ethiopia
- Global Change Institute (GCI), Faculty of Science, University of the Witwatersrand, Johannesburg, South Africa
| | - Esubalew Tesfahun
- Department of Public Health, College of Health Science, Debre Berhan University, P.O.Box 445, Debre Berhan, Ethiopia
| | - Solomon Hailemeskel
- Department of Midwifery, College of Health Science, Debre Berhan University, P.O.Box 445, Debre Berhan, Ethiopia
| |
Collapse
|
9
|
Hu X, Dong X, Wang Z. Common issues of data science on the eco-environmental risks of emerging contaminants. ENVIRONMENT INTERNATIONAL 2025; 196:109301. [PMID: 39884250 DOI: 10.1016/j.envint.2025.109301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Revised: 01/21/2025] [Accepted: 01/21/2025] [Indexed: 02/01/2025]
Abstract
Data-driven approaches (e.g., machine learning) are increasingly used to replace or assist laboratory studies in the study of emerging contaminants (ECs). In the past ten years, an increasing number of models or approaches have been applied to ECs, and the datasets used are continuously enriched. However, there are large knowledge gaps between what we have found and the natural eco-environmental meaning. For most published reviews, the contents are organized by the types of ECs, but the common issues of data science, regardless of the type of pollutant, are not sufficiently addressed. To close or narrow the knowledge gaps, we highlight the following issues ignored in the field of data-driven EC research. Complicated biological and ecological data and ensemble models revealing mechanisms and spatiotemporal trends with strong causal relationships and without data leakage deserve more attention in the future. In addition, the matrix influence, trace concentration, and complex scenario have often been ignored in previous works. Therefore, an integrated research framework related to natural fields, ecological systems, and large-scale environmental problems, rather than relying solely on laboratory data-related analysis, is urgently needed. Beyond the current prediction purposes, data science can inspire the discovery of scientific questions, and mutual inspiration among data science, process and mechanism models, and laboratory and field research is a critical direction. Focusing on the above urgent and common issues related to data, frameworks, and purposes, regardless of the type of pollutant, data science is expected to achieve great advancements in addressing the eco-environmental risks of ECs.
Collapse
Affiliation(s)
- Xiangang Hu
- Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of Education), Carbon Neutrality Interdisciplinary Science Centre, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China.
| | - Xu Dong
- Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of Education), Carbon Neutrality Interdisciplinary Science Centre, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| | - Zhangjia Wang
- Key Laboratory of Pollution Processes and Environmental Criteria (Ministry of Education), Carbon Neutrality Interdisciplinary Science Centre, College of Environmental Science and Engineering, Nankai University, Tianjin 300350, China
| |
Collapse
|
10
|
Ye Y, Pandey A, Bawden C, Sumsuzzman DM, Rajput R, Shoukat A, Singer BH, Moghadas SM, Galvani AP. Integrating artificial intelligence with mechanistic epidemiological modeling: a scoping review of opportunities and challenges. Nat Commun 2025; 16:581. [PMID: 39794317 PMCID: PMC11724045 DOI: 10.1038/s41467-024-55461-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2024] [Accepted: 12/12/2024] [Indexed: 01/13/2025] Open
Abstract
Integrating prior epidemiological knowledge embedded within mechanistic models with the data-mining capabilities of artificial intelligence (AI) offers transformative potential for epidemiological modeling. While the fusion of AI and traditional mechanistic approaches is rapidly advancing, efforts remain fragmented. This scoping review provides a comprehensive overview of emerging integrated models applied across the spectrum of infectious diseases. Through systematic search strategies, we identified 245 eligible studies from 15,460 records. Our review highlights the practical value of integrated models, including advances in disease forecasting, model parameterization, and calibration. However, key research gaps remain. These include the need for better incorporation of realistic decision-making considerations, expanded exploration of diverse datasets, and further investigation into biological and socio-behavioral mechanisms. Addressing these gaps will unlock the synergistic potential of AI and mechanistic modeling to enhance understanding of disease dynamics and support more effective public health planning and response.
Collapse
Affiliation(s)
- Yang Ye
- Center for Infectious Disease Modeling and Analysis, Yale School of Public Health, New Haven, CT, USA
| | - Abhishek Pandey
- Center for Infectious Disease Modeling and Analysis, Yale School of Public Health, New Haven, CT, USA
| | - Carolyn Bawden
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
- Agent-Based Modelling Laboratory, York University, Toronto, ON, Canada
| | | | - Rimpi Rajput
- Center for Infectious Disease Modeling and Analysis, Yale School of Public Health, New Haven, CT, USA
| | - Affan Shoukat
- Department of Mathematics and Statistics, University of Regina, Regina, SK, Canada
| | - Burton H Singer
- Emerging Pathogens Institute, University of Florida, Gainesville, FL, USA
| | - Seyed M Moghadas
- Agent-Based Modelling Laboratory, York University, Toronto, ON, Canada
| | - Alison P Galvani
- Center for Infectious Disease Modeling and Analysis, Yale School of Public Health, New Haven, CT, USA.
| |
Collapse
|
11
|
Liu X, Yue FJ, Wong WW, Guo TL, Li SL. Unravelling nitrate transformation mechanisms in karst catchments through the coupling of high-frequency sensor data and machine learning. WATER RESEARCH 2024; 267:122507. [PMID: 39342713 DOI: 10.1016/j.watres.2024.122507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 08/25/2024] [Accepted: 09/22/2024] [Indexed: 10/01/2024]
Abstract
Nitrate dynamics within a catchment are critical to the earth's system process, yet the intricate details of its transport and transformation at high resolutions remain elusive. Hydrological effects on nitrate dynamics in particular have not been thoroughly assessed previously and this knowledge gap hampers our understanding and effective management of nitrogen cycling in watersheds. Here, machine learning (ML) models were employed to reconstruct the annual variation trend in nitrate dynamics and isotopes within a typical karst catchment. Random forest model demonstrates promising potential in predicting nitrate concentration and its isotopes, surpassing other ML models (including Long Short-term Memory, Convolutional Neural Network, and Support Vector Machine) in performance. The ML-modeled NO3--N concentrations, δ15N-NO3-, and δ18O-NO3- values were in close agreement with field data (NSE values of 0.95, 0.80, and 0.53, respectively), which are notably challenging to achieve for process models. During the transition from dry to wet period, approximately 23.0 % of the annual precipitation (∼269.1 mm) was identified as the threshold for triggering a rapid response in the wet period. The modeled nitrate isotope values were significantly supported by the field data, suggesting seasonal variations of nitrogen sources, with precipitation as the primary driving force for fertilizer sources. Mixing of multiple sources appeared to be the main control of the transport and transformation of nitrate during the rising limb in the wet period, whereas process control (denitrification) took precedence during the falling limb, and the fate of nitrate was controlled by biogeochemical processes during the dry period.
Collapse
Affiliation(s)
- Xin Liu
- Institute of Surface-Earth System Science, School of Earth System Science, Tianjin University, Tianjin 300072, China; Water Studies, School of Chemistry, Monash University, Clayton, Victoria 3800, Australia
| | - Fu-Jun Yue
- Institute of Surface-Earth System Science, School of Earth System Science, Tianjin University, Tianjin 300072, China.
| | - Wei Wen Wong
- Water Studies, School of Chemistry, Monash University, Clayton, Victoria 3800, Australia
| | - Tian-Li Guo
- College of Water Resources and Architectural Engineering, Northwest A&F University, Yangling 712100, China
| | - Si-Liang Li
- Institute of Surface-Earth System Science, School of Earth System Science, Tianjin University, Tianjin 300072, China.
| |
Collapse
|
12
|
Ibanez A, Maito M, Botero-Rodríguez F, Fittipaldi S, Coronel C, Migeot J, Lacroix A, Lawlor B, Duran-Aniotz C, Baez S, Santamaria-Garcia H. Healthy aging meta-analyses and scoping review of risk factors across Latin America reveal large heterogeneity and weak predictive models. NATURE AGING 2024; 4:1153-1165. [PMID: 38886210 PMCID: PMC11333291 DOI: 10.1038/s43587-024-00648-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 05/13/2024] [Indexed: 06/20/2024]
Abstract
Models of healthy aging are typically based on the United States and Europe and may not apply to diverse and heterogeneous populations. In this study, our objectives were to conduct a meta-analysis to assess risk factors of cognition and functional ability across aging populations in Latin America and a scoping review focusing on methodological procedures. Our study design included randomized controlled trials and cohort, case-control and cross-sectional studies using multiple databases, including MEDLINE, the Virtual Health Library and Web of Science. From an initial pool of 455 studies, our meta-analysis included 38 final studies (28 assessing cognition and 10 assessing functional ability, n = 146,000 participants). Our results revealed significant but heterogeneous effects for cognition (odds ratio (OR) = 1.20, P = 0.03, confidence interval (CI) = (1.0127, 1.42); heterogeneity: I2 = 92.1%, CI = (89.8%, 94%)) and functional ability (OR = 1.20, P = 0.01, CI = (1.04, 1.39); I2 = 93.1%, CI = (89.3%, 95.5%)). Specific risk factors had limited effects, especially on functional ability, with moderate impacts for demographics and mental health and marginal effects for health status and social determinants of health. Methodological issues, such as outliers, inter-country differences and publication bias, influenced the results. Overall, we highlight the specific profile of risk factors associated with healthy aging in Latin America. The heterogeneity in results and methodological approaches in studying healthy aging call for greater harmonization and further regional research to understand healthy aging in Latin America.
Collapse
Affiliation(s)
- Agustin Ibanez
- Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibañez, Santiago de Chile, Chile.
- Global Brain Health Institute (GBHI), University of California, San Francisco (UCSF), San Francisco, CA, USA.
- University of Trinity Dublin, Dublin, Ireland.
- Cognitive Neuroscience Center (CNC), Universidad de San Andrés, Buenos Aires, Argentina.
- Trinity College Dublin, Dublin, Ireland.
| | - Marcelo Maito
- Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibañez, Santiago de Chile, Chile
| | - Felipe Botero-Rodríguez
- PhD Program of Neuroscience, Department of Psychiatry, Pontificia Universidad Javeriana, Bogotá, Colombia
- Hospital Universitario San Ignacio, Center for Brain and Cognition, Intellectus, Bogotá, Colombia
- Fundación para la Ciencia, Innovación y Tecnología - Fucintec, Bogotá, Colombia
| | - Sol Fittipaldi
- Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibañez, Santiago de Chile, Chile
- Trinity College Dublin, Dublin, Ireland
- Centro Interdisciplinario de Neurociencia de Valparaíso (CINV), Universidad de Valparaíso, Valparaíso, Chile
| | - Carlos Coronel
- Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibañez, Santiago de Chile, Chile
- Global Brain Health Institute (GBHI), University of California, San Francisco (UCSF), San Francisco, CA, USA
- University of Trinity Dublin, Dublin, Ireland
| | - Joaquin Migeot
- Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibañez, Santiago de Chile, Chile
| | - Andrea Lacroix
- Herbert Wertheim School of Public Health and Human Longevity Science, Health Sciences Office of Faculty Affairs, University California, San Diego (UCSD), San Diego, CA, USA
| | - Brian Lawlor
- Global Brain Health Institute (GBHI), University of California, San Francisco (UCSF), San Francisco, CA, USA
- University of Trinity Dublin, Dublin, Ireland
- Trinity College Dublin, Dublin, Ireland
| | - Claudia Duran-Aniotz
- Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibañez, Santiago de Chile, Chile
| | - Sandra Baez
- Global Brain Health Institute (GBHI), University of California, San Francisco (UCSF), San Francisco, CA, USA
- University of Trinity Dublin, Dublin, Ireland
- Universidad de los Andes, Bogotá, Colombia
| | - Hernando Santamaria-Garcia
- PhD Program of Neuroscience, Department of Psychiatry, Pontificia Universidad Javeriana, Bogotá, Colombia.
- Hospital Universitario San Ignacio, Center for Brain and Cognition, Intellectus, Bogotá, Colombia.
| |
Collapse
|
13
|
Labib SM. Greenness, air pollution, and temperature exposure effects in predicting premature mortality and morbidity: A small-area study using spatial random forest model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 928:172387. [PMID: 38608883 DOI: 10.1016/j.scitotenv.2024.172387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 04/08/2024] [Accepted: 04/08/2024] [Indexed: 04/14/2024]
Abstract
BACKGROUND Although studies have provided negative impacts of air pollution, heat or cold exposure on mortality and morbidity, and positive effects of increased greenness on reducing them, a few studies have focused on exploring combined and synergetic effects of these exposures in predicting these health outcomes, and most had ignored the spatial autocorrelation in analyzing their health effects. This study aims to investigate the health effects of air pollution, greenness, and temperature exposure on premature mortality and morbidity within a spatial machine-learning modeling framework. METHODS Years of potential life lost reflecting premature mortality and comparative illness and disability ratio reflecting chronic morbidity from 1673 small areas covering Greater Manchester for the year 2008-2013 obtained. Average annual levels of NO2 concentration, normalized difference vegetation index (NDVI) representing greenness, and annual average air temperature were utilized to assess exposure in each area. These exposures were linked to health outcomes using non-spatial and spatial random forest (RF) models while accounting for spatial autocorrelation. RESULTS Spatial-RF models provided the best predictive accuracy when accounted for spatial autocorrelation. Among the exposures considered, air pollution emerged as the most influential in predicting mortality and morbidity, followed by NDVI and temperature exposure. Nonlinear exposure-response relations were observed, and interactions between exposures illustrated specific ranges or sweet and sour spots of exposure thresholds where combined effects either exacerbate or moderate health conditions. CONCLUSION Air pollution exposure had a greater negative impact on health compared to greenness and temperature exposure. Combined exposure effects may indicate the highest influence of premature mortality and morbidity burden.
Collapse
Affiliation(s)
- S M Labib
- Department of Human Geography and Spatial Planning, Faculty of Geosciences, Utrecht University, the Netherlands.
| |
Collapse
|
14
|
Arbelaez Ossa L, Milford SR, Rost M, Leist AK, Shaw DM, Elger BS. AI Through Ethical Lenses: A Discourse Analysis of Guidelines for AI in Healthcare. SCIENCE AND ENGINEERING ETHICS 2024; 30:24. [PMID: 38833207 PMCID: PMC11150179 DOI: 10.1007/s11948-024-00486-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 04/30/2024] [Indexed: 06/06/2024]
Abstract
While the technologies that enable Artificial Intelligence (AI) continue to advance rapidly, there are increasing promises regarding AI's beneficial outputs and concerns about the challenges of human-computer interaction in healthcare. To address these concerns, institutions have increasingly resorted to publishing AI guidelines for healthcare, aiming to align AI with ethical practices. However, guidelines as a form of written language can be analyzed to recognize the reciprocal links between its textual communication and underlying societal ideas. From this perspective, we conducted a discourse analysis to understand how these guidelines construct, articulate, and frame ethics for AI in healthcare. We included eight guidelines and identified three prevalent and interwoven discourses: (1) AI is unavoidable and desirable; (2) AI needs to be guided with (some forms of) principles (3) trust in AI is instrumental and primary. These discourses signal an over-spillage of technical ideals to AI ethics, such as over-optimism and resulting hyper-criticism. This research provides insights into the underlying ideas present in AI guidelines and how guidelines influence the practice and alignment of AI with ethical, legal, and societal values expected to shape AI in healthcare.
Collapse
Affiliation(s)
| | - Stephen R Milford
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
| | - Michael Rost
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
| | - Anja K Leist
- Institute for Research on Socio-Economic Inequality (IRSEI) in the Department of Social Sciences, University of Luxembourg, Esch-Sur-Alzette, Luxembourg
| | - David M Shaw
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
- Care and Public Health Research Institute, Maastricht University, Maastricht, The Netherlands
| | - Bernice S Elger
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
- Center for Legal Medicine (CURML), University of Geneva, Geneva, Switzerland
| |
Collapse
|
15
|
Gao Y, Cui Y. Optimizing clinico-genomic disease prediction across ancestries: a machine learning strategy with Pareto improvement. Genome Med 2024; 16:76. [PMID: 38835075 PMCID: PMC11149372 DOI: 10.1186/s13073-024-01345-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 05/17/2024] [Indexed: 06/06/2024] Open
Abstract
BACKGROUND Accurate prediction of an individual's predisposition to diseases is vital for preventive medicine and early intervention. Various statistical and machine learning models have been developed for disease prediction using clinico-genomic data. However, the accuracy of clinico-genomic prediction of diseases may vary significantly across ancestry groups due to their unequal representation in clinical genomic datasets. METHODS We introduced a deep transfer learning approach to improve the performance of clinico-genomic prediction models for data-disadvantaged ancestry groups. We conducted machine learning experiments on multi-ancestral genomic datasets of lung cancer, prostate cancer, and Alzheimer's disease, as well as on synthetic datasets with built-in data inequality and distribution shifts across ancestry groups. RESULTS Deep transfer learning significantly improved disease prediction accuracy for data-disadvantaged populations in our multi-ancestral machine learning experiments. In contrast, transfer learning based on linear frameworks did not achieve comparable improvements for these data-disadvantaged populations. CONCLUSIONS This study shows that deep transfer learning can enhance fairness in multi-ancestral machine learning by improving prediction accuracy for data-disadvantaged populations without compromising prediction accuracy for other populations, thus providing a Pareto improvement towards equitable clinico-genomic prediction of diseases.
Collapse
Affiliation(s)
- Yan Gao
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA
- Center for Integrative and Translational Genomics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA
| | - Yan Cui
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA.
- Center for Integrative and Translational Genomics, University of Tennessee Health Science Center, Memphis, TN, 38163, USA.
- Center for Cancer Research, University of Tennessee Health Science Center, Memphis, TN, 38163, USA.
| |
Collapse
|
16
|
Kapoor S, Cantrell EM, Peng K, Pham TH, Bail CA, Gundersen OE, Hofman JM, Hullman J, Lones MA, Malik MM, Nanayakkara P, Poldrack RA, Raji ID, Roberts M, Salganik MJ, Serra-Garcia M, Stewart BM, Vandewiele G, Narayanan A. REFORMS: Consensus-based Recommendations for Machine-learning-based Science. SCIENCE ADVANCES 2024; 10:eadk3452. [PMID: 38691601 PMCID: PMC11092361 DOI: 10.1126/sciadv.adk3452] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 03/29/2024] [Indexed: 05/03/2024]
Abstract
Machine learning (ML) methods are proliferating in scientific research. However, the adoption of these methods has been accompanied by failures of validity, reproducibility, and generalizability. These failures can hinder scientific progress, lead to false consensus around invalid claims, and undermine the credibility of ML-based science. ML methods are often applied and fail in similar ways across disciplines. Motivated by this observation, our goal is to provide clear recommendations for conducting and reporting ML-based science. Drawing from an extensive review of past literature, we present the REFORMS checklist (recommendations for machine-learning-based science). It consists of 32 questions and a paired set of guidelines. REFORMS was developed on the basis of a consensus of 19 researchers across computer science, data science, mathematics, social sciences, and biomedical sciences. REFORMS can serve as a resource for researchers when designing and implementing a study, for referees when reviewing papers, and for journals when enforcing standards for transparency and reproducibility.
Collapse
Affiliation(s)
- Sayash Kapoor
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
| | - Emily M. Cantrell
- Department of Sociology, Princeton University, Princeton, NJ 08544, USA
- School of Public and International Affairs, Princeton University, Princeton, NJ 08544, USA
| | - Kenny Peng
- Department of Computer Science, Cornell University, Ithaca, NY 14850, USA
| | - Thanh Hien Pham
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
| | - Christopher A. Bail
- Department of Sociology, Duke University, Durham, NC 27708, USA
- Department of Political Science, Duke University, Durham, NC 27708, USA
- Sanford School of Public Policy, Duke University, Durham, NC 27708, USA
| | - Odd Erik Gundersen
- Department of Computer Science, Norwegian University of Science and Technology, Trondheim, Norway
- Aneo AS, Trondheim, Norway
| | | | - Jessica Hullman
- Department of Computer Science, Northwestern University, Evanston, IL 60208, USA
| | - Michael A. Lones
- School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK
| | - Momin M. Malik
- Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA
- School of Social Policy & Practice, University of Pennsylvania, Philadelphia, PA 19104, USA
- Institute in Critical Quantitative, Computational, & Mixed Methodologies, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Priyanka Nanayakkara
- Department of Computer Science, Northwestern University, Evanston, IL 60208, USA
- Department of Communication Studies, Northwestern University, Evanston, IL 60208, USA
| | | | - Inioluwa Deborah Raji
- Department of Computer Science, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Michael Roberts
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Matthew J. Salganik
- Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
- Department of Sociology, Princeton University, Princeton, NJ 08544, USA
- Office of Population Research, Princeton University, Princeton, NJ 08544, USA
| | - Marta Serra-Garcia
- Rady School of Management, University of California, San Diego, La Jolla, CA 92093, USA
| | - Brandon M. Stewart
- Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
- Department of Sociology, Princeton University, Princeton, NJ 08544, USA
- Office of Population Research, Princeton University, Princeton, NJ 08544, USA
- Department of Politics, Princeton University, Princeton, NJ 08544, USA
| | - Gilles Vandewiele
- Department of Information Technology, Ghent University, Ghent, Belgium
| | - Arvind Narayanan
- Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
- Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
| |
Collapse
|
17
|
Shara N, Mirabal-Beltran R, Talmadge B, Falah N, Ahmad M, Dempers R, Crovatt S, Eisenberg S, Anderson K. Use of Machine Learning for Early Detection of Maternal Cardiovascular Conditions: Retrospective Study Using Electronic Health Record Data. JMIR Cardio 2024; 8:e53091. [PMID: 38648629 DOI: 10.2196/53091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 03/08/2024] [Accepted: 03/11/2024] [Indexed: 04/25/2024] Open
Abstract
BACKGROUND Cardiovascular conditions (eg, cardiac and coronary conditions, hypertensive disorders of pregnancy, and cardiomyopathies) were the leading cause of maternal mortality between 2017 and 2019. The United States has the highest maternal mortality rate of any high-income nation, disproportionately impacting those who identify as non-Hispanic Black or Hispanic. Novel clinical approaches to the detection and diagnosis of cardiovascular conditions are therefore imperative. Emerging research is demonstrating that machine learning (ML) is a promising tool for detecting patients at increased risk for hypertensive disorders during pregnancy. However, additional studies are required to determine how integrating ML and big data, such as electronic health records (EHRs), can improve the identification of obstetric patients at higher risk of cardiovascular conditions. OBJECTIVE This study aimed to evaluate the capability and timing of a proprietary ML algorithm, Healthy Outcomes for all Pregnancy Experiences-Cardiovascular-Risk Assessment Technology (HOPE-CAT), to detect maternal-related cardiovascular conditions and outcomes. METHODS Retrospective data from the EHRs of a large health care system were investigated by HOPE-CAT in a virtual server environment. Deidentification of EHR data and standardization enabled HOPE-CAT to analyze data without pre-existing biases. The ML algorithm assessed risk factors selected by clinical experts in cardio-obstetrics, and the algorithm was iteratively trained using relevant literature and current standards of risk identification. After refinement of the algorithm's learned risk factors, risk profiles were generated for every patient including a designation of standard versus high risk. The profiles were individually paired with clinical outcomes pertaining to cardiovascular pregnancy conditions and complications, wherein a delta was calculated between the date of the risk profile and the actual diagnosis or intervention in the EHR. RESULTS In total, 604 pregnancies resulting in birth had records or diagnoses that could be compared against the risk profile; the majority of patients identified as Black (n=482, 79.8%) and aged between 21 and 34 years (n=509, 84.4%). Preeclampsia (n=547, 90.6%) was the most common condition, followed by thromboembolism (n=16, 2.7%) and acute kidney disease or failure (n=13, 2.2%). The average delta was 56.8 (SD 69.7) days between the identification of risk factors by HOPE-CAT and the first date of diagnosis or intervention of a related condition reported in the EHR. HOPE-CAT showed the strongest performance in early risk detection of myocardial infarction at a delta of 65.7 (SD 81.4) days. CONCLUSIONS This study provides additional evidence to support ML in obstetrical patients to enhance the early detection of cardiovascular conditions during pregnancy. ML can synthesize multiday patient presentations to enhance provider decision-making and potentially reduce maternal health disparities.
Collapse
Affiliation(s)
- Nawar Shara
- MedStar Health Research Institute, Hyattesville, MD, United States
- Georgetown-Howard Universities Center for Clinical and Translational Science, Washington, DC, DC, United States
| | | | | | - Noor Falah
- MedStar Health Research Institute, Hyattesville, MD, United States
| | - Maryam Ahmad
- MedStar Health Research Institute, Hyattesville, MD, United States
| | | | | | | | - Kelley Anderson
- School of Nursing, Georgetown University, Washington, DC, United States
| |
Collapse
|
18
|
Zhu J, Wu Y, Lin S, Duan S, Wang X, Fang Y. Identifying and predicting physical limitation and cognitive decline trajectory group of older adults in China: A data-driven machine learning analysis. J Affect Disord 2024; 350:590-599. [PMID: 38218258 DOI: 10.1016/j.jad.2024.01.095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 11/24/2023] [Accepted: 01/07/2024] [Indexed: 01/15/2024]
Abstract
OBJECTIVE This study aimed to utilize data-driven machine learning methods to identify and predict potential physical and cognitive function trajectory groups of older adults and determine their crucial factors for promoting active ageing in China. METHODS Longitudinal data on 3026 older adults from the Chinese Longitudinal Healthy Longevity and Happy Family Survey was used to identify potential physical and cognitive function trajectory groups using a group-based multi-trajectory model (GBMTM). Predictors were selected from sociodemographic characteristics, lifestyle factors, and physical and mental conditions. The trajectory groups were predicted using data-driven machine learning models and dynamic nomogram. Model performance was evaluated by area under the receiver operating characteristics curve (AUROC), area under the precision-recall curve (PRAUC), and confusion matrix. RESULTS Two physical and cognitive function trajectory groups were determined, including a trajectory group with physical limitation and cognitive decline (14.18 %) and a normal trajectory group (85.82 %). Logistic regression performed well in predicting trajectory groups (AUROC = 0.881, PRAUC = 0.649). Older adults with lower baseline score of activities of daily living, older age, less frequent housework, and fewer actual teeth were more likely to experience physical limitation and cognitive decline trajectory group. LIMITATION This study didn't carry out external validation. CONCLUSIONS This study shows that GBMTM and machine learning models effectively identify and predict physical limitation and cognitive decline trajectory group. The identified predictors might be essential for developing targeted interventions to promote healthy ageing.
Collapse
Affiliation(s)
- Junmin Zhu
- School of Public Health, Xiamen University, Xiamen, Fujian, China; Key Laboratory of Health Technology Assessment of Fujian Province, School of Public Health, Xiamen University, Xiamen, Fujian, China
| | - Yafei Wu
- School of Public Health, Xiamen University, Xiamen, Fujian, China; Key Laboratory of Health Technology Assessment of Fujian Province, School of Public Health, Xiamen University, Xiamen, Fujian, China
| | - Shaowu Lin
- School of Public Health, Xiamen University, Xiamen, Fujian, China; Key Laboratory of Health Technology Assessment of Fujian Province, School of Public Health, Xiamen University, Xiamen, Fujian, China; National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, Fujian, China
| | - Siyu Duan
- School of Public Health, Xiamen University, Xiamen, Fujian, China; Key Laboratory of Health Technology Assessment of Fujian Province, School of Public Health, Xiamen University, Xiamen, Fujian, China
| | - Xing Wang
- School of Public Health, Xiamen University, Xiamen, Fujian, China; Key Laboratory of Health Technology Assessment of Fujian Province, School of Public Health, Xiamen University, Xiamen, Fujian, China
| | - Ya Fang
- School of Public Health, Xiamen University, Xiamen, Fujian, China; Key Laboratory of Health Technology Assessment of Fujian Province, School of Public Health, Xiamen University, Xiamen, Fujian, China; National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, Fujian, China.
| |
Collapse
|
19
|
Okada A, Kaneko H, Konishi M, Kamiya K, Sugimoto T, Matsuoka S, Yokota I, Suzuki Y, Yamaguchi S, Itoh H, Fujiu K, Michihata N, Jo T, Matsui H, Fushimi K, Takeda N, Morita H, Yasunaga H, Komuro I. A machine-learning-based prediction of non-home discharge among acute heart failure patients. Clin Res Cardiol 2024; 113:522-532. [PMID: 37131097 PMCID: PMC10955024 DOI: 10.1007/s00392-023-02209-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 04/17/2023] [Indexed: 05/04/2023]
Abstract
BACKGROUND Scarce data on factors related to discharge disposition in patients hospitalized for acute heart failure (AHF) were available, and we sought to develop a parsimonious and simple predictive model for non-home discharge via machine learning. METHODS This observational cohort study using a Japanese national database included 128,068 patients admitted from home for AHF between April 2014 and March 2018. The candidate predictors for non-home discharge were patient demographics, comorbidities, and treatment performed within 2 days after hospital admission. We used 80% of the population to develop a model using all 26 candidate variables and using the variable selected by 1 standard-error rule of Lasso regression, which enhances interpretability, and 20% to validate the predictive ability. RESULTS We analyzed 128,068 patients, and 22,330 patients were not discharged to home; 7,879 underwent in-hospital death and 14,451 were transferred to other facilities. The machine-learning-based model consisted of 11 predictors, showing a discrimination ability comparable to that using all the 26 variables (c-statistic: 0.760 [95% confidence interval, 0.752-0.767] vs. 0.761 [95% confidence interval, 0.753-0.769]). The common 1SE-selected variables identified throughout all analyses were low scores in activities of daily living, advanced age, absence of hypertension, impaired consciousness, failure to initiate enteral alimentation within 2 days and low body weight. CONCLUSIONS The developed machine learning model using 11 predictors had a good predictive ability to identify patients at high risk for non-home discharge. Our findings would contribute to the effective care coordination in this era when HF is rapidly increasing in prevalence.
Collapse
Affiliation(s)
- Akira Okada
- Department of Prevention of Diabetes and Lifestyle-Related Diseases, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Hidehiro Kaneko
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan.
- The Department of Advanced Cardiology, The University of Tokyo, Tokyo, Japan.
| | - Masaaki Konishi
- Department of Medical Science and Cardiorenal Medicine, Yokohama City University Graduate School of Medicine, Kanagawa, Japan
| | - Kentaro Kamiya
- Department of Rehabilitation, School of Allied Health Sciences, Kitasato University, Kanagawa, Japan
| | - Tadafumi Sugimoto
- Department of Cardiology and Nephrology, Mie University Graduate School of Medicine, Tsu, Japan
| | - Satoshi Matsuoka
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Isao Yokota
- Department of Biostatistics, Hokkaido University Graduate School of Medicine, Sapporo, Japan
| | - Yuta Suzuki
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
- Center for Outcomes Research and Economic Evaluation for Health, National Institute of Public Health, Saitama, Japan
| | - Satoko Yamaguchi
- Department of Prevention of Diabetes and Lifestyle-Related Diseases, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Hidetaka Itoh
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Katsuhito Fujiu
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
- The Department of Advanced Cardiology, The University of Tokyo, Tokyo, Japan
| | - Nobuaki Michihata
- The Department of Health Services Research, The University of Tokyo, Tokyo, Japan
| | - Taisuke Jo
- The Department of Health Services Research, The University of Tokyo, Tokyo, Japan
| | - Hiroki Matsui
- Department of Clinical Epidemiology and Health Economics, The University of Tokyo, Tokyo, Japan
| | - Kiyohide Fushimi
- Department of Health Policy and Informatics, Tokyo Medical and Dental University, Tokyo, Japan
| | - Norifumi Takeda
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Hiroyuki Morita
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| | - Hideo Yasunaga
- Department of Clinical Epidemiology and Health Economics, The University of Tokyo, Tokyo, Japan
| | - Issei Komuro
- The Department of Cardiovascular Medicine, The University of Tokyo Hospital, 7-3-1, Hongo, Bunkyo-Ku, Tokyo, 113-8655, Japan
| |
Collapse
|
20
|
Okita J, Nakata T, Uchida H, Kudo A, Fukuda A, Ueno T, Tanigawa M, Sato N, Shibata H. Development and validation of a machine learning model to predict time to renal replacement therapy in patients with chronic kidney disease. BMC Nephrol 2024; 25:101. [PMID: 38493099 PMCID: PMC10943785 DOI: 10.1186/s12882-024-03527-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Accepted: 02/28/2024] [Indexed: 03/18/2024] Open
Abstract
BACKGROUND Predicting time to renal replacement therapy (RRT) is important in patients at high risk for end-stage kidney disease. We developed and validated machine learning models for predicting the time to RRT and compared its accuracy with conventional prediction methods that uses the rate of estimated glomerular filtration rate (eGFR) decline. METHODS Data of adult chronic kidney disease (CKD) patients who underwent hemodialysis at Oita University Hospital from April 2016 to March 2021 were extracted from electronic medical records (N = 135). A new machine learning predictor was compared with the established prediction method that uses the eGFR decline rate and the accuracy of the prediction models was determined using the coefficient of determination (R2). The data were preprocessed and split into training and validation datasets. We created multiple machine learning models using the training data and evaluated their accuracy using validation data. Furthermore, we predicted the time to RRT using a conventional prediction method that uses the eGFR decline rate for patients who had measured eGFR three or more times in two years and evaluated its accuracy. RESULTS The least absolute shrinkage and selection operator regression model exhibited moderate accuracy with an R2 of 0.60. By contrast, the conventional prediction method was found to be extremely low with an R2 of -17.1. CONCLUSIONS The significance of this study is that it shows that machine learning can predict time to RRT moderately well with continuous values from data at a single time point. This approach outperforms the conventional prediction method that uses eGFR time series data and presents new avenues for CKD treatment.
Collapse
Affiliation(s)
- Jun Okita
- Department of Endocrinology, Metabolism, Rheumatology and Nephrology, Faculty of Medicine, Oita University, 8795593, 1-1 idaigaoka Hasama-cho, Yufu-shi, Oita-ken, Japan
| | - Takeshi Nakata
- Department of Endocrinology, Metabolism, Rheumatology and Nephrology, Faculty of Medicine, Oita University, 8795593, 1-1 idaigaoka Hasama-cho, Yufu-shi, Oita-ken, Japan.
| | - Hiroki Uchida
- Department of Endocrinology, Metabolism, Rheumatology and Nephrology, Faculty of Medicine, Oita University, 8795593, 1-1 idaigaoka Hasama-cho, Yufu-shi, Oita-ken, Japan
| | - Akiko Kudo
- Department of Endocrinology, Metabolism, Rheumatology and Nephrology, Faculty of Medicine, Oita University, 8795593, 1-1 idaigaoka Hasama-cho, Yufu-shi, Oita-ken, Japan
| | - Akihiro Fukuda
- Department of Endocrinology, Metabolism, Rheumatology and Nephrology, Faculty of Medicine, Oita University, 8795593, 1-1 idaigaoka Hasama-cho, Yufu-shi, Oita-ken, Japan
| | - Tamio Ueno
- Department of Medical Technology and Sciences, School of Health Sciences at Fukuoka, International University of Health and Welfare, Okawa, Japan
| | - Masato Tanigawa
- Department of Biophysics, Faculty of Medicine, Oita University, Oita, Japan
| | - Noboru Sato
- Department of Healthcare AI Data Science, Faculty of Medicine, Oita University, Oita, Japan
| | - Hirotaka Shibata
- Department of Endocrinology, Metabolism, Rheumatology and Nephrology, Faculty of Medicine, Oita University, 8795593, 1-1 idaigaoka Hasama-cho, Yufu-shi, Oita-ken, Japan
| |
Collapse
|
21
|
Paccoud I, Leist AK, Schwaninger I, van Kessel R, Klucken J. Socio-ethical challenges and opportunities for advancing diversity, equity, and inclusion in digital medicine. Digit Health 2024; 10:20552076241277705. [PMID: 39372817 PMCID: PMC11450794 DOI: 10.1177/20552076241277705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 08/08/2024] [Indexed: 10/08/2024] Open
Abstract
Digitalization in medicine offers a significant opportunity to transform healthcare systems by providing novel digital tools and services to guide personalized prevention, prediction, diagnosis, treatment and disease management. This transformation raises a number of novel socio-ethical considerations for individuals and society as a whole, which need to be appropriately addressed to ensure that digital medical devices (DMDs) are widely adopted and benefit all patients as well as healthcare service providers. In this narrative review, based on a broad literature search in PubMed, Web of Science, Google Scholar, we outline five core socio-ethical considerations in digital medicine that intersect with the notions of equity and digital inclusion: (i) access, use and engagement with DMDs, (ii) inclusiveness in DMD clinical trials, (iii) algorithm fairness, (iv) surveillance and datafication, and (v) data privacy and trust. By integrating literature from multidisciplinary fields, including social, medical, and computer sciences, we shed light on challenges and opportunities related to the development and adoption of DMDs. We begin with an overview of the different types of DMDs, followed by in-depth discussions of five socio-ethical implications associated with their deployment. Concluding our review, we provide evidence-based multilevel recommendations aimed at fostering a more inclusive digital landscape to ensure that the development and integration of DMDs in healthcare mitigate rather than cause, maintain or exacerbate health inequities.
Collapse
Affiliation(s)
- Ivana Paccoud
- Digital Medicine Group, Department of Population Health, Luxembourg Institute of Health (LIH), Strassen, Luxembourg
- Digital Medicine, Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Anja K. Leist
- Department of Social Sciences, Institute for Research on Socio-Economic Inequality (IRSEI)), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Isabel Schwaninger
- Digital Medicine Group, Department of Population Health, Luxembourg Institute of Health (LIH), Strassen, Luxembourg
- Digital Medicine, Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Robin van Kessel
- LSE Health, Department of Health Policy, London School of Economics and Political Science, London, UK
- Mental Health Policy and Economics Group, Department of Psychiatry, University of Cambridge, Cambridge, UK
- Department of International Health, Care and Public Health Research Institute (CAPHRI), Maastricht University, Maastricht, Netherlands
- Digital Public Health Task Force, Association of Schools of Public Health in the European Region (ASPHER), Brussels, Belgium
| | - Jochen Klucken
- Digital Medicine Group, Department of Population Health, Luxembourg Institute of Health (LIH), Strassen, Luxembourg
- Digital Medicine, Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Centre Hospitalier de Luxembourg (CHL), Luxembourg, Luxembourg
| |
Collapse
|
22
|
Montorsi C, Fusco A, Van Kerm P, Bordas SPA. Predicting depression in old age: Combining life course data with machine learning. ECONOMICS AND HUMAN BIOLOGY 2024; 52:101331. [PMID: 38035653 DOI: 10.1016/j.ehb.2023.101331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 09/29/2023] [Accepted: 11/22/2023] [Indexed: 12/02/2023]
Abstract
With ageing populations, understanding life course factors that raise the risk of depression in old age may help anticipate needs and reduce healthcare costs in the long run. We estimate the risk of depression in old age by combining adult life course trajectories and childhood conditions in supervised machine learning algorithms. Using data from the Survey of Health, Ageing and Retirement in Europe (SHARE), we implement and compare the performance of six alternative machine learning algorithms. We analyse the performance of the algorithms using different life-course data configurations. While we obtain similar predictive abilities between algorithms, we achieve the highest predictive performance when employing semi-structured representations of life courses using sequence data. We use the Shapley Additive Explanations method to extract the most decisive predictive patterns. Age, health, childhood conditions, and low education predict most depression risk later in life, but we identify new predictive patterns in indicators of life course instability and low utilization of dental care services.
Collapse
Affiliation(s)
- Carlotta Montorsi
- Department of Living Conditions, Luxembourg Institute of Socio-Economic Research (LISER), 11, Porte des Sciences L-4366, Esch-sur-Alzette, Luxembourg; Department of Social Sciences, University of Luxembourg, Esch-sur-Alzette, Luxembourg; Insubria University, Department of Economics, 71, via Monte Generoso 21100, Varese, Italy.
| | - Alessio Fusco
- Department of Living Conditions, Luxembourg Institute of Socio-Economic Research (LISER), 11, Porte des Sciences L-4366, Esch-sur-Alzette, Luxembourg
| | - Philippe Van Kerm
- Department of Living Conditions, Luxembourg Institute of Socio-Economic Research (LISER), 11, Porte des Sciences L-4366, Esch-sur-Alzette, Luxembourg; Department of Social Sciences, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Stéphane P A Bordas
- Department of Engineering, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
23
|
Newby D, Orgeta V, Marshall CR, Lourida I, Albertyn CP, Tamburin S, Raymont V, Veldsman M, Koychev I, Bauermeister S, Weisman D, Foote IF, Bucholc M, Leist AK, Tang EYH, Tai XY, The Deep Dementia Phenotyping (DEMON) Network, Llewellyn DJ, Ranson JM. Artificial intelligence for dementia prevention. Alzheimers Dement 2023; 19:5952-5969. [PMID: 37837420 PMCID: PMC10843720 DOI: 10.1002/alz.13463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 08/01/2023] [Accepted: 08/07/2023] [Indexed: 10/16/2023]
Abstract
INTRODUCTION A wide range of modifiable risk factors for dementia have been identified. Considerable debate remains about these risk factors, possible interactions between them or with genetic risk, and causality, and how they can help in clinical trial recruitment and drug development. Artificial intelligence (AI) and machine learning (ML) may refine understanding. METHODS ML approaches are being developed in dementia prevention. We discuss exemplar uses and evaluate the current applications and limitations in the dementia prevention field. RESULTS Risk-profiling tools may help identify high-risk populations for clinical trials; however, their performance needs improvement. New risk-profiling and trial-recruitment tools underpinned by ML models may be effective in reducing costs and improving future trials. ML can inform drug-repurposing efforts and prioritization of disease-modifying therapeutics. DISCUSSION ML is not yet widely used but has considerable potential to enhance precision in dementia prevention. HIGHLIGHTS Artificial intelligence (AI) is not widely used in the dementia prevention field. Risk-profiling tools are not used in clinical practice. Causal insights are needed to understand risk factors over the lifespan. AI will help personalize risk-management tools for dementia prevention. AI could target specific patient groups that will benefit most for clinical trials.
Collapse
Affiliation(s)
- Danielle Newby
- University of Oxford, Department of Psychiatry, Warneford Hospital, Oxford, OX3 7JX, UK
| | - Vasiliki Orgeta
- Division of Psychiatry, University College London, London, W1T 7BN, UK
| | - Charles R Marshall
- Preventive Neurology Unit, Wolfson Institute of Population Health, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, E1 4NS, UK
- Department of Neurology, Royal London Hospital, London, E1 1BB, UK
| | - Ilianna Lourida
- Population Health Sciences Institute, Newcastle University, Newcastle, NE2 4AX, UK
- University of Exeter Medical School, Exeter, EX1 2HZ, UK
| | - Christopher P Albertyn
- Department of Old Age Psychiatry, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, SE5 8AF, UK
| | - Stefano Tamburin
- Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, Verona, 37129, Italy
| | - Vanessa Raymont
- University of Oxford, Department of Psychiatry, Warneford Hospital, Oxford, OX3 7JX, UK
| | - Michele Veldsman
- Wellcome Centre for Integrative Neuroimaging, University of Oxford, Oxford, OX3 9DU, UK
- Department of Experimental Psychology, University of Oxford, Oxford, OX2 6GG, UK
| | - Ivan Koychev
- University of Oxford, Department of Psychiatry, Warneford Hospital, Oxford, OX3 7JX, UK
| | - Sarah Bauermeister
- University of Oxford, Department of Psychiatry, Warneford Hospital, Oxford, OX3 7JX, UK
| | - David Weisman
- Abington Neurological Associates, Abington, PA 19001, USA
| | - Isabelle F Foote
- Preventive Neurology Unit, Wolfson Institute of Population Health, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, E1 4NS, UK
- Institute for Behavioral Genetics, University of Colorado Boulder, Boulder, CO 80309, USA
| | - Magda Bucholc
- Cognitive Analytics Research Lab, School of Computing, Engineering & Intelligent Systems, Ulster University, Derry, BT48 7JL, UK
| | - Anja K Leist
- Institute for Research on Socio-Economic Inequality (IRSEI), Department of Social Sciences, University of Luxembourg, L-4365, Luxembourg
| | - Eugene Y H Tang
- Population Health Sciences Institute, Newcastle University, Newcastle, NE2 4AX, UK
| | - Xin You Tai
- Nuffield Department of Clinical Neuroscience, University of Oxford, Oxford, OX3 9DU, UK
- Division of Clinical Neurology, John Radcliffe Hospital, Oxford University Hospitals Trust, Oxford, OX3 9DU, UK
| | | | - David J. Llewellyn
- University of Exeter Medical School, Exeter, EX1 2HZ, UK
- The Alan Turing Institute, London, NW1 2DB, UK
| | | |
Collapse
|
24
|
Gu W, Xue F, Han W, Wang Z, Zhao J, Zhang L, Yang C, Jiang J. Assessment of the spatial association between multiple pollutants of surface water and digestive cancer incidence in China: A novel application of spatial machine learning. ECOLOGICAL INDICATORS 2023; 154:110897. [DOI: 10.1016/j.ecolind.2023.110897] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/07/2024]
|
25
|
Gao Y, Sharma T, Cui Y. Addressing the Challenge of Biomedical Data Inequality: An Artificial Intelligence Perspective. Annu Rev Biomed Data Sci 2023; 6:153-171. [PMID: 37104653 PMCID: PMC10529864 DOI: 10.1146/annurev-biodatasci-020722-020704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Artificial intelligence (AI) and other data-driven technologies hold great promise to transform healthcare and confer the predictive power essential to precision medicine. However, the existing biomedical data, which are a vital resource and foundation for developing medical AI models, do not reflect the diversity of the human population. The low representation in biomedical data has become a significant health risk for non-European populations, and the growing application of AI opens a new pathway for this health risk to manifest and amplify. Here we review the current status of biomedical data inequality and present a conceptual framework for understanding its impacts on machine learning. We also discuss the recent advances in algorithmic interventions for mitigating health disparities arising from biomedical data inequality. Finally, we briefly discuss the newly identified disparity in data quality among ethnic groups and its potential impacts on machine learning.
Collapse
Affiliation(s)
- Yan Gao
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, USA;
| | - Teena Sharma
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, USA;
| | - Yan Cui
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, Tennessee, USA;
| |
Collapse
|
26
|
Stephen TL, Korobkova L, Breningstall B, Nguyen K, Mehta S, Pachicano M, Jones KT, Hawes D, Cabeen RP, Bienkowski MS. Machine Learning Classification of Alzheimer's Disease Pathology Reveals Diffuse Amyloid as a Major Predictor of Cognitive Impairment in Human Hippocampal Subregions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.31.543117. [PMID: 37333119 PMCID: PMC10274752 DOI: 10.1101/2023.05.31.543117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Analyzing Alzheimer's disease (AD) pathology within anatomical subregions is a significant challenge, often carried out by pathologists using a standardized, semi-quantitative approach. To augment traditional methods, a high-throughput, high-resolution pipeline was created to classify the distribution of AD pathology within hippocampal subregions. USC ADRC post-mortem tissue sections from 51 patients were stained with 4G8 for amyloid, Gallyas for neurofibrillary tangles (NFTs) and Iba1 for microglia. Machine learning (ML) techniques were utilized to identify and classify amyloid pathology (dense, diffuse and APP (amyloid precursor protein)), NFTs, neuritic plaques and microglia. These classifications were overlaid within manually segmented regions (aligned with the Allen Human Brain Atlas) to create detailed pathology maps. Cases were separated into low, intermediate, or high AD stages. Further data extraction enabled quantification of plaque size and pathology density alongside ApoE genotype, sex, and cognitive status. Our findings revealed that the increase in pathology burden across AD stages was driven mainly by diffuse amyloid. The pre and para-subiculum had the highest levels of diffuse amyloid while NFTs were highest in the A36 region in high AD cases. Moreover, different pathology types had distinct trajectories across disease stages. In a subset of AD cases, microglia were elevated in intermediate and high compared to low AD. Microglia also correlated with amyloid pathology in the Dentate Gyrus. The size of dense plaques, which may represent microglial function, was lower in ApoE4 carriers. In addition, individuals with memory impairment had higher levels of both dense and diffuse amyloid. Taken together, our findings integrating ML classification approaches with anatomical segmentation maps provide new insights on the complexity of disease pathology in AD progression. Specifically, we identified diffuse amyloid pathology as being a major driver of AD in our cohort, regions of interest and microglial responses that might advance AD diagnosis and treatment.
Collapse
|