Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Sadeghi P, Karimi H, Lavafian A, Rashedi R, Samieefar N, Shafiekhani S, Rezaei N. Machine learning and artificial intelligence within pediatric autoimmune diseases: applications, challenges, future perspective. Expert Rev Clin Immunol 2024:1-18. [PMID: 38771915 DOI: 10.1080/1744666x.2024.2359019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2023] [Accepted: 05/20/2024] [Indexed: 05/23/2024]

Labib SM. Greenness, air pollution, and temperature exposure effects in predicting premature mortality and morbidity: A small-area study using spatial random forest model. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024;928:172387. [PMID: 38608883 DOI: 10.1016/j.scitotenv.2024.172387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 04/08/2024] [Accepted: 04/08/2024] [Indexed: 04/14/2024]

Abstract

BACKGROUND

Although studies have provided negative impacts of air pollution, heat or cold exposure on mortality and morbidity, and positive effects of increased greenness on reducing them, a few studies have focused on exploring combined and synergetic effects of these exposures in predicting these health outcomes, and most had ignored the spatial autocorrelation in analyzing their health effects. This study aims to investigate the health effects of air pollution, greenness, and temperature exposure on premature mortality and morbidity within a spatial machine-learning modeling framework.

METHODS

Years of potential life lost reflecting premature mortality and comparative illness and disability ratio reflecting chronic morbidity from 1673 small areas covering Greater Manchester for the year 2008-2013 obtained. Average annual levels of NO2 concentration, normalized difference vegetation index (NDVI) representing greenness, and annual average air temperature were utilized to assess exposure in each area. These exposures were linked to health outcomes using non-spatial and spatial random forest (RF) models while accounting for spatial autocorrelation.

RESULTS

Spatial-RF models provided the best predictive accuracy when accounted for spatial autocorrelation. Among the exposures considered, air pollution emerged as the most influential in predicting mortality and morbidity, followed by NDVI and temperature exposure. Nonlinear exposure-response relations were observed, and interactions between exposures illustrated specific ranges or sweet and sour spots of exposure thresholds where combined effects either exacerbate or moderate health conditions.

CONCLUSION

Air pollution exposure had a greater negative impact on health compared to greenness and temperature exposure. Combined exposure effects may indicate the highest influence of premature mortality and morbidity burden.

Collapse

Ghazi L, Farhat K, Hoenig MP, Durant TJS, El-Khoury JM. Biomarkers vs Machines: The Race to Predict Acute Kidney Injury. Clin Chem 2024;70:805-819. [PMID: 38299927 DOI: 10.1093/clinchem/hvad217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 10/20/2023] [Indexed: 02/02/2024]

Kikuchi T, Hanaoka S, Nakao T, Takenaga T, Nomura Y, Mori H, Yoshikawa T. Synthesis of Hybrid Data Consisting of Chest Radiographs and Tabular Clinical Records Using Dual Generative Models for COVID-19 Positive Cases. JOURNAL OF IMAGING INFORMATICS IN MEDICINE 2024;37:1217-1227. [PMID: 38351224 DOI: 10.1007/s10278-024-01015-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 12/21/2023] [Accepted: 12/22/2023] [Indexed: 06/13/2024]

Sloan RA. Estimated Cardiorespiratory Fitness and Metabolic Risks. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2024;21:635. [PMID: 38791849 PMCID: PMC11120962 DOI: 10.3390/ijerph21050635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 05/14/2024] [Accepted: 05/14/2024] [Indexed: 05/26/2024]

Rafiee M, Jahangiri-Rad M, Mohseni-Bandpei A, Razmi E. Impacts of socioeconomic and environmental factors on neoplasms incidence rates using machine learning and GIS: a cross-sectional study in Iran. Sci Rep 2024;14:10604. [PMID: 38719879 PMCID: PMC11078954 DOI: 10.1038/s41598-024-61397-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/06/2024] [Indexed: 05/12/2024] Open

Abstract

Neoplasm is an umbrella term used to describe either benign or malignant conditions. The correlations between socioeconomic and environmental factors and the occurrence of new-onset of neoplasms have already been demonstrated in a body of research. Nevertheless, few studies have specifically dealt with the nature of relationship, significance of risk factors, and geographic variation of them, particularly in low- and middle-income communities. This study, thus, set out to (1) analyze spatiotemporal variations of the age-adjusted incidence rate (AAIR) of neoplasms in Iran throughout five time periods, (2) investigate relationships between a collection of environmental and socioeconomic indicators and the AAIR of neoplasms all over the country, and (3) evaluate geographical alterations in their relative importance. Our cross-sectional study design was based on county-level data from 2010 to 2020. AAIR of neoplasms data was acquired from the Institute for Health Metrics and Evaluation (IHME). HotSpot analyses and Anselin Local Moran's I indices were deployed to precisely identify AAIR of neoplasms high- and low-risk clusters. Multi-scale geographically weight regression (MGWR) analysis was worked out to evaluate the association between each explanatory variable and the AAIR of neoplasms. Utilizing random forests (RF), we also examined the relationships between environmental (e.g., UV index and PM2.5 concentration) and socioeconomic (e.g., Gini coefficient and literacy rate) factors and AAIR of neoplasms. AAIR of neoplasms displayed a significant increasing trend over the study period. According to the MGWR, the only factor that significantly varied spatially and was associated with the AAIR of neoplasms in Iran was the UV index. A good accuracy RF model was confirmed for both training and testing data with correlation coefficients R2 greater than 0.91 and 0.92, respectively. UV index and Gini coefficient ranked the highest variables in the prediction of AAIR of neoplasms, based on the relative influence of each variable. More research using machine learning approaches taking the advantages of considering all possible determinants is required to assess health strategies outcomes and properly formulate policy planning.

Collapse

Nawrin SS, Inada H, Momma H, Nagatomi R. Twenty-four-hour physical activity patterns associated with depressive symptoms: a cross-sectional study using big data-machine learning approach. BMC Public Health 2024;24:1254. [PMID: 38714982 PMCID: PMC11075341 DOI: 10.1186/s12889-024-18759-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Accepted: 05/02/2024] [Indexed: 05/12/2024] Open

Abstract

BACKGROUND

Depression is a global burden with profound personal and economic consequences. Previous studies have reported that the amount of physical activity is associated with depression. However, the relationship between the temporal patterns of physical activity and depressive symptoms is poorly understood. In this exploratory study, we hypothesize that a particular temporal pattern of daily physical activity could be associated with depressive symptoms and might be a better marker than the total amount of physical activity.

METHODS

To address the hypothesis, we investigated the association between depressive symptoms and daily dominant activity behaviors based on 24-h temporal patterns of physical activity. We conducted a cross-sectional study on NHANES 2011-2012 data collected from the noninstitutionalized civilian resident population of the United States. The number of participants that had the whole set of physical activity data collected by the accelerometer is 6613. Among 6613 participants, 4242 participants had complete demography and Patient Health Questionnaire-9 (PHQ-9) questionnaire, a tool to quantify depressive symptoms. The association between activity-count behaviors and depressive symptoms was analyzed using multivariable logistic regression to adjust for confounding factors in sequential models.

RESULTS

We identified four physical activity-count behaviors based on five physical activity-counting patterns classified by unsupervised machine learning. Regarding PHQ-9 scores, we found that evening dominant behavior was positively associated with depressive symptoms compared to morning dominant behavior as the control group.

CONCLUSIONS

Our results might contribute to monitoring and identifying individuals with latent depressive symptoms, emphasizing the importance of nuanced activity patterns and their probability of assessing depressive symptoms effectively.

Collapse

Kapoor S, Cantrell EM, Peng K, Pham TH, Bail CA, Gundersen OE, Hofman JM, Hullman J, Lones MA, Malik MM, Nanayakkara P, Poldrack RA, Raji ID, Roberts M, Salganik MJ, Serra-Garcia M, Stewart BM, Vandewiele G, Narayanan A. REFORMS: Consensus-based Recommendations for Machine-learning-based Science. SCIENCE ADVANCES 2024;10:eadk3452. [PMID: 38691601 PMCID: PMC11092361 DOI: 10.1126/sciadv.adk3452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 03/29/2024] [Indexed: 05/03/2024]

Affiliation(s)

Sayash Kapoor Department of Computer Science, Princeton University, Princeton, NJ 08544, USA Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
Emily M. Cantrell Department of Sociology, Princeton University, Princeton, NJ 08544, USA School of Public and International Affairs, Princeton University, Princeton, NJ 08544, USA
Kenny Peng Department of Computer Science, Cornell University, Ithaca, NY 14850, USA
Thanh Hien Pham Department of Computer Science, Princeton University, Princeton, NJ 08544, USA Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA
Christopher A. Bail Department of Sociology, Duke University, Durham, NC 27708, USA Department of Political Science, Duke University, Durham, NC 27708, USA Sanford School of Public Policy, Duke University, Durham, NC 27708, USA
Odd Erik Gundersen Department of Computer Science, Norwegian University of Science and Technology, Trondheim, Norway Aneo AS, Trondheim, Norway
Jake M. Hofman Microsoft Research, New York, NY 10012, USA
Jessica Hullman Department of Computer Science, Northwestern University, Evanston, IL 60208, USA
Michael A. Lones School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK
Momin M. Malik Center for Digital Health, Mayo Clinic, Rochester, MN 55905, USA School of Social Policy & Practice, University of Pennsylvania, Philadelphia, PA 19104, USA Institute in Critical Quantitative, Computational, & Mixed Methodologies, Johns Hopkins University, Baltimore, MD 21218, USA
Priyanka Nanayakkara Department of Computer Science, Northwestern University, Evanston, IL 60208, USA Department of Communication Studies, Northwestern University, Evanston, IL 60208, USA
Russell A. Poldrack Department of Psychology, Stanford University, Stanford, CA 94305, USA
Inioluwa Deborah Raji Department of Computer Science, University of California, Berkeley, Berkeley, CA 94720, USA
Michael Roberts Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK Department of Medicine, University of Cambridge, Cambridge, UK
Matthew J. Salganik Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA Department of Sociology, Princeton University, Princeton, NJ 08544, USA Office of Population Research, Princeton University, Princeton, NJ 08544, USA
Marta Serra-Garcia Rady School of Management, University of California, San Diego, La Jolla, CA 92093, USA
Brandon M. Stewart Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA Department of Sociology, Princeton University, Princeton, NJ 08544, USA Office of Population Research, Princeton University, Princeton, NJ 08544, USA Department of Politics, Princeton University, Princeton, NJ 08544, USA
Gilles Vandewiele Department of Information Technology, Ghent University, Ghent, Belgium
Arvind Narayanan Department of Computer Science, Princeton University, Princeton, NJ 08544, USA Center for Information Technology Policy, Princeton University, Princeton, NJ 08544, USA

Collapse

Nimmal Haribabu G, Basu B. Implementing Machine Learning approaches for accelerated prediction of bone strain in acetabulum of a hip joint. J Mech Behav Biomed Mater 2024;153:106495. [PMID: 38460455 DOI: 10.1016/j.jmbbm.2024.106495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 02/10/2024] [Accepted: 03/01/2024] [Indexed: 03/11/2024]

Abstract

The Finite Element (FE) methods for biomechanical analysis involving implant design and subject parameters for musculoskeletal applications are extensively reported in literature. Such an approach is manually intensive and computationally expensive with longer simulations times. Although Artificial Intelligence (AI) based approaches are implemented to a limited extent in biomechanics, such approaches to predict bone strain in acetabulum of a hip joint, are hardly explored. In this context, the primary objective of this paper is to evaluate machine learning (ML) models in tandem with high-fidelity FEA data for the accelerated prediction of the biomechanical response in the acetabulum of the human hip joint, during the walking gait. The parameters used in the FEA study included the subject weight, number and distribution of fins on the periphery of the acetabular shell, bone condition and phases of the gait cycle. The biomechanical response has also been evaluated using three different acetabular liners, including pre-clinically validated HDPE-20% HA-20% Al2O3, highly-crosslinked ultrahigh molecular weight polyethylene (HC-UHMWPE) and ZrO2-toughened Al2O3 (ZTA). Such parametric variation in FEA analysis, involving 26 variables and a full factorial design resulted in 10,752 datasets for spatially varying bone strains. The bone condition, as opposed to subject weight, was found to play a statistically significant role in determining the strain response in the periprosthetic bone of the acetabulum. While utilising hyperparameter tuning, K-fold cross validation and statistical learning approaches, a number of ML models were trained on the FEA dataset, and the Random Forest model performed the best with a coefficient of determination (R2) value of 0.99/0.97 and Root Mean Square Error (RMSE) of 0.02/0.01 on the training/test dataset. Taken together, this study establishes the potential of ML approach as a fast surrogate of FEA for implant biomechanics analysis, in less than a minute.

Collapse

Saingam P, Jain T, Woicik A, Li B, Candry P, Redcorn R, Wang S, Himmelfarb J, Bryan A, Winkler MKH, Gattuso M. Integrating socio-economic vulnerability factors improves neighborhood-scale wastewater-based epidemiology for public health applications. WATER RESEARCH 2024;254:121415. [PMID: 38479175 DOI: 10.1016/j.watres.2024.121415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 02/28/2024] [Accepted: 03/03/2024] [Indexed: 04/06/2024]

Abstract

Wastewater Based Epidemiology (WBE) of COVID-19 is a low-cost, non-invasive, and inclusive early warning tool for disease spread. Previously studied WBE focused on sampling at wastewater treatment plant scale, limiting the level at which demographic and geographic variations in disease dynamics can be incorporated into the analysis of certain neighborhoods. This study demonstrates the integration of demographic mapping to improve the WBE of COVID-19 and associated post-COVID disease prediction (here kidney disease) at the neighborhood level using machine learning. WBE was conducted at six neighborhoods in Seattle during October 2020 - February 2022. Wastewater processing and RT-qPCR were performed to obtain SARS-CoV-2 RNA concentration. Census data, clinical data of COVID-19, as well as patient data of acute kidney injury (AKI) cases reported during the study period were collected and the distribution across the city was studied using Geographic Information System (GIS) mapping. Further, we analyzed the data set to better understand socioeconomic impacts on disease prevalence of COVID-19 and AKI per neighborhood. The heterogeneity of eleven demographic factors (such as education and age among others) was observed within neighborhoods across the city of Seattle. Dynamics of COVID-19 clinical cases and wastewater SARS-CoV-2 varied across neighborhood with different levels of demographics. Machine learning models trained with data from the earlier stages of the pandemic were able to predict both COVID-19 and AKI incidence in the later stages of the pandemic (Spearman correlation coefficient of 0·546 - 0·904), with the most predictive model trained on the combination of wastewater data and demographics. The integration of demographics strengthened machine learning models' capabilities to predict prevalence of COVID-19, and of AKI as a marker for post-COVID sequelae. Demographic-based WBE presents an effective tool to monitor and manage public health beyond COVID-19 at the neighborhood level.

Collapse

Santana JEG, Oliveira-Tintino CDDM, Alencar GG, Siqueira GM, Almeida-Bezerra JW, Viana Rodrigues JP, Pinheiro Gonçalves VB, Nicolete R, Tintino SR, Coutinho HDM, Silva TGD. Liposomal nanoformulations with trans-caryophyllene and caryophyllene oxide: do they have an inhibitory action on the efflux pumps NorA, Tet(K), MsrA, and MepA? Chem Biol Interact 2024;393:110945. [PMID: 38460934 DOI: 10.1016/j.cbi.2024.110945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 02/09/2024] [Accepted: 03/06/2024] [Indexed: 03/11/2024]

Ran W, Yu Q. Data-driven clustering approach to identify novel clusters of high cognitive impairment risk among Chinese community-dwelling elderly people with normal cognition: A national cohort study. J Glob Health 2024;14:04088. [PMID: 38638099 PMCID: PMC11026990 DOI: 10.7189/jogh.14.04088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/20/2024] Open

Abstract

Background

Cognitive impairment is a highly heterogeneous disorder that necessitates further investigation into the distinct characteristics of populations at varying risk levels of cognitive impairment. Using a large-scale registry cohort of elderly individuals, we applied a data-driven approach to identify novel clusters based on diverse sociodemographic features.

Methods

A prospective cohort of 6398 elderly people from the Chinese Longitudinal Healthy Longevity Survey, followed between 2008-14, was used to develop and validate the model. Participants were aged ≥60 years, community-dwelling, and the Chinese version of the Mini-Mental State Examination (MMSE) score ≥18 were included. Sixty-nine sociodemographic features were included in the analysis. The total population was divided into two-thirds for the derivation cohort (n = 4265) and one-third for the validation cohort (n = 2133). In the derivation cohort, an unsupervised Gaussian mixture model was applied to categorise participants into distinct clusters. A classifier was developed based on the most important 10 factors and was applied to categorise participants into their corresponding clusters in a validation cohort. The difference in the three-year risk of cognitive impairment was compared across the clusters.

Results

We identified four clusters with distinct features in the derivation cohort. Cluster 1 was associated with the worst life independence, longest sleep duration, and the oldest age. Cluster 2 demonstrated the highest loneliness, characterised by non-marital status and living alone. Cluster 3 was characterised by the lowest sense of loneliness and the highest proportions in marital status and family co-residence. Cluster 4 demonstrated heightened engagement in exercise and leisure activity, along with independent decision-making, hygiene, and a diverse diet. In comparison to Cluster 4, Cluster 1 exhibited the highest three-year cognitive impairment risk (adjusted odds ratio (aOR) = 3.31; 95% confidence interval (CI) = 1.81-6.05), followed by Cluster 2 and Cluster 3 after adjustment for baseline MMSE, residence, sex, age, years of education, drinking, smoking, hypertension, diabetes, heart disease and stroke or cardiovascular diseases.

Conclusions

A data-driven approach can be instrumental in identifying individuals at high risk of cognitive impairment among cognitively normal elderly populations. Based on various sociodemographic features, these clusters can suggest individualised intervention plans.

Collapse

Eijsbroek VC, Kjell K, Schwartz HA, Boehnke JR, Fried EI, Klein DN, Gustafsson P, Augenstein I, Bossuyt PMM, Kjell O. The LEADING Guideline: Reporting Standards for Expert Panel, Best-Estimate Diagnosis, and Longitudinal Expert All Data (LEAD) Studies. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.19.24304526. [PMID: 38699296 PMCID: PMC11065032 DOI: 10.1101/2024.03.19.24304526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]

Yan Y, Schillemans T, Skantze V, Brunius C. Adjusting for covariates and assessing modeling fitness in machine learning using MUVR2. BIOINFORMATICS ADVANCES 2024;4:vbae051. [PMID: 38645717 PMCID: PMC11031361 DOI: 10.1093/bioadv/vbae051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 03/05/2024] [Accepted: 04/03/2024] [Indexed: 04/23/2024]

Choo SM, Sartori D, Lee SC, Yang HC, Syed-Abdul S. Data-Driven Identification of Factors That Influence the Quality of Adverse Event Reports: 15-Year Interpretable Machine Learning and Time-Series Analyses of VigiBase and QUEST. JMIR Med Inform 2024;12:e49643. [PMID: 38568722 PMCID: PMC11024759 DOI: 10.2196/49643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 10/10/2023] [Accepted: 02/24/2024] [Indexed: 04/20/2024] Open

Abstract

BACKGROUND

The completeness of adverse event (AE) reports, crucial for assessing putative causal relationships, is measured using the vigiGrade completeness score in VigiBase, the World Health Organization global database of reported potential AEs. Malaysian reports have surpassed the global average score (approximately 0.44), achieving a 5-year average of 0.79 (SD 0.23) as of 2019 and approaching the benchmark for well-documented reports (0.80). However, the contributing factors to this relatively high report completeness score remain unexplored.

OBJECTIVE

This study aims to explore the main drivers influencing the completeness of Malaysian AE reports in VigiBase over a 15-year period using vigiGrade. A secondary objective was to understand the strategic measures taken by the Malaysian authorities leading to enhanced report completeness across different time frames.

METHODS

We analyzed 132,738 Malaysian reports (2005-2019) recorded in VigiBase up to February 2021 split into historical International Drug Information System (INTDIS; n=63,943, 48.17% in 2005-2016) and newer E2B (n=68,795, 51.83% in 2015-2019) format subsets. For machine learning analyses, we performed a 2-stage feature selection followed by a random forest classifier to identify the top features predicting well-documented reports. We subsequently applied tree Shapley additive explanations to examine the magnitude, prevalence, and direction of feature effects. In addition, we conducted time-series analyses to evaluate chronological trends and potential influences of key interventions on reporting quality.

RESULTS

Among the analyzed reports, 42.84% (56,877/132,738) were well documented, with an increase of 65.37% (53,929/82,497) since 2015. Over two-thirds (46,186/68,795, 67.14%) of the Malaysian E2B reports were well documented compared to INTDIS reports at 16.72% (10,691/63,943). For INTDIS reports, higher pharmacovigilance center staffing was the primary feature positively associated with being well documented. In recent E2B reports, the top positive features included reaction abated upon drug dechallenge, reaction onset or drug use duration of <1 week, dosing interval of <1 day, reports from public specialist hospitals, reports by pharmacists, and reaction duration between 1 and 6 days. In contrast, reports from product registration holders and other health care professionals and reactions involving product substitution issues negatively affected the quality of E2B reports. Multifaceted strategies and interventions comprising policy changes, continuity of education, and human resource development laid the groundwork for AE reporting in Malaysia, whereas advancements in technological infrastructure, pharmacovigilance databases, and reporting tools concurred with increases in both the quantity and quality of AE reports.

CONCLUSIONS

Through interpretable machine learning and time-series analyses, this study identified key features that positively or negatively influence the completeness of Malaysian AE reports and unveiled how Malaysia has developed its pharmacovigilance capacity via multifaceted strategies and interventions. These findings will guide future work in enhancing pharmacovigilance and public health.

Collapse

Sun C, Fang R, Salemi M, Prosperi M, Rife Magalis B. DeepDynaForecast: Phylogenetic-informed graph deep learning for epidemic transmission dynamic prediction. PLoS Comput Biol 2024;20:e1011351. [PMID: 38598563 PMCID: PMC11034642 DOI: 10.1371/journal.pcbi.1011351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 04/22/2024] [Accepted: 03/11/2024] [Indexed: 04/12/2024] Open

Wójcik Z, Dimitrova V, Warrington L, Velikova G, Absolom K. Using Machine Learning to Predict Unplanned Hospital Utilization and Chemotherapy Management From Patient-Reported Outcome Measures. JCO Clin Cancer Inform 2024;8:e2300264. [PMID: 38669610 PMCID: PMC11161248 DOI: 10.1200/cci.23.00264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 02/14/2024] [Accepted: 03/01/2024] [Indexed: 04/28/2024] Open

Abstract

PURPOSE

Adverse effects of chemotherapy often require hospital admissions or treatment management. Identifying factors contributing to unplanned hospital utilization may improve health care quality and patients' well-being. This study aimed to assess if patient-reported outcome measures (PROMs) improve performance of machine learning (ML) models predicting hospital admissions, triage events (contacting helpline or attending hospital), and changes to chemotherapy.

MATERIALS AND METHODS

Clinical trial data were used and contained responses to three PROMs (European Organisation for Research and Treatment of Cancer Core Quality of Life Questionnaire [QLQ-C30], EuroQol Five-Dimensional Visual Analogue Scale [EQ-5D], and Functional Assessment of Cancer Therapy-General [FACT-G]) and clinical information on 508 participants undergoing chemotherapy. Six feature sets (with following variables: [1] all available; [2] clinical; [3] PROMs; [4] clinical and QLQ-C30; [5] clinical and EQ-5D; [6] clinical and FACT-G) were applied in six ML models (logistic regression [LR], decision tree, adaptive boosting, random forest [RF], support vector machines [SVMs], and neural network) to predict admissions, triage events, and chemotherapy changes.

RESULTS

The comprehensive analysis of predictive performances of the six ML models for each feature set in three different methods for handling class imbalance indicated that PROMs improved predictions of all outcomes. RF and SVMs had the highest performance for predicting admissions and changes to chemotherapy in balanced data sets, and LR in imbalanced data set. Balancing data led to the best performance compared with imbalanced data set or data set with balanced train set only.

CONCLUSION

These results endorsed the view that ML can be applied on PROM data to predict hospital utilization and chemotherapy management. If further explored, this study may contribute to health care planning and treatment personalization. Rigorous comparison of model performance affected by different imbalanced data handling methods shows best practice in ML research.

Collapse

Zhu J, Wu Y, Lin S, Duan S, Wang X, Fang Y. Identifying and predicting physical limitation and cognitive decline trajectory group of older adults in China: A data-driven machine learning analysis. J Affect Disord 2024;350:590-599. [PMID: 38218258 DOI: 10.1016/j.jad.2024.01.095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 11/24/2023] [Accepted: 01/07/2024] [Indexed: 01/15/2024]

Brooks JM, Chapman CG, Chen BK, Floyd SB, Hikmet N. Assessing the properties of patient-specific treatment effect estimates from causal forest algorithms under essential heterogeneity. BMC Med Res Methodol 2024;24:66. [PMID: 38481139 PMCID: PMC10935905 DOI: 10.1186/s12874-024-02187-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 02/21/2024] [Indexed: 03/17/2024] Open

Zafar F, Fakhare Alam L, Vivas RR, Wang J, Whei SJ, Mehmood S, Sadeghzadegan A, Lakkimsetti M, Nazir Z. The Role of Artificial Intelligence in Identifying Depression and Anxiety: A Comprehensive Literature Review. Cureus 2024;16:e56472. [PMID: 38638735 PMCID: PMC11025697 DOI: 10.7759/cureus.56472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/18/2024] [Indexed: 04/20/2024] Open

Abstract

This narrative literature review undertakes a comprehensive examination of the burgeoning field, tracing the development of artificial intelligence (AI)-powered tools for depression and anxiety detection from the level of intricate algorithms to practical applications. Delivering essential mental health care services is now a significant public health priority. In recent years, AI has become a game-changer in the early identification and intervention of these pervasive mental health disorders. AI tools can potentially empower behavioral healthcare services by helping psychiatrists collect objective data on patients' progress and tasks. This study emphasizes the current understanding of AI, the different types of AI, its current use in multiple mental health disorders, advantages, disadvantages, and future potentials. As technology develops and the digitalization of the modern era increases, there will be a rise in the application of artificial intelligence in psychiatry; therefore, a comprehensive understanding will be needed. We searched PubMed, Google Scholar, and Science Direct using keywords for this. In a recent review of studies using electronic health records (EHR) with AI and machine learning techniques for diagnosing all clinical conditions, roughly 99 publications have been found. Out of these, 35 studies were identified for mental health disorders in all age groups, and among them, six studies utilized EHR data sources. By critically analyzing prominent scholarly works, we aim to illuminate the current state of this technology, exploring its successes, limitations, and future directions. In doing so, we hope to contribute to a nuanced understanding of AI's potential to revolutionize mental health diagnostics and pave the way for further research and development in this critically important domain.

Collapse

Lee H, Hanson HA, Logan J, Maguire D, Kapadia A, Dewji S, Agasthya G. Evaluating county-level lung cancer incidence from environmental radiation exposure, PM_2.5, and other exposures with regression and machine learning models. ENVIRONMENTAL GEOCHEMISTRY AND HEALTH 2024;46:82. [PMID: 38367080 PMCID: PMC10874317 DOI: 10.1007/s10653-023-01820-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 11/27/2023] [Indexed: 02/19/2024]

Abstract

Characterizing the interplay between exposures shaping the human exposome is vital for uncovering the etiology of complex diseases. For example, cancer risk is modified by a range of multifactorial external environmental exposures. Environmental, socioeconomic, and lifestyle factors all shape lung cancer risk. However, epidemiological studies of radon aimed at identifying populations at high risk for lung cancer often fail to consider multiple exposures simultaneously. For example, moderating factors, such as PM2.5, may affect the transport of radon progeny to lung tissue. This ecological analysis leveraged a population-level dataset from the National Cancer Institute's Surveillance, Epidemiology, and End-Results data (2013-17) to simultaneously investigate the effect of multiple sources of low-dose radiation (gross [Formula: see text] activity and indoor radon) and PM2.5 on lung cancer incidence rates in the USA. County-level factors (environmental, sociodemographic, lifestyle) were controlled for, and Poisson regression and random forest models were used to assess the association between radon exposure and lung and bronchus cancer incidence rates. Tree-based machine learning (ML) method perform better than traditional regression: Poisson regression: 6.29/7.13 (mean absolute percentage error, MAPE), 12.70/12.77 (root mean square error, RMSE); Poisson random forest regression: 1.22/1.16 (MAPE), 8.01/8.15 (RMSE). The effect of PM2.5 increased with the concentration of environmental radon, thereby confirming findings from previous studies that investigated the possible synergistic effect of radon and PM2.5 on health outcomes. In summary, the results demonstrated (1) a need to consider multiple environmental exposures when assessing radon exposure's association with lung cancer risk, thereby highlighting (1) the importance of an exposomics framework and (2) that employing ML models may capture the complex interplay between environmental exposures and health, as in the case of indoor radon exposure and lung cancer incidence.

Collapse

Alkhamis MA, Al Jarallah M, Attur S, Zubaid M. Interpretable machine learning models for predicting in-hospital and 30 days adverse events in acute coronary syndrome patients in Kuwait. Sci Rep 2024;14:1243. [PMID: 38216605 PMCID: PMC10786865 DOI: 10.1038/s41598-024-51604-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 01/07/2024] [Indexed: 01/14/2024] Open

Bednorz A, Mak JKL, Jylhävä J, Religa D. Use of Electronic Medical Records (EMR) in Gerontology: Benefits, Considerations and a Promising Future. Clin Interv Aging 2023;18:2171-2183. [PMID: 38152074 PMCID: PMC10752027 DOI: 10.2147/cia.s400887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 11/05/2023] [Indexed: 12/29/2023] Open

Ma Q, Cheng C, Chen Y, Wang Q, Li B, Wang P. Effect and prediction of physical exercise and diet on blood pressure control in patients with hypertension. Medicine (Baltimore) 2023;102:e36612. [PMID: 38115342 PMCID: PMC10727525 DOI: 10.1097/md.0000000000036612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 08/07/2023] [Accepted: 11/21/2023] [Indexed: 12/21/2023] Open

Gharbi-Meliani A, Husson F, Vandendriessche H, Bayen E, Yaffe K, Bachoud-Lévi AC, Cleret de Langavant L. Identification of high likelihood of dementia in population-based surveys using unsupervised clustering: a longitudinal analysis. Alzheimers Res Ther 2023;15:209. [PMID: 38031083 PMCID: PMC10688099 DOI: 10.1186/s13195-023-01357-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 11/21/2023] [Indexed: 12/01/2023]

Abstract

BACKGROUND

Dementia is defined as a cognitive decline that affects functional status. Longitudinal ageing surveys often lack a clinical diagnosis of dementia though measure cognition and daily function over time. We used unsupervised machine learning and longitudinal data to identify transition to probable dementia.

METHODS

Multiple Factor Analysis was applied to longitudinal function and cognitive data of 15,278 baseline participants (aged 50 years and more) from the Survey of Health, Ageing, and Retirement in Europe (SHARE) (waves 1, 2 and 4-7, between 2004 and 2017). Hierarchical Clustering on Principal Components discriminated three clusters at each wave. We estimated probable or "Likely Dementia" prevalence by sex and age, and assessed whether dementia risk factors increased the risk of being assigned probable dementia status using multistate models. Next, we compared the "Likely Dementia" cluster with self-reported dementia status and replicated our findings in the English Longitudinal Study of Ageing (ELSA) cohort (waves 1-9, between 2002 and 2019, 7840 participants at baseline).

RESULTS

Our algorithm identified a higher number of probable dementia cases compared with self-reported cases and showed good discriminative power across all waves (AUC ranged from 0.754 [0.722-0.787] to 0.830 [0.800-0.861]). "Likely Dementia" status was more prevalent in older people, displayed a 2:1 female/male ratio, and was associated with nine factors that increased risk of transition to dementia: low education, hearing loss, hypertension, drinking, smoking, depression, social isolation, physical inactivity, diabetes, and obesity. Results were replicated in ELSA cohort with good accuracy.

CONCLUSIONS

Machine learning clustering can be used to study dementia determinants and outcomes in longitudinal population ageing surveys in which dementia clinical diagnosis is lacking.

Collapse

Affiliation(s)

Amin Gharbi-Meliani Neuropsychologie Interventionnelle, U955 E01, Institut Mondor de Recherche Biomédicale & Département d'études Cognitives, INSERM, Ecole Normale Supérieure, Université PSL, Université Paris-Est Créteil, Creteil, 94000, France
François Husson Institut Agro, Univ Rennes1, CNRS, IRMAR, Rennes, 35000, France
Henri Vandendriessche Laboratoire de Neurosciences Cognitives et Computationnelles, Département d'études Cognitives, Ecole Normale Supérieure, Université PSL, INSERM, Paris, 75005, France
Eleonore Bayen Département de Rééducation Neurologique, Sorbonne Université, Hôpital Pitié-Salpêtrière-Assistance Publique Hôpitaux de Paris, Paris, 75013, France Global Brain Health Institute, University of California, San Francisco, CA, 94143, USA
Kristine Yaffe Global Brain Health Institute, University of California, San Francisco, CA, 94143, USA Departments of Psychiatry, Neurology and Epidemiology and Biostatistics, University of California, San Francisco, CA, 94143, USA
Anne-Catherine Bachoud-Lévi Neuropsychologie Interventionnelle, U955 E01, Institut Mondor de Recherche Biomédicale & Département d'études Cognitives, INSERM, Ecole Normale Supérieure, Université PSL, Université Paris-Est Créteil, Creteil, 94000, France Service de Neurologie, Centre de référence maladie de Huntington, Hôpital Henri Mondor, Assistance Publique Hôpitaux de Paris, 1 rue Gustave Eiffel, Creteil, 94000, France
Laurent Cleret de Langavant Neuropsychologie Interventionnelle, U955 E01, Institut Mondor de Recherche Biomédicale & Département d'études Cognitives, INSERM, Ecole Normale Supérieure, Université PSL, Université Paris-Est Créteil, Creteil, 94000, France. Global Brain Health Institute, University of California, San Francisco, CA, 94143, USA. Service de Neurologie, Centre de référence maladie de Huntington, Hôpital Henri Mondor, Assistance Publique Hôpitaux de Paris, 1 rue Gustave Eiffel, Creteil, 94000, France.

Collapse

Li Q, Zheng JX, Jia TW, Feng XY, Lv C, Zhang LJ, Yang GJ, Xu J, Zhou XN. Optimized strategy for schistosomiasis elimination: results from marginal benefit modeling. Parasit Vectors 2023;16:419. [PMID: 37968661 PMCID: PMC10652544 DOI: 10.1186/s13071-023-06001-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 10/06/2023] [Indexed: 11/17/2023] Open

Affiliation(s)

Qin Li National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China
Jin-Xin Zheng Ruijin Hospital Affiliated to The Shanghai Jiao Tong University Medical School, Shanghai, 200025, China
Tie-Wu Jia National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China
Xin-Yu Feng National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China
Chao Lv National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China School of Global Health, Chinese Center for Tropical Diseases Research and Shanghai Jiao Tong University School of Medicine, One Health Center, Shanghai Jiao Tong University and The Edinburgh University, Shanghai, 200025, China
Li-Juan Zhang National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China
Guo-Jing Yang School of Tropical Medicine, Hainan Medical University, Haikou, 571199, China
Jing Xu National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China
Xiao-Nong Zhou National Institute of Parasitic Diseases, Chinese Center for Disease Control and Prevention (Chinese Center for Tropical Diseases Research), National Health Commission Key Laboratory of Parasite and Vector Biology, WHO Collaborating Centre for Tropical Diseases, National Center for International Research on Tropical Diseases, Shanghai, 200025, China. School of Global Health, Chinese Center for Tropical Diseases Research and Shanghai Jiao Tong University School of Medicine, One Health Center, Shanghai Jiao Tong University and The Edinburgh University, Shanghai, 200025, China.

Collapse

Breeze F, Hossain RR, Mayo M, McKelvie J. Predicting ophthalmic clinic non-attendance using machine learning: Development and validation of models using nationwide data. Clin Exp Ophthalmol 2023;51:764-774. [PMID: 37885379 DOI: 10.1111/ceo.14310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 09/04/2023] [Accepted: 10/08/2023] [Indexed: 10/28/2023]

Ma X, Mo C, Li Y, Chen X, Gui C. Prediction of the development of contrast‑induced nephropathy following percutaneous coronary artery intervention by machine learning. Acta Cardiol 2023;78:912-921. [PMID: 37052397 DOI: 10.1080/00015385.2023.2198937] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 03/30/2023] [Indexed: 04/14/2023]

Lotfata A, Moosazadeh M, Helbich M, Hoseini B. Socioeconomic and environmental determinants of asthma prevalence: a cross-sectional study at the U.S. County level using geographically weighted random forests. Int J Health Geogr 2023;22:18. [PMID: 37563691 PMCID: PMC10413687 DOI: 10.1186/s12942-023-00343-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 08/04/2023] [Indexed: 08/12/2023] Open

Abstract

BACKGROUND

Some studies have established associations between the prevalence of new-onset asthma and asthma exacerbation and socioeconomic and environmental determinants. However, research remains limited concerning the shape of these associations, the importance of the risk factors, and how these factors vary geographically.

OBJECTIVE

We aimed (1) to examine ecological associations between asthma prevalence and multiple socio-physical determinants in the United States; and (2) to assess geographic variations in their relative importance.

METHODS

Our study design is cross sectional based on county-level data for 2020 across the United States. We obtained self-reported asthma prevalence data of adults aged 18 years or older for each county. We applied conventional and geographically weighted random forest (GWRF) to investigate the associations between asthma prevalence and socioeconomic (e.g., poverty) and environmental determinants (e.g., air pollution and green space). To enhance the interpretability of the GWRF, we (1) assessed the shape of the associations through partial dependence plots, (2) ranked the determinants according to their global importance scores, and (3) mapped the local variable importance spatially.

RESULTS

Of the 3059 counties, the average asthma prevalence was 9.9 (standard deviation ± 0.99). The GWRF outperformed the conventional random forest. We found an indication, for example, that temperature was inversely associated with asthma prevalence, while poverty showed positive associations. The partial dependence plots showed that these associations had a non-linear shape. Ranking the socio-physical environmental factors concerning their global importance showed that smoking prevalence and depression prevalence were most relevant, while green space and limited language were of minor relevance. The local variable importance measures showed striking geographical differences.

CONCLUSION

Our findings strengthen the evidence that socio-physical environments play a role in explaining asthma prevalence, but their relevance seems to vary geographically. The results are vital for implementing future asthma prevention programs that should be tailor-made for specific areas.

Collapse

Hamidi F, Gilani N, Arabi Belaghi R, Yaghoobi H, Babaei E, Sarbakhsh P, Malakouti J. Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta. Front Digit Health 2023;5:1187578. [PMID: 37621964 PMCID: PMC10445490 DOI: 10.3389/fdgth.2023.1187578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 07/20/2023] [Indexed: 08/26/2023] Open

Fan P, Miranda O, Qi X, Kofler J, Sweet RA, Wang L. Unveiling the Enigma: Exploring Risk Factors and Mechanisms for Psychotic Symptoms in Alzheimer's Disease through Electronic Medical Records with Deep Learning Models. Pharmaceuticals (Basel) 2023;16:911. [PMID: 37513822 PMCID: PMC10385983 DOI: 10.3390/ph16070911] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/14/2023] [Accepted: 06/16/2023] [Indexed: 07/30/2023] Open

Brinch ML, Hald T, Wainaina L, Merlotti A, Remondini D, Henri C, Njage PMK. Comparison of Source Attribution Methodologies for Human Campylobacteriosis. Pathogens 2023;12:786. [PMID: 37375476 DOI: 10.3390/pathogens12060786] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 05/09/2023] [Accepted: 05/10/2023] [Indexed: 06/29/2023] Open

Ross RK, Keil AP, Cole SR, Edwards JK, Stringer JSA. A WARNING ABOUT USING PREDICTED VALUES TO ESTIMATE DESCRIPTIVE MEASURES. Am J Epidemiol 2023;192:840-843. [PMID: 36708231 PMCID: PMC10893853 DOI: 10.1093/aje/kwad020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 01/11/2023] [Accepted: 01/25/2023] [Indexed: 01/29/2023] Open

Liu Y, Zhuang Y, Yu L, Li Q, Zhao C, Meng R, Zhu J, Guo X. A Machine Learning Framework Based on Extreme Gradient Boosting to Predict the Occurrence and Development of Infectious Diseases in Laying Hen Farms, Taking H9N2 as an Example. Animals (Basel) 2023;13:1494. [PMID: 37174531 PMCID: PMC10177545 DOI: 10.3390/ani13091494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2023] [Revised: 04/26/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023] Open

Data driven contagion risk management in low-income countries using machine learning applications with COVID-19 in South Asia. Sci Rep 2023;13:3732. [PMID: 36878910 PMCID: PMC9987367 DOI: 10.1038/s41598-023-30348-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 02/21/2023] [Indexed: 03/08/2023] Open

Gharbi-Meliani A, Husson F, Vandendriessche H, Eleonore Bayen F, Yaffe K, Bachoud-Lévi AC, de Langavant LC. Identification of High Likelihood of Dementia in Population-Based Surveys using Unsupervised Clustering: a Longitudinal Analysis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.02.17.23286078. [PMID: 36865284 PMCID: PMC9980227 DOI: 10.1101/2023.02.17.23286078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]

Abstract

Background

Dementia is defined by cognitive decline that affects functional status. Longitudinal ageing surveys often lack a clinical diagnosis of dementia though measure cognitive and function over time. We used unsupervised machine learning and longitudinal data to identify transition to probable dementia.

Methods

Findings

Our algorithm identified a higher number of probable dementia cases compared with self-reported cases and showed good discriminative power across all waves (AUC ranged from 0.754 [0.722-0.787] to 0.830 [0.800-0.861]). "Likely Dementia" status was more prevalent in older people, displayed a 2:1 female/male ratio and was associated with nine factors that increased risk of transition to dementia: low education, hearing loss, hypertension, drinking, smoking, depression, social isolation, physical inactivity, diabetes, and obesity. Results were replicated in ELSA cohort with good accuracy.

Interpretation

Machine learning clustering can be used to study dementia determinants and outcomes in longitudinal population ageing surveys in which dementia clinical diagnosis is lacking.

Collapse

Affiliation(s)

Amin Gharbi-Meliani Equipe neuropsychologie interventionnelle, Institut Mondor de Recherche Biomédicale, Département d'études cognitives, Ecole normale supérieure, Université PSL, Université Paris-Est Créteil, AP-HP Hôpital Henri Mondor-Albert Chenevier, Centre de référence Maladie de Huntington et Service de Neurologie, INSERM, 75005 Paris [ou 94000 Créteil], France
François Husson Institut Agro, Univ Rennes1, CNRS, IRMAR, 35000, Rennes, France
Henri Vandendriessche Laboratoire de Neurosciences Cognitives et Computationnelles, Département d'études cognitives, Ecole normale supérieure, Université PSL, INSERM, 75005 Paris, France
France Eleonore Bayen Global Brain Health Institute, University of California, San Francisco, CA, United States; Sorbonne Université, Hôpital Pitié-Salpêtrière-Assistance Publique Hôpitaux de Paris, Département de Rééducation Neurologique, Paris, France
Kristine Yaffe Global Brain Health Institute, University of California, San Francisco, CA, United States; Departments of Psychiatry, Neurology and Epidemiology and Biostatistics, University of California, San Francisco
Anne-Catherine Bachoud-Lévi Equipe neuropsychologie interventionnelle, Institut Mondor de Recherche Biomédicale, Département d'études cognitives, Ecole normale supérieure, Université PSL, Université Paris-Est Créteil, AP-HP Hôpital Henri Mondor-Albert Chenevier, Centre de référence Maladie de Huntington et Service de Neurologie, INSERM, 75005 Paris [ou 94000 Créteil], France
Laurent Cleret de Langavant Equipe neuropsychologie interventionnelle, Institut Mondor de Recherche Biomédicale, Département d'études cognitives, Ecole normale supérieure, Université PSL, Université Paris-Est Créteil, AP-HP Hôpital Henri Mondor-Albert Chenevier, Centre de référence Maladie de Huntington et Service de Neurologie, INSERM, 75005 Paris [ou 94000 Créteil], France; Global Brain Health Institute, University of California, San Francisco, CA, United States

Collapse

Parhofer KG, Anastassopoulou A, Calver H, Becker C, Rathore AS, Dave R, Zamfir C. Estimating Prevalence and Characteristics of Statin Intolerance among High and Very High Cardiovascular Risk Patients in Germany (2017 to 2020). J Clin Med 2023;12:jcm12020705. [PMID: 36675634 PMCID: PMC9864390 DOI: 10.3390/jcm12020705] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 01/06/2023] [Accepted: 01/08/2023] [Indexed: 01/18/2023] Open

Barboza LA, Chou-Chen SW, Vásquez P, García YE, Calvo JG, Hidalgo HG, Sanchez F. Assessing dengue fever risk in Costa Rica by using climate variables and machine learning techniques. PLoS Negl Trop Dis 2023;17:e0011047. [PMID: 36638136 PMCID: PMC9879398 DOI: 10.1371/journal.pntd.0011047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 01/26/2023] [Accepted: 12/20/2022] [Indexed: 01/14/2023] Open

Improving the Accuracy of Diabetes Diagnosis Applications through a Hybrid Feature Selection Algorithm. Neural Process Lett 2023;55:153-169. [PMID: 33814965 PMCID: PMC7997791 DOI: 10.1007/s11063-021-10491-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/09/2021] [Indexed: 01/20/2023]

Hayakawa T, Nagashima T, Akimoto H, Minagawa K, Takahashi Y, Asai S. Benzodiazepine-related dementia risks and protopathic biases revealed by multiple-kernel learning with electronic medical records. Digit Health 2023;9:20552076231178577. [PMID: 37312937 PMCID: PMC10259140 DOI: 10.1177/20552076231178577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 05/06/2023] [Indexed: 06/15/2023] Open

Ikram M, Shaikh NF, Vishwanatha JK, Sambamoorthi U. Leading Predictors of COVID-19-Related Poor Mental Health in Adult Asian Indians: An Application of Extreme Gradient Boosting and Shapley Additive Explanations. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;20:775. [PMID: 36613095 PMCID: PMC9819341 DOI: 10.3390/ijerph20010775] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 12/22/2022] [Accepted: 12/27/2022] [Indexed: 06/17/2023]

Kirk D, Kok E, Tufano M, Tekinerdogan B, Feskens EJM, Camps G. Machine Learning in Nutrition Research. Adv Nutr 2022;13:2573-2589. [PMID: 36166846 PMCID: PMC9776646 DOI: 10.1093/advances/nmac103] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 08/02/2022] [Accepted: 09/22/2022] [Indexed: 01/29/2023] Open

Wang J. Mathematical Models for Cholera Dynamics-A Review. Microorganisms 2022;10:microorganisms10122358. [PMID: 36557611 PMCID: PMC9783556 DOI: 10.3390/microorganisms10122358] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 11/27/2022] [Accepted: 11/28/2022] [Indexed: 11/30/2022] Open

Wu Y, Jia M, Xiang C, Fang Y. Latent trajectories of frailty and risk prediction models among geriatric community dwellers: an interpretable machine learning perspective. BMC Geriatr 2022;22:900. [PMID: 36434518 PMCID: PMC9700973 DOI: 10.1186/s12877-022-03576-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 11/01/2022] [Indexed: 11/27/2022] Open

Leist AK, Klee M, Kim JH, Rehkopf DH, Bordas SPA, Muniz-Terrera G, Wade S. Mapping of machine learning approaches for description, prediction, and causal inference in the social and health sciences. SCIENCE ADVANCES 2022;8:eabk1942. [PMID: 36260666 PMCID: PMC9581488 DOI: 10.1126/sciadv.abk1942] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 09/01/2022] [Indexed: 05/20/2023]

Zheng D, Hao X, Khan M, Wang L, Li F, Xiang N, Kang F, Hamalainen T, Cong F, Song K, Qiao C. Comparison of machine learning and logistic regression as predictive models for adverse maternal and neonatal outcomes of preeclampsia: A retrospective study. Front Cardiovasc Med 2022;9:959649. [PMID: 36312231 PMCID: PMC9596815 DOI: 10.3389/fcvm.2022.959649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 09/12/2022] [Indexed: 12/05/2022] Open

Abstract

Introduction

Preeclampsia, one of the leading causes of maternal and fetal morbidity and mortality, demands accurate predictive models for the lack of effective treatment. Predictive models based on machine learning algorithms demonstrate promising potential, while there is a controversial discussion about whether machine learning methods should be recommended preferably, compared to traditional statistical models.

Methods

We employed both logistic regression and six machine learning methods as binary predictive models for a dataset containing 733 women diagnosed with preeclampsia. Participants were grouped by four different pregnancy outcomes. After the imputation of missing values, statistical description and comparison were conducted preliminarily to explore the characteristics of documented 73 variables. Sequentially, correlation analysis and feature selection were performed as preprocessing steps to filter contributing variables for developing models. The models were evaluated by multiple criteria.

Results

We first figured out that the influential variables screened by preprocessing steps did not overlap with those determined by statistical differences. Secondly, the most accurate imputation method is K-Nearest Neighbor, and the imputation process did not affect the performance of the developed models much. Finally, the performance of models was investigated. The random forest classifier, multi-layer perceptron, and support vector machine demonstrated better discriminative power for prediction evaluated by the area under the receiver operating characteristic curve, while the decision tree classifier, random forest, and logistic regression yielded better calibration ability verified, as by the calibration curve.

Conclusion

Machine learning algorithms can accomplish prediction modeling and demonstrate superior discrimination, while Logistic Regression can be calibrated well. Statistical analysis and machine learning are two scientific domains sharing similar themes. The predictive abilities of such developed models vary according to the characteristics of datasets, which still need larger sample sizes and more influential predictors to accumulate evidence.

Collapse

Affiliation(s)

Dongying Zheng State Key Laboratory of Fine Chemicals, Dalian R&D Center for Stem Cell and Tissue Engineering, Dalian University of Technology, Dalian, China,Department of Obstetrics and Gynecology, Second Affiliated Hospital of Dalian Medical University, Dalian, China,Faculty of Information Technology, University of Jyvaskyla, Jyväskylä, Finland
Xinyu Hao Faculty of Information Technology, University of Jyvaskyla, Jyväskylä, Finland,School of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China
Muhanmmad Khan Institute of Zoology, University of Punjab, Lahore, Pakistan
Lixia Wang Department of Obstetrics and Gynecology, Second Affiliated Hospital of Dalian Medical University, Dalian, China
Fan Li Department of Obstetrics and Gynecology, Shengjing Hospital, China Medical University, Shenyang, China
Ning Xiang Department of Obstetrics and Gynecology, Jingzhou Hospital Affiliated to Yangtze University, Jingzhou, China
Fuli Kang Department of Obstetrics and Gynecology, Second Affiliated Hospital of Dalian Medical University, Dalian, China
Timo Hamalainen Faculty of Information Technology, University of Jyvaskyla, Jyväskylä, Finland
Fengyu Cong Faculty of Information Technology, University of Jyvaskyla, Jyväskylä, Finland,School of Biomedical Engineering, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China,School of Artificial Intelligence, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, China,Key Laboratory of Integrated Circuit and Biomedical Electronic System, Dalian University of Technology, Dalian, China
Kedong Song State Key Laboratory of Fine Chemicals, Dalian R&D Center for Stem Cell and Tissue Engineering, Dalian University of Technology, Dalian, China,*Correspondence: Kedong Song
Chong Qiao Department of Obstetrics and Gynecology, Shengjing Hospital, China Medical University, Shenyang, China,Chong Qiao

Collapse

Yoshihara A, Yoshimura Noh J, Inoue K, Taguchi J, Hata K, Aizawa T, Taira Arai Y, Watanabe N, Fukushita M, Matsumoto M, Suzuki N, Hoshiyama A, Suzuki A, Mitsumatsu T, Kinoshita A, Mikura K, Yoshimura R, Sugino K, Ito K. Prediction model of Graves' disease in general clinical practice based on complete blood count and biochemistry profile. Endocr J 2022;69:1091-1100. [PMID: 35387949 DOI: 10.1507/endocrj.ej21-0741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open

Ru B, Kujawski S, Lee Afanador N, Baumgartner R, Pawaskar M, Das A. Predicting Measles Outbreaks in the United States: Evaluation of Machine Learning Approaches (Preprint). JMIR Form Res 2022;7:e42832. [PMID: 37014694 PMCID: PMC10131820 DOI: 10.2196/42832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 01/24/2023] [Accepted: 02/07/2023] [Indexed: 02/10/2023] Open

Abstract

BACKGROUND

Measles, a highly contagious viral infection, is resurging in the United States, driven by international importation and declining domestic vaccination coverage. Despite this resurgence, measles outbreaks are still rare events that are difficult to predict. Improved methods to predict outbreaks at the county level would facilitate the optimal allocation of public health resources.

OBJECTIVE

We aimed to validate and compare extreme gradient boosting (XGBoost) and logistic regression, 2 supervised learning approaches, to predict the US counties most likely to experience measles cases. We also aimed to assess the performance of hybrid versions of these models that incorporated additional predictors generated by 2 clustering algorithms, hierarchical density-based spatial clustering of applications with noise (HDBSCAN) and unsupervised random forest (uRF).

METHODS

We constructed a supervised machine learning model based on XGBoost and unsupervised models based on HDBSCAN and uRF. The unsupervised models were used to investigate clustering patterns among counties with measles outbreaks; these clustering data were also incorporated into hybrid XGBoost models as additional input variables. The machine learning models were then compared to logistic regression models with and without input from the unsupervised models.

RESULTS

Both HDBSCAN and uRF identified clusters that included a high percentage of counties with measles outbreaks. XGBoost and XGBoost hybrid models outperformed logistic regression and logistic regression hybrid models, with the area under the receiver operating curve values of 0.920-0.926 versus 0.900-0.908, the area under the precision-recall curve values of 0.522-0.532 versus 0.485-0.513, and F₂ scores of 0.595-0.601 versus 0.385-0.426. Logistic regression or logistic regression hybrid models had higher sensitivity than XGBoost or XGBoost hybrid models (0.837-0.857 vs 0.704-0.735) but a lower positive predictive value (0.122-0.141 vs 0.340-0.367) and specificity (0.793-0.821 vs 0.952-0.958). The hybrid versions of the logistic regression and XGBoost models had slightly higher areas under the precision-recall curve, specificity, and positive predictive values than the respective models that did not include any unsupervised features.

CONCLUSIONS

XGBoost provided more accurate predictions of measles cases at the county level compared with logistic regression. The threshold of prediction in this model can be adjusted to align with each county's resources, priorities, and risk for measles. While clustering pattern data from unsupervised machine learning approaches improved some aspects of model performance in this imbalanced data set, the optimal approach for the integration of such approaches with supervised machine learning models requires further investigation.

Collapse

Russo V, Lallo E, Munnia A, Spedicato M, Messerini L, D’Aurizio R, Ceroni EG, Brunelli G, Galvano A, Russo A, Landini I, Nobili S, Ceppi M, Bruzzone M, Cianchi F, Staderini F, Roselli M, Riondino S, Ferroni P, Guadagni F, Mini E, Peluso M. Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis. Cancers (Basel) 2022;14:4012. [PMID: 36011003 PMCID: PMC9406544 DOI: 10.3390/cancers14164012] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 07/26/2022] [Accepted: 08/12/2022] [Indexed: 12/24/2022] Open

Abstract

Tailored treatments for metastatic colorectal cancer (mCRC) have not yet completely evolved due to the variety in response to drugs. Therefore, artificial intelligence has been recently used to develop prognostic and predictive models of treatment response (either activity/efficacy or toxicity) to aid in clinical decision making. In this systematic review, we have examined the ability of learning methods to predict response to chemotherapy alone or combined with targeted therapy in mCRC patients by targeting specific narrative publications in Medline up to April 2022 to identify appropriate original scientific articles. After the literature search, 26 original articles met inclusion and exclusion criteria and were included in the study. Our results show that all investigations conducted on this field have provided generally promising results in predicting the response to therapy or toxic side-effects. By a meta-analytic approach we found that the overall weighted means of the area under the receiver operating characteristic (ROC) curve (AUC) were 0.90, 95% C.I. 0.80-0.95 and 0.83, 95% C.I. 0.74-0.89 in training and validation sets, respectively, indicating a good classification performance in discriminating response vs. non-response. The calculation of overall HR indicates that learning models have strong ability to predict improved survival. Lastly, the delta-radiomics and the 74 gene signatures were able to discriminate response vs. non-response by correctly identifying up to 99% of mCRC patients who were responders and up to 100% of patients who were non-responders. Specifically, when we evaluated the predictive models with tests reaching 80% sensitivity (SE) and 90% specificity (SP), the delta radiomics showed an SE of 99% and an SP of 94% in the training set and an SE of 85% and SP of 92 in the test set, whereas for the 74 gene signatures the SE was 97.6% and the SP 100% in the training set.

Collapse

Affiliation(s)

Valentina Russo Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
Eleonora Lallo Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
Armelle Munnia Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
Miriana Spedicato Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
Luca Messerini Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
Romina D’Aurizio Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
Elia Giuseppe Ceroni Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
Giulia Brunelli Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
Antonio Galvano Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy
Antonio Russo Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy
Ida Landini Department of Health Sciences, University of Florence, 50139 Florence, Italy
Stefania Nobili Department of Neurosciences, Imaging and Clinical Sciences, “G. D’Annunzio” Chieti-Pescara, 66100 Chieti, Italy
Marcello Ceppi Clinical Epidemiology Unit, IRCCS-Ospedale Policlinico San Martino, 16131 Genova, Italy
Marco Bruzzone Clinical Epidemiology Unit, IRCCS-Ospedale Policlinico San Martino, 16131 Genova, Italy
Fabio Cianchi Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
Fabio Staderini Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
Mario Roselli Medical Oncology Unit, Department of Systems Medicine, Tor Vergata University, 00133 Rome, Italy
Silvia Riondino Medical Oncology Unit, Department of Systems Medicine, Tor Vergata University, 00133 Rome, Italy
Patrizia Ferroni BioBIM (InterInstitutional Multidisciplinary Biobank), IRCCS San Raffaele Roma, 00166 Rome, Italy Department of Human Sciences & Quality of Life Promotion, San Raffaele Roma Open University, 00166 Rome, Italy
Fiorella Guadagni BioBIM (InterInstitutional Multidisciplinary Biobank), IRCCS San Raffaele Roma, 00166 Rome, Italy Department of Human Sciences & Quality of Life Promotion, San Raffaele Roma Open University, 00166 Rome, Italy
Enrico Mini Department of Health Sciences, University of Florence, 50139 Florence, Italy
Marco Peluso Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy

Collapse

Wang S, Wang W, Li X, Liu Y, Wei J, Zheng J, Wang Y, Ye B, Zhao R, Huang Y, Peng S, Zheng Y, Zeng Y. Using machine learning algorithms for predicting cognitive impairment and identifying modifiable factors among Chinese elderly people. Front Aging Neurosci 2022;14:977034. [PMID: 36034140 PMCID: PMC9407018 DOI: 10.3389/fnagi.2022.977034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 07/19/2022] [Indexed: 11/18/2022] Open

Abstract

Objectives: This study firstly aimed to explore predicting cognitive impairment at an early stage using a large population-based longitudinal survey of elderly Chinese people. The second aim was to identify reversible factors which may help slow the rate of decline in cognitive function over 3 years in the community. Methods: We included 12,280 elderly people from four waves of the Chinese Longitudinal Healthy Longevity Survey (CLHLS), followed from 2002 to 2014. The Chinese version of the Mini-Mental State Examination (MMSE) was used to examine cognitive function. Six machine learning algorithms (including a neural network model) and an ensemble method were trained on data split 2/3 for training and 1/3 testing. Parameters were explored in training data using 3-fold cross-validation and models were evaluated in test data. The model performance was measured by area-under-curve (AUC), sensitivity, and specificity. In addition, due to its better interpretability, logistic regression (LR) was used to assess the association of life behavior and its change with cognitive impairment after 3 years. Results: Support vector machine and multi-layer perceptron were found to be the best performing algorithms with AUC of 0.8267 and 0.8256, respectively. Fusing the results of all six single models further improves the AUC to 0.8269. Playing more Mahjong or cards (OR = 0.49,95% CI: 0.38-0.64), doing more garden works (OR = 0.54,95% CI: 0.43-0.68), watching TV or listening to the radio more (OR = 0.67,95% CI: 0.59-0.77) were associated with decreased risk of cognitive impairment after 3 years. Conclusions: Machine learning algorithms especially the SVM, and the ensemble model can be leveraged to identify the elderly at risk of cognitive impairment. Doing more leisure activities, doing more gardening work, and engaging in more activities combined were associated with decreased risk of cognitive impairment.

Collapse