1
|
Ji W, Wang C, Chen H, Liang Y, Wang S. Predicting post-stroke cognitive impairment using machine learning: A prospective cohort study. J Stroke Cerebrovasc Dis 2023; 32:107354. [PMID: 37716104 DOI: 10.1016/j.jstrokecerebrovasdis.2023.107354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 08/27/2023] [Accepted: 09/11/2023] [Indexed: 09/18/2023] Open
Abstract
BACKGROUND Post-stroke cognitive impairment (PSCI) is a serious complication of stroke that warrants prompt detection and management. Consequently, the development of a diagnostic prediction model holds clinical significance. OBJECTIVE Machine learning algorithms were employed to identify crucial variables and forecast PSCI occurrence within 3-6 months following acute ischemic stroke (AIS). METHODS A prospective study was conducted on a developed cohort (331 patients) utilizing data from the Affiliated Zhongda Hospital of Southeast University between January 2022 and August 2022, as well as an external validation cohort (66 patients) from December 2022 to January 2023. The optimal model was determined by integrating nine machine learning classification models, and personalized risk assessment was facilitated by a Shapley Additive exPlanations (SHAP) interpretation. RESULTS Age, education, baseline National Institutes of Health Scale (NIHSS), Cerebral white matter degeneration (CWMD), Homocysteine (Hcy), and C-reactive protein (CRP) were identified as predictors of PSCI occurrence. Gaussian Naïve Bayes (GNB) model was determined to be the optimal model, surpassing other classifier models in the validation set (area under the curve [AUC]: 0.925, 95 % confidence interval [CI]: 0.861 - 0.988) and achieving the lowest Brier score. The GNB model performed well in the test sets (AUC: 0.919, accuracy: 0.864, sensitivity: 0.818, and specificity: 0.932). CONCLUSIONS The present study involved the development of a GNB model and its elucidation through employment of the SHAP method. These findings provide compelling evidence for preventing PSCI, which could serve as a guide for high-risk patients to undertake appropriate preventive measures.
Collapse
Affiliation(s)
- Wencan Ji
- Nanjing Medical University, Nanjing, China; Jiangsu Research Center for Primary Health Development and General Practice Education, Jiangsu, China; Department of General Practice, Zhongda Hospital, Southeast University, Nanjing, China
| | - Canjun Wang
- Center of Clinical Laboratory Medicine, Zhongda Hospital, Southeast University, Nanjing, China
| | - Hanqing Chen
- Department of General Practice, Zhongda Hospital, Southeast University, Nanjing, China
| | - Yan Liang
- Department of General Practice, Zhongda Hospital, Southeast University, Nanjing, China
| | - Shaohua Wang
- Nanjing Medical University, Nanjing, China; Department of Endocrinology, Affiliated Zhongda Hospital of Southeast University, Nanjing, China.
| |
Collapse
|
2
|
Solanes A, Mezquida G, Janssen J, Amoretti S, Lobo A, González-Pinto A, Arango C, Vieta E, Castro-Fornieles J, Bergé D, Albacete A, Giné E, Parellada M, Bernardo M, Bioque M, Morén C, Pina-Camacho L, Díaz-Caneja CM, Zorrilla I, Corres EG, De-la-Camara C, Barcones F, Escarti MJ, Aguilar EJ, Legido T, Martin M, Verdolini N, Martinez-Aran A, Baeza I, de la Serna E, Contreras F, Bobes J, García-Portilla MP, Sanchez-Pastor L, Rodriguez-Jimenez R, Usall J, Butjosa A, Salgado-Pineda P, Salvador R, Pomarol-Clotet E, Radua J, PEPs group (collaborators). Combining MRI and clinical data to detect high relapse risk after the first episode of psychosis. SCHIZOPHRENIA 2022; 8:100. [PMID: 36396933 PMCID: PMC9672064 DOI: 10.1038/s41537-022-00309-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 10/28/2022] [Indexed: 11/18/2022]
Abstract
AbstractDetecting patients at high relapse risk after the first episode of psychosis (HRR-FEP) could help the clinician adjust the preventive treatment. To develop a tool to detect patients at HRR using their baseline clinical and structural MRI, we followed 227 patients with FEP for 18–24 months and applied MRIPredict. We previously optimized the MRI-based machine-learning parameters (combining unmodulated and modulated gray and white matter and using voxel-based ensemble) in two independent datasets. Patients estimated to be at HRR-FEP showed a substantially increased risk of relapse (hazard ratio = 4.58, P < 0.05). Accuracy was poorer when we only used clinical or MRI data. We thus show the potential of combining clinical and MRI data to detect which individuals are more likely to relapse, who may benefit from increased frequency of visits, and which are unlikely, who may be currently receiving unnecessary prophylactic treatments. We also provide an updated version of the MRIPredict software.
Collapse
|
3
|
Niu Z, Wu X, Zhu Y, Yang L, Shi Y, Wang Y, Qiu H, Gu W, Wu Y, Long X, Lu Z, Hu S, Yao Z, Yang H, Liu T, Xia Y, Chen Z, Chen J, Fang Y. Early Diagnosis of Bipolar Disorder Coming Soon: Application of an Oxidative Stress Injury Biomarker (BIOS) Model. Neurosci Bull 2022; 38:979-991. [PMID: 35590012 PMCID: PMC9468206 DOI: 10.1007/s12264-022-00871-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Accepted: 02/10/2022] [Indexed: 11/26/2022] Open
Abstract
Early distinction of bipolar disorder (BD) from major depressive disorder (MDD) is difficult since no tools are available to estimate the risk of BD. In this study, we aimed to develop and validate a model of oxidative stress injury for predicting BD. Data were collected from 1252 BD and 1359 MDD patients, including 64 MDD patients identified as converting to BD from 2009 through 2018. 30 variables from a randomly-selected subsample of 1827 (70%) patients were used to develop the model, including age, sex, oxidative stress markers (uric acid, bilirubin, albumin, and prealbumin), sex hormones, cytokines, thyroid and liver function, and glycolipid metabolism. Univariate analyses and the Least Absolute Shrinkage and Selection Operator were applied for data dimension reduction and variable selection. Multivariable logistic regression was used to construct a model for predicting bipolar disorder by oxidative stress biomarkers (BIOS) on a nomogram. Internal validation was assessed in the remaining 784 patients (30%), and independent external validation was done with data from 3797 matched patients from five other hospitals in China. 10 predictors, mainly oxidative stress markers, were shown on the nomogram. The BIOS model showed good discrimination in the training sample, with an AUC of 75.1% (95% CI: 72.9%-77.3%), sensitivity of 0.66, and specificity of 0.73. The discrimination was good both in internal validation (AUC 72.1%, 68.6%-75.6%) and external validation (AUC 65.7%, 63.9%-67.5%). In this study, we developed a nomogram centered on oxidative stress injury, which could help in the individualized prediction of BD. For better real-world practice, a set of measurements, especially on oxidative stress markers, should be emphasized using big data in psychiatry.
Collapse
Affiliation(s)
- Zhiang Niu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
| | - Xiaohui Wu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
| | - Yuncheng Zhu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
- Division of Mood Disorders, Shanghai Hongkou Mental Health Center, Shanghai, 200083, China
| | - Lu Yang
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
| | - Yifan Shi
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
| | - Yun Wang
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
| | - Hong Qiu
- Information and Statistics Department, Shanghai Mental Health Center, Shanghai, 200030, China
| | - Wenjie Gu
- Information and Statistics Department, Shanghai Mental Health Center, Shanghai, 200030, China
| | - Yina Wu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China
| | - Xiangyun Long
- Department of Psychiatry, Tongji Hospital, Tongji University School of Medicine, Shanghai, 200333, China
| | - Zheng Lu
- Department of Psychiatry, Tongji Hospital, Tongji University School of Medicine, Shanghai, 200333, China
| | - Shaohua Hu
- Department of Psychiatry, First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, 310003, China
| | - Zhijian Yao
- Nanjing Brain Hospital, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, 210029, China
| | - Haichen Yang
- Shenzhen Mental Health Center, Shenzhen, 518003, China
| | - Tiebang Liu
- Shenzhen Mental Health Center, Shenzhen, 518003, China
| | - Yong Xia
- Affiliated Mental Health Center, Zhejiang University School of Medicine, Hangzhou Seventh People's Hospital, Hangzhou, 310013, China
| | - Zhiyu Chen
- Affiliated Mental Health Center, Zhejiang University School of Medicine, Hangzhou Seventh People's Hospital, Hangzhou, 310013, China
| | - Jun Chen
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China.
| | - Yiru Fang
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, 200030, China.
- CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai, 200031, China.
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai, 201108, China.
| |
Collapse
|
4
|
Zhu T, Jiang J, Hu Y, Zhang W. Individualized prediction of psychiatric readmissions for patients with major depressive disorder: a 10-year retrospective cohort study. Transl Psychiatry 2022; 12:170. [PMID: 35461305 PMCID: PMC9035153 DOI: 10.1038/s41398-022-01937-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 04/12/2022] [Accepted: 04/14/2022] [Indexed: 11/09/2022] Open
Abstract
Patients with major depressive disorder (MDD) are at high risk of psychiatric readmission while the factors associated with such adverse illness trajectories and the impact of the same factor at different follow-up times remain unclear. Based on machine learning (ML) approaches and real-world electronic medical records (EMR), we aimed to predict individual psychiatric readmission within 30, 60, 90, 180, and 365 days of an initial major depression hospitalization. In addition, we examined to what extent our prediction model could be made interpretable by quantifying and visualizing the features that drive the predictions at different follow-up times. By identifying 13,177 individuals discharged from a hospital located in western China between 2009 and 2018 with a recorded diagnosis of MDD, we established five prediction-modeling cohorts with different follow-up times. Four different ML models were trained with features extracted from the EMR, and explainable methods (SHAP and Break Down) were utilized to analyze the contribution of each of the features at both population-level and individual-level. The model showed a performance on the holdout testing dataset that decreased over follow-up time after discharge: AUC 0.814 (0.758-0.87) within 30 days, AUC 0.780 (0.728-0.833) within 60 days, AUC 0.798 (0.75-0.846) within 90 days, AUC 0.740 (0.687-0.794) within 180 days, and AUC 0.711 (0.676-0.747) within 365 days. Results add evidence that markers of depression severity and symptoms (recurrence of the symptoms, combination of key symptoms, the number of core symptoms and physical symptoms), along with age, gender, type of payment, length of stay, comorbidity, treatment patterns such as the use of anxiolytics, antipsychotics, antidepressants (especially Fluoxetine, Clonazepam, Olanzapine, and Alprazolam), physiotherapy, and psychotherapy, and vital signs like pulse and SBP, may improve prediction of psychiatric readmission. Some features can drive the prediction towards readmission at one follow-up time and towards non-readmission at another. Using such a model for decision support gives the clinician dynamic information of the patient's risk of psychiatric readmission and the specific features pulling towards readmission. This finding points to the potential of establishing personalized interventions that change with follow-up time.
Collapse
Affiliation(s)
- Ting Zhu
- grid.13291.380000 0001 0807 1581West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China ,grid.13291.380000 0001 0807 1581Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Jingwen Jiang
- grid.13291.380000 0001 0807 1581West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China ,grid.13291.380000 0001 0807 1581Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Yao Hu
- grid.13291.380000 0001 0807 1581West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China ,grid.13291.380000 0001 0807 1581Med-X Center for Informatics, Sichuan University, Chengdu, China
| | - Wei Zhang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China. .,Med-X Center for Informatics, Sichuan University, Chengdu, China. .,Mental Health Center of West China Hospital, Sichuan University, Chengdu, China.
| |
Collapse
|
5
|
Kong S, Niu Z, Lyu D, Cui L, Wu X, Yang L, Qiu H, Gu W, Fang Y. Analysis of Seasonal Clinical Characteristics in Patients With Bipolar or Unipolar Depression. Front Psychiatry 2022; 13:847485. [PMID: 35463511 PMCID: PMC9019079 DOI: 10.3389/fpsyt.2022.847485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/02/2022] [Accepted: 03/15/2022] [Indexed: 11/13/2022] Open
Abstract
This study was to investigate the characteristics of seasonal symptoms and non-enzymatic oxidative stress in the first hospitalized patients with bipolar and unipolar depression, aiming to differentiate bipolar depression from unipolar depression and reduce their misdiagnosis. A total of 450 patients with bipolar depression and 855 patients with depression were included in the present study. According to the season when the patients were admitted to the hospital due to the acute onset of depression, they were further divided into spring, summer, autumn and winter groups. According to the characteristics of symptoms of bipolar disorder in the DSM-5, the characteristic symptoms of bipolar disorder were collected from the medical record information, and clinical biochemical indicators that can reflect the oxidative stress were also recorded. The seasonal risk factors in patients with bipolar or unipolar depression were analyzed. The relationship of age and gender with the bipolar or unipolar depression which attacked in winter was explored. There were significant differences between groups in the melancholic features, atypical features and conjugated bilirubin in spring. In summer, there were significant differences between groups in the melancholic features, uric acid and conjugated bilirubin. In autumn, there were marked differences between groups in melancholic features, anxiety and pain, atypical features, uric acid, total bilirubin, conjugated bilirubin and albumin. In winter, the conjugated bilirubin and prealbumin were significantly different between two groups. The melancholic features and uric acid that in summer as well as melancholic features, uric acid and total bilirubin in autumn were the seasonal independent risk factors for the unipolar depression as compared to bipolar depression. In winter, significant difference was noted in the age between two groups. In conclusion, compared with patients with unipolar depression, patients with bipolar depression have seasonal characteristics. Clinical symptoms and indicators of oxidative stress may become factors for the differentiation of seasonal unipolar depression from bipolar depression. Young subjects aged 15-35 years are more likely to develop bipolar depression in winter.
Collapse
Affiliation(s)
- Shuqi Kong
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Zhiang Niu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Dongbin Lyu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Lvchun Cui
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Xiaohui Wu
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Lu Yang
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Hong Qiu
- Information and Statistical Department, Shanghai Mental Health Center, Shanghai, China
| | - Wenjie Gu
- Information and Statistical Department, Shanghai Mental Health Center, Shanghai, China
| | - Yiru Fang
- Clinical Research Center and Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China.,Shanghai Key Laboratory of Psychotic Disorders, Shanghai, China.,CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai, China
| |
Collapse
|
6
|
Corsico P. "It's all about delivery": researchers and health professionals' views on the moral challenges of accessing neurobiological information in the context of psychosis. BMC Med Ethics 2021; 22:11. [PMID: 33557813 PMCID: PMC7869514 DOI: 10.1186/s12910-020-00551-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 10/21/2020] [Indexed: 12/05/2022] Open
Abstract
Background The convergence of neuroscience, genomics, and data science holds promise to unveil the neurobiology of psychosis and to produce new ways of preventing, diagnosing, and treating psychotic illness. Yet, moral challenges arise in neurobiological research and in the clinical translation of research findings. This article investigates the views of relevant actors in mental health on the moral challenges of accessing neurobiological information in the context of psychosis. Methods Semi-structured individual interviews with two groups: researchers employed in the National Health Service (NHS) or a university in England (n = 14), and mental health professionals employed in NHS mental health services (n = 14). This article compares results in the two groups (total n = 28). Results This article presents findings around three conceptual areas: (1) research ethics as mostly unproblematic, (2) psychosis, neurobiological information, and mental health care, and (3) identity, relationships, and the future. These areas are drawn from the themes and topics that emerged in the interviews across the two groups of participants. Researchers and health professionals provided similar accounts of the moral challenges of accessing—which includes acquisition, communication, and use of—neurobiological information in the context of psychosis. Acquiring neurobiological information was perceived as mostly unproblematic, provided ethical safeguards are put in place. Conversely, participants argued that substantive moral challenges arise from how neurobiological information is delivered—that is, communicated and used—in research and in clinical care. Neurobiological information was seen as a powerful tool in the process through which individuals define their identity and establish personal and clinical goals. The pervasiveness of this narrative tool may influence researchers and health professionals’ perception of ethical principles and moral obligations. Conclusions This study suggests that the moral challenges that arise from accessing neurobiological information in the context of psychosis go beyond traditional research and clinical ethics concerns. Reflecting on how accessing neurobiological information can influence individual self-narratives will be vital to ensure the ethical translation of neuroscience and genomics into mental health. Trial registration The study did not involve a health care intervention on human participants. It was retrospectively registered on 11 July 2018, registration number: researchregistry4255.
Collapse
Affiliation(s)
- Paolo Corsico
- Centre for Social Ethics and Policy, Department of Law, School of Social Sciences, The University of Manchester, Williamson Building, Oxford Road, Manchester, M13 9PL, UK.
| |
Collapse
|
7
|
Corsico P. Psychosis, vulnerability, and the moral significance of biomedical innovation in psychiatry. Why ethicists should join efforts. MEDICINE, HEALTH CARE, AND PHILOSOPHY 2020; 23:269-279. [PMID: 31773383 PMCID: PMC7260249 DOI: 10.1007/s11019-019-09932-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
The study of the neuroscience and genomics of mental illness are increasingly intertwined. This is mostly due to the translation of medical technologies into psychiatry and to technological convergence. This article focuses on psychosis. I argue that the convergence of neuroscience and genomics in the context of psychosis is morally problematic, and that ethics scholarship should go beyond the identification of a number of ethical, legal, and social issues. My argument is composed of two strands. First, I argue that we should respond to technological convergence by developing an integrated, patient-centred approach focused on the assessment of individual vulnerabilities. Responding to technological convergence requires that we (i) integrate insights from several areas of ethics, (ii) translate bioethical principles into the mental health context, and (iii) proactively try to anticipate future ethical concerns. Second, I argue that a nuanced understanding of the concept of vulnerability might help us to accomplish this task. I borrow Florencia Luna's notion of 'layers of vulnerability' to show how potential harms or wrongs to individuals who experience psychosis can be conceptualised as stemming from different sources, or layers, of vulnerability. I argue that a layered notion of vulnerability might serve as a common ground to achieve the ethical integration needed to ensure that biomedical innovation can truly benefit, and not harm, individuals who suffer from psychosis.
Collapse
Affiliation(s)
- Paolo Corsico
- Centre for Social Ethics and Policy, Department of Law, School of Social Sciences, The University of Manchester, Williamson Building, Oxford Road, Manchester, M13 9PL, UK.
| |
Collapse
|
8
|
Schmidt A, Borgwardt S. Implementing MR Imaging into Clinical Routine Screening in Patients with Psychosis? Neuroimaging Clin N Am 2020; 30:65-72. [DOI: 10.1016/j.nic.2019.09.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
9
|
Weng SF, Vaz L, Qureshi N, Kai J. Prediction of premature all-cause mortality: A prospective general population cohort study comparing machine-learning and standard epidemiological approaches. PLoS One 2019; 14:e0214365. [PMID: 30917171 PMCID: PMC6436798 DOI: 10.1371/journal.pone.0214365] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2018] [Accepted: 03/12/2019] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Prognostic modelling using standard methods is well-established, particularly for predicting risk of single diseases. Machine-learning may offer potential to explore outcomes of even greater complexity, such as premature death. This study aimed to develop novel prediction algorithms using machine-learning, in addition to standard survival modelling, to predict premature all-cause mortality. METHODS A prospective population cohort of 502,628 participants aged 40-69 years were recruited to the UK Biobank from 2006-2010 and followed-up until 2016. Participants were assessed on a range of demographic, biometric, clinical and lifestyle factors. Mortality data by ICD-10 were obtained from linkage to Office of National Statistics. Models were developed using deep learning, random forest and Cox regression. Calibration was assessed by comparing observed to predicted risks; and discrimination by area under the 'receiver operating curve' (AUC). FINDINGS 14,418 deaths (2.9%) occurred over a total follow-up time of 3,508,454 person-years. A simple age and gender Cox model was the least predictive (AUC 0.689, 95% CI 0.681-0.699). A multivariate Cox regression model significantly improved discrimination by 6.2% (AUC 0.751, 95% CI 0.748-0.767). The application of machine-learning algorithms further improved discrimination by 3.2% using random forest (AUC 0.783, 95% CI 0.776-0.791) and 3.9% using deep learning (AUC 0.790, 95% CI 0.783-0.797). These ML algorithms improved discrimination by 9.4% and 10.1% respectively from a simple age and gender Cox regression model. Random forest and deep learning achieved similar levels of discrimination with no significant difference. Machine-learning algorithms were well-calibrated, while Cox regression models consistently over-predicted risk. CONCLUSIONS Machine-learning significantly improved accuracy of prediction of premature all-cause mortality in this middle-aged population, compared to standard methods. This study illustrates the value of machine-learning for risk prediction within a traditional epidemiological study design, and how this approach might be reported to assist scientific verification.
Collapse
Affiliation(s)
- Stephen F. Weng
- NIHR School for Primary Care Research, University of Nottingham, Nottingham, United Kingdom
- Primary Care Stratified Medicine (PRISM), Division of Primary Care, School of Medicine, University of Nottingham, Nottingham United Kingdom
| | - Luis Vaz
- NIHR School for Primary Care Research, University of Nottingham, Nottingham, United Kingdom
| | - Nadeem Qureshi
- NIHR School for Primary Care Research, University of Nottingham, Nottingham, United Kingdom
- Primary Care Stratified Medicine (PRISM), Division of Primary Care, School of Medicine, University of Nottingham, Nottingham United Kingdom
| | - Joe Kai
- NIHR School for Primary Care Research, University of Nottingham, Nottingham, United Kingdom
- Primary Care Stratified Medicine (PRISM), Division of Primary Care, School of Medicine, University of Nottingham, Nottingham United Kingdom
| |
Collapse
|
10
|
Pyenson B, Alston M, Gomberg J, Han F, Khandelwal N, Dei M, Son M, Vora J. Applying Machine Learning Techniques to Identify Undiagnosed Patients with Exocrine Pancreatic Insufficiency. JOURNAL OF HEALTH ECONOMICS AND OUTCOMES RESEARCH 2019; 6:32-46. [PMID: 32685578 PMCID: PMC7299452 DOI: 10.36469/9727] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
BACKGROUND Exocrine pancreatic insufficiency (EPI) is a serious condition characterized by a lack of functional exocrine pancreatic enzymes and the resultant inability to properly digest nutrients. EPI can be caused by a variety of disorders, including chronic pancreatitis, pancreatic cancer, and celiac disease. EPI remains underdiagnosed because of the nonspecific nature of clinical symptoms, lack of an ideal diagnostic test, and the inability to easily identify affected patients using administrative claims data. OBJECTIVES To develop a machine learning model that identifies patients in a commercial medical claims database who likely have EPI but are undiagnosed. METHODS A machine learning algorithm was developed in Scikit-learn, a Python module. The study population, selected from the 2014 Truven MarketScan® Commercial Claims Database, consisted of patients with EPI-prone conditions. Patients were labeled with 290 condition category flags and split into actual positive EPI cases, actual negative EPI cases, and unlabeled cases. The study population was then randomly divided into a training subset and a testing subset. The training subset was used to determine the performance metrics of 27 models and to select the highest performing model, and the testing subset was used to evaluate performance of the best machine learning model. RESULTS The study population consisted of 2088 actual positive EPI cases, 1077 actual negative EPI cases, and 437 530 unlabeled cases. In the best performing model, the precision, recall, and accuracy were 0.91, 0.80, and 0.86, respectively. The best-performing model estimated that the number of patients likely to have EPI was about 12 times the number of patients directly identified as EPI-positive through a claims analysis in the study population. The most important features in assigning EPI probability were the presence or absence of diagnosis codes related to pancreatic and digestive conditions. CONCLUSIONS Machine learning techniques demonstrated high predictive power in identifying patients with EPI and could facilitate an enhanced understanding of its etiology and help to identify patients for possible diagnosis and treatment.
Collapse
Affiliation(s)
| | | | | | - Feng Han
- Milliman, New York, NY, during study
| | | | | | | | | |
Collapse
|
11
|
Kellmeyer P. Big Brain Data: On the Responsible Use of Brain Data from Clinical and Consumer-Directed Neurotechnological Devices. NEUROETHICS-NETH 2018. [DOI: 10.1007/s12152-018-9371-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
AbstractThe focus of this paper are the ethical, legal and social challenges for ensuring the responsible use of “big brain data”—the recording, collection and analysis of individuals’ brain data on a large scale with clinical and consumer-directed neurotechnological devices. First, I highlight the benefits of big data and machine learning analytics in neuroscience for basic and translational research. Then, I describe some of the technological, social and psychological barriers for securing brain data from unwarranted access. In this context, I then examine ways in which safeguards at the hardware and software level, as well as increasing “data literacy” in society, may enhance the security of neurotechnological devices and protect the privacy of personal brain data. Regarding ethical and legal ramifications of big brain data, I first discuss effects on the autonomy, the sense of agency and authenticity, as well as the self that may result from the interaction between users and intelligent, particularly closed-loop, neurotechnological devices. I then discuss the impact of the “datafication” in basic and clinical neuroscience research on the just distribution of resources and access to these transformative technologies. In the legal realm, I examine possible legal consequences that arises from the increasing abilities to decode brain states and their corresponding subjective phenomenological experiences on the hitherto inaccessible privacy of these information. Finally, I discuss the implications of big brain data for national and international regulatory policies and models of good data governance.
Collapse
|
12
|
Muhlestein WE, Akagi DS, Kallos JA, Morone PJ, Weaver KD, Thompson RC, Chambless LB. Using a Guided Machine Learning Ensemble Model to Predict Discharge Disposition following Meningioma Resection. J Neurol Surg B Skull Base 2018; 79:123-130. [PMID: 29868316 PMCID: PMC5978858 DOI: 10.1055/s-0037-1604393] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 06/14/2017] [Indexed: 12/14/2022] Open
Abstract
Objective Machine learning (ML) algorithms are powerful tools for predicting patient outcomes. This study pilots a novel approach to algorithm selection and model creation using prediction of discharge disposition following meningioma resection as a proof of concept. Materials and Methods A diversity of ML algorithms were trained on a single-institution database of meningioma patients to predict discharge disposition. Algorithms were ranked by predictive power and top performers were combined to create an ensemble model. The final ensemble was internally validated on never-before-seen data to demonstrate generalizability. The predictive power of the ensemble was compared with a logistic regression. Further analyses were performed to identify how important variables impact the ensemble. Results Our ensemble model predicted disposition significantly better than a logistic regression (area under the curve of 0.78 and 0.71, respectively, p = 0.01). Tumor size, presentation at the emergency department, body mass index, convexity location, and preoperative motor deficit most strongly influence the model, though the independent impact of individual variables is nuanced. Conclusion Using a novel ML technique, we built a guided ML ensemble model that predicts discharge destination following meningioma resection with greater predictive power than a logistic regression, and that provides greater clinical insight than a univariate analysis. These techniques can be extended to predict many other patient outcomes of interest.
Collapse
Affiliation(s)
- Whitney E. Muhlestein
- Department of Neurosurgery, Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States
| | | | - Justiss A. Kallos
- Department of Neurosurgery, Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States
| | - Peter J. Morone
- Department of Neurosurgery, Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States
| | - Kyle D. Weaver
- Department of Neurosurgery, Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States
| | - Reid C. Thompson
- Department of Neurosurgery, Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States
| | - Lola B. Chambless
- Department of Neurosurgery, Vanderbilt University Medical Center, Vanderbilt University, Nashville, Tennessee, United States
| |
Collapse
|
13
|
Realising stratified psychiatry using multidimensional signatures and trajectories. J Transl Med 2017; 15:15. [PMID: 28100276 PMCID: PMC5241978 DOI: 10.1186/s12967-016-1116-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Accepted: 12/27/2016] [Indexed: 12/21/2022] Open
Abstract
Background
Stratified or personalised medicine targets treatments for groups of individuals with a disorder based on individual heterogeneity and shared factors that influence the likelihood of response. Psychiatry has traditionally defined diagnoses by constellations of co-occurring signs and symptoms that are assigned a categorical label (e.g. schizophrenia). Trial methodology in psychiatry has evaluated interventions targeted at these categorical entities, with diagnoses being equated to disorders. Recent insights into both the nosology and neurobiology of psychiatric disorder reveal that traditional categorical diagnoses cannot be equated with disorders. We argue that current quantitative methodology (1) inherits these categorical assumptions, (2) allows only for the discovery of average treatment response, (3) relies on composite outcome measures and (4) sacrifices valuable predictive information for stratified and personalised treatment in psychiatry. Methods and findings To achieve a truly ‘stratified psychiatry’ we propose and then operationalise two necessary steps: first, a formal multi-dimensional representation of disorder definition and clinical state, and second, the similar redefinition of outcomes as multidimensional constructs that can expose within- and between-patient differences in response. We use the categorical diagnosis of schizophrenia—conceptualised as a label for heterogeneous disorders—as a means of introducing operational definitions of stratified psychiatry using principles from multivariate analysis. We demonstrate this framework by application to the Clinical Antipsychotic Trials of Intervention Effectiveness dataset, showing heterogeneity in both patient clinical states and their trajectories after treatment that are lost in the traditional categorical approach with composite outcomes. We then systematically review a decade of registered clinical trials for cognitive deficits in schizophrenia highlighting existing assumptions of categorical diagnoses and aggregate outcomes while identifying a small number of trials that could be reanalysed using our proposal. Conclusion We describe quantitative methods for the development of a multi-dimensional model of clinical state, disorders and trajectories which practically realises stratified psychiatry. We highlight the potential for recovering existing trial data, the implications for stratified psychiatry in trial design and clinical treatment and finally, describe different kinds of probabilistic reasoning tools necessary to implement stratification.
Collapse
|