201
|
Kim JO(R, Jeong YS, Kim JH, Lee JW, Park D, Kim HS. Machine Learning-Based Cardiovascular Disease Prediction Model: A Cohort Study on the Korean National Health Insurance Service Health Screening Database. Diagnostics (Basel) 2021; 11:diagnostics11060943. [PMID: 34070504 PMCID: PMC8229422 DOI: 10.3390/diagnostics11060943] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 05/21/2021] [Accepted: 05/24/2021] [Indexed: 11/17/2022] Open
Abstract
Background: This study proposes a cardiovascular diseases (CVD) prediction model using machine learning (ML) algorithms based on the National Health Insurance Service-Health Screening datasets. Methods: We extracted 4699 patients aged over 45 as the CVD group, diagnosed according to the international classification of diseases system (I20–I25). In addition, 4699 random subjects without CVD diagnosis were enrolled as a non-CVD group. Both groups were matched by age and gender. Various ML algorithms were applied to perform CVD prediction; then, the performances of all the prediction models were compared. Results: The extreme gradient boosting, gradient boosting, and random forest algorithms exhibited the best average prediction accuracy (area under receiver operating characteristic curve (AUROC): 0.812, 0.812, and 0.811, respectively) among all algorithms validated in this study. Based on AUROC, the ML algorithms improved the CVD prediction performance, compared to previously proposed prediction models. Preexisting CVD history was the most important factor contributing to the accuracy of the prediction model, followed by total cholesterol, low-density lipoprotein cholesterol, waist-height ratio, and body mass index. Conclusions: Our results indicate that the proposed health screening dataset-based CVD prediction model using ML algorithms is readily applicable, produces validated results and outperforms the previous CVD prediction models.
Collapse
Affiliation(s)
- Joung Ouk (Ryan) Kim
- Department of AI and Big Data, Swiss School of Management, 6500 Bellinzona, Switzerland; (J.O.K.); (J.H.K.)
| | - Yong-Suk Jeong
- Department of Cardiology, Brain and Vascular Center, Pohang Stroke and Spine Hospital, Pohang 37659, Korea;
| | - Jin Ho Kim
- Department of AI and Big Data, Swiss School of Management, 6500 Bellinzona, Switzerland; (J.O.K.); (J.H.K.)
| | - Jong-Weon Lee
- Department of Physical Medicine and Rehabilitation, National Health Insurance Service Ilsan Hospital, Goyang 10444, Korea;
| | - Dougho Park
- Department of Rehabilitation Medicine, Brain and Vascular Center, Pohang Stroke and Spine Hospital, Pohang 37659, Korea
- Correspondence: (D.P.); (H.-S.K.)
| | - Hyoung-Seop Kim
- Department of Physical Medicine and Rehabilitation, National Health Insurance Service Ilsan Hospital, Goyang 10444, Korea;
- Correspondence: (D.P.); (H.-S.K.)
| |
Collapse
|
202
|
Ocasio E, Duong TQ. Deep learning prediction of mild cognitive impairment conversion to Alzheimer's disease at 3 years after diagnosis using longitudinal and whole-brain 3D MRI. PeerJ Comput Sci 2021; 7:e560. [PMID: 34141888 PMCID: PMC8176545 DOI: 10.7717/peerj-cs.560] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 05/03/2021] [Indexed: 06/12/2023]
Abstract
BACKGROUND While there is no cure for Alzheimer's disease (AD), early diagnosis and accurate prognosis of AD may enable or encourage lifestyle changes, neurocognitive enrichment, and interventions to slow the rate of cognitive decline. The goal of our study was to develop and evaluate a novel deep learning algorithm to predict mild cognitive impairment (MCI) to AD conversion at three years after diagnosis using longitudinal and whole-brain 3D MRI. METHODS This retrospective study consisted of 320 normal cognition (NC), 554 MCI, and 237 AD patients. Longitudinal data include T1-weighted 3D MRI obtained at initial presentation with diagnosis of MCI and at 12-month follow up. Whole-brain 3D MRI volumes were used without a priori segmentation of regional structural volumes or cortical thicknesses. MRIs of the AD and NC cohort were used to train a deep learning classification model to obtain weights to be applied via transfer learning for prediction of MCI patient conversion to AD at three years post-diagnosis. Two (zero-shot and fine tuning) transfer learning methods were evaluated. Three different convolutional neural network (CNN) architectures (sequential, residual bottleneck, and wide residual) were compared. Data were split into 75% and 25% for training and testing, respectively, with 4-fold cross validation. Prediction accuracy was evaluated using balanced accuracy. Heatmaps were generated. RESULTS The sequential convolutional approach yielded slightly better performance than the residual-based architecture, the zero-shot transfer learning approach yielded better performance than fine tuning, and CNN using longitudinal data performed better than CNN using a single timepoint MRI in predicting MCI conversion to AD. The best CNN model for predicting MCI conversion to AD at three years after diagnosis yielded a balanced accuracy of 0.793. Heatmaps of the prediction model showed regions most relevant to the network including the lateral ventricles, periventricular white matter and cortical gray matter. CONCLUSIONS This is the first convolutional neural network model using longitudinal and whole-brain 3D MRIs without extracting regional brain volumes or cortical thicknesses to predict future MCI to AD conversion at 3 years after diagnosis. This approach could lead to early prediction of patients who are likely to progress to AD and thus may lead to better management of the disease.
Collapse
Affiliation(s)
- Ethan Ocasio
- Department of Radiology, Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, NY, United States of America
| | - Tim Q. Duong
- Department of Radiology, Montefiore Medical Center, Albert Einstein College of Medicine, Bronx, NY, United States of America
| |
Collapse
|
203
|
Abdulkareem M, Petersen SE. The Promise of AI in Detection, Diagnosis, and Epidemiology for Combating COVID-19: Beyond the Hype. Front Artif Intell 2021; 4:652669. [PMID: 34056579 PMCID: PMC8160471 DOI: 10.3389/frai.2021.652669] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Accepted: 04/13/2021] [Indexed: 12/24/2022] Open
Abstract
COVID-19 has created enormous suffering, affecting lives, and causing deaths. The ease with which this type of coronavirus can spread has exposed weaknesses of many healthcare systems around the world. Since its emergence, many governments, research communities, commercial enterprises, and other institutions and stakeholders around the world have been fighting in various ways to curb the spread of the disease. Science and technology have helped in the implementation of policies of many governments that are directed toward mitigating the impacts of the pandemic and in diagnosing and providing care for the disease. Recent technological tools, artificial intelligence (AI) tools in particular, have also been explored to track the spread of the coronavirus, identify patients with high mortality risk and diagnose patients for the disease. In this paper, areas where AI techniques are being used in the detection, diagnosis and epidemiological predictions, forecasting and social control for combating COVID-19 are discussed, highlighting areas of successful applications and underscoring issues that need to be addressed to achieve significant progress in battling COVID-19 and future pandemics. Several AI systems have been developed for diagnosing COVID-19 using medical imaging modalities such as chest CT and X-ray images. These AI systems mainly differ in their choices of the algorithms for image segmentation, classification and disease diagnosis. Other AI-based systems have focused on predicting mortality rate, long-term patient hospitalization and patient outcomes for COVID-19. AI has huge potential in the battle against the COVID-19 pandemic but successful practical deployments of these AI-based tools have so far been limited due to challenges such as limited data accessibility, the need for external evaluation of AI models, the lack of awareness of AI experts of the regulatory landscape governing the deployment of AI tools in healthcare, the need for clinicians and other experts to work with AI experts in a multidisciplinary context and the need to address public concerns over data collection, privacy, and protection. Having a dedicated team with expertise in medical data collection, privacy, access and sharing, using federated learning whereby AI scientists hand over training algorithms to the healthcare institutions to train models locally, and taking full advantage of biomedical data stored in biobanks can alleviate some of problems posed by these challenges. Addressing these challenges will ultimately accelerate the translation of AI research into practical and useful solutions for combating pandemics.
Collapse
Affiliation(s)
- Musa Abdulkareem
- Barts Heart Centre, Barts Health National Health Service (NHS) Trust, London, United Kingdom
- National Institute for Health Research (NIHR) Barts Biomedical Research Centre, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
- Health Data Research UK, London, United Kingdom
| | - Steffen E. Petersen
- Barts Heart Centre, Barts Health National Health Service (NHS) Trust, London, United Kingdom
- National Institute for Health Research (NIHR) Barts Biomedical Research Centre, William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
- Health Data Research UK, London, United Kingdom
- The Alan Turing Institute, London, United Kingdom
| |
Collapse
|
204
|
Fahmy AS, Rowin EJ, Manning WJ, Maron MS, Nezafat R. Machine Learning for Predicting Heart Failure Progression in Hypertrophic Cardiomyopathy. Front Cardiovasc Med 2021; 8:647857. [PMID: 34055932 PMCID: PMC8155292 DOI: 10.3389/fcvm.2021.647857] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Accepted: 03/29/2021] [Indexed: 11/25/2022] Open
Abstract
Background: Development of advanced heart failure (HF) symptoms is the most common adverse pathway in hypertrophic cardiomyopathy (HCM) patients. Currently, there is a limited ability to identify HCM patients at risk of HF. Objectives: In this study, we present a machine learning (ML)-based model to identify individual HCM patients who are at high risk of developing advanced HF symptoms. Methods: From a consecutive cohort of HCM patients evaluated at the Tufts HCM Institute from 2001 to 2018, we extracted a set of 64 potential risk factors measured at baseline. Only patients with New York Heart Association (NYHA) functional class I/II and LV ejection fraction (LVEF) by echocardiography >35% were included. The study cohort (n = 1,427 patients) was split into three disjoint subsets: development (50%), model selection (10%), and independent validation (40%). The least absolute shrinkage and selection operator was used to select the most influential clinical variables. An ensemble of ML classifiers, including logistic regression, was used to identify patients with high risk of developing a HF outcome. Study outcomes were defined as progression to NYHA class III/IV, drop in LVEF below 35%, septal reduction procedure, and/or heart transplantation. Results: During a mean follow-up of 4.7 ± 3.7 years, advanced HF occurred in 283 (20% out of 1,427) patients. The model features included patients' sex, NYHA class (I or II), HCM type (i.e., obstructive or not), LV wall thickness, LVEF, presence of HF symptoms (e.g., dyspnea, presyncope), comorbidities (atrial fibrillation, hypertension, mitral regurgitation, and systolic anterior motion), and type of cardiac medications. The developed risk stratification model showed strong differentiation power to identify patients at advanced HF risk in the testing dataset (c-statistics = 0.81; 95% confidence interval [CI]: 0.76, 0.86). The model allowed correct identification of high-risk patients with accuracy 74% (CI: 0.70, 0.78), sensitivity 80% (CI: 0.77, 0.83), and specificity 72% (CI: 0.68, 0.76). The model performance was comparable among different sex and age groups. Conclusions: A 5-year risk prediction of progressive HF in HCM patients can be accurately estimated using ML analysis of patients' clinical and imaging parameters. A set of 17 clinical and imaging variables were identified as the most important predictors of progressive HF in HCM.
Collapse
Affiliation(s)
- Ahmed S Fahmy
- Cardiovascular Division, Department of Medicine, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, United States
| | - Ethan J Rowin
- Division of Cardiology, Hypertrophic Cardiomyopathy Center, Tufts Medical Center, Boston, MA, United States
| | - Warren J Manning
- Cardiovascular Division, Department of Medicine, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, United States.,Department of Radiology, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, United States
| | - Martin S Maron
- Division of Cardiology, Hypertrophic Cardiomyopathy Center, Tufts Medical Center, Boston, MA, United States
| | - Reza Nezafat
- Cardiovascular Division, Department of Medicine, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, MA, United States
| |
Collapse
|
205
|
Shah AA, Devana SK, Lee C, Kianian R, van der Schaar M, SooHoo NF. Development of a Novel, Potentially Universal Machine Learning Algorithm for Prediction of Complications After Total Hip Arthroplasty. J Arthroplasty 2021; 36:1655-1662.e1. [PMID: 33478891 PMCID: PMC10371358 DOI: 10.1016/j.arth.2020.12.040] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 11/19/2020] [Accepted: 12/22/2020] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND As the prevalence of hip osteoarthritis increases, the number of total hip arthroplasty (THA) procedures performed is also projected to increase. Accurately risk-stratifying patients who undergo THA would be of great utility, given the significant cost and morbidity associated with developing perioperative complications. We aim to develop a novel machine learning (ML)-based ensemble algorithm for the prediction of major complications after THA, as well as compare its performance against standard benchmark ML methods. METHODS This is a retrospective cohort study of 89,986 adults who underwent primary THA at any California-licensed hospital between 2015 and 2017. The primary outcome was major complications (eg infection, venous thromboembolism, cardiac complication, pulmonary complication). We developed a model predicting complication risk using AutoPrognosis, an automated ML framework that configures the optimally performing ensemble of ML-based prognostic models. We compared our model with logistic regression and standard benchmark ML models, assessing discrimination and calibration. RESULTS There were 545 patients who had major complications (0.61%). Our novel algorithm was well-calibrated and improved risk prediction compared to logistic regression, as well as outperformed the other four standard benchmark ML algorithms. The variables most important for AutoPrognosis (eg malnutrition, dementia, cancer) differ from those that are most important for logistic regression (eg chronic atherosclerosis, renal failure, chronic obstructive pulmonary disease). CONCLUSION We report a novel ensemble ML algorithm for the prediction of major complications after THA. It demonstrates superior risk prediction compared to logistic regression and other standard ML benchmark algorithms. By providing accurate prognostic information, this algorithm may facilitate more informed preoperative shared decision-making.
Collapse
Affiliation(s)
- Akash A Shah
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Sai K Devana
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Changhee Lee
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA
| | - Reza Kianian
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA; Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| | - Nelson F SooHoo
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| |
Collapse
|
206
|
Nawaz MS, Shoaib B, Ashraf MA. Intelligent Cardiovascular Disease Prediction Empowered with Gradient Descent Optimization. Heliyon 2021; 7:e06948. [PMID: 34013084 PMCID: PMC8113842 DOI: 10.1016/j.heliyon.2021.e06948] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 12/17/2020] [Accepted: 04/26/2021] [Indexed: 11/17/2022] Open
Abstract
Disorders of the heart and blood vessels are named cardiovascular disease. 'The heart's proper functionality is of an utmost necessity for the survival of life. The death rate due to heart disease, has been increased rapidly. Cardiovascular illness is believed the deadliest cause of death across the globe. From the facts and figures shared by the WHO (World Health Organization) 17.9 Million human lost their lives due to cardiovascular diseases. This research is carried out for the effective diagnosis of heart disease using the heart disease dataset available on the UCI Machine Repository. Heart disease diagnosis with an optimization algorithm can be fruitful in terms of higher accuracy and sensitivity. Finding an acceptable optimal solution among multiple solutions for a specific problem is known as optimization. Different machine learning algorithms have been applied as Support Machine Vector (SVM), K-Nearest Neighbor (KNN), Naïve Bayes (NB), Artificial Neural Network (ANN), Random Forest (RF), and Gradient Descent Optimization (GDO). Intelligent Cardiovascular Disease Prediction Empowered with Gradient Descent Optimization model produces the optimal results among under consideration classification algorithms. 98.54 % accuracy has been achieved by the GDO based model while performance evaluation it. 99.43% sensitivity (recall) and 97.76% precision have also been recorded. From the prediction results of the system, it's satisfactory to utilize it for cardiovascular disease diagnosis. The proposed system will be helpful for the analysis of cardiovascular disease.
Collapse
Affiliation(s)
- Muhammad Saqib Nawaz
- Department of Computer Science, Minhaj University Lahore, Lahore, 54000, Pakistan
| | - Bilal Shoaib
- Department of Computer Science, Minhaj University Lahore, Lahore, 54000, Pakistan
| | - Muhammad Adeel Ashraf
- Department of Computer Science, School of Systems and Technology, University of Management & Technology, Lahore, 54000, Pakistan
| |
Collapse
|
207
|
Migueles JH, Aadland E, Andersen LB, Brønd JC, Chastin SF, Hansen BH, Konstabel K, Kvalheim OM, McGregor DE, Rowlands AV, Sabia S, van Hees VT, Walmsley R, Ortega FB. GRANADA consensus on analytical approaches to assess associations with accelerometer-determined physical behaviours (physical activity, sedentary behaviour and sleep) in epidemiological studies. Br J Sports Med 2021; 56:376-384. [PMID: 33846158 PMCID: PMC8938657 DOI: 10.1136/bjsports-2020-103604] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/20/2021] [Indexed: 02/06/2023]
Abstract
The inter-relationship between physical activity, sedentary behaviour and sleep (collectively defined as physical behaviours) is of interest to researchers from different fields. Each of these physical behaviours has been investigated in epidemiological studies, yet their codependency and interactions need to be further explored and accounted for in data analysis. Modern accelerometers capture continuous movement through the day, which presents the challenge of how to best use the richness of these data. In recent years, analytical approaches first applied in other scientific fields have been applied to physical behaviour epidemiology (eg, isotemporal substitution models, compositional data analysis, multivariate pattern analysis, functional data analysis and machine learning). A comprehensive description, discussion, and consensus on the strengths and limitations of these analytical approaches will help researchers decide which approach to use in different situations. In this context, a scientific workshop and meeting were held in Granada to discuss: (1) analytical approaches currently used in the scientific literature on physical behaviour, highlighting strengths and limitations, providing practical recommendations on their use and including a decision tree for assisting researchers’ decision-making; and (2) current gaps and future research directions around the analysis and use of accelerometer data. Advances in analytical approaches to accelerometer-determined physical behaviours in epidemiological studies are expected to influence the interpretation of current and future evidence, and ultimately impact on future physical behaviour guidelines.
Collapse
Affiliation(s)
- Jairo H Migueles
- PROFITH "PROmoting FITness and Health through physical activity" Research Group, Sport and Health University Research Institute (iMUDS), Department of Physical Education and Sports, Faculty of Sport Sciences, University of Granada, Granada, Spain .,Department of Health, Medicine and Caring Sciences, Linköping University, Linköping, Sweden
| | - Eivind Aadland
- Faculty of Education, Arts and Sports, Western Norway University of Applied Sciences, Sogndal, Norway
| | - Lars Bo Andersen
- Faculty of Education, Arts and Sports, Western Norway University of Applied Sciences, Sogndal, Norway
| | - Jan Christian Brønd
- Department of Sport Science and Biomechanics, University of Southern Denmark, Odense, Denmark
| | - Sebastien F Chastin
- School of Health and Life Science, Glasgow Caledonian University, Glasgow, UK.,Department of Movement and Sport Science, Ghent University, Ghent, Belgium
| | - Bjørge H Hansen
- Department of Sports Medicine, Norwegian School of Sport Sciences, Osloål, Norway.,Departement of Sport Science and Physical Education, University of Agder, Kristiansand, Norway
| | - Kenn Konstabel
- Department of Chronic Diseases, National Institute for Health Development, Tallinn, Estonia.,School of Natural Sciences and Health, Tallinn University, Tallinn, Estonia.,Institute of Psychology, University of Tartu, Tartu, Estonia
| | | | - Duncan E McGregor
- School of Health and Life Science, Glasgow Caledonian University, Glasgow, UK.,Biomathematics and Statistics Scotland, Edinburgh, UK
| | - Alex V Rowlands
- Diabetes Research Centre, University of Leicester, Leicester General Hospital, Leicester, UK.,NIHR Leicester Biomedical Research Centre, Leicester General Hospital, Leicester, UK.,Alliance for Research in Exercise, Nutrition and Activity (ARENA), Sansom Institute for Health Research, Division of Health Sciences, University of South Australia, Adelaide, SA, Australia
| | - Séverine Sabia
- Université de Paris, Inserm U1153, Epidemiology of Ageing and Neurodegenerative diseases, Paris, France.,Department of Epidemiology and Public Health, University College London, London, UK
| | - Vincent T van Hees
- Accelting, Almere, The Netherlands.,Amsterdam UMC, Vrije Universiteit Amsterdam, Department of Public and Occupational Health, Amsterdam Public Health research institute, Amsterdam, The Netherlands
| | - Rosemary Walmsley
- Nuffield Department of Population Health, University of Oxford, Oxford, UK.,Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Francisco B Ortega
- PROFITH "PROmoting FITness and Health through physical activity" Research Group, Sport and Health University Research Institute (iMUDS), Department of Physical Education and Sports, Faculty of Sport Sciences, University of Granada, Granada, Spain .,Department of Biosciences and Nutrition, Karolinska Institutet, Huddinge, Sweden
| | | |
Collapse
|
208
|
From evidence to practice: development of web-based Dutch lipid reference values. Neth Heart J 2021; 29:441-450. [PMID: 33844162 PMCID: PMC8397806 DOI: 10.1007/s12471-021-01562-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/09/2021] [Indexed: 12/03/2022] Open
Abstract
Introduction In the Netherlands, the total number of yearly measured lipid profiles exceeds 500,000. While lipid values are strongly affected by age and sex, until recently, no up-to-date age- and sex-specific lipid reference values were available. We describe the translation of big-cohort lipid data into accessible reference values, which can be easily incorporated in daily clinical practice. Methods Lipid values (total cholesterol, LDL cholesterol, HDL cholesterol and triglycerides) from all healthy adults and children in the LifeLines cohort were used to generate age- and sex-specific percentiles. A combination of RStudio, Cascading Style Sheets and HyperText Markup Language was used to interactively display the percentiles in a responsive web layout. Results After exclusion of subjects reporting cardiovascular disease or lipid-lowering therapy at baseline, 141,611 subjects were included. On the website, input fields were created for age, sex and all main plasma lipids. Upon input of these values, corresponding percentiles are calculated, and output is displayed in a table and an interactive graph for each lipid. The website has been made available in both Dutch and English and can be accessed at www.lipidtools.com. Conclusion We constructed the first searchable, national lipid reference value tool with graphical display in the Netherlands to use in screening for dyslipidaemias and to reduce the underuse of lipid-lowering therapy in Dutch primary prevention. This study illustrates that data collected in big-cohort studies can be made easily accessible with modern digital techniques and preludes the digital health revolution yet to come.
Collapse
|
209
|
Banerjee A, Chen S, Fatemifar G, Zeina M, Lumbers RT, Mielke J, Gill S, Kotecha D, Freitag DF, Denaxas S, Hemingway H. Machine learning for subtype definition and risk prediction in heart failure, acute coronary syndromes and atrial fibrillation: systematic review of validity and clinical utility. BMC Med 2021; 19:85. [PMID: 33820530 PMCID: PMC8022365 DOI: 10.1186/s12916-021-01940-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Accepted: 02/12/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Machine learning (ML) is increasingly used in research for subtype definition and risk prediction, particularly in cardiovascular diseases. No existing ML models are routinely used for cardiovascular disease management, and their phase of clinical utility is unknown, partly due to a lack of clear criteria. We evaluated ML for subtype definition and risk prediction in heart failure (HF), acute coronary syndromes (ACS) and atrial fibrillation (AF). METHODS For ML studies of subtype definition and risk prediction, we conducted a systematic review in HF, ACS and AF, using PubMed, MEDLINE and Web of Science from January 2000 until December 2019. By adapting published criteria for diagnostic and prognostic studies, we developed a seven-domain, ML-specific checklist. RESULTS Of 5918 studies identified, 97 were included. Across studies for subtype definition (n = 40) and risk prediction (n = 57), there was variation in data source, population size (median 606 and median 6769), clinical setting (outpatient, inpatient, different departments), number of covariates (median 19 and median 48) and ML methods. All studies were single disease, most were North American (n = 61/97) and only 14 studies combined definition and risk prediction. Subtype definition and risk prediction studies respectively had limitations in development (e.g. 15.0% and 78.9% of studies related to patient benefit; 15.0% and 15.8% had low patient selection bias), validation (12.5% and 5.3% externally validated) and impact (32.5% and 91.2% improved outcome prediction; no effectiveness or cost-effectiveness evaluations). CONCLUSIONS Studies of ML in HF, ACS and AF are limited by number and type of included covariates, ML methods, population size, country, clinical setting and focus on single diseases, not overlap or multimorbidity. Clinical utility and implementation rely on improvements in development, validation and impact, facilitated by simple checklists. We provide clear steps prior to safe implementation of machine learning in clinical practice for cardiovascular diseases and other disease areas.
Collapse
Affiliation(s)
- Amitava Banerjee
- Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK.
- Health Data Research UK, University College London, London, UK.
- University College London Hospitals NHS Trust, 235 Euston Road, London, UK.
- Barts Health NHS Trust, The Royal London Hospital, Whitechapel Rd, London, UK.
| | - Suliang Chen
- Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK
- Health Data Research UK, University College London, London, UK
| | - Ghazaleh Fatemifar
- Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK
- Health Data Research UK, University College London, London, UK
| | | | - R Thomas Lumbers
- Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK
- Health Data Research UK, University College London, London, UK
- University College London Hospitals NHS Trust, 235 Euston Road, London, UK
| | - Johanna Mielke
- Bayer AG, Division Pharmaceuticals, Open Innovation & Digital Technologies, Wuppertal, Germany
| | - Simrat Gill
- University of Birmingham Institute of Cardiovascular Sciences and University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Dipak Kotecha
- University of Birmingham Institute of Cardiovascular Sciences and University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
- Department of Cardiology, University Medical Centre Utrecht, Utrecht, the Netherlands
| | - Daniel F Freitag
- Bayer AG, Division Pharmaceuticals, Open Innovation & Digital Technologies, Wuppertal, Germany
| | - Spiros Denaxas
- Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK
- Health Data Research UK, University College London, London, UK
- The Alan Turing Institute, London, UK
| | - Harry Hemingway
- Institute of Health Informatics, University College London, 222 Euston Road, London, NW1 2DA, UK
- Health Data Research UK, University College London, London, UK
- University College London Hospitals Biomedical Research Centre (UCLH BRC), London, UK
| |
Collapse
|
210
|
Abstract
Hypertension remains the largest modifiable cause of mortality worldwide despite the availability of effective medications and sustained research efforts over the past 100 years. Hypertension requires transformative solutions that can help reduce the global burden of the disease. Artificial intelligence and machine learning, which have made a substantial impact on our everyday lives over the last decade may be the route to this transformation. However, artificial intelligence in health care is still in its nascent stages and realizing its potential requires numerous challenges to be overcome. In this review, we provide a clinician-centric perspective on artificial intelligence and machine learning as applied to medicine and hypertension. We focus on the main roadblocks impeding implementation of this technology in clinical care and describe efforts driving potential solutions. At the juncture, there is a critical requirement for clinical and scientific expertise to work in tandem with algorithmic innovation followed by rigorous validation and scrutiny to realize the promise of artificial intelligence-enabled health care for hypertension and other chronic diseases.
Collapse
Affiliation(s)
- Sandosh Padmanabhan
- BHF Glasgow Cardiovascular Research Centre, Institute of Cardiovascular and Medical Sciences, University of Glasgow
| | - Tran Quoc Bao Tran
- BHF Glasgow Cardiovascular Research Centre, Institute of Cardiovascular and Medical Sciences, University of Glasgow
| | - Anna F Dominiczak
- BHF Glasgow Cardiovascular Research Centre, Institute of Cardiovascular and Medical Sciences, University of Glasgow
| |
Collapse
|
211
|
Sajeev S, Champion S, Beleigoli A, Chew D, Reed RL, Magliano DJ, Shaw JE, Milne RL, Appleton S, Gill TK, Maeder A. Predicting Australian Adults at High Risk of Cardiovascular Disease Mortality Using Standard Risk Factors and Machine Learning. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18063187. [PMID: 33808743 PMCID: PMC8003399 DOI: 10.3390/ijerph18063187] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Revised: 03/15/2021] [Accepted: 03/16/2021] [Indexed: 12/23/2022]
Abstract
Effective cardiovascular disease (CVD) prevention relies on timely identification and intervention for individuals at risk. Conventional formula-based techniques have been demonstrated to over- or under-predict the risk of CVD in the Australian population. This study assessed the ability of machine learning models to predict CVD mortality risk in the Australian population and compare performance with the well-established Framingham model. Data is drawn from three Australian cohort studies: the North West Adelaide Health Study (NWAHS), the Australian Diabetes, Obesity, and Lifestyle study, and the Melbourne Collaborative Cohort Study (MCCS). Four machine learning models for predicting 15-year CVD mortality risk were developed and compared to the 2008 Framingham model. Machine learning models performed significantly better compared to the Framingham model when applied to the three Australian cohorts. Machine learning based models improved prediction by 2.7% to 5.2% across three Australian cohorts. In an aggregated cohort, machine learning models improved prediction by up to 5.1% (area-under-curve (AUC) 0.852, 95% CI 0.837–0.867). Net reclassification improvement (NRI) was up to 26% with machine learning models. Machine learning based models also showed improved performance when stratified by sex and diabetes status. Results suggest a potential for improving CVD risk prediction in the Australian population using machine learning models.
Collapse
Affiliation(s)
- Shelda Sajeev
- Flinders Digital Health Research Centre, College of Nursing & Health Sciences, Flinders University, Adelaide SA 5001, Australia; (S.C.); (A.B.); (A.M.)
- Chifley Business School, Torrens University, Australia, Adelaide, SA 5000, Australia
- Correspondence:
| | - Stephanie Champion
- Flinders Digital Health Research Centre, College of Nursing & Health Sciences, Flinders University, Adelaide SA 5001, Australia; (S.C.); (A.B.); (A.M.)
| | - Alline Beleigoli
- Flinders Digital Health Research Centre, College of Nursing & Health Sciences, Flinders University, Adelaide SA 5001, Australia; (S.C.); (A.B.); (A.M.)
- Caring Futures Institute, Flinders University, Adelaide, SA 5001, Australia
| | - Derek Chew
- College of Medicine and Public Health, Flinders University, Adelaide, SA 5001, Australia; (D.C.); (R.L.R.)
| | - Richard L. Reed
- College of Medicine and Public Health, Flinders University, Adelaide, SA 5001, Australia; (D.C.); (R.L.R.)
| | - Dianna J. Magliano
- Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia; (D.J.M.); (J.E.S.)
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC 3004, Australia
| | - Jonathan E. Shaw
- Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia; (D.J.M.); (J.E.S.)
- School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC 3004, Australia
- School of Life Sciences, La Trobe University, Melbourne, VIC 3086, Australia
| | - Roger L. Milne
- Cancer Epidemiology Division, Cancer Council Victoria, 615 St Kilda Road, Melbourne, VIC 3004, Australia;
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, 207 Bouverie Street, Melbourne, VIC 3010, Australia
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC 3168, Australia
| | - Sarah Appleton
- Flinders Health and Medical Research Institute (Sleep Health)/Adelaide Institute for Sleep Health (AISH), College of Medicine and Public Health, Flinders University, Adelaide, SA 5042, Australia;
- Adelaide Medical School, The University of Adelaide, Adelaide, SA 5005, Australia;
| | - Tiffany K. Gill
- Adelaide Medical School, The University of Adelaide, Adelaide, SA 5005, Australia;
| | - Anthony Maeder
- Flinders Digital Health Research Centre, College of Nursing & Health Sciences, Flinders University, Adelaide SA 5001, Australia; (S.C.); (A.B.); (A.M.)
| |
Collapse
|
212
|
Allan S, Olaiya R, Burhan R. Reviewing the use and quality of machine learning in developing clinical prediction models for cardiovascular disease. Postgrad Med J 2021; 98:551-558. [PMID: 33692158 DOI: 10.1136/postgradmedj-2020-139352] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Revised: 02/10/2021] [Accepted: 02/12/2021] [Indexed: 12/23/2022]
Abstract
Cardiovascular disease (CVD) is one of the leading causes of death across the world. CVD can lead to angina, heart attacks, heart failure, strokes, and eventually, death; among many other serious conditions. The early intervention with those at a higher risk of developing CVD, typically with statin treatment, leads to better health outcomes. For this reason, clinical prediction models (CPMs) have been developed to identify those at a high risk of developing CVD so that treatment can begin at an earlier stage. Currently, CPMs are built around statistical analysis of factors linked to developing CVD, such as body mass index and family history. The emerging field of machine learning (ML) in healthcare, using computer algorithms that learn from a dataset without explicit programming, has the potential to outperform the CPMs available today. ML has already shown exciting progress in the detection of skin malignancies, bone fractures and many other medical conditions. In this review, we will analyse and explain the CPMs currently in use with comparisons to their developing ML counterparts. We have found that although the newest non-ML CPMs are effective, ML-based approaches consistently outperform them. However, improvements to the literature need to be made before ML should be implemented over current CPMs.
Collapse
Affiliation(s)
- Simon Allan
- Manchester Medical School, The University of Manchester, Manchester, UK
| | - Raphael Olaiya
- UCL Centre for Artificial Intelligence, University College London, London, UK
| | - Rasan Burhan
- St George's Healthcare NHS Trust, St George's Healthcare NHS Trust, London, UK
| |
Collapse
|
213
|
JAMTHIKAR AD, PUVVULA A, GUPTA D, JOHRI AM, NAMBI V, KHANNA NN, SABA L, MAVROGENI S, LAIRD JR, PAREEK G, MINER M, SFIKAKIS PP, PROTOGEROU A, KITAS GD, NICOLAIDES A, SHARMA AM, VISWANATHAN V, RATHORE VS, KOLLURI R, BHATT DL, SURI JS. Cardiovascular disease and stroke risk assessment in patients with chronic kidney disease using integration of estimated glomerular filtration rate, ultrasonic image phenotypes, and artificial intelligence: a narrative review. INT ANGIOL 2021; 40:150-164. [DOI: 10.23736/s0392-9590.20.04538-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
214
|
Brnabic A, Hess LM. Systematic literature review of machine learning methods used in the analysis of real-world data for patient-provider decision making. BMC Med Inform Decis Mak 2021; 21:54. [PMID: 33588830 PMCID: PMC7885605 DOI: 10.1186/s12911-021-01403-2] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Accepted: 01/20/2021] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Machine learning is a broad term encompassing a number of methods that allow the investigator to learn from the data. These methods may permit large real-world databases to be more rapidly translated to applications to inform patient-provider decision making. METHODS This systematic literature review was conducted to identify published observational research of employed machine learning to inform decision making at the patient-provider level. The search strategy was implemented and studies meeting eligibility criteria were evaluated by two independent reviewers. Relevant data related to study design, statistical methods and strengths and limitations were identified; study quality was assessed using a modified version of the Luo checklist. RESULTS A total of 34 publications from January 2014 to September 2020 were identified and evaluated for this review. There were diverse methods, statistical packages and approaches used across identified studies. The most common methods included decision tree and random forest approaches. Most studies applied internal validation but only two conducted external validation. Most studies utilized one algorithm, and only eight studies applied multiple machine learning algorithms to the data. Seven items on the Luo checklist failed to be met by more than 50% of published studies. CONCLUSIONS A wide variety of approaches, algorithms, statistical software, and validation strategies were employed in the application of machine learning methods to inform patient-provider decision making. There is a need to ensure that multiple machine learning approaches are used, the model selection strategy is clearly defined, and both internal and external validation are necessary to be sure that decisions for patient care are being made with the highest quality evidence. Future work should routinely employ ensemble methods incorporating multiple machine learning algorithms.
Collapse
Affiliation(s)
| | - Lisa M Hess
- Eli Lilly and Company, Indianapolis, IN, USA.
| |
Collapse
|
215
|
Ravaut M, Sadeghi H, Leung KK, Volkovs M, Kornas K, Harish V, Watson T, Lewis GF, Weisman A, Poutanen T, Rosella L. Predicting adverse outcomes due to diabetes complications with machine learning using administrative health data. NPJ Digit Med 2021; 4:24. [PMID: 33580109 PMCID: PMC7881135 DOI: 10.1038/s41746-021-00394-8] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 01/11/2021] [Indexed: 02/07/2023] Open
Abstract
Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7-77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.
Collapse
Affiliation(s)
- Mathieu Ravaut
- Layer 6 AI, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | | | | | | | - Kathy Kornas
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Vinyas Harish
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- MD/PhD Program, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Tristan Watson
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
- ICES, Toronto, ON, Canada
| | - Gary F Lewis
- Department of Medicine, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
- Department of Physiology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | - Alanna Weisman
- Lunenfeld-Tanenbaum Research Institute, Mt. Sinai Hospital, Toronto, ON, Canada
- Division of Endocrinology and Metabolism, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada
| | | | - Laura Rosella
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada.
- ICES, Toronto, ON, Canada.
- Vector Institute, Toronto, ON, Canada.
- Institute for Better Health, Trillium Health Partners, Mississauga, ON, Canada.
- Department of Laboratory Medicine & Pathology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
216
|
Lee C, Light A, Alaa A, Thurtle D, van der Schaar M, Gnanapragasam VJ. Application of a novel machine learning framework for predicting non-metastatic prostate cancer-specific mortality in men using the Surveillance, Epidemiology, and End Results (SEER) database. LANCET DIGITAL HEALTH 2021; 3:e158-e165. [PMID: 33549512 DOI: 10.1016/s2589-7500(20)30314-9] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Revised: 12/03/2020] [Accepted: 12/10/2020] [Indexed: 12/23/2022]
Abstract
BACKGROUND Accurate prognostication is crucial in treatment decisions made for men diagnosed with non-metastatic prostate cancer. Current models rely on prespecified variables, which limits their performance. We aimed to investigate a novel machine learning approach to develop an improved prognostic model for predicting 10-year prostate cancer-specific mortality and compare its performance with existing validated models. METHODS We derived and tested a machine learning-based model using Survival Quilts, an algorithm that automatically selects and tunes ensembles of survival models using clinicopathological variables. Our study involved a US population-based cohort of 171 942 men diagnosed with non-metastatic prostate cancer between Jan 1, 2000, and Dec 31, 2016, from the prospectively maintained Surveillance, Epidemiology, and End Results (SEER) Program. The primary outcome was prediction of 10-year prostate cancer-specific mortality. Model discrimination was assessed using the concordance index (c-index), and calibration was assessed using Brier scores. The Survival Quilts model was compared with nine other prognostic models in clinical use, and decision curve analysis was done. FINDINGS 647 151 men with prostate cancer were enrolled into the SEER database, of whom 171 942 were included in this study. Discrimination improved with greater granularity, and multivariable models outperformed tier-based models. The Survival Quilts model showed good discrimination (c-index 0·829, 95% CI 0·820-0·838) for 10-year prostate cancer-specific mortality, which was similar to the top-ranked multivariable models: PREDICT Prostate (0·820, 0·811-0·829) and Memorial Sloan Kettering Cancer Center (MSKCC) nomogram (0·787, 0·776-0·798). All three multivariable models showed good calibration with low Brier scores (Survival Quilts 0·036, 95% CI 0·035-0·037; PREDICT Prostate 0·036, 0·035-0·037; MSKCC 0·037, 0·035-0·039). Of the tier-based systems, the Cancer of the Prostate Risk Assessment model (c-index 0·782, 95% CI 0·771-0·793) and Cambridge Prognostic Groups model (0·779, 0·767-0·791) showed higher discrimination for predicting 10-year prostate cancer-specific mortality. c-indices for models from the National Comprehensive Cancer Care Network, Genitourinary Radiation Oncologists of Canada, American Urological Association, European Association of Urology, and National Institute for Health and Care Excellence ranged from 0·711 (0·701-0·721) to 0·761 (0·750-0·772). Discrimination for the Survival Quilts model was maintained when stratified by age and ethnicity. Decision curve analysis showed an incremental net benefit from the Survival Quilts model compared with the MSKCC and PREDICT Prostate models currently used in practice. INTERPRETATION A novel machine learning-based approach produced a prognostic model, Survival Quilts, with discrimination for 10-year prostate cancer-specific mortality similar to the top-ranked prognostic models, using only standard clinicopathological variables. Future integration of additional data will likely improve model performance and accuracy for personalised prognostics. FUNDING None.
Collapse
Affiliation(s)
- Changhee Lee
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
| | - Alexander Light
- Department of Surgery, Division of Urology, University of Cambridge, Cambridge, UK; Department of Urology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Ahmed Alaa
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
| | - David Thurtle
- Department of Surgery, Division of Urology, University of Cambridge, Cambridge, UK; Department of Urology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK; Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Vincent J Gnanapragasam
- Department of Surgery, Division of Urology, University of Cambridge, Cambridge, UK; Cambridge Urology Translational Research and Clinical Trials Office, Cambridge Biomedical Campus, University of Cambridge, Cambridge, UK; Department of Urology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK.
| |
Collapse
|
217
|
Isgut M, Sun J, Quyyumi AA, Gibson G. Highly elevated polygenic risk scores are better predictors of myocardial infarction risk early in life than later. Genome Med 2021; 13:13. [PMID: 33509272 PMCID: PMC7845089 DOI: 10.1186/s13073-021-00828-8] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 01/07/2021] [Indexed: 01/09/2023] Open
Abstract
Background Several polygenic risk scores (PRS) have been developed for cardiovascular risk prediction, but the additive value of including PRS together with conventional risk factors for risk prediction is questionable. This study assesses the clinical utility of including four PRS generated from 194, 46K, 1.5M, and 6M SNPs, along with conventional risk factors, to predict risk of ischemic heart disease (IHD), myocardial infarction (MI), and first MI event on or before age 50 (early MI). Methods A cross-validated logistic regression (LR) algorithm was trained either on ~ 440K European ancestry individuals from the UK Biobank (UKB), or the full UKB population, including as features different combinations of conventional established-at-birth risk factors (ancestry, sex) and risk factors that are non-fixed over an individual’s lifespan (age, BMI, hypertension, hyperlipidemia, diabetes, smoking, family history), with and without also including PRS. The algorithm was trained separately with IHD, MI, and early MI as prediction labels. Results When LR was trained using risk factors established-at-birth, adding the four PRS significantly improved the area under the curve (AUC) for IHD (0.62 to 0.67) and MI (0.67 to 0.73), as well as for early MI (0.70 to 0.79). When LR was trained using all risk factors, adding the four PRS only resulted in a significantly higher disease prevalence in the 98th and 99th percentiles of both the IHD and MI scores. Conclusions PRS improve cardiovascular risk stratification early in life when knowledge of later-life risk factors is unavailable. However, by middle age, when many risk factors are known, the improvement attributed to PRS is marginal for the general population. Supplementary Information The online version contains supplementary material available at 10.1186/s13073-021-00828-8.
Collapse
Affiliation(s)
- Monica Isgut
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, EBB1 Suite 2115, Georgia Tech, Atlanta, GA, 30332, USA
| | - Jimeng Sun
- Department of Computer Science, University of Illinois Urbana-Champaign, Champaign, USA
| | - Arshed A Quyyumi
- Department of Medicine, Emory Clinical Cardiovascular Research Institute, Emory University School of Medicine, Atlanta, USA
| | - Greg Gibson
- School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, EBB1 Suite 2115, Georgia Tech, Atlanta, GA, 30332, USA.
| |
Collapse
|
218
|
Plasma protein expression profiles, cardiovascular disease, and religious struggles among South Asians in the MASALA study. Sci Rep 2021; 11:961. [PMID: 33441605 PMCID: PMC7806901 DOI: 10.1038/s41598-020-79429-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Accepted: 12/07/2020] [Indexed: 11/12/2022] Open
Abstract
Blood protein concentrations are clinically useful, predictive biomarkers of cardiovascular disease (CVD). Despite a higher burden of CVD among U.S. South Asians, no CVD-related proteomics study has been conducted in this sub-population. The aim of this study is to investigate the associations between plasma protein levels and CVD incidence, and to assess the potential influence of religiosity/spirituality (R/S) on significant protein-CVD associations, in South Asians from the MASALA Study. We used a nested case–control design of 50 participants with incident CVD and 50 sex- and age-matched controls. Plasma samples were analyzed by SOMAscan for expression of 1305 proteins. Multivariable logistic regression models and model selection using Akaike Information Criteria were performed on the proteins and clinical covariates, with further effect modification analyses conducted to assess the influence of R/S measures on significant associations between proteins and incident CVD events. We identified 36 proteins that were significantly expressed differentially among CVD cases compared to matched controls. These proteins are involved in immune cell recruitment, atherosclerosis, endothelial cell differentiation, and vascularization. A final multivariable model found three proteins (Contactin-5 [CNTN5], Low affinity immunoglobulin gamma Fc region receptor II-a [FCGR2A], and Complement factor B [CFB]) associated with incident CVD after adjustment for diabetes (AUC = 0.82). Religious struggles that exacerbate the adverse impact of stressful life events, significantly modified the effect of Contactin-5 and Complement factor B on risk of CVD. Our research is this first assessment of the relationship between protein concentrations and risk of CVD in a South Asian sample. Further research is needed to understand patterns of proteomic profiles across diverse ethnic communities, and the influence of resources for resiliency on proteomic signatures and ultimately, risk of CVD.
Collapse
|
219
|
García-Del-Valle S, Arnal-Velasco D, Molina-Mendoza R, Gómez-Arnau JI. Update on early warning scores. Best Pract Res Clin Anaesthesiol 2021; 35:105-113. [PMID: 33742570 DOI: 10.1016/j.bpa.2020.12.013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 12/19/2020] [Indexed: 12/23/2022]
Abstract
Early warning scores (EWS) have the objective to provide a preventive approach for detecting those patients in general wards at risk of deterioration before it begins. Well implemented and combined with a tiered response, the EWS expect to be a relevant tool for patient safety. Most of the evidence for their use has been published for the general EWS. Their strengths, such as objectivity and systematic response, health provider training, universal applicability and automatization potential need to be highlighted to counterbalance the weakness and limitations that have also been described. The near future will probably increase availability of EWS, reliability and predictive value through the spread and acceptability of continuous monitoring in general ward, its integration in decision support algorithms with automatic alerts and the elaboration of temporal vital signs patterns that will finally allow to perform a personal modelling depending on individual patient characteristics.
Collapse
|
220
|
Morelli D, Dolezalova N, Ponzo S, Colombo M, Plans D. Development of Digitally Obtainable 10-Year Risk Scores for Depression and Anxiety in the General Population. Front Psychiatry 2021; 12:689026. [PMID: 34483986 PMCID: PMC8414584 DOI: 10.3389/fpsyt.2021.689026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 07/13/2021] [Indexed: 11/29/2022] Open
Abstract
The burden of depression and anxiety in the world is rising. Identification of individuals at increased risk of developing these conditions would help to target them for prevention and ultimately reduce the healthcare burden. We developed a 10-year predictive algorithm for depression and anxiety using the full cohort of over 400,000 UK Biobank (UKB) participants without pre-existing depression or anxiety using digitally obtainable information. From the initial 167 variables selected from UKB, processed into 429 features, iterative backward elimination using Cox proportional hazards model was performed to select predictors which account for the majority of its predictive capability. Baseline and reduced models were then trained for depression and anxiety using both Cox and DeepSurv, a deep neural network approach to survival analysis. The baseline Cox model achieved concordance of 0.7772 and 0.7720 on the validation dataset for depression and anxiety, respectively. For the DeepSurv model, respective concordance indices were 0.7810 and 0.7728. After feature selection, the depression model contained 39 predictors and the concordance index was 0.7769 for Cox and 0.7772 for DeepSurv. The reduced anxiety model, with 53 predictors, achieved concordance of 0.7699 for Cox and 0.7710 for DeepSurv. The final models showed good discrimination and calibration in the test datasets. We developed predictive risk scores with high discrimination for depression and anxiety using the UKB cohort, incorporating predictors which are easily obtainable via smartphone. If deployed in a digital solution, it would allow individuals to track their risk, as well as provide some pointers to how to decrease it through lifestyle changes.
Collapse
Affiliation(s)
- Davide Morelli
- Huma Therapeutics Ltd., London, United Kingdom.,Department of Engineering Science, Institute of Biomedical Engineering, University of Oxford, Oxford, United Kingdom
| | | | - Sonia Ponzo
- Huma Therapeutics Ltd., London, United Kingdom
| | | | - David Plans
- Huma Therapeutics Ltd., London, United Kingdom.,Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom.,Initiative in the Digital Economy at Exeter (INDEX) Group, Department of Science, Innovation, Technology, and Entrepreneurship, University of Exeter, Exeter, United Kingdom
| |
Collapse
|
221
|
Renn BN, Schurr M, Zaslavsky O, Pratap A. Artificial Intelligence: An Interprofessional Perspective on Implications for Geriatric Mental Health Research and Care. Front Psychiatry 2021; 12:734909. [PMID: 34867524 PMCID: PMC8634654 DOI: 10.3389/fpsyt.2021.734909] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 10/07/2021] [Indexed: 11/26/2022] Open
Abstract
Artificial intelligence (AI) in healthcare aims to learn patterns in large multimodal datasets within and across individuals. These patterns may either improve understanding of current clinical status or predict a future outcome. AI holds the potential to revolutionize geriatric mental health care and research by supporting diagnosis, treatment, and clinical decision-making. However, much of this momentum is driven by data and computer scientists and engineers and runs the risk of being disconnected from pragmatic issues in clinical practice. This interprofessional perspective bridges the experiences of clinical scientists and data science. We provide a brief overview of AI with the main focus on possible applications and challenges of using AI-based approaches for research and clinical care in geriatric mental health. We suggest future AI applications in geriatric mental health consider pragmatic considerations of clinical practice, methodological differences between data and clinical science, and address issues of ethics, privacy, and trust.
Collapse
Affiliation(s)
- Brenna N Renn
- Department of Psychology, University of Nevada, Las Vegas, NV, United States
| | - Matthew Schurr
- Department of Psychology, University of Nevada, Las Vegas, NV, United States
| | - Oleg Zaslavsky
- Department of Biobehavioral Nursing and Health Informatics, University of Washington, Seattle, WA, United States
| | - Abhishek Pratap
- Krembil Centre for Neuroinformatics, Centre for Addiction and Mental Health, Toronto, ON, Canada.,Vector Institute for Artificial Intelligence, Toronto, ON, Canada.,Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, United States.,Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, United Kingdom
| |
Collapse
|
222
|
Chakravadhanula K. A smartphone-based test and predictive models for rapid, non-invasive, and point-of-care monitoring of ocular and cardiovascular complications related to diabetes. INFORMATICS IN MEDICINE UNLOCKED 2021. [DOI: 10.1016/j.imu.2020.100485] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
223
|
Kim M, Chae K, Lee S, Jang HJ, Kim S. Automated Classification of Online Sources for Infectious Disease Occurrences Using Machine-Learning-Based Natural Language Processing Approaches. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17249467. [PMID: 33348764 PMCID: PMC7766498 DOI: 10.3390/ijerph17249467] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 12/14/2020] [Accepted: 12/15/2020] [Indexed: 11/18/2022]
Abstract
Collecting valid information from electronic sources to detect the potential outbreak of infectious disease is time-consuming and labor-intensive. The automated identification of relevant information using machine learning is necessary to respond to a potential disease outbreak. A total of 2864 documents were collected from various websites and subsequently manually categorized and labeled by two reviewers. Accurate labels for the training and test data were provided based on a reviewer consensus. Two machine learning algorithms—ConvNet and bidirectional long short-term memory (BiLSTM)—and two classification methods—DocClass and SenClass—were used for classifying the documents. The precision, recall, F1, accuracy, and area under the curve were measured to evaluate the performance of each model. ConvNet yielded higher average, min, and max accuracies (87.6%, 85.2%, and 91.1%, respectively) than BiLSTM with DocClass, while BiLSTM performed better than ConvNet with SenClass with average, min, and max accuracies of 92.8%, 92.6%, and 93.3%, respectively. The performance of BiLSTM with SenClass yielded an overall accuracy of 92.9% in classifying infectious disease occurrences. Machine learning had a compatible performance with a human expert given a particular text extraction system. This study suggests that analyzing information from the website using machine learning can achieve significant accuracies in the presence of abundant articles/documents.
Collapse
Affiliation(s)
- Mira Kim
- Department of Preventive Medicine, College of Medicine, The Catholic University of Korea, Seoul 06591, Korea; (M.K.); (K.C.)
| | - Kyunghee Chae
- Department of Preventive Medicine, College of Medicine, The Catholic University of Korea, Seoul 06591, Korea; (M.K.); (K.C.)
| | - Seungwoo Lee
- Department of Data and HPC Science, University of Science and Technology, Daejeon 34113, Korea;
- Research Data Sharing Center, Korea Institute of Science and Technology Information, Daejeon 34141, Korea;
| | - Hong-Jun Jang
- Research Data Sharing Center, Korea Institute of Science and Technology Information, Daejeon 34141, Korea;
| | - Sukil Kim
- Department of Preventive Medicine, College of Medicine, The Catholic University of Korea, Seoul 06591, Korea; (M.K.); (K.C.)
- Correspondence: ; Tel.: +82-2-2258-7367
| |
Collapse
|
224
|
Taylor AK, Steeg S, Quinlivan L, Gunnell D, Hawton K, Kapur N. Accuracy of individual and combined risk-scale items in the prediction of repetition of self-harm: multicentre prospective cohort study. BJPsych Open 2020; 7:e2. [PMID: 33261707 PMCID: PMC7791570 DOI: 10.1192/bjo.2020.123] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 09/18/2020] [Accepted: 09/29/2020] [Indexed: 11/25/2022] Open
Abstract
BACKGROUND Individuals attending emergency departments following self-harm have increased risks of future self-harm. Despite the common use of risk scales in self-harm assessment, there is growing evidence that combinations of risk factors do not accurately identify those at greatest risk of further self-harm and suicide. AIMS To evaluate and compare predictive accuracy in prediction of repeat self-harm from clinician and patient ratings of risk, individual risk-scale items and a scale constructed with top-performing items. METHOD We conducted secondary analysis of data from a five-hospital multicentre prospective cohort study of participants referred to psychiatric liaison services following self-harm. We tested predictive utility of items from five risk scales: Manchester Self-Harm Rule, ReACT Self-Harm Rule, SAD PERSONS, Modified SAD PERSONS, Barratt Impulsiveness Scale and clinician and patient risk estimates. Area under the curve (AUC), sensitivity, specificity, predictive values and likelihood ratios were used to evaluate predictive accuracy, with sensitivity analyses using classification-tree regression. RESULTS A total of 483 self-harm episodes were included, and 145 (30%) were followed by a repeat presentation within 6 months. AUC of individual items ranged from 0.43-0.65. Combining best performing items resulted in an AUC of 0.56. Some individual items outperformed the scale they originated from; no items were superior to clinician or patient risk estimations. CONCLUSIONS No individual or combination of items outperformed patients' or clinicians' ratings. This suggests there are limitations to combining risk factors to predict risk of self-harm repetition. Risk scales should have little role in the management of people who have self-harmed.
Collapse
Affiliation(s)
- Anna Kathryn Taylor
- Centre for Mental Health and Safety, Manchester Academic Health Science Centre, University of Manchester, UK
| | - Sarah Steeg
- Centre for Mental Health and Safety, Manchester Academic Health Science Centre, University of Manchester, UK
| | - Leah Quinlivan
- Centre for Mental Health and Safety, Manchester Academic Health Science Centre, University of Manchester, UK; and NIHR Greater Manchester Patient Safety Translational Research Centre, University of Manchester, UK
| | - David Gunnell
- Department of Population Health Sciences, University of Bristol, UK
| | - Keith Hawton
- Centre for Suicide Research University Department of Psychiatry, Warneford Hospital, UK; and Oxford Health NHS Foundation Trust, Warneford Hospital, UK
| | - Nav Kapur
- Centre for Mental Health and Safety, Manchester Academic Health Science Centre, University of Manchester, UK; and NIHR Greater Manchester Patient Safety Translational Research Centre, University of Manchester, UK
| |
Collapse
|
225
|
Albrecht K, Milatz F, Callhoff J, Redeker I, Minden K, Strangfeld A, Regierer A. [Perspectives for rheumatological health services research at the German Rheumatism Research Center]. Z Rheumatol 2020; 79:1003-1008. [PMID: 33258978 PMCID: PMC7705411 DOI: 10.1007/s00393-020-00907-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/03/2020] [Indexed: 11/26/2022]
Abstract
In diesem Übersichtsartikel werden aktuelle Projekte und Perspektiven der rheumatologischen Versorgungsforschung am Programmbereich Epidemiologie des DRFZ (Deutsches Rheuma-Forschungszentrum) zusammengefasst. Versorgungsforschung wird mithilfe verschiedener Datenquellen betrieben. Neben den klassischen rheumatologischen Krankheitsregistern werden zunehmend auch Krankenkassendaten und bevölkerungsbezogene Kohorten für Analysen verwendet. Von der Datenerfassung über das Monitoring bis zu Analysealgorithmen verändern digitale Anwendungen die Versorgungsforschung der nächsten Jahre. Kollaborative Analysen mit nationalen und internationalen Kooperationspartnern unter Einbindung von Biomarkern komplettieren die Forschung am Programmbereich Epidemiologie. Die Digitalisierung der Forschungsprojekte ist ein zentraler Baustein, der die Versorgungsforschung im kommenden Jahrzehnt weiter verändern wird.
Collapse
Affiliation(s)
- K Albrecht
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland.
| | - F Milatz
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland
| | - J Callhoff
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland
| | - I Redeker
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland
| | - K Minden
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland
| | - A Strangfeld
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland
| | - A Regierer
- Programmbereich Epidemiologie und Versorgungsforschung, Deutsches Rheuma-Forschungszentrum Berlin, Charitéplatz 1, 10117, Berlin, Deutschland
| |
Collapse
|
226
|
Zhu JS, Ge P, Jiang C, Zhang Y, Li X, Zhao Z, Zhang L, Duong TQ. Deep-learning artificial intelligence analysis of clinical variables predicts mortality in COVID-19 patients. J Am Coll Emerg Physicians Open 2020; 1:1364-1373. [PMID: 32838390 PMCID: PMC7405082 DOI: 10.1002/emp2.12205] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 07/01/2020] [Accepted: 07/13/2020] [Indexed: 01/01/2023] Open
Abstract
Objective The large number of clinical variables associated with coronavirus disease 2019 (COVID-19) infection makes it challenging for frontline physicians to effectively triage COVID-19 patients during the pandemic. This study aimed to develop an efficient deep-learning artificial intelligence algorithm to identify top clinical variable predictors and derive a risk stratification score system to help clinicians triage COVID-19 patients. Methods This retrospective study consisted of 181 hospitalized patients with confirmed COVID-19 infection from January 29, 2020 to March 21, 2020 from a major hospital in Wuhan, China. The primary outcome was mortality. Demographics, comorbidities, vital signs, symptoms, and laboratory tests were collected at initial presentation, totaling 78 clinical variables. A deep-learning algorithm and a risk stratification score system were developed to predict mortality. Data were split into 85% training and 15% testing. Prediction performance was compared with those using COVID-19 severity score, CURB-65 score, and pneumonia severity index (PSI). Results Of the 181 COVID-19 patients, 39 expired and 142 survived. Five top predictors of mortality were D-dimer, O2 Index, neutrophil:lymphocyte ratio, C-reactive protein, and lactate dehydrogenase. The top 5 predictors and the resultant risk score yielded, respectively, an area under curve (AUC) of 0.968 (95% CI = 0.87-1.0) and 0.954 (95% CI = 0.80-0.99) for the testing dataset. Our models outperformed COVID-19 severity score (AUC = 0.756), CURB-65 score (AUC = 0.671), and PSI (AUC = 0.838). The mortality rates for our risk stratification scores (0-5) were 0%, 0%, 6.7%, 18.2%, 67.7%, and 83.3%, respectively. Conclusions Deep-learning prediction model and the resultant risk stratification score may prove useful in clinical decisionmaking under time-sensitive and resource-constrained environment.
Collapse
Affiliation(s)
- Jocelyn S Zhu
- Departments of Radiology, Renaissance School of MedicineStony Brook UniversityStony BrookNew YorkUSA
| | - Peilin Ge
- Departments of Radiology, Renaissance School of MedicineStony Brook UniversityStony BrookNew YorkUSA
| | - Chunguo Jiang
- Department of Respiratory and Critical Care Medicine, Beijing Institute of Respiratory Medicine, Beijing Chaoyang HospitalCapital Medical UniversityBeijingChina
| | - Yong Zhang
- Department of Hepatobiliary Surgery, Union Hospital, Tongji Medical CollegeHuazhong University of Science and TechnologyWuhanChina
| | - Xiaoran Li
- Departments of Radiology, Renaissance School of MedicineStony Brook UniversityStony BrookNew YorkUSA
| | - Zirun Zhao
- Departments of Radiology, Renaissance School of MedicineStony Brook UniversityStony BrookNew YorkUSA
| | - Liming Zhang
- Department of Respiratory and Critical Care Medicine, Beijing Institute of Respiratory Medicine, Beijing Chaoyang HospitalCapital Medical UniversityBeijingChina
| | - Tim Q. Duong
- Departments of Radiology, Renaissance School of MedicineStony Brook UniversityStony BrookNew YorkUSA
| |
Collapse
|
227
|
Qian Z, Alaa AM, van der Schaar M. CPAS: the UK's national machine learning-based hospital capacity planning system for COVID-19. Mach Learn 2020; 110:15-35. [PMID: 33250568 PMCID: PMC7685302 DOI: 10.1007/s10994-020-05921-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 09/05/2020] [Accepted: 09/26/2020] [Indexed: 12/20/2022]
Abstract
The coronavirus disease 2019 (COVID-19) global pandemic poses the threat of overwhelming healthcare systems with unprecedented demands for intensive care resources. Managing these demands cannot be effectively conducted without a nationwide collective effort that relies on data to forecast hospital demands on the national, regional, hospital and individual levels. To this end, we developed the COVID-19 Capacity Planning and Analysis System (CPAS)—a machine learning-based system for hospital resource planning that we have successfully deployed at individual hospitals and across regions in the UK in coordination with NHS Digital. In this paper, we discuss the main challenges of deploying a machine learning-based decision support system at national scale, and explain how CPAS addresses these challenges by (1) defining the appropriate learning problem, (2) combining bottom-up and top-down analytical approaches, (3) using state-of-the-art machine learning algorithms, (4) integrating heterogeneous data sources, and (5) presenting the result with an interactive and transparent interface. CPAS is one of the first machine learning-based systems to be deployed in hospitals on a national scale to address the COVID-19 pandemic—we conclude the paper with a summary of the lessons learned from this experience.
Collapse
Affiliation(s)
| | | | - Mihaela van der Schaar
- University of Cambridge, Cambridge, UK
- University of California, Los Angeles, USA
- The Alan Turing Institute, London, UK
| |
Collapse
|
228
|
Multiclass machine learning vs. conventional calculators for stroke/CVD risk assessment using carotid plaque predictors with coronary angiography scores as gold standard: a 500 participants study. Int J Cardiovasc Imaging 2020; 37:1171-1187. [DOI: 10.1007/s10554-020-02099-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 11/03/2020] [Indexed: 02/07/2023]
|
229
|
Low-Cost Office-Based Cardiovascular Risk Stratification Using Machine Learning and Focused Carotid Ultrasound in an Asian-Indian Cohort. J Med Syst 2020; 44:208. [DOI: 10.1007/s10916-020-01675-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 11/09/2020] [Indexed: 12/13/2022]
|
230
|
Li Y, Sperrin M, Ashcroft DM, van Staa TP. Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar. BMJ 2020; 371:m3919. [PMID: 33148619 PMCID: PMC7610202 DOI: 10.1136/bmj.m3919] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
OBJECTIVE To assess the consistency of machine learning and statistical techniques in predicting individual level and population level risks of cardiovascular disease and the effects of censoring on risk predictions. DESIGN Longitudinal cohort study from 1 January 1998 to 31 December 2018. SETTING AND PARTICIPANTS 3.6 million patients from the Clinical Practice Research Datalink registered at 391 general practices in England with linked hospital admission and mortality records. MAIN OUTCOME MEASURES Model performance including discrimination, calibration, and consistency of individual risk prediction for the same patients among models with comparable model performance. 19 different prediction techniques were applied, including 12 families of machine learning models (grid searched for best models), three Cox proportional hazards models (local fitted, QRISK3, and Framingham), three parametric survival models, and one logistic model. RESULTS The various models had similar population level performance (C statistics of about 0.87 and similar calibration). However, the predictions for individual risks of cardiovascular disease varied widely between and within different types of machine learning and statistical models, especially in patients with higher risks. A patient with a risk of 9.5-10.5% predicted by QRISK3 had a risk of 2.9-9.2% in a random forest and 2.4-7.2% in a neural network. The differences in predicted risks between QRISK3 and a neural network ranged between -23.2% and 0.1% (95% range). Models that ignored censoring (that is, assumed censored patients to be event free) substantially underestimated risk of cardiovascular disease. Of the 223 815 patients with a cardiovascular disease risk above 7.5% with QRISK3, 57.8% would be reclassified below 7.5% when using another model. CONCLUSIONS A variety of models predicted risks for the same patients very differently despite similar model performances. The logistic models and commonly used machine learning models should not be directly applied to the prediction of long term risks without considering censoring. Survival models that consider censoring and that are explainable, such as QRISK3, are preferable. The level of consistency within and between models should be routinely assessed before they are used for clinical decision making.
Collapse
Affiliation(s)
- Yan Li
- Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Matthew Sperrin
- Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Darren M Ashcroft
- Centre for Pharmacoepidemiology and Drug Safety, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
- NIHR Greater Manchester Patient Safety Translational Research Centre, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Tjeerd Pieter van Staa
- Health e-Research Centre, Health Data Research UK North, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, Manchester M13 9PL, UK
- Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Utrecht, Netherlands
- Alan Turing Institute, Headquartered at the British Library, London, UK
| |
Collapse
|
231
|
Chaikijurajai T, Laffin LJ, Tang WHW. Artificial Intelligence and Hypertension: Recent Advances and Future Outlook. Am J Hypertens 2020; 33:967-974. [PMID: 32615586 PMCID: PMC7608522 DOI: 10.1093/ajh/hpaa102] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Accepted: 06/26/2020] [Indexed: 12/19/2022] Open
Abstract
Prevention and treatment of hypertension (HTN) are a challenging public health problem. Recent evidence suggests that artificial intelligence (AI) has potential to be a promising tool for reducing the global burden of HTN, and furthering precision medicine related to cardiovascular (CV) diseases including HTN. Since AI can stimulate human thought processes and learning with complex algorithms and advanced computational power, AI can be applied to multimodal and big data, including genetics, epigenetics, proteomics, metabolomics, CV imaging, socioeconomic, behavioral, and environmental factors. AI demonstrates the ability to identify risk factors and phenotypes of HTN, predict the risk of incident HTN, diagnose HTN, estimate blood pressure (BP), develop novel cuffless methods for BP measurement, and comprehensively identify factors associated with treatment adherence and success. Moreover, AI has also been used to analyze data from major randomized controlled trials exploring different BP targets to uncover previously undescribed factors associated with CV outcomes. Therefore, AI-integrated HTN care has the potential to transform clinical practice by incorporating personalized prevention and treatment approaches, such as determining optimal and patient-specific BP goals, identifying the most effective antihypertensive medication regimen for an individual, and developing interventions targeting modifiable risk factors. Although the role of AI in HTN has been increasingly recognized over the past decade, it remains in its infancy, and future studies with big data analysis and N-of-1 study design are needed to further demonstrate the applicability of AI in HTN prevention and treatment.
Collapse
Affiliation(s)
- Thanat Chaikijurajai
- Department of Cardiovascular Medicine, Heart, Vascular and Thoracic Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Luke J Laffin
- Department of Cardiovascular Medicine, Heart, Vascular and Thoracic Institute, Cleveland Clinic, Cleveland, Ohio, USA
| | - Wai Hong Wilson Tang
- Department of Cardiovascular Medicine, Heart, Vascular and Thoracic Institute, Cleveland Clinic, Cleveland, Ohio, USA
| |
Collapse
|
232
|
Timmins IR, Zaccardi F, Nelson CP, Franks PW, Yates T, Dudbridge F. Genome-wide association study of self-reported walking pace suggests beneficial effects of brisk walking on health and survival. Commun Biol 2020; 3:634. [PMID: 33128006 PMCID: PMC7599247 DOI: 10.1038/s42003-020-01357-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 10/01/2020] [Indexed: 12/19/2022] Open
Abstract
Walking is a simple form of exercise, widely promoted for its health benefits. Self-reported walking pace has been associated with a range of cardiorespiratory and cancer outcomes, and is a strong predictor of mortality. Here we perform a genome-wide association study of self-reported walking pace in 450,967 European ancestry UK Biobank participants. We identify 70 independent associated loci (P < 5 × 10-8), 11 of which are novel. We estimate the SNP-based heritability as 13.2% (s.e. = 0.21%), reducing to 8.9% (s.e. = 0.17%) with adjustment for body mass index. Significant genetic correlations are observed with cardiometabolic, respiratory and psychiatric traits, educational attainment and all-cause mortality. Mendelian randomization analyses suggest a potential causal link of increasing walking pace with a lower cardiometabolic risk profile. Given its low heritability and simple measurement, these findings suggest that self-reported walking pace is a pragmatic target for interventions aiming for general benefits on health.
Collapse
Affiliation(s)
- Iain R Timmins
- Department of Health Sciences, University of Leicester, Leicester, UK
| | | | - Christopher P Nelson
- Department of Cardiovascular Sciences, University of Leicester, Leicester, UK.,NIHR Leicester Biomedical Research Centre, University Hospitals of Leicester NHS Trust & University of Leicester, Leicester, UK
| | - Paul W Franks
- Department of Clinical Sciences, Lund University, Lund, Sweden
| | - Thomas Yates
- Diabetes Research Centre, University of Leicester, Leicester, UK.,NIHR Leicester Biomedical Research Centre, University Hospitals of Leicester NHS Trust & University of Leicester, Leicester, UK
| | - Frank Dudbridge
- Department of Health Sciences, University of Leicester, Leicester, UK.
| |
Collapse
|
233
|
Jamthikar AD, Gupta D, Saba L, Khanna NN, Viskovic K, Mavrogeni S, Laird JR, Sattar N, Johri AM, Pareek G, Miner M, Sfikakis PP, Protogerou A, Viswanathan V, Sharma A, Kitas GD, Nicolaides A, Kolluri R, Suri JS. Artificial intelligence framework for predictive cardiovascular and stroke risk assessment models: A narrative review of integrated approaches using carotid ultrasound. Comput Biol Med 2020; 126:104043. [PMID: 33065389 DOI: 10.1016/j.compbiomed.2020.104043] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 09/10/2020] [Accepted: 10/04/2020] [Indexed: 12/12/2022]
Abstract
RECENT FINDINGS Cardiovascular disease (CVD) is the leading cause of mortality and poses challenges for healthcare providers globally. Risk-based approaches for the management of CVD are becoming popular for recommending treatment plans for asymptomatic individuals. Several conventional predictive CVD risk models based do not provide an accurate CVD risk assessment for patients with different baseline risk profiles. Artificial intelligence (AI) algorithms have changed the landscape of CVD risk assessment and demonstrated a better performance when compared against conventional models, mainly due to its ability to handle the input nonlinear variations. Further, it has the flexibility to add risk factors derived from medical imaging modalities that image the morphology of the plaque. The integration of noninvasive carotid ultrasound image-based phenotypes with conventional risk factors in the AI framework has further provided stronger power for CVD risk prediction, so-called "integrated predictive CVD risk models." PURPOSE of the review: The objective of this review is (i) to understand several aspects in the development of predictive CVD risk models, (ii) to explore current conventional predictive risk models and their successes and challenges, and (iii) to refine the search for predictive CVD risk models using noninvasive carotid ultrasound as an exemplar in the artificial intelligence-based framework. CONCLUSION Conventional predictive CVD risk models are suboptimal and could be improved. This review examines the potential to include more noninvasive image-based phenotypes in the CVD risk assessment using powerful AI-based strategies.
Collapse
Affiliation(s)
- Ankush D Jamthikar
- Department of Electronics and Communication Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India
| | - Deep Gupta
- Department of Electronics and Communication Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India
| | - Luca Saba
- Department of Radiology, University of Cagliari, Italy
| | - Narendra N Khanna
- Department of Cardiology, Indraprastha APOLLO Hospitals, New Delhi, India
| | - Klaudija Viskovic
- Department of Radiology and Ultrasound, University Hospital for Infectious Diseases, Croatia
| | - Sophie Mavrogeni
- Cardiology Clinic, Onassis Cardiac Surgery Center, Athens, Greece
| | - John R Laird
- Heart and Vascular Institute, Adventist Health St. Helena, St Helena, CA, USA
| | - Naveed Sattar
- Institute of Cardiovascular & Medical Sciences, University of Glasgow, Scotland, UK
| | - Amer M Johri
- Department of Medicine, Division of Cardiology, Queen's University, Kingston, Ontario, Canada
| | - Gyan Pareek
- Minimally Invasive Urology Institute, Brown University, Providence, RI, USA
| | - Martin Miner
- Men's Health Center, Miriam Hospital Providence, Rhode Island, USA
| | - Petros P Sfikakis
- Rheumatology Unit, National Kapodistrian University of Athens, Greece
| | - Athanasios Protogerou
- Department of Cardiovascular Prevention & Research Unit Clinic & Laboratory of Pathophysiology, National and Kapodistrian Univ. of Athens, Greece
| | - Vijay Viswanathan
- MV Hospital for Diabetes and Professor M Viswanathan Diabetes Research Centre, Chennai, India
| | - Aditya Sharma
- Division of Cardiovascular Medicine, University of Virginia, Charlottesville, VA, USA
| | - George D Kitas
- R & D Academic Affairs, Dudley Group NHS Foundation Trust, Dudley, United Kingdom
| | - Andrew Nicolaides
- Vascular Screening and Diagnostic Centre and University of Nicosia Medical School, Nicosia, Cyprus
| | | | - Jasjit S Suri
- Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA, USA.
| |
Collapse
|
234
|
Sánchez-Cabo F, Rossello X, Fuster V, Benito F, Manzano JP, Silla JC, Fernández-Alvira JM, Oliva B, Fernández-Friera L, López-Melgar B, Mendiguren JM, Sanz J, Ordovás JM, Andrés V, Fernández-Ortiz A, Bueno H, Ibáñez B, García-Ruiz JM, Lara-Pezzi E. Machine Learning Improves Cardiovascular Risk Definition for Young, Asymptomatic Individuals. J Am Coll Cardiol 2020; 76:1674-1685. [DOI: 10.1016/j.jacc.2020.08.017] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Revised: 07/27/2020] [Accepted: 08/03/2020] [Indexed: 12/21/2022]
|
235
|
Abstract
Our understanding of the development of congenital heart disease (CHD) across the lifespan has evolved. These include the evidence for the change in demographics of CHD, the observations that lifelong complications of CHD result in CHD as a lifespan disease, and the concept of long windows of exposure to risk that start in foetal life and magnify the expression of risk in adulthood. These observations set the stage for trajectories as an emerging construct to target health-service interventions. The lifelong cardiovascular and systemic complications of CHD make the long-term care of these patients challenging for cardiologists and internists alike. A life-course approach is thus required to facilitate our understanding of the natural history and to orient our clinical efforts. Three specific examples are illustrated: neurocognition; cancer resulting from exposure to low-dose ionizing radiation; and cardiovascular disease acquired in ageing adults. As patients grow, they do not just want to live longer, they want to live well. With the need to move beyond the mortality outcome, a shift in paradigm is needed. A life-course health development framework is developed for CHD. Trajectories are used as a complex construct to illustrate the patient's healthcare journey. There is a need to define disease trajectories, wellness trajectories and ageing trajectories in this population. Disease trajectories for repaired tetralogy of Fallot, transposition of the great arteries and the Fontan operation are hypothetically constructed. For clinicians, the life-course horizon helps to frame the patient's history and plan for the future. For researchers, life-course epidemiology offers a framework that will help increase the relevance of clinical enquiry and improve study design and analyses. A health-service policy framework is proposed for a growing number of conditions that start in the before birth and extend as long as contemporary survival now permits. Ultimately, the goal is the precision delivery of health services that enables lifelong health management, organization of developmental health services, and integration of vertical and horizontal health-service delivery.
Collapse
Affiliation(s)
- A Marelli
- McGill University Health Centre, RVH/Glen Site, Cardiology, McGill University Health Centre, Montreal, Québec, Canada
| |
Collapse
|
236
|
Nori VS, Hane CA, Sun Y, Crown WH, Bleicher PA. Deep neural network models for identifying incident dementia using claims and EHR datasets. PLoS One 2020; 15:e0236400. [PMID: 32970677 PMCID: PMC7514098 DOI: 10.1371/journal.pone.0236400] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 07/06/2020] [Indexed: 01/28/2023] Open
Abstract
This study investigates the use of deep learning methods to improve the accuracy of a predictive model for dementia, and compares the performance to a traditional machine learning model. With sufficient accuracy the model can be deployed as a first round screening tool for clinical follow-up including neurological examination, neuropsychological testing, imaging and recruitment to clinical trials. Seven cohorts with two years of data, three to eight years prior to index date, and an incident cohort were created. Four trained models for each cohort, boosted trees, feed forward network, recurrent neural network and recurrent neural network with pre-trained weights, were constructed and their performance compared using validation and test data. The incident model had an AUC of 94.4% and F1 score of 54.1%. Eight years removed from index date the AUC and F1 scores were 80.7% and 25.6%, respectively. The results for the remaining cohorts were between these ranges. Deep learning models can result in significant improvement in performance but come at a cost in terms of run times and hardware requirements. The results of the model at index date indicate that this modeling can be effective at stratifying patients at risk of dementia. At this time, the inability to sustain this quality at longer lead times is more an issue of data availability and quality rather than one of algorithm choices.
Collapse
Affiliation(s)
- Vijay S. Nori
- OptumLabs, Boston, Massachusetts, United States of America
- * E-mail:
| | | | - Yezhou Sun
- OptumLabs, Boston, Massachusetts, United States of America
| | | | | |
Collapse
|
237
|
Perry BI, Upthegrove R, Crawford O, Jang S, Lau E, McGill I, Carver E, Jones PB, Khandaker GM. Cardiometabolic risk prediction algorithms for young people with psychosis: a systematic review and exploratory analysis. Acta Psychiatr Scand 2020; 142:215-232. [PMID: 32654119 DOI: 10.1111/acps.13212] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/06/2020] [Indexed: 12/25/2022]
Abstract
OBJECTIVE Cardiometabolic risk prediction algorithms are common in clinical practice. Young people with psychosis are at high risk for developing cardiometabolic disorders. We aimed to examine whether existing cardiometabolic risk prediction algorithms are suitable for young people with psychosis. METHODS We conducted a systematic review and narrative synthesis of studies reporting the development and validation of cardiometabolic risk prediction algorithms for general or psychiatric populations. Furthermore, we used data from 505 participants with or at risk of psychosis at age 18 years in the ALSPAC birth cohort, to explore the performance of three algorithms (QDiabetes, QRISK3 and PRIMROSE) highlighted as potentially suitable. We repeated analyses after artificially increasing participant age to the mean age of the original algorithm studies to examine the impact of age on predictive performance. RESULTS We screened 7820 results, including 110 studies. All algorithms were developed in relatively older participants, and most were at high risk of bias. Three studies (QDiabetes, QRISK3 and PRIMROSE) featured psychiatric predictors. Age was more strongly weighted than other risk factors in each algorithm. In our exploratory analysis, calibration plots for all three algorithms implied a consistent systematic underprediction of cardiometabolic risk in the younger sample. After increasing participant age, calibration plots were markedly improved. CONCLUSION Existing cardiometabolic risk prediction algorithms cannot be recommended for young people with or at risk of psychosis. Existing algorithms may underpredict risk in young people, even in the face of other high-risk features. Recalibration of existing algorithms or a new tailored algorithm for the population is required.
Collapse
Affiliation(s)
- B I Perry
- Department of Psychiatry, University of Cambridge, Cambridge, UK.,Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| | - R Upthegrove
- Institute for Mental Health, University of Birmingham, Birmingham, UK
| | - O Crawford
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - S Jang
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - E Lau
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - I McGill
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - E Carver
- University of Cambridge School of Clinical Medicine, Cambridge, UK
| | - P B Jones
- Department of Psychiatry, University of Cambridge, Cambridge, UK.,Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| | - G M Khandaker
- Department of Psychiatry, University of Cambridge, Cambridge, UK.,Cambridgeshire and Peterborough NHS Foundation Trust, Cambridge, UK
| |
Collapse
|
238
|
Lee J, Lim JS, Chu Y, Lee CH, Ryu OH, Choi HH, Park YS, Kim C. Prediction of Coronary Artery Calcium Score Using Machine Learning in a Healthy Population. J Pers Med 2020; 10:jpm10030096. [PMID: 32825442 PMCID: PMC7565334 DOI: 10.3390/jpm10030096] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2020] [Revised: 08/12/2020] [Accepted: 08/17/2020] [Indexed: 01/05/2023] Open
Abstract
Background: Coronary artery calcium score (CACS) is a reliable predictor for future cardiovascular disease risk. Although deep learning studies using computed tomography (CT) images to predict CACS have been reported, no study has assessed the feasibility of machine learning (ML) algorithms to predict the CACS using clinical variables in a healthy general population. Therefore, we aimed to assess whether ML algorithms other than binary logistic regression (BLR) could predict high CACS in a healthy population with general health examination data. Methods: This retrospective observational study included participants who had regular health screening including coronary CT angiography. High CACS was defined by the Agatston score ≥ 100. Univariable and multivariable BLR was performed to assess predictors for high CACS in the entire dataset. When performing ML prediction for high CACS, the dataset was randomly divided into a training and test dataset with a 7:3 ratio. BLR, catboost, and xgboost algorithms with 5-fold cross-validation and grid search technique were used to find the best performing classifier. Performance comparison of each ML algorithm was evaluated with the area under the receiver operating characteristic (AUROC) curve. Results: A total of 2133 participants were included in the final analysis. Mean age and proportion of male sex were 55.4 ± 11.3 years and 1483 (69.5%), respectively. In multivariable BLR analysis, age (odds ratio [OR], 1.12; 95% confidence interval [CI], 1.10–1.15, p < 0.001), male sex (OR, 2.91; 95% CI, 1.57–5.38, p < 0.001), systolic blood pressure (OR, 1.02; 95% CI, 1.00–1.03, p = 0.019), and low-density lipoprotein cholesterol (OR, 1.00; 95% CI, 0.99–1.00, p = 0.047) were significant predictors for high CACS. Performance in predicting high CACS of xgboost was AUROC of 0.823, followed by catboost (0.750) and BLR (0.585). The comparison of AUROC between xgboost and BLR was significant (p for AUROC comparison < 0.001). Conclusions: Xgboost ML algorithm was found to be a more reliable predictor of CACS in healthy participants compared to the BLR algorithm. ML algorithms may be useful for predicting CACS with only laboratory data in healthy participants.
Collapse
Affiliation(s)
- Jongseok Lee
- School of Business Administration, Hallym University, Chuncheon 24252, Korea; (J.L.); (C.H.L.)
| | - Jae-Sung Lim
- Department of Neurology, Hallym University Sacred Heart Hospital, Anyang 14068, Korea;
| | - Younggi Chu
- Industry-University Cooperation Group, Hallym University, Chuncheon 24252, Korea;
| | - Chang Hee Lee
- School of Business Administration, Hallym University, Chuncheon 24252, Korea; (J.L.); (C.H.L.)
| | - Ohk-Hyun Ryu
- Department of Endocrinology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea;
| | - Hyun Hee Choi
- Department of Cardiology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea;
| | - Yong Soon Park
- Department of Family Medicine, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea;
| | - Chulho Kim
- Department of Neurology, Chuncheon Sacred Heart Hospital, Chuncheon 24253, Korea
- Correspondence: ; Tel.: +82-33-240-5255; Fax: +82-33-255-6244
| |
Collapse
|
239
|
Jamthikar A, Gupta D, Saba L, Khanna NN, Araki T, Viskovic K, Mavrogeni S, Laird JR, Pareek G, Miner M, Sfikakis PP, Protogerou A, Viswanathan V, Sharma A, Nicolaides A, Kitas GD, Suri JS. Cardiovascular/stroke risk predictive calculators: a comparison between statistical and machine learning models. Cardiovasc Diagn Ther 2020; 10:919-938. [PMID: 32968651 DOI: 10.21037/cdt.2020.01.07] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Background Statistically derived cardiovascular risk calculators (CVRC) that use conventional risk factors, generally underestimate or overestimate the risk of cardiovascular disease (CVD) or stroke events primarily due to lack of integration of plaque burden. This study investigates the role of machine learning (ML)-based CVD/stroke risk calculators (CVRCML) and compares against statistically derived CVRC (CVRCStat) based on (I) conventional factors or (II) combined conventional with plaque burden (integrated factors). Methods The proposed study is divided into 3 parts: (I) statistical calculator: initially, the 10-year CVD/stroke risk was computed using 13 types of CVRCStat (without and with plaque burden) and binary risk stratification of the patients was performed using the predefined thresholds and risk classes; (II) ML calculator: using the same risk factors (without and with plaque burden), as adopted in 13 different CVRCStat, the patients were again risk-stratified using CVRCML based on support vector machine (SVM) and finally; (III) both types of calculators were evaluated using AUC based on ROC analysis, which was computed using combination of predicted class and endpoint equivalent to CVD/stroke events. Results An Institutional Review Board approved 202 patients (156 males and 46 females) of Japanese ethnicity were recruited for this study with a mean age of 69±11 years. The AUC for 13 different types of CVRCStat calculators were: AECRS2.0 (AUC 0.83, P<0.001), QRISK3 (AUC 0.72, P<0.001), WHO (AUC 0.70, P<0.001), ASCVD (AUC 0.67, P<0.001), FRScardio (AUC 0.67, P<0.01), FRSstroke (AUC 0.64, P<0.001), MSRC (AUC 0.63, P=0.03), UKPDS56 (AUC 0.63, P<0.001), NIPPON (AUC 0.63, P<0.001), PROCAM (AUC 0.59, P<0.001), RRS (AUC 0.57, P<0.001), UKPDS60 (AUC 0.53, P<0.001), and SCORE (AUC 0.45, P<0.001), while the AUC for the CVRCML with integrated risk factors (AUC 0.88, P<0.001), a 42% increase in performance. The overall risk-stratification accuracy for the CVRCML with integrated risk factors was 92.52% which was higher compared all the other CVRCStat. Conclusions ML-based CVD/stroke risk calculator provided a higher predictive ability of 10-year CVD/stroke compared to the 13 different types of statistically derived risk calculators including integrated model AECRS 2.0.
Collapse
Affiliation(s)
- Ankush Jamthikar
- Department of Electronics and Communication Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India
| | - Deep Gupta
- Department of Electronics and Communication Engineering, Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India
| | - Luca Saba
- Department of Radiology, University of Cagliari, Cagliari, Italy
| | - Narendra N Khanna
- Department of Cardiology, Indraprastha APOLLO Hospitals, New Delhi, India
| | - Tadashi Araki
- Division of Cardiovascular Medicine, Toho University, Tokyo, Japan
| | - Klaudija Viskovic
- Department of Radiology and Ultrasound, University Hospital for Infectious Diseases, Zagreb, Croatia
| | - Sophie Mavrogeni
- Cardiology Clinic, Onassis Cardiac Surgery Center, Athens, Greece
| | - John R Laird
- Heart and Vascular Institute, Adventist Health St. Helena, St Helena, CA, USA
| | - Gyan Pareek
- Minimally Invasive Urology Institute, Brown University, Providence, Rhode Island, USA
| | - Martin Miner
- Men's Health Center, Miriam Hospital Providence, Rhode Island, USA
| | - Petros P Sfikakis
- Rheumatology Unit, National and Kapodistrian University of Athens, Athens, Greece
| | - Athanasios Protogerou
- Department of Cardiovascular Prevention & Research Unit Clinic & Laboratory of Pathophysiology, National and Kapodistrian University of Athens, Athens, Greece
| | - Vijay Viswanathan
- MV Hospital for Diabetes and Professor M Viswanathan Diabetes Research Centre, Chennai, India
| | - Aditya Sharma
- Division of Cardiovascular Medicine, University of Virginia, Charlottesville, VA, USA
| | - Andrew Nicolaides
- Vascular Screening and Diagnostic Centre and University of Nicosia Medical School, Nicosia, Cyprus
| | - George D Kitas
- R & D Academic Affairs, Dudley Group NHS Foundation Trust, Dudley, UK
| | - Jasjit S Suri
- Stroke Monitoring and Diagnostic Division, AtheroPoint™, Roseville, CA, USA
| |
Collapse
|
240
|
Kim S, Hahn JO, Youn BD. Detection and Severity Assessment of Peripheral Occlusive Artery Disease via Deep Learning Analysis of Arterial Pulse Waveforms: Proof-of-Concept and Potential Challenges. Front Bioeng Biotechnol 2020; 8:720. [PMID: 32714911 PMCID: PMC7340176 DOI: 10.3389/fbioe.2020.00720] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Accepted: 06/08/2020] [Indexed: 11/13/2022] Open
Abstract
Toward the ultimate goal of affordable and non-invasive screening of peripheral occlusive artery disease (PAD), the objective of this work is to investigate the potential of deep learning-based arterial pulse waveform analysis in detecting and assessing the severity of PAD. Using an established transmission line model of arterial hemodynamics, a large number of virtual patients associated with PAD of a wide range of severity and the corresponding arterial pulse waveform data were created. A deep convolutional neural network capable of detecting and assessing the severity of PAD based on the analysis of brachial and ankle arterial pulse waveforms was constructed, evaluated for efficacy, and compared with the state-of-the-art ankle-brachial index (ABI) using the virtual patients. The results suggested that deep learning may diagnose PAD more accurately and robustly than ABI. In sum, this work demonstrates the initial proof-of-concept of deep learning-based arterial pulse waveform analysis for affordable and convenient PAD screening as well as presents challenges that must be addressed for real-world clinical applications.
Collapse
Affiliation(s)
- Sooho Kim
- Department of Mechanical and Aerospace Engineering, Seoul National University, Seoul, South Korea
| | - Jin-Oh Hahn
- Department of Mechanical Engineering, University of Maryland, College Park, MD, United States
| | - Byeng Dong Youn
- Department of Mechanical and Aerospace Engineering, Seoul National University, Seoul, South Korea.,OnePredict, Inc., Seoul, South Korea
| |
Collapse
|
241
|
Time Series Analysis and Forecasting with Automated Machine Learning on a National ICD-10 Database. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:ijerph17144979. [PMID: 32664331 PMCID: PMC7400312 DOI: 10.3390/ijerph17144979] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Revised: 06/29/2020] [Accepted: 07/07/2020] [Indexed: 12/22/2022]
Abstract
The application of machine learning (ML) for use in generating insights and making predictions on new records continues to expand within the medical community. Despite this progress to date, the application of time series analysis has remained underexplored due to complexity of the underlying techniques. In this study, we have deployed a novel ML, called automated time series (AutoTS) machine learning, to automate data processing and the application of a multitude of models to assess which best forecasts future values. This rapid experimentation allows for and enables the selection of the most accurate model in order to perform time series predictions. By using the nation-wide ICD-10 (International Classification of Diseases, Tenth Revision) dataset of hospitalized patients of Romania, we have generated time series datasets over the period of 2008–2018 and performed highly accurate AutoTS predictions for the ten deadliest diseases. Forecast results for the years 2019 and 2020 were generated on a NUTS 2 (Nomenclature of Territorial Units for Statistics) regional level. This is the first study to our knowledge to perform time series forecasting of multiple diseases at a regional level using automated time series machine learning on a national ICD-10 dataset. The deployment of AutoTS technology can help decision makers in implementing targeted national health policies more efficiently.
Collapse
|
242
|
Barriers and Recommendations for Developing a Data Commons for the Implementation and Application of Cardiovascular Disease and Diabetes Risk Scoring in the Philippines. CURR EPIDEMIOL REP 2020. [DOI: 10.1007/s40471-020-00232-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
243
|
Automated machine learning: Review of the state-of-the-art and opportunities for healthcare. Artif Intell Med 2020; 104:101822. [DOI: 10.1016/j.artmed.2020.101822] [Citation(s) in RCA: 197] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Revised: 01/17/2020] [Accepted: 02/17/2020] [Indexed: 12/13/2022]
|
244
|
Yang L, Wu H, Jin X, Zheng P, Hu S, Xu X, Yu W, Yan J. Study of cardiovascular disease prediction model based on random forest in eastern China. Sci Rep 2020; 10:5245. [PMID: 32251324 PMCID: PMC7090086 DOI: 10.1038/s41598-020-62133-5] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 02/28/2020] [Indexed: 12/13/2022] Open
Abstract
Cardiovascular disease (CVD) is the leading cause of death worldwide and a major public health concern. CVD prediction is one of the most effective measures for CVD control. In this study, 29930 subjects with high-risk of CVD were selected from 101056 people in 2014, regular follow-up was conducted using electronic health record system. Logistic regression analysis showed that nearly 30 indicators were related to CVD, including male, old age, family income, smoking, drinking, obesity, excessive waist circumference, abnormal cholesterol, abnormal low-density lipoprotein, abnormal fasting blood glucose and else. Several methods were used to build prediction model including multivariate regression model, classification and regression tree (CART), Naïve Bayes, Bagged trees, Ada Boost and Random Forest. We used the multivariate regression model as a benchmark for performance evaluation (Area under the curve, AUC = 0.7143). The results showed that the Random Forest was superior to other methods with an AUC of 0.787 and achieved a significant improvement over the benchmark. We provided a CVD prediction model for 3-year risk assessment of CVD. It was based on a large population with high risk of CVD in eastern China using Random Forest algorithm, which would provide reference for the work of CVD prediction and treatment in China.
Collapse
Affiliation(s)
- Li Yang
- Zhejiang Provincial Center for Cardiovascular Disease Control and Prevention, Zhejiang Hospital, Hangzhou, 310013, China
- Key Laboratory of Public Health Safety, Ministry of Education, Health Communication Institute, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Haibin Wu
- Ewell Technology Co., Ltd, Tower D of Oriental Communication Technology City, Hangzhou, 310000, China
| | - Xiaoqing Jin
- Chinese Acupuncture Department, Zhejiang Hospital, Hangzhou, 310013, China
| | - Pinpin Zheng
- Key Laboratory of Public Health Safety, Ministry of Education, Health Communication Institute, Fudan University, 138 Yixueyuan Road, Shanghai, 200032, China
| | - Shiyun Hu
- Zhejiang Provincial Center for Cardiovascular Disease Control and Prevention, Zhejiang Hospital, Hangzhou, 310013, China
| | - Xiaoling Xu
- Zhejiang Provincial Center for Cardiovascular Disease Control and Prevention, Zhejiang Hospital, Hangzhou, 310013, China
| | - Wei Yu
- Zhejiang Provincial Center for Cardiovascular Disease Control and Prevention, Zhejiang Hospital, Hangzhou, 310013, China
| | - Jing Yan
- Zhejiang Provincial Center for Cardiovascular Disease Control and Prevention, Zhejiang Hospital, Hangzhou, 310013, China.
| |
Collapse
|
245
|
Abstract
Introduction: With the increase in the number of patients with cardiovascular diseases, better risk-prediction models for cardiovascular events are needed. Statistical-based risk-prediction models for cardiovascular events (CVEs) are available, but they lack the ability to predict individual-level risk. Machine learning (ML) methods are especially equipped to handle complex data and provide accurate risk-prediction models at the individual level.Areas covered: In this review, the authors summarize the literature comparing the performance of machine learning methods to that of traditional, statistical-based models in predicting CVEs. They provide a brief summary of ML methods and then discuss risk-prediction models for CVEs such as major adverse cardiovascular events, heart failure and arrhythmias.Expert opinion: Current evidence supports the superiority of ML methods over statistical-based models in predicting CVEs. Statistical models are applicable at the population level and are subject to overfitting, while ML methods can provide an individualized risk level for CVEs. Further prospective research on ML-guided treatments to prevent CVEs is needed.
Collapse
Affiliation(s)
- Brijesh Patel
- Heart and Vascular Institute, West Virginia University, Morgantown, WV, USA
| | - Partho Sengupta
- Heart and Vascular Institute, West Virginia University, Morgantown, WV, USA
| |
Collapse
|
246
|
Kwon O, Na W, Kim YH. Machine Learning: a New Opportunity for Risk Prediction. Korean Circ J 2019; 50:85-87. [PMID: 31854158 PMCID: PMC6923232 DOI: 10.4070/kcj.2019.0314] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Accepted: 10/07/2019] [Indexed: 01/16/2023] Open
Affiliation(s)
- Osung Kwon
- Division of Cardiology, Department of Internal Medicine, Eunpyeong St. Mary's Hospital, the Catholic University of Korea, Seoul, Korea
| | - Wonjun Na
- Division of Cardiology, Department of Internal Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Young Hak Kim
- Division of Cardiology, Department of Internal Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea.
| |
Collapse
|
247
|
Cuocolo R, Perillo T, De Rosa E, Ugga L, Petretta M. Current applications of big data and machine learning in cardiology. J Geriatr Cardiol 2019; 16:601-607. [PMID: 31555327 PMCID: PMC6748901 DOI: 10.11909/j.issn.1671-5411.2019.08.002] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Revised: 07/24/2019] [Accepted: 07/26/2019] [Indexed: 12/14/2022] Open
Abstract
Machine learning (ML) is a software solution with the ability of making predictions without prior explicit programming, aiding in the analysis of large amounts of data. These algorithms can be trained through supervised or unsupervised learning. Cardiology is one of the fields of medicine with the highest interest in its applications. They can facilitate every step of patient care, reducing the margin of error and contributing to precision medicine. In particular, ML has been proposed for cardiac imaging applications such as automated computation of scores, differentiation of prognostic phenotypes, quantification of heart function and segmentation of the heart. These tools have also demonstrated the capability of performing early and accurate detection of anomalies in electrocardiographic exams. ML algorithms can also contribute to cardiovascular risk assessment in different settings and perform predictions of cardiovascular events. Another interesting research avenue in this field is represented by genomic assessment of cardiovascular diseases. Therefore, ML could aid in making earlier diagnosis of disease, develop patient-tailored therapies and identify predictive characteristics in different pathologic conditions, leading to precision cardiology.
Collapse
Affiliation(s)
- Renato Cuocolo
- Department of Advanced Biomedical Sciences, University of Naples “Federico II”, Naples, Italy
| | - Teresa Perillo
- Department of Advanced Biomedical Sciences, University of Naples “Federico II”, Naples, Italy
| | - Eliana De Rosa
- Department of Translational Medical Sciences, University of Naples “Federico II”, Naples, Italy
| | - Lorenzo Ugga
- Department of Advanced Biomedical Sciences, University of Naples “Federico II”, Naples, Italy
| | - Mario Petretta
- Department of Translational Medical Sciences, University of Naples “Federico II”, Naples, Italy
| |
Collapse
|