1
|
Chang W, Ji X, Wang L, Liu H, Zhang Y, Chen B, Zhou S. A Machine-Learning Method of Predicting Vital Capacity Plateau Value for Ventilatory Pump Failure Based on Data Mining. Healthcare (Basel) 2021; 9:1306. [PMID: 34682985 PMCID: PMC8544367 DOI: 10.3390/healthcare9101306] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 09/22/2021] [Accepted: 09/26/2021] [Indexed: 11/23/2022] Open
Abstract
Ventilatory pump failure is a common cause of death for patients with neuromuscular diseases. The vital capacity plateau value (VCPLAT) is an important indicator to judge the status of ventilatory pump failure for patients with congenital myopathy, Duchenne muscular dystrophy and spinal muscular atrophy. Due to the complex relationship between VCPLAT and the patient's own condition, it is difficult to predict the VCPLAT for pediatric disease from a medical perspective. We established a VCPLAT prediction model based on data mining and machine learning. We first performed the correlation analysis and recursive feature elimination with cross-validation (RFECV) to provide high-quality feature combinations. Based on this, the Light Gradient Boosting Machine (LightGBM) algorithm was to establish a prediction model with powerful performance. Finally, we verified the validity and superiority of the proposed method via comparison with other prediction models in similar works. After 10-fold cross-validation, the proposed prediction method had the best performance and its explained variance score (EVS), mean absolute error (MAE), mean squared error (MSE), root mean square error (RMSE), median absolute error (MedAE) and R2 were 0.949, 0.028, 0.002, 0.045, 0.015 and 0.948, respectively. It also performed well on test datasets. Therefore, it can accurately and effectively predict the VCPLAT, thereby determining the severity of the condition to provide auxiliary decision-making for doctors in clinical diagnosis and treatment.
Collapse
Affiliation(s)
- Wenbing Chang
- School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China; (W.C.); (X.J.); (H.L.); (Y.Z.); (B.C.)
| | - Xinpeng Ji
- School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China; (W.C.); (X.J.); (H.L.); (Y.Z.); (B.C.)
| | - Liping Wang
- Department of Neurology, Peking University Third Hospital, Beijing 100191, China;
| | - Houxiang Liu
- School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China; (W.C.); (X.J.); (H.L.); (Y.Z.); (B.C.)
| | - Yue Zhang
- School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China; (W.C.); (X.J.); (H.L.); (Y.Z.); (B.C.)
| | - Bang Chen
- School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China; (W.C.); (X.J.); (H.L.); (Y.Z.); (B.C.)
| | - Shenghan Zhou
- School of Reliability and Systems Engineering, Beihang University, Beijing 100191, China; (W.C.); (X.J.); (H.L.); (Y.Z.); (B.C.)
| |
Collapse
|
2
|
Tran A, Walsh CJ, Batt J, Dos Santos CC, Hu P. A machine learning-based clinical tool for diagnosing myopathy using multi-cohort microarray expression profiles. J Transl Med 2020; 18:454. [PMID: 33256785 PMCID: PMC7708151 DOI: 10.1186/s12967-020-02630-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 11/23/2020] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND Myopathies are a heterogenous collection of disorders characterized by dysfunction of skeletal muscle. In practice, myopathies are frequently encountered by physicians and precise diagnosis remains a challenge in primary care. Molecular expression profiles show promise for disease diagnosis in various pathologies. We propose a novel machine learning-based clinical tool for predicting muscle disease subtypes using multi-cohort microarray expression data. MATERIALS AND METHODS Muscle tissue samples originating from 1260 patients with muscle weakness. Data was curated from 42 independent cohorts with expression profiles in public microarray gene expression repositories, which represent a broad range of patient ages and peripheral muscles. Cohorts were categorized into five muscle disease subtypes: immobility, inflammatory myopathies, intensive care unit acquired weakness (ICUAW), congenital, and chronic systemic disease. The data contains expression data on 34,099 genes. Data augmentation techniques were used to address class imbalances in the muscle disease subtypes. Support vector machine (SVM) models were trained on two-thirds of the 1260 samples based on the top selected gene signature using analysis of variance (ANOVA). The model was validated in the remaining samples using area under the receiver operator curve (AUC). Gene enrichment analysis was used to identify enriched biological functions in the gene signature. RESULTS The AUC ranges from 0.611 to 0.649 in the observed imbalanced data. Overall, using the augmented data, chronic systemic disease was the best predicted class with AUC 0.872 (95% confidence interval (CI): 0.824-0.920). The least discriminated classes were ICUAW with AUC 0.777 (95% CI: 0.668-0.887) and immobility with AUC 0.789 (95% CI: 0.716-0.861). Disease-specific gene set enrichment results showed that the gene signature was enriched in biological processes including neural precursor cell proliferation for ICUAW and aerobic respiration for congenital (false discovery rate q-value < 0.001). CONCLUSION Our results present a well-performing molecular classification tool with the selected gene markers for muscle disease classification. In practice, this tool addresses an important gap in the literature on myopathies and presents a potentially useful clinical tool for muscle disease subtype diagnosis.
Collapse
Affiliation(s)
- Andrew Tran
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Chris J Walsh
- Keenan Research Center for Biomedical Science, St. Michael's Hospital, Toronto, ON, Canada
- Institute of Medical Sciences and Department of Medicine, University of Toronto, Toronto, ON, Canada
| | - Jane Batt
- Keenan Research Center for Biomedical Science, St. Michael's Hospital, Toronto, ON, Canada
- Interdepartmental Division of Critical Care, St. Michael's Hospital, University of Toronto, 30 Bond Street, Room 4-008, Toronto, ON, M5B 1WB, Canada
| | - Claudia C Dos Santos
- Keenan Research Center for Biomedical Science, St. Michael's Hospital, Toronto, ON, Canada.
- Interdepartmental Division of Critical Care, St. Michael's Hospital, University of Toronto, 30 Bond Street, Room 4-008, Toronto, ON, M5B 1WB, Canada.
| | - Pingzhao Hu
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada.
- Department of Biochemistry and Medical Genetics, University of Manitoba, 745 Bannatyne Avenue, Winnipeg, MB, R3E 0J9, Canada.
- Research Institute in Oncology and Hematology, Winnipeg, MB, Canada.
| |
Collapse
|
3
|
Choi JY, Yoo TK, Seo JG, Kwak J, Um TT, Rim TH. Multi-categorical deep learning neural network to classify retinal images: A pilot study employing small database. PLoS One 2017; 12:e0187336. [PMID: 29095872 PMCID: PMC5667846 DOI: 10.1371/journal.pone.0187336] [Citation(s) in RCA: 110] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2017] [Accepted: 10/18/2017] [Indexed: 01/03/2023] Open
Abstract
Deep learning emerges as a powerful tool for analyzing medical images. Retinal disease detection by using computer-aided diagnosis from fundus image has emerged as a new method. We applied deep learning convolutional neural network by using MatConvNet for an automated detection of multiple retinal diseases with fundus photographs involved in STructured Analysis of the REtina (STARE) database. Dataset was built by expanding data on 10 categories, including normal retina and nine retinal diseases. The optimal outcomes were acquired by using a random forest transfer learning based on VGG-19 architecture. The classification results depended greatly on the number of categories. As the number of categories increased, the performance of deep learning models was diminished. When all 10 categories were included, we obtained results with an accuracy of 30.5%, relative classifier information (RCI) of 0.052, and Cohen's kappa of 0.224. Considering three integrated normal, background diabetic retinopathy, and dry age-related macular degeneration, the multi-categorical classifier showed accuracy of 72.8%, 0.283 RCI, and 0.577 kappa. In addition, several ensemble classifiers enhanced the multi-categorical classification performance. The transfer learning incorporated with ensemble classifier of clustering and voting approach presented the best performance with accuracy of 36.7%, 0.053 RCI, and 0.225 kappa in the 10 retinal diseases classification problem. First, due to the small size of datasets, the deep learning techniques in this study were ineffective to be applied in clinics where numerous patients suffering from various types of retinal disorders visit for diagnosis and treatment. Second, we found that the transfer learning incorporated with ensemble classifiers can improve the classification performance in order to detect multi-categorical retinal diseases. Further studies should confirm the effectiveness of algorithms with large datasets obtained from hospitals.
Collapse
Affiliation(s)
- Joon Yul Choi
- Department of Electrical and Computer Engineering, Seoul National University, Seoul, South Korea
| | - Tae Keun Yoo
- Institute of Vision Research, Department of Ophthalmology, Yonsei University College of Medicine, Seoul, South Korea
| | - Jeong Gi Seo
- Institute of Vision Research, Department of Ophthalmology, Yonsei University College of Medicine, Seoul, South Korea
| | - Jiyong Kwak
- Institute of Vision Research, Department of Ophthalmology, Yonsei University College of Medicine, Seoul, South Korea
| | - Terry Taewoong Um
- Department of Electrical & Computer Engineering, University of Waterloo, Waterloo, Ontario, Canada
| | - Tyler Hyungtaek Rim
- Institute of Vision Research, Department of Ophthalmology, Yonsei University College of Medicine, Seoul, South Korea
| |
Collapse
|
4
|
Optimal classification for the diagnosis of duchenne muscular dystrophy images using support vector machines. Int J Comput Assist Radiol Surg 2015; 11:1755-63. [PMID: 26476638 DOI: 10.1007/s11548-015-1312-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 09/29/2015] [Indexed: 10/22/2022]
Abstract
BACKGROUND This study aimed to investigate the optimal support vector machines (SVM)-based classifier of duchenne muscular dystrophy (DMD) magnetic resonance imaging (MRI) images. METHODS T1-weighted (T1W) and T2-weighted (T2W) images of the 15 boys with DMD and 15 normal controls were obtained. Textural features of the images were extracted and wavelet decomposed, and then, principal features were selected. Scale transform was then performed for MRI images. Afterward, SVM-based classifiers of MRI images were analyzed based on the radical basis function and decomposition levels. The cost (C) parameter and kernel parameter [Formula: see text] were used for classification. Then, the optimal SVM-based classifier, expressed as [Formula: see text]), was identified by performance evaluation (sensitivity, specificity and accuracy). RESULTS Eight of 12 textural features were selected as principal features (eigenvalues [Formula: see text]). The 16 SVM-based classifiers were obtained using combination of (C, [Formula: see text]), and those with lower C and [Formula: see text] values showed higher performances, especially classifier of [Formula: see text]). The SVM-based classifiers of T1W images showed higher performance than T1W images at the same decomposition level. The T1W images in classifier of [Formula: see text]) at level 2 decomposition showed the highest performance of all, and its overall correct sensitivity, specificity, and accuracy reached 96.9, 97.3, and 97.1 %, respectively. CONCLUSION The T1W images in SVM-based classifier [Formula: see text] at level 2 decomposition showed the highest performance of all, demonstrating that it was the optimal classification for the diagnosis of DMD.
Collapse
|
5
|
Salvatore C, Cerasa A, Battista P, Gilardi MC, Quattrone A, Castiglioni I. Magnetic resonance imaging biomarkers for the early diagnosis of Alzheimer's disease: a machine learning approach. Front Neurosci 2015; 9:307. [PMID: 26388719 PMCID: PMC4555016 DOI: 10.3389/fnins.2015.00307] [Citation(s) in RCA: 123] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2015] [Accepted: 08/13/2015] [Indexed: 11/13/2022] Open
Abstract
Determination of sensitive and specific markers of very early AD progression is intended to aid researchers and clinicians to develop new treatments and monitor their effectiveness, as well as to lessen the time and cost of clinical trials. Magnetic Resonance (MR)-related biomarkers have been recently identified by the use of machine learning methods for the in vivo differential diagnosis of AD. However, the vast majority of neuroimaging papers investigating this topic are focused on the difference between AD and patients with mild cognitive impairment (MCI), not considering the impact of MCI patients who will (MCIc) or not convert (MCInc) to AD. Morphological T1-weighted MRIs of 137 AD, 76 MCIc, 134 MCInc, and 162 healthy controls (CN) selected from the Alzheimer's disease neuroimaging initiative (ADNI) cohort, were used by an optimized machine learning algorithm. Voxels influencing the classification between these AD-related pre-clinical phases involved hippocampus, entorhinal cortex, basal ganglia, gyrus rectus, precuneus, and cerebellum, all critical regions known to be strongly involved in the pathophysiological mechanisms of AD. Classification accuracy was 76% AD vs. CN, 72% MCIc vs. CN, 66% MCIc vs. MCInc (nested 20-fold cross validation). Our data encourage the application of computer-based diagnosis in clinical practice of AD opening new prospective in the early management of AD patients.
Collapse
Affiliation(s)
- Christian Salvatore
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR) Milan, Italy
| | - Antonio Cerasa
- Neuroimaging Research Unit, Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR) Catanzaro, Italy
| | - Petronilla Battista
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR) Milan, Italy
| | - Maria C Gilardi
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR) Milan, Italy
| | - Aldo Quattrone
- Department of Medical Sciences, Institute of Neurology, University "Magna Graecia" Catanzaro, Italy
| | - Isabella Castiglioni
- Institute of Molecular Bioimaging and Physiology, National Research Council (IBFM-CNR) Milan, Italy
| | | |
Collapse
|