1
|
Yang Y, Huang L, Gu Y, Wang Z, Liu S, Chen Q, Ning W, Hong G. Predicting cerebral infarction and transient ischemic attack in healthy individuals and those with dysmetabolism: a machine learning approach combined with routine blood tests. Sci Rep 2025; 15:13044. [PMID: 40240412 PMCID: PMC12003726 DOI: 10.1038/s41598-025-94682-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2024] [Accepted: 03/17/2025] [Indexed: 04/18/2025] Open
Abstract
Ischemic cerebral infarction is the most prevalent type of stroke, causing significant disability and death worldwide. Transient ischemic attack (TIA) is a strong predictor of subsequent stroke. Individuals with dysmetabolism, such as hypertension, hypercholesterolemia, and diabetes, are at increased risk for cerebral infarction (CI) and TIA. In resource-limited settings, diagnosing CI and TIA can be particularly difficult due to a lack of advanced imaging and specialized expertise. Therefore, we aim to develop a simple, convenient, blood-based approach that could assist clinicians in diagnosing CI and TIA, especially in regions where advanced imaging or stroke-specific expertise is limited. All study subjects were patients admitted to the First Hospital of Xiamen University and healthy check-up populations between January 2018 and September 2023. This study employed five machine learning methods alongside 21 blood routine indicators, 30 blood biochemical indicators, age, and gender to construct predictive models for CI and TIA in both healthy individuals and those with dysmetabolism. The Area Under the Receiver Operating Characteristic (ROC) Curve (AUC) served as the metric to assess the comprehensive predictive capability of the models. Subsequently, the SHAP package was employed for model interpretation. Extreme Gradient Boosting (XGBoost) outperforms other models in all predictive models. In the models predicting CI and TIA among healthy, the AUC is 0.9958 (0.9947-0.9969) and 0.9928 (0.9899-0.9951), respectively. Among the nine shared key features of the two models are indicators of glucose metabolism, lipid metabolism, and liver metabolism. In the models for predicting CI and TIA among patients with hypertension, hypercholesterolemia, diabetes, and those with all three metabolic disorders combined, the AUCs ranged from 0.6990 to 0.8591. We found that the indicators K significantly contributed to predict CI and TIA from those with dysmetabolism. Additionally, metabolic-related indicators, such as glucose (GLU) and high-density lipoprotein cholesterol (HDL-C), are ranked highly among the top ten contributing features. XGBoost performed the best in all models. It can effectively differentiate CI and TIA from healthy and dysmetabolic patients by combining blood routine and blood biochemical indicators. Moreover, it can also differentiate CI from TIA. Although any suspicious findings from this model would still require confirmatory imaging, the simplicity and low cost of blood-based testing may offer a practical adjunct for clinicians-particularly in areas lacking advanced imaging or extensive stroke expertise-and could facilitate earlier diagnostic decision-making.
Collapse
Affiliation(s)
- Yunyun Yang
- Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
- Institute for Clinical Medical Research, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
| | - Lindan Huang
- Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
- Institute for Clinical Medical Research, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
- School of Public Health, Xiamen University, Xiamen, 361003, Fujian, China
| | - Ying Gu
- Institute for Clinical Medical Research, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
| | - Zhicheng Wang
- Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
- Department of Otolaryngology, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
| | - Shuai Liu
- Institute for Clinical Medical Research, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
| | - Qun Chen
- Institute for Clinical Medical Research, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China
| | - Wanshan Ning
- Institute for Clinical Medical Research, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China.
| | - Guolin Hong
- Department of Laboratory Medicine, Xiamen Key Laboratory of Genetic Testing, the First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, 361003, Fujian, China.
- School of Public Health, Xiamen University, Xiamen, 361003, Fujian, China.
| |
Collapse
|
2
|
Xu JS, Yang K, Quan B, Xie J, Zheng YS. A multicenter study on developing a prognostic model for severe fever with thrombocytopenia syndrome using machine learning. Front Microbiol 2025; 16:1557922. [PMID: 40177493 PMCID: PMC11962041 DOI: 10.3389/fmicb.2025.1557922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Accepted: 03/05/2025] [Indexed: 04/05/2025] Open
Abstract
Background Severe Fever with Thrombocytopenia Syndrome (SFTS) is a disease caused by infection with the Severe Fever with Thrombocytopenia Syndrome virus (SFTSV), a novel Bunyavirus. Accurate prognostic assessment is crucial for developing individualized prevention and treatment strategies. However, machine learning prognostic models for SFTS are rare and need further improvement and clinical validation. Objective This study aims to develop and validate an interpretable prognostic model based on machine learning (ML) methods to enhance the understanding of SFTS progression. Methods This multicenter retrospective study analyzed patient data from two provinces in China. The derivation cohort included 292 patients treated at The Second Hospital of Nanjing from January 2022 to December 2023, with a 7:3 split for model training and internal validation. The external validation cohort consisted of 104 patients from The First Affiliated Hospital of Wannan Medical College during the same period. Twenty-four commonly available clinical features were selected, and the Boruta algorithm identified 12 candidate predictors, ranked by Z-scores, which were progressively incorporated into 10 machine learning models to develop prognostic models. Model performance was assessed using the area under the receiver-operating-characteristic curve (AUC), accuracy, recall, and F1 score. The clinical utility of the best-performing model was evaluated through decision curve analysis (DCA) based on net benefit. Robustness was tested with 10-fold cross-validation, and feature importance was explained using SHapley Additive exPlanation (SHAP) both globally and locally. Results Among the 10 machine learning models, the XGBoost model demonstrated the best overall discriminatory ability. Considering both AUC index and feature simplicity, a final interpretable XGBoost model with 7 key features was constructed. The model showed high predictive accuracy for patient outcomes in both internal (AUC = 0.911, 95% CI: 0.842-0.967) and external validations (AUC = 0.891, 95% CI: 0.786-0.977). A clinical tool based on this model has been developed and implemented using the Streamlit framework. Conclusion The interpretable XGBoost-based prognostic model for SFTS shows high predictive accuracy and has been translated into a clinical tool. The model's 7 key features serve as valuable indicators for early prognosis of SFTS, warranting close attention from healthcare professionals in clinical practice.
Collapse
Affiliation(s)
- Jian-She Xu
- School of Public Health, Nanjing Medical University, Nanjing, China
| | - Kai Yang
- Department of Intensive Care Unit, The Second Hospital of Nanjing, Affiliated of Nanjing University of Chinese Medicine, Nanjing, China
| | - Bin Quan
- Department of Infectious Disease, The First Affiliated Hospital of Wannan Medical College, Wuhu, China
| | - Jing Xie
- Department of Intensive Care Unit, The Second Hospital of Nanjing, Affiliated of Nanjing University of Chinese Medicine, Nanjing, China
| | - Yi-Shan Zheng
- School of Public Health, Nanjing Medical University, Nanjing, China
- Department of Intensive Care Unit, The Second Hospital of Nanjing, Affiliated of Nanjing University of Chinese Medicine, Nanjing, China
| |
Collapse
|
3
|
Kotoulas SC, Spyratos D, Porpodis K, Domvri K, Boutou A, Kaimakamis E, Mouratidou C, Alevroudis I, Dourliou V, Tsakiri K, Sakkou A, Marneri A, Angeloudi E, Papagiouvanni I, Michailidou A, Malandris K, Mourelatos C, Tsantos A, Pataka A. A Thorough Review of the Clinical Applications of Artificial Intelligence in Lung Cancer. Cancers (Basel) 2025; 17:882. [PMID: 40075729 PMCID: PMC11898928 DOI: 10.3390/cancers17050882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2024] [Revised: 02/06/2025] [Accepted: 02/25/2025] [Indexed: 03/14/2025] Open
Abstract
According to data from the World Health Organization (WHO), lung cancer is becoming a global epidemic. It is particularly high in the list of the leading causes of death not only in developed countries, but also worldwide; furthermore, it holds the leading place in terms of cancer-related mortality. Nevertheless, many breakthroughs have been made the last two decades regarding its management, with one of the most prominent being the implementation of artificial intelligence (AI) in various aspects of disease management. We included 473 papers in this thorough review, most of which have been published during the last 5-10 years, in order to describe these breakthroughs. In screening programs, AI is capable of not only detecting suspicious lung nodules in different imaging modalities-such as chest X-rays, computed tomography (CT), and positron emission tomography (PET) scans-but also discriminating between benign and malignant nodules as well, with success rates comparable to or even better than those of experienced radiologists. Furthermore, AI seems to be able to recognize biomarkers that appear in patients who may develop lung cancer, even years before this event. Moreover, it can also assist pathologists and cytologists in recognizing the type of lung tumor, as well as specific histologic or genetic markers that play a key role in treating the disease. Finally, in the treatment field, AI can guide in the development of personalized options for lung cancer patients, possibly improving their prognosis.
Collapse
Affiliation(s)
- Serafeim-Chrysovalantis Kotoulas
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Dionysios Spyratos
- Pulmonary Department, Unit of thoracic Malignancies Research, General Hospital of Thessaloniki “G. Papanikolaou”, Aristotle’s University of Thessaloniki, Leoforos Papanikolaou Municipality of Chortiatis, 57010 Thessaloniki, Greece; (D.S.); (K.P.); (K.D.)
| | - Konstantinos Porpodis
- Pulmonary Department, Unit of thoracic Malignancies Research, General Hospital of Thessaloniki “G. Papanikolaou”, Aristotle’s University of Thessaloniki, Leoforos Papanikolaou Municipality of Chortiatis, 57010 Thessaloniki, Greece; (D.S.); (K.P.); (K.D.)
| | - Kalliopi Domvri
- Pulmonary Department, Unit of thoracic Malignancies Research, General Hospital of Thessaloniki “G. Papanikolaou”, Aristotle’s University of Thessaloniki, Leoforos Papanikolaou Municipality of Chortiatis, 57010 Thessaloniki, Greece; (D.S.); (K.P.); (K.D.)
| | - Afroditi Boutou
- Pulmonary Department General, Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (A.B.); (A.T.)
| | - Evangelos Kaimakamis
- 1st ICU, Medical Informatics Laboratory, General Hospital of Thessaloniki “G. Papanikolaou”, Aristotle’s University of Thessaloniki, Leoforos Papanikolaou Municipality of Chortiatis, 57010 Thessaloniki, Greece;
| | - Christina Mouratidou
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Ioannis Alevroudis
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Vasiliki Dourliou
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Kalliopi Tsakiri
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Agni Sakkou
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Alexandra Marneri
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Elena Angeloudi
- Adult ICU, General Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (C.M.); (I.A.); (V.D.); (K.T.); (A.S.); (A.M.); (E.A.)
| | - Ioanna Papagiouvanni
- 4th Internal Medicine Department, General Hospital of Thessaloniki “Ippokrateio”, Aristotle’s University of Thessaloniki, Konstantinoupoleos 49, 54642 Thessaloniki, Greece;
| | - Anastasia Michailidou
- 2nd Propaedeutic Internal Medicine Department, General Hospital of Thessaloniki “Ippokrateio”, Aristotle’s University of Thessaloniki, Konstantinoupoleos 49, 54642 Thessaloniki, Greece;
| | - Konstantinos Malandris
- 2nd Internal Medicine Department, General Hospital of Thessaloniki “Ippokrateio”, Aristotle’s University of Thessaloniki, Konstantinoupoleos 49, 54642 Thessaloniki, Greece;
| | - Constantinos Mourelatos
- Biology and Genetics Laboratory, Aristotle’s University of Thessaloniki, 54624 Thessaloniki, Greece;
| | - Alexandros Tsantos
- Pulmonary Department General, Hospital of Thessaloniki “Ippokrateio”, Konstantinoupoleos 49, 54642 Thessaloniki, Greece; (A.B.); (A.T.)
| | - Athanasia Pataka
- Respiratory Failure Clinic and Sleep Laboratory, General Hospital of Thessaloniki “G. Papanikolaou”, Aristotle’s University of Thessaloniki, Leoforos Papanikolaou Municipality of Chortiatis, 57010 Thessaloniki, Greece;
| |
Collapse
|
4
|
Li S, Li M, Wu J, Li Y, Han J, Song Y, Cao W, Zhou X. Developing and validating a clinlabomics-based machine-learning model for early detection of retinal detachment in patients with high myopia. J Transl Med 2024; 22:405. [PMID: 38689321 PMCID: PMC11061938 DOI: 10.1186/s12967-024-05131-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2023] [Accepted: 03/26/2024] [Indexed: 05/02/2024] Open
Abstract
BACKGROUND Retinal detachment (RD) is a vision-threatening disorder of significant severity. Individuals with high myopia (HM) face a 2 to 6 times higher risk of developing RD compared to non-myopes. The timely identification of high myopia-related retinal detachment (HMRD) is crucial for effective treatment and prevention of additional vision impairment. Consequently, our objective was to streamline and validate a machine-learning model based on clinical laboratory omics (clinlabomics) for the early detection of RD in HM patients. METHODS We extracted clinlabomics data from the electronic health records for 24,440 HM and 5607 HMRD between 2015 and 2022. Lasso regression analysis assessed fifty-nine variables, excluding collinear variables (variance inflation factor > 10). Four models based on random forest, gradient boosting machine (GBM), generalized linear model, and Deep Learning Model were trained for HMRD diagnosis and employed for internal validation. An external test of the models was done. Three random data sets were further processed to validate the performance of the diagnostic model. The primary outcomes were the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (AUCPR) to diagnose HMRD. RESULTS Nine variables were selected by all models. Given the AUC and AUCPR values across the different sets, the GBM model was chosen as the final diagnostic model. The GBM model had an AUC of 0.8550 (95%CI = 0.8322-0.8967) and an AUCPR of 0.5584 (95%CI = 0.5250-0.5879) in the training set. The AUC and AUCPR in the internal validation were 0.8405 (95%CI = 0.8060-0.8966) and 0.5355 (95%CI = 0.4988-0.5732). During the external test evaluation, it reached an AUC of 0.7579 (95%CI = 0.7340-0.7840) and an AUCPR of 0.5587 (95%CI = 0.5345-0.5880). A similar discriminative capacity was observed in the three random data sets. The GBM model was well-calibrated across all the sets. The GBM-RD model was implemented into a web application that provides risk prediction for HM individuals. CONCLUSION GBM algorithms based on nine features successfully predicted the diagnosis of RD in patients with HM, which will help ophthalmologists to establish a preliminary diagnosis and to improve diagnostic accuracy in the clinic.
Collapse
Affiliation(s)
- Shengjie Li
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Meiyan Li
- Department of Ophthalmology and Optometry, Fudan University Eye Ear Nose and Throat Hospital, Shanghai, China
- NHC Key Laboratory of Myopia (Fudan University), Shanghai, China
- Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, 200031, China
- Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China
- Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
| | - Jianing Wu
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Yingzhu Li
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Jianping Han
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Yunxiao Song
- Department of Clinical Laboratory, Shanghai Xuhui Central Hospital, Fudan University, Shanghai, China.
| | - Wenjun Cao
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China.
| | - Xingtao Zhou
- Department of Ophthalmology and Optometry, Fudan University Eye Ear Nose and Throat Hospital, Shanghai, China.
- NHC Key Laboratory of Myopia (Fudan University), Shanghai, China.
- Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, 200031, China.
- Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China.
- Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China.
| |
Collapse
|
5
|
Nabeel SM, Bazai SU, Alasbali N, Liu Y, Ghafoor MI, Khan R, Ku CS, Yang J, Shahab S, Por LY. Optimizing lung cancer classification through hyperparameter tuning. Digit Health 2024; 10:20552076241249661. [PMID: 38698834 PMCID: PMC11064752 DOI: 10.1177/20552076241249661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 04/04/2024] [Indexed: 05/05/2024] Open
Abstract
Artificial intelligence is steadily permeating various sectors, including healthcare. This research specifically addresses lung cancer, the world's deadliest disease with the highest mortality rate. Two primary factors contribute to its onset: genetic predisposition and environmental factors, such as smoking and exposure to pollutants. Recognizing the need for more effective diagnosis techniques, our study embarked on devising a machine learning strategy tailored to boost precision in lung cancer detection. Our aim was to devise a diagnostic method that is both less invasive and cost-effective. To this end, we proposed four methods, benchmarking them against prevalent techniques using a universally recognized dataset from Kaggle. Among our methods, one emerged as particularly promising, outperforming the competition in accuracy, precision and sensitivity. This method utilized hyperparameter tuning, focusing on the Gamma and C parameters, which were set at a value of 10. These parameters influence kernel width and regularization strength, respectively. As a result, we achieved an accuracy of 99.16%, a precision of 98% and a sensitivity rate of 100%. In conclusion, our enhanced prediction mechanism has proven to surpass traditional and contemporary strategies in lung cancer detection.
Collapse
Affiliation(s)
- Syed Muhammad Nabeel
- Department of Computer Engineering, Balochistan University of Information Technology, Engineering, and Management Sciences (BUITEMS), Quetta, Balochistan, Pakistan
| | - Sibghat Ullah Bazai
- Department of Computer Engineering, Balochistan University of Information Technology, Engineering, and Management Sciences (BUITEMS), Quetta, Balochistan, Pakistan
| | - Nada Alasbali
- Department of Informatics and Computing Systems, College of Computer Science, King Khalid University, Abha, Saudi Arabia
| | - Yifan Liu
- Department of Electronic Science, Binhai College of Nankai University, Tianjing, China
| | | | - Rozi Khan
- Department of Computer Science, National University of Sciences and Technology (NUST) Balochistan Campus Quetta, Quetta, Balochistan, Pakistan
| | - Chin Soon Ku
- Department of Computer Science, Universiti Tunku Abdul Rahman, Kampar, Malaysia
| | - Jing Yang
- Department of Computer System and Technology, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia
| | - Sana Shahab
- Department of Business Administration, College of Business Administration, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia
| | - Lip Yee Por
- Department of Computer System and Technology, Faculty of Computer Science and Information Technology, Universiti Malaya, Kuala Lumpur, Malaysia
| |
Collapse
|
6
|
Bentick K, Runevic J, Akula S, Kyriacou T, Cool P, Andras P. Machine learning models based on routinely sampled blood tests can predict the presence of malignancy amongst patients with suspected musculoskeletal malignancy. Methods 2023; 220:55-60. [PMID: 37951558 DOI: 10.1016/j.ymeth.2023.10.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/09/2023] [Accepted: 10/26/2023] [Indexed: 11/14/2023] Open
Abstract
AIMS This study explores the possibility of using routinely taken blood tests in the diagnosis and triage of patients with suspected musculoskeletal malignancy. METHODS A retrospective study was performed on results of patients who had presented for assessment to a regional musculoskeletal tumour unit. Blood results of patients with a histologically confirmed diagnosis between 2010 and 2020 were retrieved. 33 distinct blood tests were available for model forming. Results were standardised by calculating z-scores. Data were split into a training set (70%) and a test set (30%). The training set was balanced by resampling underrepresented classes. The random forest algorithm performed best and was selected for model forming. Receiver operating characteristic curves were used to find the optimum threshold. Models were calibrated and performance metrics evaluated with confusion tables. RESULTS 2371 patients formed the study population. 1080 had a malignant diagnosis in one of three categories: sarcoma, metastasis, or haematological malignancy. 1291 had a benign condition. Metastasis could be predicted with an accuracy of 79% (AUC 87%, sensitivity 79%, specificity 80% NPV 91%). Haematological malignancy accuracy 79% (AUC 81%, sensitivity 77%, specificity 79%, NPV 97%). Sarcoma accuracy 64% (AUC 73%, sensitivity 76%, specificity 61%, NPV 88%) and all malignancy accuracy 74% (AUC 80%, sensitivity 72%, specificity 75%, NPV 76%). CONCLUSION Routinely performed blood tests can be useful in triage of musculoskeletal tumours and can be used to predict presence of musculoskeletal malignancy.
Collapse
Affiliation(s)
- Kieran Bentick
- Robert Jones and Agnes Hunt Orthopaedic Hospital, Oswestry SY10 7AG, United Kingdom
| | | | | | | | - Paul Cool
- Robert Jones and Agnes Hunt Orthopaedic Hospital, Oswestry SY10 7AG, United Kingdom; Keele University, Keele ST5 5BG, United Kingdom.
| | | |
Collapse
|
7
|
Choudhary A, Yu J, Kouznetsova VL, Kesari S, Tsigelny IF. Two-Stage Deep-Learning Classifier for Diagnostics of Lung Cancer Using Metabolites. Metabolites 2023; 13:1055. [PMID: 37887380 PMCID: PMC10609149 DOI: 10.3390/metabo13101055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 09/18/2023] [Accepted: 09/26/2023] [Indexed: 10/28/2023] Open
Abstract
We developed a machine-learning system for the selective diagnostics of adenocarcinoma (AD), squamous cell carcinoma (SQ), and small-cell carcinoma lung (SC) cancers based on their metabolomic profiles. The system is organized as two-stage binary classifiers. The best accuracy for classification is 92%. We used the biomarkers sets that contain mostly metabolites related to cancer development. Compared to traditional methods, which exclude hierarchical classification, our method splits a challenging multiclass task into smaller tasks. This allows a two-stage classifier, which is more accurate in the scenario of lung cancer classification. Compared to traditional methods, such a "divide and conquer strategy" gives much more accurate and explainable results. Such methods, including our algorithm, allow for the systematic tracking of each computational step.
Collapse
Affiliation(s)
- Ashvin Choudhary
- School of Life Science, University of California, Los Angeles, CA 90095, USA;
| | - Jianpeng Yu
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA;
| | - Valentina L. Kouznetsova
- San Diego Supercomputer Center, University of California, San Diego, CA 92093, USA;
- IUL, La Jolla, CA 92038, USA
- CureScience Institute, San Diego, CA 92121, USA
| | - Santosh Kesari
- Pacific Neuroscience Institute, Santa Monica, CA 90404, USA;
| | - Igor F. Tsigelny
- San Diego Supercomputer Center, University of California, San Diego, CA 92093, USA;
- IUL, La Jolla, CA 92038, USA
- CureScience Institute, San Diego, CA 92121, USA
- Department of Neurosciences, University of California, San Diego, CA 92093, USA
| |
Collapse
|
8
|
Zhai Y, Lin X, Wei Q, Pu Y, Pang Y. Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations. Heliyon 2023; 9:e17772. [PMID: 37483738 PMCID: PMC10359813 DOI: 10.1016/j.heliyon.2023.e17772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 06/26/2023] [Accepted: 06/27/2023] [Indexed: 07/25/2023] Open
Abstract
Introduction Lung cancer is a prevalent malignancy globally, with approximately 20% of patients developing cardiopulmonary complications after lobectomy. In order to prevent complications, an accurate and personalized method based on machine learning (ML) is required. Methods During the period of 2017-2021, a retrospective analysis was conducted on the medical records of patients who had undergone lobectomy for non-small cell lung cancer (NSCLC). We performed logical regression, decision tree (DT), random forest (RF), gradient boost DT, and eXtreme gradient boosting analyses to establish an ML model. The ten-fold cross-validation was used to evaluate the performance of multiple ML models based on various evaluation metrics, including accuracy, precision, recall, F1 score, and area under the receiver operating (AUC). Additionally, we also calculated the Kappa value of these model. Each model used grid search to optimize hyper-parameters and then used the interpretability method to provide explanations for the model's Decisions. Results The study included 718 eligible patients, among whom the incidence of postoperative cardiopulmonary complications was 20.89%. The RF model showed the best comprehensive performance among all models, and its ten-fold cross-validation accuracy, precision, recall, F1 score, and AUC were (OR and 95% confidence interval [CI]) 0.786 (0.738-0.834), 0.803 (0.735-0.872), 0.738 (0.678-0.797), 0.766 (0.714-0.818), 0.856 (0.815-0.898), respectively. The kappa value of the RF model was 0.696 (0.617-0.768). The SHAP method showed that gender, age, and intraoperative blood loss were closely associated with postoperative cardiopulmonary complications. Conclusion The application of ML methods for predicting postoperative cardiopulmonary complications based on clinical data of patients with NSCLC showed a good performance. The results indicate that ML combined with the SHAP individualized interpretation method has practical clinical value.
Collapse
Affiliation(s)
- Yihai Zhai
- Guangxi Medical University Cancer Hospital, Department of Thoracic Surgery, Nanning, 530021, China
| | - Xue Lin
- The Second Affiliated Hospital of Guangxi Medical University, Department of Oncology, Nanning, 530000, China
| | - Qiaolin Wei
- Guangxi Medical University Cancer Hospital, Department of Interventional Therapy, Nanning, 530021, China
| | - Yuanjin Pu
- Guangxi Medical University Cancer Hospital, Department of Thoracic Surgery, Nanning, 530021, China
| | - Yonghui Pang
- Guangxi Medical University Cancer Hospital, Department of Thoracic Surgery, Nanning, 530021, China
| |
Collapse
|
9
|
Huang X, Xie B, Long J, Chen H, Zhang H, Fan L, Chen S, Chen K, Wei Y. Prediction of risk factors for scrub typhus from 2006 to 2019 based on random forest model in Guangzhou, China. Trop Med Int Health 2023. [PMID: 37230481 DOI: 10.1111/tmi.13896] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
OBJECTIVES Scrub typhus is an increasingly serious public health problem, which is becoming the most common vector-borne disease in Guangzhou. This study aimed to analyse the correlation between scrub typhus incidence and potential factors and rank the importance of influential factors. METHODS We collected monthly scrub typhus cases, meteorological variables, rodent density (RD), Normalised Difference Vegetation Index (NDVI), and land use type in Guangzhou from 2006 to 2019. Correlation analysis and a random forest model were used to identify the risk factors for scrub typhus and predict the importance rank of influencing factors related to scrub typhus incidence. RESULTS The epidemiological results of the scrub typhus cases in Guangzhou between 2006 and 2019 showed that the incidence rate was on the rise. The results of correlation analysis revealed that a positive relationship between scrub typhus incidence and meteorological factors of mean temperature (Tmean ), accumulative rainfall (RF), relative humidity (RH), sunshine hours (SH), and NDVI, RD, population density, and green land coverage area (all p < 0.001). Additionally, we tested the relationship between the incidence of scrub typhus and the lagging meteorological factors through cross-correlation function, and found that incidence was positively correlated with 1-month lag Tmean , 2-month lag RF, 2-month lag RH, and 6-month lag SH (all p < 0.001). Based on the random forest model, we found that the Tmean was the most important predictor among the influential factors, followed by NDVI. CONCLUSIONS Meteorological factors, NDVI, RD, and land use type jointly affect the incidence of scrub typhus in Guangzhou. Our results provide a better understanding of the influential factors correlated with scrub typus, which can improve our capacity for biological monitoring and help public health authorities to formulate disease control strategies.
Collapse
Affiliation(s)
- Xiaobin Huang
- School of Public Health, Sun Yat-Sen University, Guangzhou, China
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Binbin Xie
- Department of Surveillance and Control, Hainan Tropical Diseases Research Center, Haikou, China
| | - Jiali Long
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Haiyan Chen
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Hao Zhang
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Lirui Fan
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Shouyi Chen
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Kuncai Chen
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| | - Yuehong Wei
- Department of Parasitic Disease and Endemic Disease Control and Prevention, Guangzhou Center for Disease Control and Prevention, Guangzhou, China
| |
Collapse
|
10
|
Li S, Li M, Wu J, Li Y, Han J, Cao W, Zhou X. Development and validation of a routine blood parameters-based model for screening the occurrence of retinal detachment in high myopia in the context of PPPM. EPMA J 2023. [PMCID: PMC10015135 DOI: 10.1007/s13167-023-00319-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2023]
Abstract
Background/aims Timely detection and treatment of retinal detachment (RD) could effectively save vision and reduce the risk of progressing visual field defects. High myopia (HM) is known to be associated with an increased risk of RD. Evidently, it should be clearly discriminated the individuals with high or low risk of RD in patients with HM. By using multi-parametric analysis, risk assessment, and other techniques, it is crucial to create cutting-edge screening programs that may be utilized to improve population eye health and develop person-specific, cost-effective preventative, and targeted therapeutic measures. Therefore, we propose a novel, routine blood parameters-based prediction model as a screening program to help distinguish who should offer detailed ophthalmic examinations for RD diagnosis, prevent visual field defect progression, and provide personalized, serial monitoring in the context of predictive, preventive, and personalized medicine (PPPM/3 PM). Methods This population-based study included 20,870 subjects (HM = 19,284, HMRD = 1586) who underwent detailed routine blood tests and ophthalmic evaluations. HMRD cases and HM controls were matched using a nested case-control design. Then, the HMRD cases and HM controls were randomly assigned to the discovery cohort, validation cohort 1, and validation cohort 2 maintaining a 6:2:2 ratio, and other subjects were assigned to the HM validation cohort. Receiver operating characteristic curve analysis was performed to select feature indexes. Feature indexes were integrated into seven algorithm models, and an optimal model was selected based on the highest area under the curve (AUC) and accuracy. Results Six feature indexes were selected: lymphocyte, basophil, mean platelet volume, platelet distribution width, neutrophil-to-lymphocyte ratio, and lymphocyte-to-monocyte ratio. Among the algorithm models, the algorithm of conditional probability (ACP) showed the best performance achieving an AUC of 0.79, a diagnostic accuracy of 0.72, a sensitivity of 0.71, and a specificity of 0.74 in the discovery cohort. A good performance of the ACP model was also observed in the validation cohort 1 (AUC = 0.81, accuracy = 0.72, sensitivity = 0.71, specificity = 0.73) and validation cohort 2 (AUC = 0.77, accuracy = 0.71, sensitivity = 0.70, specificity = 0.72). In addition, ACP model calibration was found to be good across three cohorts. In the HM validation cohort, the ACP model achieved a diagnostic accuracy of 0.81 for negative classification. Conclusion We have developed a routine blood parameters-based model with an ACP algorithm that could potentially be applied in the clinic with a PPPM approach for serial monitoring and predicting the occurrence of RD in HM and can facilitate the prevention of HM progression to RD. According to the current study, routine blood measures are essential in patient risk classification, predictive diagnosis, and targeted therapy. Therefore, for high-risk RD persons, novel screening programs and prompt treatment plans are essential to enhance individual outcomes and healthcare offered to the community with HM. Supplementary Information The online version contains supplementary material available at 10.1007/s13167-023-00319-3.
Collapse
Affiliation(s)
- Shengjie Li
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
- Eye Institute and Department of Ophthalmology, Eye & ENT Hospital of Fudan University, Shanghai, China
- NHC Key Laboratory of Myopia, Shanghai, China
- Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China
| | - Meiyan Li
- Eye Institute and Department of Ophthalmology, Eye & ENT Hospital of Fudan University, Shanghai, China
- NHC Key Laboratory of Myopia, Shanghai, China
- Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China
- Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
- Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
| | - Jianing Wu
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Yingzhu Li
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Jianping Han
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
| | - Wenjun Cao
- Department of Clinical Laboratory, Eye & ENT Hospital, Shanghai Medical College, Fudan University, Shanghai, China
- Eye Institute and Department of Ophthalmology, Eye & ENT Hospital of Fudan University, Shanghai, China
- NHC Key Laboratory of Myopia, Shanghai, China
- Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China
| | - Xingtao Zhou
- Eye Institute and Department of Ophthalmology, Eye & ENT Hospital of Fudan University, Shanghai, China
- NHC Key Laboratory of Myopia, Shanghai, China
- Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China
- Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China
- Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
| |
Collapse
|
11
|
Wen X, Leng P, Wang J, Yang G, Zu R, Jia X, Zhang K, Mengesha BA, Huang J, Wang D, Luo H. Clinlabomics: leveraging clinical laboratory data by data mining strategies. BMC Bioinformatics 2022; 23:387. [PMID: 36153474 PMCID: PMC9509545 DOI: 10.1186/s12859-022-04926-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Accepted: 09/13/2022] [Indexed: 11/29/2022] Open
Abstract
The recent global focus on big data in medicine has been associated with the rise of artificial intelligence (AI) in diagnosis and decision-making following recent advances in computer technology. Up to now, AI has been applied to various aspects of medicine, including disease diagnosis, surveillance, treatment, predicting future risk, targeted interventions and understanding of the disease. There have been plenty of successful examples in medicine of using big data, such as radiology and pathology, ophthalmology cardiology and surgery. Combining medicine and AI has become a powerful tool to change health care, and even to change the nature of disease screening in clinical diagnosis. As all we know, clinical laboratories produce large amounts of testing data every day and the clinical laboratory data combined with AI may establish a new diagnosis and treatment has attracted wide attention. At present, a new concept of radiomics has been created for imaging data combined with AI, but a new definition of clinical laboratory data combined with AI has lacked so that many studies in this field cannot be accurately classified. Therefore, we propose a new concept of clinical laboratory omics (Clinlabomics) by combining clinical laboratory medicine and AI. Clinlabomics can use high-throughput methods to extract large amounts of feature data from blood, body fluids, secretions, excreta, and cast clinical laboratory test data. Then using the data statistics, machine learning, and other methods to read more undiscovered information. In this review, we have summarized the application of clinical laboratory data combined with AI in medical fields. Undeniable, the application of Clinlabomics is a method that can assist many fields of medicine but still requires further validation in a multi-center environment and laboratory.
Collapse
|
12
|
A Self-Activated CNN Approach for Multi-Class Chest-Related COVID-19 Detection. APPLIED SCIENCES-BASEL 2021. [DOI: 10.3390/app11199023] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Chest diseases can be dangerous and deadly. They include many chest infections such as pneumonia, asthma, edema, and, lately, COVID-19. COVID-19 has many similar symptoms compared to pneumonia, such as breathing hardness and chest burden. However, it is a challenging task to differentiate COVID-19 from other chest diseases. Several related studies proposed a computer-aided COVID-19 detection system for the single-class COVID-19 detection, which may be misleading due to similar symptoms of other chest diseases. This paper proposes a framework for the detection of 15 types of chest diseases, including the COVID-19 disease, via a chest X-ray modality. Two-way classification is performed in proposed Framework. First, a deep learning-based convolutional neural network (CNN) architecture with a soft-max classifier is proposed. Second, transfer learning is applied using fully-connected layer of proposed CNN that extracted deep features. The deep features are fed to the classical Machine Learning (ML) classification methods. However, the proposed framework improves the accuracy for COVID-19 detection and increases the predictability rates for other chest diseases. The experimental results show that the proposed framework, when compared to other state-of-the-art models for diagnosing COVID-19 and other chest diseases, is more robust, and the results are promising.
Collapse
|
13
|
Liu H, Wang X, Tang K, Peng E, Xia D, Chen Z. Machine learning-assisted decision-support models to better predict patients with calculous pyonephrosis. Transl Androl Urol 2021; 10:710-723. [PMID: 33718073 PMCID: PMC7947454 DOI: 10.21037/tau-20-1208] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Background To develop a machine learning (ML)-assisted model capable of accurately identifying patients with calculous pyonephrosis before making treatment decisions by integrating multiple clinical characteristics. Methods We retrospectively collected data from patients with obstructed hydronephrosis who underwent retrograde ureteral stent insertion, percutaneous nephrostomy (PCN), or percutaneous nephrolithotomy (PCNL). The study cohort was divided into training and testing datasets in a 70:30 ratio for further analysis. We developed 5 ML-assisted models from 22 clinical features using logistic regression (LR), LR optimized by least absolute shrinkage and selection operator (Lasso) regularization (Lasso-LR), support vector machine (SVM), extreme gradient boosting (XGBoost), and random forest (RF). The area under the curve (AUC) was applied to determine the model with the highest discrimination. Decision curve analysis (DCA) was used to investigate the clinical net benefit associated with using the predictive models. Results A total of 322 patients were included, with 225 patients in the training dataset, and 97 patients in the testing dataset. The XGBoost model showed good discrimination with the AUC, accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of 0.981, 0.991, 0.962, 1.000, 1.000, and 0.989, respectively, followed by SVM [AUC =0.985, 95% confidence interval (CI): 0.970–1.000], Lasso-LR (AUC =0.977, 95% CI: 0.958–0.996), LR (AUC =0.936, 95% CI: 0.905–0.968), and RF (AUC =0.920, 95% CI: 0.870–0.970). Validation of the model showed that SVM yielded the highest AUC (0.977, 95% CI: 0.952–1.000), followed by Lasso-LR (AUC =0.959, 95% CI: 0.921–0.997), XGBoost (AUC =0.958, 95% CI: 0.902–1.000), LR (AUC =0.932, 95% CI: 0.878–0.987), and RF (AUC =0.868, 95% CI: 0.779–0.958) in the testing dataset. Conclusions Our ML-based models had good discrimination in predicting patients with obstructed hydronephrosis at high risk of harboring pyonephrosis, and the use of these models may be greatly beneficial to urologists in treatment planning, patient selection, and decision-making.
Collapse
Affiliation(s)
- Hailang Liu
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Xinguang Wang
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Kun Tang
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Ejun Peng
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Ding Xia
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| | - Zhiqiang Chen
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China
| |
Collapse
|
14
|
Zhao J, Wu J, Wei J, Su X, Chai Y, Li S, Wang Z. Liq_ccRCC: Identification of Clear Cell Renal Cell Carcinoma Based on the Integration of Clinical Liquid Indices. Front Oncol 2021; 10:605769. [PMID: 33585225 PMCID: PMC7873977 DOI: 10.3389/fonc.2020.605769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Accepted: 11/09/2020] [Indexed: 12/01/2022] Open
Abstract
Currently, preoperative diagnosis and differentiation of renal clear cell carcinoma and other subtypes remain a serious challenge for doctors. The liquid biopsy technique and artificial intelligence have inspired the pursuit of distinguishing clear cell renal cell carcinoma using clinically available test data. In this work, a method called liq_ccRCC based on the integration of clinical blood and urine indices through machine learning approaches was successfully designed to achieve this goal. Clinically available biochemical blood data and urine indices were collected from 306 patients with renal cell carcinoma. Finally, the integration of 18 top-ranked clinical liquid indices (13 blood samples and 5 urine samples) was proven to be able to distinguish renal clear cell carcinoma from other subtypes of renal carcinoma by cross-valuation with an AUC of 0.9372. The successful introduction of this identification method suggests that subtype differentiation of renal cell carcinoma can be accomplished based on clinical liquid test data, which is noninvasive and easy to perform. It has huge potential to be developed as a promising innovation strategy for preoperative subtype differentiation of renal cell carcinoma with the advantages of convenience and real-time testing. liq_ccRCC is available online for the free test of readers at http://lishuyan.lzu.edu.cn/liq_ccRCC.
Collapse
Affiliation(s)
- Jianhong Zhao
- Department of Radiology, Lanzhou University Second Hospital, Lanzhou, China
| | - Jiangpeng Wu
- Department of Chemistry and Chemical Engineering, Lanzhou University, Lanzhou, China
| | - Jinyan Wei
- Department of Radiology, Lanzhou University Second Hospital, Lanzhou, China
| | - Xiaolu Su
- Department of Pathology, Lanzhou University Second Hospital, Lanzhou, China
| | - Yanjun Chai
- Department of Radiology, Lanzhou University Second Hospital, Lanzhou, China
| | - Shuyan Li
- Department of Chemistry and Chemical Engineering, Lanzhou University, Lanzhou, China
| | - Zhiping Wang
- Institute of Urology, Lanzhou University Second Hospital, Key Laboratory of Gansu Province for Urological Diseases, Clinical Center of Gansu Province for Nephrourology, Lanzhou, China
| |
Collapse
|
15
|
Liu H, Tang K, Peng E, Wang L, Xia D, Chen Z. Predicting Prostate Cancer Upgrading of Biopsy Gleason Grade Group at Radical Prostatectomy Using Machine Learning-Assisted Decision-Support Models. Cancer Manag Res 2020; 12:13099-13110. [PMID: 33376402 PMCID: PMC7765752 DOI: 10.2147/cmar.s286167] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Accepted: 11/25/2020] [Indexed: 01/30/2023] Open
Affiliation(s)
- Hailang Liu
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan430030, Hubei, People’s Republic of China
| | - Kun Tang
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan430030, Hubei, People’s Republic of China
| | - Ejun Peng
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan430030, Hubei, People’s Republic of China
| | - Liang Wang
- Department of Radiology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan430030, Hubei, People’s Republic of China
| | - Ding Xia
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan430030, Hubei, People’s Republic of China
- Correspondence: Ding Xia; Zhiqiang Chen Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, No. 1095 Jiefang Avenue, Wuhan430030, Hubei, People’s Republic of China Email ;
| | - Zhiqiang Chen
- Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan430030, Hubei, People’s Republic of China
| |
Collapse
|
16
|
Zhang P, Wu J, Zhai H, Li S. ABCModeller: an automatic data mining tool based on a consistent voting method with a user-friendly graphical interface. Brief Bioinform 2020; 22:5924101. [PMID: 33057581 DOI: 10.1093/bib/bbaa247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 08/27/2020] [Accepted: 09/03/2020] [Indexed: 02/06/2023] Open
Abstract
In order to extract useful information from a huge amount of biological data nowadays, simple and convenient tools are urgently needed for data analysis and modeling. In this paper, an automatic data mining tool, termed as ABCModeller (Automatic Binary Classification Modeller), with a user-friendly graphical interface was developed here, which includes automated functions as data preprocessing, significant feature extraction, classification modeling, model evaluation and prediction. In order to enhance the generalization ability of the final model, a consistent voting method was built here in this tool with the utilization of three popular machine-learning algorithms, as artificial neural network, support vector machine and random forest. Besides, Fibonacci search and orthogonal experimental design methods were also employed here to automatically select significant features in the data space and optimal hyperparameters of the three algorithms to achieve the best model. The reliability of this tool has been verified through multiple benchmark data sets. In addition, with the advantage of a user-friendly graphical interface of this tool, users without any programming skills can easily obtain reliable models directly from original data, which can reduce the complexity of modeling and data mining, and contribute to the development of related research including but not limited to biology. The excitable file of this tool can be downloaded from http://lishuyan.lzu.edu.cn/ABCModeller.rar.
Collapse
|
17
|
Liang W, Liu D, Li M, Wang W, Qin Z, Zhang J, Zhang Y, Hu Y, Bao H, Xiang Y, Wang B, Wu J, Sun J, Hu C, Ye X, Zhang X, Xiao W, Yun C, Sun D, Wang W, Chang N, Zhang Y, Zhao J, Zhang X, Xu J, Wu D, Liu X, Guo Y, Zhang Q, Zhang W, Yang L, Li Z, Zhang X, Han B, Tong Z, He J, Qu J, Fan JB, Zhong N. Evaluating the diagnostic accuracy of a ctDNA methylation classifier for incidental lung nodules: protocol for a prospective, observational, and multicenter clinical trial of 10,560 cases. Transl Lung Cancer Res 2020; 9:2016-2026. [PMID: 33209621 PMCID: PMC7653103 DOI: 10.21037/tlcr-20-701] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Background Lung nodules are a diagnostic challenge. Current clinical management of lung nodule patients is inefficient and therefore causes patient misclassification, which increases healthcare expenses. However, a precise and robust lung nodule classifier to minimize discomfort for patients and healthcare costs is still lacking. The aim of the present protocol is to evaluate the effectiveness of using a liquid biopsy classifier to diagnose nodules compared to physician estimates and whether the classifier can reduce the number of unnecessary biopsies in benign cases. Methods A prospective cohort of 10,560 patients enrolled at 23 clinical centers in China with non-calcified pulmonary nodules, ranging from 0.5 to 3 cm in diameter, indicated by LDCT or CT will be included. After signed consent forms, the participants’ pulmonary nodules will be assessed using three evaluation tools: (I) physician cancer probability estimates (II) validated lung nodule risk models, including Mayo Clinic and Veteran’s Affairs models (III) ctDNA methylation classifier previously established. Each patient will undergo LDCT/CT follow-ups for 2 to 3 years and their information and one blood sample will be collected at baseline, 3, 6, 12, 24 and 36 months. The primary study outcomes will be the diagnostic accuracy of the methylation classifier in the cohort. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) will be used to compare the diagnostic value of each testing tool in differentiating benign and malignant pulmonary nodules. Discussion We are conducting an observational study to explore the accuracy of using a ctDNA methylation classifier for incidental lung nodules diagnosis Trial registration Clinicaltrials.gov NCT03651986.
Collapse
Affiliation(s)
- Wenhua Liang
- Department of Thoracic Surgery/Oncology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou Institute of Respiratory Disease & Health, China State Key Laboratory and National Clinical Research Center for Respiratory Disease, Guangzhou, China
| | - Dan Liu
- Department of Respiratory Medicine, West China Hospital of Sichuan University, Chengdu, China
| | - Min Li
- Department of Respiratory Medicine, Xiangya Cancer Center, Xiangya Hospital, Central South University, Changsha, China
| | - Wei Wang
- School of Medicine, Shandong University, Jinan, China
| | - Zheng Qin
- Department of Respiratory Medicine, The First Hospital of China Medical University, Shenyang, China
| | - Jian Zhang
- Department of Pulmonary Medicine, Xijing Hospital, Air Force Medical University of PLA, Xi'an, China
| | - Yong Zhang
- Department of Pulmonary Medicine, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Yang Hu
- Department of Respiratory Medicine, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai, China
| | - Hairong Bao
- Department of Gerontol Respiratory Medicine, The Frist Hospital of Lanzhou University, Lanzhou, China
| | - Yi Xiang
- Department of Pulmonary & Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Bo Wang
- AnchorDx Medical Co., Ltd., Guangzhou, China
| | - Jing Wu
- AnchorDx Medical Co., Ltd., Guangzhou, China
| | - Jianyu Sun
- AnchorDx Medical Co., Ltd., Guangzhou, China
| | - Chengping Hu
- Department of Respiratory Medicine, Xiangya Cancer Center, Xiangya Hospital, Central South University, Changsha, China
| | - Xianwei Ye
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People's Hospital, Guiyang, China
| | - Xiangyan Zhang
- Department of Respiratory and Critical Care Medicine, Guizhou Provincial People's Hospital, Guiyang, China
| | - Wei Xiao
- Department of Respiratory Medicine, QILU Hospital, Shandong University, Jinan, China
| | - Chunmei Yun
- Department of Pulmonary and Critical Care Medicine, Inner Mongolia Autonomous Region People's Hospital, Hohhot, China
| | - Dejun Sun
- Department of Pulmonary and Critical Care Medicine, Inner Mongolia Autonomous Region People's Hospital, Hohhot, China
| | - Wei Wang
- Department of Respiratory Medicine, The First Hospital of China Medical University, Shenyang, China
| | - Ning Chang
- Department of Pulmonary Medicine, Xijing Hospital, Air Force Medical University of PLA, Xi'an, China
| | - Yunhui Zhang
- Department of Respiratory Medicine, The First People's Hospital of Yunnan Province, Kunming, China
| | - Jianping Zhao
- Department of Respiratory and Critical Care Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xin Zhang
- Department of Pulmonary Medicine, Zhongshan Hospital, Fudan University, Shanghai, China
| | - Jinfu Xu
- Department of Respiratory Medicine, Shanghai Pulmonary Hospital, Tongji University School of Medicine, Shanghai, China
| | - Di Wu
- Department of Respiratory Medicine, Shenzhen People's Hospital, Shenzhen, China
| | - Xiaoju Liu
- Department of Gerontol Respiratory Medicine, The Frist Hospital of Lanzhou University, Lanzhou, China
| | - Yubiao Guo
- Department of Pulmonary Medicine, The First Affiliated Hospital of Sun Yat-Sen University, Guangzhou, China
| | - Qichuan Zhang
- Department of Respiratory Medicine, Shantou Central Hospital, Shantou, China
| | - Wei Zhang
- Department of Respiration, The First Affiliated Hospital of Nanchang University, Nanchang, China
| | - Lan Yang
- Department of Pulmonary and Critical Care Medicine, The First Affiliated Hospital, Xi'an Jiaotong University, Xi'an, China
| | - Zhanqing Li
- Department of Cardiothoracic Surgery, The Second Affiliated Hospital of Xiamen Medical College, Xiamen, China
| | - Xiaoju Zhang
- Department of Respiratory Medicine, Henan Provincial People's Hospital, Zhengzhou, China
| | - Baohui Han
- Department of Pulmonary Medicine, Shanghai Chest Hospital, Shanghai Jiaotong University, Shanghai, China
| | - Zhaohui Tong
- Department of Respiratory and Critical Care Medicine, Beijing Institute of Respiratory Medicine, Beijing Chaoyang Hospital, Capital Medical University, Beijing, China
| | - Jianxing He
- Department of Thoracic Surgery/Oncology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou Institute of Respiratory Disease & Health, China State Key Laboratory and National Clinical Research Center for Respiratory Disease, Guangzhou, China
| | - Jieming Qu
- Department of Pulmonary & Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Jian-Bing Fan
- AnchorDx Medical Co., Ltd., Guangzhou, China.,Department of Pathology, School of Basic Medical Science, Southern Medical University, Guangzhou, China
| | - Nanshan Zhong
- National Clinical Research Center for Respiratory Disease, State Key Laboratory of Respiratory Diseases, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
18
|
Diagnostic significance of C-reactive protein and hematological parameters in acute toxoplasmosis. J Parasit Dis 2020; 44:785-793. [PMID: 32904402 PMCID: PMC7456360 DOI: 10.1007/s12639-020-01262-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 08/17/2020] [Indexed: 01/08/2023] Open
Abstract
Abstract Toxoplasmosis is a zoonosis caused by Toxoplasma gondii, which can be acquired by oral contact and may cause severe health problems especially for pregnant (congenital toxoplasmosis) and immunocompromised patients. This study aimed to verify the diagnostic significance of hematological parameters and C-reactive protein (CRP) for toxoplasmosis acute detection. A case-control study was carried out between December 2017 and May 2018, in samples of convenience independent of age and sex. The case group was formed by 25 patients with positive anti-Toxoplasma gondii IgG/IgM antibody and the control group was formed by 21 patients with non-positive anti-Toxoplasma gondii IgG/IgM antibody. The results of the hematological parameters and CRP were analyzed in these patients. The patients with Toxoplasma gondii IgM antibody reagent showed higher lymphocytes counting and lower neutrophils counting than the control group. C-reactive protein levels were not different between the groups case and control. ROC curve analysis highlighted that the cut-off value of > 32.00% for lymphocytes and < 57.50% for neutrophils were able to produce specificity higher than 90% for IgM antibody detection. The Naïve Bayes classifier was considered suitable (AUC ≈ 0.700) to separate both groups according to their white cell counting. Changes in lymphocytes and neutrophils may be useful parameters for toxoplasmosis identification and may be used as a tool in the complementary diagnosis of toxoplasmosis. Graphic abstract ![]()
Collapse
|
19
|
Tarekegn A, Ricceri F, Costa G, Ferracin E, Giacobini M. Predictive Modeling for Frailty Conditions in Elderly People: Machine Learning Approaches. JMIR Med Inform 2020; 8:e16678. [PMID: 32442149 PMCID: PMC7303829 DOI: 10.2196/16678] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 01/07/2020] [Accepted: 02/16/2020] [Indexed: 12/15/2022] Open
Abstract
Background Frailty is one of the most critical age-related conditions in older adults. It is often recognized as a syndrome of physiological decline in late life, characterized by a marked vulnerability to adverse health outcomes. A clear operational definition of frailty, however, has not been agreed so far. There is a wide range of studies on the detection of frailty and their association with mortality. Several of these studies have focused on the possible risk factors associated with frailty in the elderly population while predicting who will be at increased risk of frailty is still overlooked in clinical settings. Objective The objective of our study was to develop predictive models for frailty conditions in older people using different machine learning methods based on a database of clinical characteristics and socioeconomic factors. Methods An administrative health database containing 1,095,612 elderly people aged 65 or older with 58 input variables and 6 output variables was used. We first identify and define six problems/outputs as surrogates of frailty. We then resolve the imbalanced nature of the data through resampling process and a comparative study between the different machine learning (ML) algorithms – Artificial neural network (ANN), Genetic programming (GP), Support vector machines (SVM), Random Forest (RF), Logistic regression (LR) and Decision tree (DT) – was carried out. The performance of each model was evaluated using a separate unseen dataset. Results Predicting mortality outcome has shown higher performance with ANN (TPR 0.81, TNR 0.76, accuracy 0.78, F1-score 0.79) and SVM (TPR 0.77, TNR 0.80, accuracy 0.79, F1-score 0.78) than predicting the other outcomes. On average, over the six problems, the DT classifier has shown the lowest accuracy, while other models (GP, LR, RF, ANN, and SVM) performed better. All models have shown lower accuracy in predicting an event of an emergency admission with red code than predicting fracture and disability. In predicting urgent hospitalization, only SVM achieved better performance (TPR 0.75, TNR 0.77, accuracy 0.73, F1-score 0.76) with the 10-fold cross validation compared with other models in all evaluation metrics. Conclusions We developed machine learning models for predicting frailty conditions (mortality, urgent hospitalization, disability, fracture, and emergency admission). The results show that the prediction performance of machine learning models significantly varies from problem to problem in terms of different evaluation metrics. Through further improvement, the model that performs better can be used as a base for developing decision-support tools to improve early identification and prediction of frail older adults.
Collapse
Affiliation(s)
- Adane Tarekegn
- Modeling and Data Science, Department of Mathematics, University of Turin, Turin, Italy
| | - Fulvio Ricceri
- Department of Clinical and Biological Sciences, University of Turin, Turin, Italy.,Unit of Epidemiology, Regional Health Service, Local Health Unit Torino 3, Turin, Italy
| | - Giuseppe Costa
- Department of Clinical and Biological Sciences, University of Turin, Turin, Italy.,Unit of Epidemiology, Regional Health Service, Local Health Unit Torino 3, Turin, Italy
| | - Elisa Ferracin
- Unit of Epidemiology, Regional Health Service, Local Health Unit Torino 3, Turin, Italy
| | - Mario Giacobini
- Data Analysis and Modeling Unit, Department of Veterinary Sciences, University of Turin, Turin, Italy
| |
Collapse
|