1
|
Pagano D, Barresi V, Tropea A, Galvano A, Bazan V, Caldarella A, Sani C, Pompeo G, Russo V, Liotta R, Scuderi C, Mercorillo S, Barbera F, Di Lorenzo N, Jukna A, Carradori V, Rizzo M, Gruttadauria S, Peluso M. Clinical Validation of a Machine Learning-Based Biomarker Signature to Predict Response to Cytotoxic Chemotherapy Alone or Combined with Targeted Therapy in Metastatic Colorectal Cancer Patients: A Study Protocol and Review. Life (Basel) 2025; 15:320. [PMID: 40003728 PMCID: PMC11857289 DOI: 10.3390/life15020320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2024] [Revised: 02/12/2025] [Accepted: 02/14/2025] [Indexed: 02/27/2025] Open
Abstract
Metastatic colorectal cancer (mCRC) is a severe condition with high rates of illness and death. Current treatments are limited and not always effective because the cancer responds differently to drugs in different patients. This research aims to use artificial intelligence (AI) to improve treatment by predicting which therapies will work best for individual patients. By analyzing large sets of patient data and using machine learning, we hope to create a model that can identify which patients will respond to chemotherapy, either alone or combined with other targeted treatments. The study will involve dividing patients into training and validation sets to develop and test the models, avoiding overfitting. Various machine learning algorithms, like random survival forest and neural networks, will be integrated to develop a highly accurate and stable predictive model. The model's performance will be evaluated using statistical measures such as sensitivity, specificity, and the area under the curve (AUC). The aim is to personalize treatments, improve patient outcomes, reduce healthcare costs, and make the treatment process more efficient. If successful, this research could significantly impact the medical community by providing a new tool for better managing and treating mCRC, leading to more personalized and effective cancer care. In addition, we examine the applicability of learning methods to biomarker discovery and therapy prediction by considering recent narrative publications.
Collapse
Affiliation(s)
- Duilio Pagano
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Vincenza Barresi
- Department of Biomedical and Biotechnological Sciences, University of Catania, 95123 Catania, Italy; (V.B.); (C.S.); (S.M.)
| | - Alessandro Tropea
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Antonio Galvano
- Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy; (A.G.); (V.B.)
| | - Viviana Bazan
- Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy; (A.G.); (V.B.)
| | - Adele Caldarella
- Tuscany Cancer Registry, Clinical Epidemiology Unit, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy;
| | - Cristina Sani
- Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy; (C.S.); (G.P.)
| | - Gianpaolo Pompeo
- Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy; (C.S.); (G.P.)
| | - Valentina Russo
- Research and Development Branch, Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy; (V.R.); (V.C.); (M.P.)
| | - Rosa Liotta
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Chiara Scuderi
- Department of Biomedical and Biotechnological Sciences, University of Catania, 95123 Catania, Italy; (V.B.); (C.S.); (S.M.)
| | - Simona Mercorillo
- Department of Biomedical and Biotechnological Sciences, University of Catania, 95123 Catania, Italy; (V.B.); (C.S.); (S.M.)
| | - Floriana Barbera
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Noemi Di Lorenzo
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Agita Jukna
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Valentina Carradori
- Research and Development Branch, Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy; (V.R.); (V.C.); (M.P.)
| | - Monica Rizzo
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
| | - Salvatore Gruttadauria
- Department for the Treatment and Study of Abdominal Diseases and Abdominal Transplantation, Istituto di Ricovero e Cura a Carattere Scientifico-Istituto Mediterraneo per i Trapianti e Terapie ad Alta Specializzazione (IRCCS-ISMETT), University of Pittsburgh Medical Center (UPMC), 90127 Palermo, Italy; (D.P.); (A.T.); (R.L.); (F.B.); (N.D.L.); (A.J.); (M.R.)
- Department of General Surgery and Medical-Surgical Specialties, University of Catania, 95123 Catania, Italy
| | - Marco Peluso
- Research and Development Branch, Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy; (V.R.); (V.C.); (M.P.)
| |
Collapse
|
2
|
Xu J, Gao Y, Lu Q, Zhang R, Gui J, Liu X, Yue Z. RiceSNP-BST: a deep learning framework for predicting biotic stress-associated SNPs in rice. Brief Bioinform 2024; 25:bbae599. [PMID: 39562160 PMCID: PMC11576077 DOI: 10.1093/bib/bbae599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 10/07/2024] [Accepted: 11/04/2024] [Indexed: 11/21/2024] Open
Abstract
Rice consistently faces significant threats from biotic stresses, such as fungi, bacteria, pests, and viruses. Consequently, accurately and rapidly identifying previously unknown single-nucleotide polymorphisms (SNPs) in the rice genome is a critical challenge for rice research and the development of resistant varieties. However, the limited availability of high-quality rice genotype data has hindered this research. Deep learning has transformed biological research by facilitating the prediction and analysis of SNPs in biological sequence data. Convolutional neural networks are especially effective in extracting structural and local features from DNA sequences, leading to significant advancements in genomics. Nevertheless, the expanding catalog of genome-wide association studies provides valuable biological insights for rice research. Expanding on this idea, we introduce RiceSNP-BST, an automatic architecture search framework designed to predict SNPs associated with rice biotic stress traits (BST-associated SNPs) by integrating multidimensional features. Notably, the model successfully innovates the datasets, offering more precision than state-of-the-art methods while demonstrating good performance on an independent test set and cross-species datasets. Additionally, we extracted features from the original DNA sequences and employed causal inference to enhance the biological interpretability of the model. This study highlights the potential of RiceSNP-BST in advancing genome prediction in rice. Furthermore, a user-friendly web server for RiceSNP-BST (http://rice-snp-bst.aielab.cc) has been developed to support broader genome research.
Collapse
Affiliation(s)
- Jiajun Xu
- School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| | - Yujia Gao
- School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| | - Quan Lu
- School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| | - Renyi Zhang
- School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| | - Jianfeng Gui
- School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| | - Xiaoshuang Liu
- Research Center for Biological Breeding Technology, Advance Academy, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| | - Zhenyu Yue
- School of Information and Artificial Intelligence, Anhui Provincial Engineering Research Center for Beidou Precision Agriculture Information, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
- Research Center for Biological Breeding Technology, Advance Academy, Anhui Agricultural University, 130, Changjiang West Road, Hefei, Anhui Province 230036, China
| |
Collapse
|
3
|
Wang Y, Yao X, Wang D, Ye C, Xu L. A machine learning screening model for identifying the risk of high-frequency hearing impairment in a general population. BMC Public Health 2024; 24:1160. [PMID: 38664666 PMCID: PMC11044481 DOI: 10.1186/s12889-024-18636-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Accepted: 04/17/2024] [Indexed: 04/29/2024] Open
Abstract
BACKGROUND Hearing impairment (HI) has become a major public health issue in China. Currently, due to the limitations of primary health care, the gold standard for HI diagnosis (pure-tone hearing test) is not suitable for large-scale use in community settings. Therefore, the purpose of this study was to develop a cost-effective HI screening model for the general population using machine learning (ML) methods and data gathered from community-based scenarios, aiming to help improve the hearing-related health outcomes of community residents. METHODS This study recruited 3371 community residents from 7 health centres in Zhejiang, China. Sixty-eight indicators derived from questionnaire surveys and routine haematological tests were delivered and used for modelling. Seven commonly used ML models (the naive Bayes (NB), K-nearest neighbours (KNN), support vector machine (SVM), random forest (RF), eXtreme Gradient Boosting (XGBoost), boosting, and least absolute shrinkage and selection operator (LASSO regression)) were adopted and compared to develop the final high-frequency hearing impairment (HFHI) screening model for community residents. The model was constructed with a nomogram to obtain the risk score of the probability of individuals suffering from HFHI. According to the risk score, the population was divided into three risk stratifications (low, medium and high) and the risk factor characteristics of each dimension under different risk stratifications were identified. RESULTS Among all the algorithms used, the LASSO-based model achieved the best performance on the validation set by attaining an area under the curve (AUC) of 0.868 (95% confidence interval (CI): 0.847-0.889) and reaching precision, specificity and F-score values all greater than 80%. Five demographic indicators, 7 disease-related features, 5 behavioural factors, 2 environmental exposures, 2 hearing cognitive factors, and 13 blood test indicators were identified in the final screening model. A total of 91.42% (1235/1129) of the subjects in the high-risk group were confirmed to have HI by audiometry, which was 3.99 times greater than that in the low-risk group (22.91%, 301/1314). The high-risk population was mainly characterized as older, low-income and low-educated males, especially those with multiple chronic conditions, noise exposure, poor lifestyle, abnormal blood indices (e.g., red cell distribution width (RDW) and platelet distribution width (PDW)) and liver function indicators (e.g., triglyceride (TG), indirect bilirubin (IBIL), aspartate aminotransferase (AST) and low-density lipoprotein (LDL)). An HFHI nomogram was further generated to improve the operability of the screening model for community applications. CONCLUSIONS The HFHI risk screening model developed based on ML algorithms can more accurately identify residents with HFHI by categorizing them into the high-risk groups, which can further help to identify modifiable and immutable risk factors for residents at high risk of HI and promote their personalized HI prevention or intervention.
Collapse
Affiliation(s)
- Yi Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Hangzhou Normal University, Hangzhou, 311121, Zhejiang, China
- Hangzhou Center for Disease Control and Prevention, Hangzhou, Zhejiang, China
| | - Xinmeng Yao
- Department of Epidemiology and Biostatistics, School of Public Health, Hangzhou Normal University, Hangzhou, 311121, Zhejiang, China
| | - Dahui Wang
- Department of Health Management, School of Public Health, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Chengyin Ye
- Department of Health Management, School of Public Health, Hangzhou Normal University, Hangzhou, Zhejiang, China.
| | - Liangwen Xu
- Department of Epidemiology and Biostatistics, School of Public Health, Hangzhou Normal University, Hangzhou, 311121, Zhejiang, China.
| |
Collapse
|
4
|
Deng X, Luo Y, Lu M, Lin Y, Ma L. Identification of GMFG as a novel biomarker in IgA nephropathy based on comprehensive bioinformatics analysis. Heliyon 2024; 10:e28997. [PMID: 38601619 PMCID: PMC11004809 DOI: 10.1016/j.heliyon.2024.e28997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 03/22/2024] [Accepted: 03/27/2024] [Indexed: 04/12/2024] Open
Abstract
Background IgA nephropathy (IgAN) stands as the most prevalent form of glomerulonephritis and ranks among the leading causes of end-stage renal disease worldwide. Regrettably, we continue to grapple with the absence of dependable diagnostic markers and specific therapeutic agents for IgAN. Therefore, this study endeavors to explore novel biomarkers and potential therapeutic targets in IgAN, while also considering their relevance in the context of tumors. Methods We gathered IgAN datasets from the Gene Expression Omnibus (GEO) database. Subsequently, leveraging these datasets, we conducted an array of analyses, encompassing differential gene expression, weighted gene co-expression network analysis (WGCNA), machine learning, receiver operator characteristic (ROC) curve analysis, gene expression validation, clinical correlations, and immune infiltration. Finally, we carried out pan-cancer analysis based on hub gene. Results We obtained 1391 differentially expressed genes (DEGs) in GSE93798 and 783 DGEs in GSE14795, respectively. identifying 69 common genes for further investigation. Subsequently, GMFG was identified the hub gene based on machine learning. In the verification set and the training set, the GMFG was higher in the IgAN group than in the healthy group and all of the GMFG area under the curve (AUC) was more 0.8. In addition, GMFG has a close relationship with the prognosis of malignancies and a range of immune cells. Conclusions Our study suggests that GMFG could serve as a promising novel biomarker and potential therapeutic target for both IgAN and certain types of tumors.
Collapse
Affiliation(s)
- Xiaoqi Deng
- Department of Nephrology, Zigong Fourth People's Hospital, Zigong, 643000, Sichuan Province, China
| | - Yu Luo
- Chongqing Medical University, Chongqing, 400000, China
| | - Meiqi Lu
- School of Medicine, Xiamen University, Xiamen, 361000, China
| | - Yun Lin
- Department of Nephrology, Zigong Fourth People's Hospital, Zigong, 643000, Sichuan Province, China
| | - Li Ma
- Department of Nephrology, Zigong Fourth People's Hospital, Zigong, 643000, Sichuan Province, China
| |
Collapse
|
5
|
Mondello A, Dal Bo M, Toffoli G, Polano M. Machine learning in onco-pharmacogenomics: a path to precision medicine with many challenges. Front Pharmacol 2024; 14:1260276. [PMID: 38264526 PMCID: PMC10803549 DOI: 10.3389/fphar.2023.1260276] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 12/26/2023] [Indexed: 01/25/2024] Open
Abstract
Over the past two decades, Next-Generation Sequencing (NGS) has revolutionized the approach to cancer research. Applications of NGS include the identification of tumor specific alterations that can influence tumor pathobiology and also impact diagnosis, prognosis and therapeutic options. Pharmacogenomics (PGx) studies the role of inheritance of individual genetic patterns in drug response and has taken advantage of NGS technology as it provides access to high-throughput data that can, however, be difficult to manage. Machine learning (ML) has recently been used in the life sciences to discover hidden patterns from complex NGS data and to solve various PGx problems. In this review, we provide a comprehensive overview of the NGS approaches that can be employed and the different PGx studies implicating the use of NGS data. We also provide an excursus of the ML algorithms that can exert a role as fundamental strategies in the PGx field to improve personalized medicine in cancer.
Collapse
Affiliation(s)
| | | | | | - Maurizio Polano
- Experimental and Clinical Pharmacology Unit, Centro di Riferimento Oncologico di Aviano (CRO), Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS), Aviano, Italy
| |
Collapse
|
6
|
He J, Zhao Y, Zhou Z, Zhang M. Machine learning and integrative analysis identify the common pathogenesis of azoospermia complicated with COVID-19. Front Immunol 2023; 14:1114870. [PMID: 37283758 PMCID: PMC10239851 DOI: 10.3389/fimmu.2023.1114870] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 05/05/2023] [Indexed: 06/08/2023] Open
Abstract
Background Although more recent evidence has indicated COVID-19 is prone to azoospermia, the common molecular mechanism of its occurrence remains to be elucidated. The aim of the present study is to further investigate the mechanism of this complication. Methods To discover the common differentially expressed genes (DEGs) and pathways of azoospermia and COVID-19, integrated weighted co-expression network (WGCNA), multiple machine learning analyses, and single-cell RNA-sequencing (scRNA-seq) were performed. Results Therefore, we screened two key network modules in the obstructive azoospermia (OA) and non-obstructive azoospermia (NOA) samples. The differentially expressed genes were mainly related to the immune system and infectious virus diseases. We then used multiple machine learning methods to detect biomarkers that differentiated OA from NOA. Enrichment analysis showed that azoospermia patients and COVID-19 patients shared a common IL-17 signaling pathway. In addition, GLO1, GPR135, DYNLL2, and EPB41L3 were identified as significant hub genes in these two diseases. Screening of two different molecular subtypes revealed that azoospermia-related genes were associated with clinicopathological characteristics of age, hospital-free-days, ventilator-free-days, charlson score, and d-dimer of patients with COVID-19 (P < 0.05). Finally, we used the Xsum method to predict potential drugs and single-cell sequencing data to further characterize whether azoospermia-related genes could validate the biological patterns of impaired spermatogenesis in cryptozoospermia patients. Conclusion Our study performs a comprehensive and integrated bioinformatics analysis of azoospermia and COVID-19. These hub genes and common pathways may provide new insights for further mechanism research.
Collapse
Affiliation(s)
- Jiarong He
- Department of Neurosurgery, The Second Xiangya Hospital, Central South University, Changsha, Hunan, PR, China
| | - Yuanqiao Zhao
- Department of Urology, The Second Xiangya Hospital, Central South University, Changsha, Hunan, PR, China
| | - Zhixian Zhou
- Department of Obstetrics and Gynecology, The Second Xiangya Hospital, Central South University, Changsha, Hunan, PR, China
| | - Mingming Zhang
- Department of Neurosurgery, The Second Xiangya Hospital, Central South University, Changsha, Hunan, PR, China
| |
Collapse
|
7
|
Yang S, Yang Z, Yang J. 4mCBERT: A computing tool for the identification of DNA N4-methylcytosine sites by sequence- and chemical-derived information based on ensemble learning strategies. Int J Biol Macromol 2023; 231:123180. [PMID: 36646347 DOI: 10.1016/j.ijbiomac.2023.123180] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 11/26/2022] [Accepted: 12/30/2022] [Indexed: 01/15/2023]
Abstract
N4-methylcytosine (4mC) is an important DNA chemical modification pattern which is a new methylation modification discovered in recent years and plays critical roles in gene expression regulation, defense against invading genetic elements, genomic imprinting, and so on. Identifying 4mC site from DNA sequence segment contributes to discovering more novel modification patterns. In this paper, we present a model called 4mCBERT that encodes DNA sequence segments by sequence characteristics including one-hot, electron-ion interaction pseudopotential, nucleotide chemical property, word2vec and chemical information containing physicochemical properties (PCP), chemical bidirectional encoder representations from transformers (chemical BERT) and employs ensemble learning framework to develop a prediction model. PCP and chemical BERT features are firstly constructed and applied to predict 4mC sites and show positive contributions to identifying 4mC. For the Matthew's Correlation Coefficient, 4mCBERT significantly outperformed other state-of-the-art models on six independent benchmark datasets including A. thaliana, C. elegans, D. melanogaster, E. coli, G. Pickering, and G. subterraneous by 4.32 % to 24.39 %, 2.52 % to 31.65 %, 2 % to 16.49 %, 6.63 % to 35.15, 8.59 % to 61.85 %, and 8.45 % to 34.45 %. Moreover, 4mCBERT is designed to allow users to predict 4mC sites and retrain 4mC prediction models. In brief, 4mCBERT shows higher performance on six benchmark datasets by incorporating sequence- and chemical-driven information and is available at http://cczubio.top/4mCBERT and https://github.com/abcair/4mCBERT.
Collapse
Affiliation(s)
- Sen Yang
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou 213164, China; The Affiliated Changzhou No 2 People's Hospital of Nanjing Medical University, Changzhou 213164, China.
| | - Zexi Yang
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou 213164, China
| | - Jun Yang
- School of Educational Sciences, Yili Normal University, Yining 835000, China
| |
Collapse
|
8
|
Liu H, Jiao R, Wang L, Feng F, Zhao X, Yang J. Machine-learning-based analysis of the sensitivity and specificity on lipid-lowering effect of one-month-administered statins. Medicine (Baltimore) 2023; 102:e33139. [PMID: 36862920 PMCID: PMC9981436 DOI: 10.1097/md.0000000000033139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 03/04/2023] Open
Abstract
Few predictive studies have been reported on the efficacy of atorvastatin in reducing lipoprotein cholesterol to be qualified after 1-month course of treatment in different individuals. A total of 14,180 community-based residents aged ≥ 65 received health checkup, 1013 of whom had low-density lipoprotein (LDL) higher than 2.6mmol/L so that they were put on 1-month course of treatment with atorvastatin. At its completion, lipoprotein cholesterol was measured again. With < 2.6 mmol/L considered as the treatment standard, 411 individuals were judged as the qualified group, and 602, and as the unqualified group. The basic sociodemographic features covered 57 items. The data were randomly divided into train sets and test ones. The recursive random-forest algorithm was applied to predicting the patients response to atorvastatin, the recursive feature elimination method, to screening all the physical indicators. The overall accuracy, sensitivity and specificity were calculated, respectively, and so were the receiver operator characteristic curve and the area under the curve of the test set. In the prediction model on the efficacy of 1-month treatment of statins for LDL, the sensitivity, 86.86%; and the specificity, 94.83%. In the prediction model on the efficacy of the same treatment for triglyceride, the sensitivity, 71.21%; and the specificity, 73.46%. As to the prediction of total cholesterol, the sensitivity, 94.38%; and the specificity, 96.55%. And in the case of high-density lipoprotein (HDL), the sensitivity, 84.86%; and the specificity, 100%. recursive feature elimination analysis showed that total cholesterol was the most important feature of atorvastatin efficacy of reducing LDL; that HDL was the most important one of its efficacies of reducing triglycerides; that LDL was the most important one of its efficacies of reducing total cholesterol; and that triglyceride was the most important one of its efficacies of reducing HDL. Random-forest can help predict whether atorvastatin efficacy of reducing lipoprotein cholesterol to be qualified after 1-month course of treatment in different individuals.
Collapse
Affiliation(s)
- Huiqin Liu
- Department of Neurology, Shanghai Pudong New Area People’s Hospital, Shanghai, China
| | - Ronghong Jiao
- Department of Clinical Laboratory, Shanghai Pudong New Area People’s Hospital, Shanghai, China
| | - Lingling Wang
- Department of Neurology, Shanghai Pudong New Area People’s Hospital, Shanghai, China
| | - Fei Feng
- Department of Neurology, East Hospital Affiliated to Tongji University, Shanghai, China
| | - Xiaohui Zhao
- Department of Neurology, Shanghai Pudong New Area People’s Hospital, Shanghai, China
| | - Juan Yang
- Department of Neurology, Shanghai Pudong New Area People’s Hospital, Shanghai, China
- * Correspondence: Juan Yang, Department of Neurology, Shanghai Pudong New Area People’s Hospital, Shanghai 201299, China (e-mail: )
| |
Collapse
|
9
|
Wu Y, Zhu W, Wang J, Liu L, Zhang W, Wang Y, Shi J, Xia J, Gu Y, Qian Q, Hong Y. Using machine learning for mortality prediction and risk stratification in atezolizumab-treated cancer patients: Integrative analysis of eight clinical trials. Cancer Med 2023; 12:3744-3757. [PMID: 35871390 PMCID: PMC9939114 DOI: 10.1002/cam4.5060] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 05/25/2022] [Accepted: 07/13/2022] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Few models exist to predict mortality in cancer patients receiving immunotherapy. Our aim was to build a machine learning-based risk stratification model for predicting mortality in atezolizumab-treated cancer patients. METHODS Data from 2538 patients in eight atezolizumab-treated cancer clinical trials across three cancer types (non-small-cell lung cancer, bladder transitional cell carcinoma, and renal cell carcinoma) were included. The whole cohort was randomly split into development and validation cohorts in a 7:3 ratio. Machine-learning algorithms (extreme gradient boosting, random forest, logistic regression with lasso regularization, support vector machine, and K-nearest neighbor) were applied to develop prediction models. Model performance was mainly assessed by area under the receiver operating characteristic curve (AUC) value, calibration plot, and decision curve analysis. The probability of death risk was then stratified. RESULTS One thousand and three hundred and seventy-nine (54.33%) patients died. The random forest (RF) model was overall the best in terms of predictive performance, with the AUC of 0.844 (95% confidence interval [CI]: 0.826-0.862) in the development cohort and 0.786 (95% CI: 0.754-0.818) in the validation cohort for predicting mortality. Twelve baseline variables contributing to mortality prediction in the RF model were C-reactive protein, PD-L1 level, cancer type, prior liver metastasis, derived neutrophil-to-lymphocyte ratio, alkaline phosphatase, albumin, hemoglobin, white blood cell count, number of metastatic sites, pulse rate, and Eastern Cooperative Oncology Group (ECOG) performance status. A total of 1782 (70.2%) patients were separated into the high-risk and 756 (29.8%) low-risk groups. Patients in the high-risk group were significantly more likely to die, experience disease progression, discontinue study, and discontinue treatment than patients in the low-risk group (all p values < 0.001). Risk groups were not associated with immune-related adverse events and grades 3-5 treatment-related adverse events (all p values > 0.05). CONCLUSION RF model has good performance in mortality prediction and risk stratification for cancer patients receiving atezolizumab monotherapy.
Collapse
Affiliation(s)
- Yougen Wu
- National Institute of Clinical Research, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
| | - Wenyu Zhu
- Shanghai Long For Health Data Technology Co.ltdShanghaiChina
| | - Jing Wang
- Shanghai Long For Health Data Technology Co.ltdShanghaiChina
| | - Lvwen Liu
- Shanghai Long For Health Data Technology Co.ltdShanghaiChina
| | - Wei Zhang
- Department of BiostatisticsFudan University School of Public HealthShanghaiChina
| | - Yang Wang
- Department of UrologyThe Fifth People's Hospital of Shanghai, Fudan UniversityShanghaiChina
| | - Jindong Shi
- Department of Respiratory MedicineThe Fifth People's Hospital of Shanghai, Fudan UniversityShanghaiChina
| | - Ju Xia
- National Institute of Clinical Research, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
| | - Yuting Gu
- National Institute of Clinical Research, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
| | - Qingqing Qian
- National Institute of Clinical Research, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
- Department of Pharmacy, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
| | - Yang Hong
- National Institute of Clinical Research, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
- Department of Osteology, The Fifth People's Hospital of ShanghaiFudan UniversityShanghaiChina
| |
Collapse
|
10
|
Xu D, Chen R, Jiang Y, Wang S, Liu Z, Chen X, Fan X, Zhu J, Li J. Application of machine learning in the prediction of deficient mismatch repair in patients with colorectal cancer based on routine preoperative characterization. Front Oncol 2022; 12:1049305. [PMID: 36620593 PMCID: PMC9814116 DOI: 10.3389/fonc.2022.1049305] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 12/07/2022] [Indexed: 12/24/2022] Open
Abstract
Simple summary Detecting deficient mismatch repair (dMMR) in patients with colorectal cancer is essential for clinical decision-making, including evaluation of prognosis, guidance of adjuvant chemotherapy and immunotherapy, and primary screening for Lynch syndrome. However, outside of tertiary care centers, existing detection methods are not widely disseminated and highly depend on the experienced pathologist. Therefore, it is of great clinical significance to develop a broadly accessible and low-cost tool for dMMR prediction, particularly prior to surgery. In this study, we developed a convenient and reliable model for predicting dMMR status in CRC patients on routine preoperative characterization utilizing multiple machine learning algorithms. This model will work as an automated screening tool for identifying patients suitable for mismatch repair testing and consequently for improving the detection rate of dMMR, while reducing unnecessary labor and cost in patients with proficient mismatch repair. Background Deficient mismatch repair (dMMR) indicates a sustained anti-tumor immune response and has a favorable prognosis in patients with colorectal cancer (CRC). Although all CRC patients are recommended to undergo dMMR testing after surgery, current diagnostic approaches are not available for all country hospitals and patients. Therefore, efficient and low-cost predictive models for dMMR, especially for preoperative evaluations, are warranted. Methods A large scale of 5596 CRC patients who underwent surgical resection and mismatch repair testing were enrolled and randomly divided into training and validation cohorts. The clinical features exploited for predicting dMMR comprised the demographic characteristics, preoperative laboratory data, and tumor burden information. Machine learning (ML) methods involving eight basic algorithms, ensemble learning methods, and fusion algorithms were adopted with 10-fold cross-validation, and their performance was evaluated based on the area under the receiver operating characteristic curve (AUC) and calibration curves. The clinical net benefits were assessed using a decision curve analysis (DCA), and a nomogram was developed to facilitate model clinical practicality. Results All models achieved an AUC of nearly 0.80 in the validation cohort, with the stacking model exhibiting the best performance (AUC = 0.832). Logistical DCA revealed that the stacking model yielded more clinical net benefits than the conventional regression models. In the subgroup analysis, the stacking model also predicted dMMR regardless of the clinical stage. The nomogram showed a favorable consistence with the actual outcome in the calibration curve. Conclusion With the aid of ML algorithms, we developed a novel and robust model for predicting dMMR in CRC patients with satisfactory discriminative performance and designed a user-friendly and convenient nomogram.
Collapse
Affiliation(s)
- Dong Xu
- Division of Digestive Surgery, Xijing Hospital of Digestive Diseases, Air force Medical University, Xi’an, China,School of Clinical Medicine, Xi’an Medical University, Xi’an, China
| | - Rujie Chen
- Division of Digestive Surgery, Xijing Hospital of Digestive Diseases, Air force Medical University, Xi’an, China,Department of Neurosurgery, Xijing Hospital, Air Force Medical University, Xi’an, China,State Key Laboratory of Cancer Biology, Institute of Digestive Diseases, Xijing Hospital, The Fourth Military Medical University, Xi’an, China
| | - Yu Jiang
- Division of Digestive Surgery, Xijing Hospital of Digestive Diseases, Air force Medical University, Xi’an, China,School of Clinical Medicine, Xi’an Medical University, Xi’an, China
| | - Shuai Wang
- Xi’an Institute of Flight of the Air Force, Ming Gang Station Hospital, Minggang, China
| | - Zhiyu Liu
- Division of Digestive Surgery, Xijing Hospital of Digestive Diseases, Air force Medical University, Xi’an, China,School of Clinical Medicine, Xi’an Medical University, Xi’an, China
| | - Xihao Chen
- Division of Digestive Surgery, Xijing Hospital of Digestive Diseases, Air force Medical University, Xi’an, China,School of Clinical Medicine, Xi’an Medical University, Xi’an, China
| | - Xiaoyan Fan
- Department of Experiment Surgery, Xijing Hospital, Fourth Military Medical University, Xi’an, China
| | - Jun Zhu
- Department of General Surgery, The Southern Theater Air Force Hospital, Guangzhou, China,*Correspondence: Jipeng Li, ; Jun Zhu,
| | - Jipeng Li
- Division of Digestive Surgery, Xijing Hospital of Digestive Diseases, Air force Medical University, Xi’an, China,State Key Laboratory of Cancer Biology, Institute of Digestive Diseases, Xijing Hospital, The Fourth Military Medical University, Xi’an, China,Department of Experiment Surgery, Xijing Hospital, Fourth Military Medical University, Xi’an, China,*Correspondence: Jipeng Li, ; Jun Zhu,
| |
Collapse
|
11
|
Russo V, Lallo E, Munnia A, Spedicato M, Messerini L, D’Aurizio R, Ceroni EG, Brunelli G, Galvano A, Russo A, Landini I, Nobili S, Ceppi M, Bruzzone M, Cianchi F, Staderini F, Roselli M, Riondino S, Ferroni P, Guadagni F, Mini E, Peluso M. Artificial Intelligence Predictive Models of Response to Cytotoxic Chemotherapy Alone or Combined to Targeted Therapy for Metastatic Colorectal Cancer Patients: A Systematic Review and Meta-Analysis. Cancers (Basel) 2022; 14:4012. [PMID: 36011003 PMCID: PMC9406544 DOI: 10.3390/cancers14164012] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 07/26/2022] [Accepted: 08/12/2022] [Indexed: 12/24/2022] Open
Abstract
Tailored treatments for metastatic colorectal cancer (mCRC) have not yet completely evolved due to the variety in response to drugs. Therefore, artificial intelligence has been recently used to develop prognostic and predictive models of treatment response (either activity/efficacy or toxicity) to aid in clinical decision making. In this systematic review, we have examined the ability of learning methods to predict response to chemotherapy alone or combined with targeted therapy in mCRC patients by targeting specific narrative publications in Medline up to April 2022 to identify appropriate original scientific articles. After the literature search, 26 original articles met inclusion and exclusion criteria and were included in the study. Our results show that all investigations conducted on this field have provided generally promising results in predicting the response to therapy or toxic side-effects. By a meta-analytic approach we found that the overall weighted means of the area under the receiver operating characteristic (ROC) curve (AUC) were 0.90, 95% C.I. 0.80-0.95 and 0.83, 95% C.I. 0.74-0.89 in training and validation sets, respectively, indicating a good classification performance in discriminating response vs. non-response. The calculation of overall HR indicates that learning models have strong ability to predict improved survival. Lastly, the delta-radiomics and the 74 gene signatures were able to discriminate response vs. non-response by correctly identifying up to 99% of mCRC patients who were responders and up to 100% of patients who were non-responders. Specifically, when we evaluated the predictive models with tests reaching 80% sensitivity (SE) and 90% specificity (SP), the delta radiomics showed an SE of 99% and an SP of 94% in the training set and an SE of 85% and SP of 92 in the test set, whereas for the 74 gene signatures the SE was 97.6% and the SP 100% in the training set.
Collapse
Affiliation(s)
- Valentina Russo
- Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
| | - Eleonora Lallo
- Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
| | - Armelle Munnia
- Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
| | - Miriana Spedicato
- Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
| | - Luca Messerini
- Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
| | - Romina D’Aurizio
- Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
| | - Elia Giuseppe Ceroni
- Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
| | - Giulia Brunelli
- Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
| | - Antonio Galvano
- Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy
| | - Antonio Russo
- Department of Surgical, Oncological and Oral Sciences, University of Palermo, 90127 Palermo, Italy
| | - Ida Landini
- Department of Health Sciences, University of Florence, 50139 Florence, Italy
| | - Stefania Nobili
- Department of Neurosciences, Imaging and Clinical Sciences, “G. D’Annunzio” Chieti-Pescara, 66100 Chieti, Italy
| | - Marcello Ceppi
- Clinical Epidemiology Unit, IRCCS-Ospedale Policlinico San Martino, 16131 Genova, Italy
| | - Marco Bruzzone
- Clinical Epidemiology Unit, IRCCS-Ospedale Policlinico San Martino, 16131 Genova, Italy
| | - Fabio Cianchi
- Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
| | - Fabio Staderini
- Department of Experimental and Clinical Medicine, University of Florence, 50134 Florence, Italy
| | - Mario Roselli
- Medical Oncology Unit, Department of Systems Medicine, Tor Vergata University, 00133 Rome, Italy
| | - Silvia Riondino
- Medical Oncology Unit, Department of Systems Medicine, Tor Vergata University, 00133 Rome, Italy
| | - Patrizia Ferroni
- BioBIM (InterInstitutional Multidisciplinary Biobank), IRCCS San Raffaele Roma, 00166 Rome, Italy
- Department of Human Sciences & Quality of Life Promotion, San Raffaele Roma Open University, 00166 Rome, Italy
| | - Fiorella Guadagni
- BioBIM (InterInstitutional Multidisciplinary Biobank), IRCCS San Raffaele Roma, 00166 Rome, Italy
- Department of Human Sciences & Quality of Life Promotion, San Raffaele Roma Open University, 00166 Rome, Italy
| | - Enrico Mini
- Department of Health Sciences, University of Florence, 50139 Florence, Italy
| | - Marco Peluso
- Research and Development Branch, Regional Cancer Prevention Laboratory, ISPRO-Study, Prevention and Oncology Network Institute, 50139 Florence, Italy
| |
Collapse
|
12
|
Li G, Liu J, Lu H, Hu W, Hu M, He J, Yang W, Zhu Z, Zhu J, Zhang H, Zhao H, Huang F. Multiple environmental exposures and obesity in eastern China: An individual exposure evaluation model. CHEMOSPHERE 2022; 298:134316. [PMID: 35302002 DOI: 10.1016/j.chemosphere.2022.134316] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 03/09/2022] [Accepted: 03/12/2022] [Indexed: 06/14/2023]
Abstract
Obesity has caused a huge burden of disease. Few studies have explored individuals' environmental exposure level and the impact of multiple environmental exposures on obesity. The aim of this study was to explore individual air pollution exposure evaluation, and the association between and multiple environmental factors and obesity among adult residents in rural areas of China. In this study, 8400 residents of 14 districts and counties in eastern of China were selected by multistage stratified cluster sampling, and a total of 8377 residents were included in the final analysis. We adopted BMI (Body Mass Index) > 28 kg/m2 as the definition of obesity. First, an individual air pollution evaluation model was established based on the monitoring data of air pollution stations closest to residential address, different demographic characteristics of residents and daily living habits using generalized linear model and random forest model. Then, we used Bayesian Kernel Machine Regression (BKMR) and Quantile g-Computation (QgC) models to explore multiple environmental exposures on obesity. The results showed that six air pollutants were significantly positively associated with obesity, and green space had a significant protective effect on obesity. The BKMR model showed that the effects of different air pollutants on obesity were significantly enhanced by each other, while green space significantly reduced the positive effect of air pollution on obesity. The QgC model showed a significant positive association with obesity when all environmental factors were exposed as a whole, especially in males, higher household incomes and young people. It suggested that relevant authorities should improve regional air quality and green space to reduce the burden of disease caused by obesity.
Collapse
Affiliation(s)
- Guoao Li
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Jianjun Liu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Huanhuan Lu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Wenlei Hu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Mingjun Hu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Jialiu He
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Wanjun Yang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Zhenyu Zhu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Jinliang Zhu
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Hanshuang Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Huanhuan Zhao
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China
| | - Fen Huang
- Department of Epidemiology and Biostatistics, School of Public Health, Anhui Medical University, 81 Meishan Road, Hefei, Anhui, China.
| |
Collapse
|
13
|
Quandt F, Flottmann F, Madai VI, Alegiani A, Küpper C, Kellert L, Hilbert A, Frey D, Liebig T, Fiehler J, Goyal M, Saver JL, Gerloff C, Thomalla G, Tiedt S. Machine Learning-Based Identification of Target Groups for Thrombectomy in Acute Stroke. Transl Stroke Res 2022; 14:311-321. [PMID: 35670996 PMCID: PMC10159968 DOI: 10.1007/s12975-022-01040-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 04/19/2022] [Accepted: 05/22/2022] [Indexed: 11/29/2022]
Abstract
Whether endovascular thrombectomy (EVT) improves functional outcome in patients with large-vessel occlusion (LVO) stroke that do not comply with inclusion criteria of randomized controlled trials (RCTs) but that are considered for EVT in clinical practice is uncertain. We aimed to systematically identify patients with LVO stroke underrepresented in RCTs who might benefit from EVT. Following the premises that (i) patients without reperfusion after EVT represent a non-treated control group and (ii) the level of reperfusion affects outcome in patients with benefit from EVT but not in patients without treatment benefit, we systematically assessed the importance of reperfusion level on functional outcome prediction using machine learning in patients with LVO stroke treated with EVT in clinical practice (N = 5235, German-Stroke-Registry) and in patients treated with EVT or best medical management from RCTs (N = 1488, Virtual-International-Stroke-Trials-Archive). The importance of reperfusion level on outcome prediction in an RCT-like real-world cohort equaled the importance of EVT treatment allocation for outcome prediction in RCT data and was higher compared to an unselected real-world population. The importance of reperfusion level was magnified in patient groups underrepresented in RCTs, including patients with lower NIHSS scores (0-10), M2 occlusions, and lower ASPECTS (0-5 and 6-8). Reperfusion level was equally important in patients with vertebrobasilar as with anterior LVO stroke. The importance of reperfusion level for outcome prediction identifies patient target groups who likely benefit from EVT, including vertebrobasilar stroke patients and among patients underrepresented in RCT patients with low NIHSS scores, low ASPECTS, and M2 occlusions.
Collapse
Affiliation(s)
- Fanny Quandt
- Department of Neurology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Fabian Flottmann
- Department of Diagnostic and Interventional Neuroradiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Vince I Madai
- Charité Lab for Artificial Intelligence in Medicine-CLAIM, Charité - Universitätsmedizin Berlin, Berlin, Germany.,QUEST Center for Transforming Biomedical Research, Berlin Institute of Health, Charité - Universitätsmedizin Berlin, Berlin, Germany.,School of Computing and Digital Technology, Faculty of Computing, Engineering and the Built Environment, Birmingham City University, Birmingham, UK
| | - Anna Alegiani
- Department of Neurology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Clemens Küpper
- Department of Neurology, University Hospital, LMU Munich, Munich, Germany
| | - Lars Kellert
- Department of Neurology, University Hospital, LMU Munich, Munich, Germany
| | - Adam Hilbert
- Charité Lab for Artificial Intelligence in Medicine-CLAIM, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Dietmar Frey
- Charité Lab for Artificial Intelligence in Medicine-CLAIM, Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Thomas Liebig
- Institute of Neuroradiology, University Hospital, LMU Munich, Munich, Germany
| | - Jens Fiehler
- Department of Diagnostic and Interventional Neuroradiology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Mayank Goyal
- Department of Radiology, Foothills Medical Centre, University of Calgary, Calgary, AB, Canada
| | - Jeffrey L Saver
- Department of Neurology and Comprehensive Stroke Center, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Christian Gerloff
- Department of Neurology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Götz Thomalla
- Department of Neurology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Steffen Tiedt
- Institute for Stroke and Dementia Research, University Hospital, LMU Munich, Feodor-Lynen-Straße 17, 81377, Munich, Germany.
| | | |
Collapse
|
14
|
Murali V, Muralidhar YP, Königs C, Nair M, Madhu S, Nedungadi P, Srinivasa G, Athri P. Predicting clinical trial outcomes using drug bioactivities through graph database integration and machine learning. Chem Biol Drug Des 2022; 100:169-184. [PMID: 35587730 DOI: 10.1111/cbdd.14092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 04/24/2022] [Accepted: 05/15/2022] [Indexed: 11/29/2022]
Abstract
The ability to estimate the probability of a drug to receive approval in clinical trials provides natural advantages to optimizing pharmaceutical research workflows. Success rates of clinical trials have deep implications for costs, duration of development, and under pressure due to stringent regulatory approval processes. We propose a machine learning approach that can predict the outcome of the trial with reliable accuracies, using biological activities, physicochemical properties of the compounds, target-related features, and NLP-based compound representation. In the above list, biological activities have never been used as an independent variable towards the prediction of clinical trial outcomes. We have extracted the drug-disease pair from clinical trials and mapped target(s) to that pair using multiple data sources. Empirical results demonstrate that ensemble learning outperforms independently trained, small-data ML models. We report results and inferences derived from a Random forest classifier with an average accuracy of 93%, and an F1 score of 0.96 for the "Pass" class. "Pass" refers to one of the two classes (Pass/Fail) of all clinical trials, and the model performed well in predicting the "Pass" category. Through the analysis of feature contributions to predictive capability, we have demonstrated that bioactivity plays a statistically significant role in predicting clinical trial outcome. A significant effort has gone into the production of the dataset that, for the first time, integrates clinical trial information with protein targets. Cleaned, organized, integrated data and code to map these entities, created as a part of this work, are available open-source. This reproducibility and the freely available code ensure that researchers with access to deep curated and proprietary clinical trial databases (we only use open-source data in this study) can further expand the scope of the results.
Collapse
Affiliation(s)
- Vidhya Murali
- Department of Computer Science and Engineering, Amrita School of Engineering, Bengaluru, India
| | - Y Pradyumna Muralidhar
- PES Center for Pattern Recognition, Department of Computer Science and Engineering, PES University, Bengaluru, India
| | - Cassandra Königs
- Bioinformatics and Medical Informatics, Bielefeld University, Northrhine-Westphalia, Germany
| | - Meera Nair
- Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham, Amritapuri, Kerala, India
| | - Sethulekshmi Madhu
- Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham, Amritapuri, Kerala, India
| | - Prema Nedungadi
- Department of Computer Science and Engineering, Amrita School of Engineering, Kerala, India
| | - Gowri Srinivasa
- PES Center for Pattern Recognition, Department of Computer Science and Engineering, PES University, Bengaluru, India
| | - Prashanth Athri
- Department of Computer Science and Engineering, Amrita School of Engineering, Bengaluru, India
| |
Collapse
|
15
|
Zhang C, Mou M, Zhou Y, Zhang W, Lian X, Shi S, Lu M, Sun H, Li F, Wang Y, Zeng Z, Li Z, Zhang B, Qiu Y, Zhu F, Gao J. Biological activities of drug inactive ingredients. Brief Bioinform 2022; 23:6582006. [PMID: 35524477 DOI: 10.1093/bib/bbac160] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/01/2022] [Accepted: 04/09/2022] [Indexed: 02/06/2023] Open
Abstract
In a drug formulation (DFM), the major components by mass are not Active Pharmaceutical Ingredient (API) but rather Drug Inactive Ingredients (DIGs). DIGs can reach much higher concentrations than that achieved by API, which raises great concerns about their clinical toxicities. Therefore, the biological activities of DIG on physiologically relevant target are widely demanded by both clinical investigation and pharmaceutical industry. However, such activity data are not available in any existing pharmaceutical knowledge base, and their potentials in predicting the DIG-target interaction have not been evaluated yet. In this study, the comprehensive assessment and analysis on the biological activities of DIGs were therefore conducted. First, the largest number of DIGs and DFMs were systematically curated and confirmed based on all drugs approved by US Food and Drug Administration. Second, comprehensive activities for both DIGs and DFMs were provided for the first time to pharmaceutical community. Third, the biological targets of each DIG and formulation were fully referenced to available databases that described their pharmaceutical/biological characteristics. Finally, a variety of popular artificial intelligence techniques were used to assess the predictive potential of DIGs' activity data, which was the first evaluation on the possibility to predict DIG's activity. As the activities of DIGs are critical for current pharmaceutical studies, this work is expected to have significant implications for the future practice of drug discovery and precision medicine.
Collapse
Affiliation(s)
- Chenyang Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Ying Zhou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, 79 QingChun Road, Hangzhou, Zhejiang 310000, China
| | - Wei Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Xichen Lian
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Shuiyang Shi
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yunxia Wang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Bing Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, 79 QingChun Road, Hangzhou, Zhejiang 310000, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Jianqing Gao
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| |
Collapse
|
16
|
He B, Wei C, Cai Q, Zhang P, Shi S, Peng X, Zhao Z, Yin W, Tu G, Peng W, Tao Y, Wang X. Switched alternative splicing events as attractive features in lung squamous cell carcinoma. Cancer Cell Int 2022; 22:5. [PMID: 34986865 PMCID: PMC8734344 DOI: 10.1186/s12935-021-02429-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 12/23/2021] [Indexed: 11/10/2022] Open
Abstract
Background Alternative splicing (AS) plays important roles in transcriptome and proteome diversity. Its dysregulation has a close affiliation with oncogenic processes. This study aimed to evaluate AS-based biomarkers by machine learning algorithms for lung squamous cell carcinoma (LUSC) patients. Method The Cancer Genome Atlas (TCGA) database and TCGA SpliceSeq database were utilized. After data composition balancing, Boruta feature selection and Spearman correlation analysis were used for differentially expressed AS events. Random forests and a nested fivefold cross-validation were applied for lymph node metastasis (LNM) classifier building. Random survival forest combined with Cox regression model was performed for a prognostic model, based on which a nomogram was developed. Functional enrichment analysis and Spearman correlation analysis were also conducted to explore underlying mechanisms. The expression of some switch-involved AS events along with parent genes was verified by qRT-PCR with 20 pairs of normal and LUSC tissues. Results We found 16 pairs of splicing events from same parent genes which were strongly related to the splicing switch (intrapair correlation coefficient = − 1). Next, we built a reliable LNM classifier based on 13 AS events as well as a nice prognostic model, in which switched AS events behaved prominently. The qRT-PCR presented consistent results with previous bioinformatics analysis, and some AS events like ITIH5-10715-AT and QKI-78404-AT showed remarkable detection efficiency for LUSC. Conclusion AS events, especially switched ones from the same parent genes, could provide new insights into the molecular diagnosis and therapeutic drug design of LUSC. Supplementary Information The online version contains supplementary material available at 10.1186/s12935-021-02429-2.
Collapse
Affiliation(s)
- Boxue He
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Xiangya School of Medicine, Central South University, Changsha, 410008, China
| | - Cong Wei
- Xiangya School of Medicine, Central South University, Changsha, 410008, China
| | - Qidong Cai
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Pengfei Zhang
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Shuai Shi
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Xiong Peng
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Zhenyu Zhao
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Wei Yin
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Guangxu Tu
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Weilin Peng
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China
| | - Yongguang Tao
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China.,Key Laboratory of Carcinogenesis and Cancer Invasion, Ministry of Education, Department of Pathology, Xiangya Hospital, Central South University, Hunan, 410078, China.,NHC Key Laboratory of Carcinogenesis (Central South University), Cancer Research Institute and School of Basic Medicine, Central South University, Changsha, 410078, Hunan, China
| | - Xiang Wang
- Department of Thoracic Surgery, Second Xiangya Hospital, Central South University, Changsha, 410011, China. .,Hunan Key Laboratory of Early Diagnosis and Precise Treatment of Lung Cancer, Second Xiangya Hospital, Central South University, Changsha, 410011, China.
| |
Collapse
|
17
|
Santaolalla A, Hulsen T, Davis J, Ahmed HU, Moore CM, Punwani S, Attard G, McCartan N, Emberton M, Coolen A, Van Hemelrijck M. The ReIMAGINE Multimodal Warehouse: Using Artificial Intelligence for Accurate Risk Stratification of Prostate Cancer. Front Artif Intell 2021; 4:769582. [PMID: 34870187 PMCID: PMC8637844 DOI: 10.3389/frai.2021.769582] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Accepted: 10/12/2021] [Indexed: 02/05/2023] Open
Abstract
Introduction. Prostate cancer (PCa) is the most frequent cancer diagnosis in men worldwide. Our ability to identify those men whose cancer will decrease their lifespan and/or quality of life remains poor. The ReIMAGINE Consortium has been established to improve PCa diagnosis. Materials and methods. MRI will likely become the future cornerstone of the risk-stratification process for men at risk of early prostate cancer. We will, for the first time, be able to combine the underlying molecular changes in PCa with the state-of-the-art imaging. ReIMAGINE Screening invites men for MRI and PSA evaluation. ReIMAGINE Risk includes men at risk of prostate cancer based on MRI, and includes biomarker testing. Results. Baseline clinical information, genomics, blood, urine, fresh prostate tissue samples, digital pathology and radiomics data will be analysed. Data will be de-identified, stored with correlated mpMRI disease endotypes and linked with long term follow-up outcomes in an instance of the Philips Clinical Data Lake, consisting of cloud-based software. The ReIMAGINE platform includes application programming interfaces and a user interface that allows users to browse data, select cohorts, manage users and access rights, query data, and more. Connection to analytics tools such as Python allows statistical and stratification method pipelines to run profiling regression analyses. Discussion. The ReIMAGINE Multimodal Warehouse comprises a unique data source for PCa research, to improve risk stratification for PCa and inform clinical practice. The de-identified dataset characterized by clinical, imaging, genomics and digital pathology PCa patient phenotypes will be a valuable resource for the scientific and medical community.
Collapse
Affiliation(s)
- Aida Santaolalla
- King’s College London, School of Cancer and Pharmaceutical Sciences, Translational Oncology and Urology Research (TOUR), London, United Kingdom
| | - Tim Hulsen
- Philips Research, Department of Hospital Services and Informatics, Eindhoven, Netherlands
| | - Jenson Davis
- Philips, Data Science Services, Best, Netherlands
| | - Hashim U. Ahmed
- Imperial College London, Faculty of Medicine, Imperial Prostate, Department of Surgery and Cancer, London, United Kingdom
| | - Caroline M. Moore
- Division of Surgical and Interventional Science, University College London, London, United Kingdom
| | - Shonit Punwani
- Centre for Medical Imaging, University College London, London, United Kingdom
| | - Gert Attard
- Cancer Institute, University College London, London, United Kingdom
| | - Neil McCartan
- Division of Surgical and Interventional Science, University College London, London, United Kingdom
| | - Mark Emberton
- Division of Surgical and Interventional Science, University College London, London, United Kingdom
| | - Anthony Coolen
- King’s College London, School of Cancer and Pharmaceutical Sciences, Translational Oncology and Urology Research (TOUR), London, United Kingdom
- Department of Biophysics, Donders Institute, Radboud University Nijmegen, Nijmegen, Netherlands
| | - Mieke Van Hemelrijck
- King’s College London, School of Cancer and Pharmaceutical Sciences, Translational Oncology and Urology Research (TOUR), London, United Kingdom
| |
Collapse
|
18
|
Nicolet BP, Zandhuis ND, Lattanzio VM, Wolkers MC. Sequence determinants as key regulators in gene expression of T cells. Immunol Rev 2021; 304:10-29. [PMID: 34486113 PMCID: PMC9292449 DOI: 10.1111/imr.13021] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 08/09/2021] [Accepted: 08/17/2021] [Indexed: 12/12/2022]
Abstract
T cell homeostasis, T cell differentiation, and T cell effector function rely on the constant fine-tuning of gene expression. To alter the T cell state, substantial remodeling of the proteome is required. This remodeling depends on the intricate interplay of regulatory mechanisms, including post-transcriptional gene regulation. In this review, we discuss how the sequence of a transcript influences these post-transcriptional events. In particular, we review how sequence determinants such as sequence conservation, GC content, and chemical modifications define the levels of the mRNA and the protein in a T cell. We describe the effect of different forms of alternative splicing on mRNA expression and protein production, and their effect on subcellular localization. In addition, we discuss the role of sequences and structures as binding hubs for miRNAs and RNA-binding proteins in T cells. The review thus highlights how the intimate interplay of post-transcriptional mechanisms dictate cellular fate decisions in T cells.
Collapse
Affiliation(s)
- Benoit P. Nicolet
- Department of HematopoiesisSanquin Research and Landsteiner LaboratoryAmsterdam UMCUniversity of AmsterdamAmsterdamThe Netherlands
- Oncode InstituteUtrechtThe Netherlands
| | - Nordin D. Zandhuis
- Department of HematopoiesisSanquin Research and Landsteiner LaboratoryAmsterdam UMCUniversity of AmsterdamAmsterdamThe Netherlands
- Oncode InstituteUtrechtThe Netherlands
| | - V. Maria Lattanzio
- Department of HematopoiesisSanquin Research and Landsteiner LaboratoryAmsterdam UMCUniversity of AmsterdamAmsterdamThe Netherlands
- Oncode InstituteUtrechtThe Netherlands
| | - Monika C. Wolkers
- Department of HematopoiesisSanquin Research and Landsteiner LaboratoryAmsterdam UMCUniversity of AmsterdamAmsterdamThe Netherlands
- Oncode InstituteUtrechtThe Netherlands
| |
Collapse
|
19
|
Cheng Y, Chen C, Yang J, Yang H, Fu M, Zhong X, Wang B, He M, Hu Z, Zhang Z, Jin X, Kang Y, Wu Q. Using Machine Learning Algorithms to Predict Hospital Acquired Thrombocytopenia after Operation in the Intensive Care Unit: A Retrospective Cohort Study. Diagnostics (Basel) 2021; 11:diagnostics11091614. [PMID: 34573956 PMCID: PMC8466367 DOI: 10.3390/diagnostics11091614] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 08/25/2021] [Accepted: 09/01/2021] [Indexed: 02/05/2023] Open
Abstract
Hospital acquired thrombocytopenia (HAT) is a common hematological complication after surgery. This research aimed to develop and compare the performance of seven machine learning (ML) algorithms for predicting patients that are at risk of HAT after surgery. We conducted a retrospective cohort study which enrolled adult patients transferred to the intensive care unit (ICU) after surgery in West China Hospital of Sichuan University from January 2016 to December 2018. All subjects were randomly divided into a derivation set (70%) and test set (30%). ten-fold cross-validation was used to estimate the hyperparameters of ML algorithms during the training process in the derivation set. After ML models were developed, the sensitivity, specificity, area under the curve (AUC), and net benefit (decision analysis curve, DCA) were calculated to evaluate the performances of ML models in the test set. A total of 10,369 patients were included and in 1354 (13.1%) HAT occurred. The AUC of all seven ML models exceeded 0.7, the two highest were Gradient Boosting (GB) (0.834, 0.814-0.853, p < 0.001) and Random Forest (RF) (0.828, 0.807-0.848, p < 0.001). There was no difference between GB and RF (0.834 vs. 0.828, p = 0.293); however, these two were better than the remaining five models (p < 0.001). The DCA revealed that all ML models had high net benefits with a threshold probability approximately less than 0.6. In conclusion, we found that ML models constructed by multiple preoperative variables can predict HAT in patients transferred to ICU after surgery, which can improve risk stratification and guide management in clinical practice.
Collapse
Affiliation(s)
- Yisong Cheng
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Chaoyue Chen
- Department of Neurosurgery, West China Hospital, Sichuan University, Chengdu 610041, China;
| | - Jie Yang
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Hao Yang
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Min Fu
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Xi Zhong
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Bo Wang
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Min He
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Zhi Hu
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Zhongwei Zhang
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Xiaodong Jin
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Yan Kang
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
| | - Qin Wu
- Department of Critical Care Medicine, West China Hospital, Sichuan University, Chengdu 610041, China; (Y.C.); (J.Y.); (H.Y.); (M.F.); (X.Z.); (B.W.); (M.H.); (Z.H.); (Z.Z.); (X.J.); (Y.K.)
- Correspondence: ; Tel.: +86-028-8542-2506
| |
Collapse
|
20
|
Feng Y, Wang Z, Yang N, Liu S, Yan J, Song J, Yang S, Zhang Y. Identification of Biomarkers for Cervical Cancer Radiotherapy Resistance Based on RNA Sequencing Data. Front Cell Dev Biol 2021; 9:724172. [PMID: 34414195 PMCID: PMC8369412 DOI: 10.3389/fcell.2021.724172] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Accepted: 07/14/2021] [Indexed: 11/28/2022] Open
Abstract
Cervical cancer as a common gynecological malignancy threatens the health and lives of women. Resistance to radiotherapy is the primary cause of treatment failure and is mainly related to difference in the inherent vulnerability of tumors after radiotherapy. Here, we investigated signature genes associated with poor response to radiotherapy by analyzing an independent cervical cancer dataset from the Gene Expression Omnibus, including pre-irradiation and mid-irradiation information. A total of 316 differentially expressed genes were significantly identified. The correlations between these genes were investigated through the Pearson correlation analysis. Subsequently, random forest model was used in determining cancer-related genes, and all genes were ranked by random forest scoring. The top 30 candidate genes were selected for uncovering their biological functions. Functional enrichment analysis revealed that the biological functions chiefly enriched in tumor immune responses, such as cellular defense response, negative regulation of immune system process, T cell activation, neutrophil activation involved in immune response, regulation of antigen processing and presentation, and peptidyl-tyrosine autophosphorylation. Finally, the top 30 genes were screened and analyzed through literature verification. After validation, 10 genes (KLRK1, LCK, KIF20A, CD247, FASLG, CD163, ZAP70, CD8B, ZNF683, and F10) were to our objective. Overall, the present research confirmed that integrated bioinformatics methods can contribute to the understanding of the molecular mechanisms and potential therapeutic targets underlying radiotherapy resistance in cervical cancer.
Collapse
Affiliation(s)
- Yue Feng
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Zhao Wang
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Nan Yang
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Sijia Liu
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Jiazhuo Yan
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Jiayu Song
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Shanshan Yang
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| | - Yunyan Zhang
- Department of Gynecological Radiotherapy, Harbin Medical University Cancer Hospital, Harbin, China
| |
Collapse
|