1
|
Książek K, Masarczyk W, Głomb P, Romaszewski M, Stokłosa I, Ścisło P, Dębski P, Pudlo R, Buza K, Gorczyca P, Piegza M. Assessment of symptom severity in psychotic disorder patients based on heart rate variability and accelerometer mobility data. Comput Biol Med 2024; 176:108544. [PMID: 38723395 DOI: 10.1016/j.compbiomed.2024.108544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 04/22/2024] [Accepted: 04/28/2024] [Indexed: 05/31/2024]
Abstract
BACKGROUND Advancement in mental health care requires easily accessible, efficient diagnostic and treatment assessment tools. Viable biomarkers could enable objectification and automation of the diagnostic and treatment process, currently dependent on a psychiatric interview. Available wearable technology and computational methods make it possible to incorporate heart rate variability (HRV), an indicator of autonomic nervous system (ANS) activity, into potential diagnostic and treatment assessment frameworks as a biomarker of disease severity in mental disorders, including schizophrenia and bipolar disorder (BD). METHOD We used a commercially available electrocardiography (ECG) chest strap with a built-in accelerometer, i.e. Polar H10, to record R-R intervals and physical activity of 30 hospitalized schizophrenia or BD patients and 30 control participants through ca. 1.5-2 h time periods. We validated a novel approach to data acquisition based on a flexible, patient-friendly and cost-effective setting. We analyzed the relationship between HRV and the Positive and Negative Syndrome Scale (PANSS) test scores, as well as the HRV and mobility coefficient. We also proposed a method of rest period selection based on R-R intervals and mobility data. The source code for reproducing all experiments is available on GitHub, while the dataset is published on Zenodo. RESULTS Mean HRV values were lower in the patient compared to the control group and negatively correlated with the results of the PANSS general subcategory. For the control group, we also discovered the inversely proportional dependency between the mobility coefficient, based on accelerometer data, and HRV. This relationship was less pronounced for the treatment group. CONCLUSIONS HRV value itself, as well as the relationship between HRV and mobility, may be promising biomarkers in disease diagnostics. These findings can be used to develop a flexible monitoring system for symptom severity assessment.
Collapse
Affiliation(s)
- Kamil Książek
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, Gliwice, 44-100, Poland.
| | - Wilhelm Masarczyk
- Department of Psychiatry, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Pyskowicka 49, Tarnowskie Góry, 42-612, Poland
| | - Przemysław Głomb
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, Gliwice, 44-100, Poland
| | - Michał Romaszewski
- Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, Bałtycka 5, Gliwice, 44-100, Poland
| | - Iga Stokłosa
- Department of Psychiatry, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Pyskowicka 49, Tarnowskie Góry, 42-612, Poland
| | - Piotr Ścisło
- Psychiatric Department of the Multidisciplinary Hospital, Tarnowskie Góry, 42-612, Poland
| | - Paweł Dębski
- Institute of Psychology, Humanitas University in Sosnowiec, Kilińskiego 43, Sosnowiec, 41-200, Poland
| | - Robert Pudlo
- Department of Psychoprophylaxis, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Pyskowicka 49, Tarnowskie Góry, 42-612, Poland
| | - Krisztián Buza
- Budapest Business University, Buzogány utca 10-12, Budapest, 1149, Hungary; BioIntelligence Group, Department of Mathematics-Informatics, Sapientia Hungarian University of Transylvania, Târgu Mureş, Romania
| | - Piotr Gorczyca
- Department of Psychiatry, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Pyskowicka 49, Tarnowskie Góry, 42-612, Poland
| | - Magdalena Piegza
- Department of Psychiatry, Faculty of Medical Sciences in Zabrze, Medical University of Silesia, Pyskowicka 49, Tarnowskie Góry, 42-612, Poland
| |
Collapse
|
2
|
Ou M, Du Z, Jiang Y, Zhou Q, Yuan J, Tian L, Zhu H. Causal relationship between schizophrenia and sex hormone binding globulin: A Mendelian randomization study. Schizophr Res 2024; 267:528-533. [PMID: 37735050 DOI: 10.1016/j.schres.2023.09.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 08/21/2023] [Accepted: 09/15/2023] [Indexed: 09/23/2023]
Abstract
OBJECTIVE The underlying etiology of schizophrenia is still not fully understood, and recent studies have pointed to a potential link between hormonal factors and the risk of developing this condition. Sex Hormone-Binding Globulin (SHBG) is a protein that regulates the bioavailability of sex hormones. However, the causal relationship between SHBG levels and schizophrenia remains unclear, this study aimed to investigate the causal relationship based on two-sample Mendelian randomization (MR) analysis. METHODS This study was based on the summary data of genome-wide association studies (GWAS) of schizophrenia and SHBG in European populations. Two-sample MR was applied, and genetic factors were used as instrumental variables, and the causal relationship between schizophrenia and SHBG was assessed. RESULTS We selected 79 single nucleotide polymorphisms with genome-wide significance from the schizophrenia GWAS as instrumental variables. The inverse variance weighting (IVW) method results showed that there is a causal relationship and a positive correlation between schizophrenia and female SHBG, with an odds ratio (OR) of 1.024 (95%CI: 1.007-1.042, P = 0.005), and this result was further confirmed by the Weighted median odds ratio (OR) of 1.032 (95%CI: 1.016-1.048, P = 5.58E-05) and the Weighted mode of 1.035 (95%CI: 1.004-1.067, P = 0.028). Schizophrenia and male SHBG also have a causal relationship and a positive correlation, with an odds ratio (OR) of 1.027 (95%CI: 1.007-1.047, P = 0.008). CONCLUSION This study found a positive correlation between schizophrenia and SHBG in both men and women through MR analysis, indicating that the level of SHBG may be elevated in patients with schizophrenia, regardless of gender.
Collapse
Affiliation(s)
- Mengmeng Ou
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China
| | - Zhiqiang Du
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China
| | - Ying Jiang
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China
| | - Qin Zhou
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China
| | - Jianming Yuan
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China
| | - Lin Tian
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China.
| | - Haohao Zhu
- Mental Health Center of Jiangnan University, Wuxi Central Rehabilitation Hospital, Wuxi, Jiangsu 214151, China.
| |
Collapse
|
3
|
Yang Q, Chen S, Jiang W, Mi L, Liu J, Hu Y, Ji X, Wang J, Zhu F. MultiClassMetabo: A Superior Classification Model Constructed Using Metabolic Markers in Multiclass Metabolomics. Anal Chem 2024; 96:1410-1418. [PMID: 38221713 DOI: 10.1021/acs.analchem.3c03212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2024]
Abstract
Multiclass metabolomics has become a popular technique for revealing the mechanisms underlying certain physiological processes, different tumor types, or different therapeutic responses. In multiclass metabolomics, it is highly important to uncover the underlying biological information on biosamples by identifying the metabolic markers with the most associations and classifying the different sample classes. The classification problem of multiclass metabolomics is more difficult than that of the binary problem. To date, various methods exist for constructing classification models and identifying metabolic markers consisting of well-established techniques and newly emerging machine learning algorithms. However, how to construct a superior classification model using these methods remains unclear for a given multiclass metabolomic data set. Herein, MultiClassMetabo has been developed for constructing a superior classification model using metabolic markers identified in multiclass metabolomics. MultiClassMetabo can enable online services, including (a) identifying metabolic markers by marker identification methods, (b) constructing classification models by classification methods, and (c) performing a comprehensive assessment from multiple perspectives to construct a superior classification model for multiclass metabolomics. In summary, MultiClassMetabo is distinguished for its capability to construct a superior classification model using the most appropriate method through a comprehensive assessment, which makes it an important complement to other available tools in multiclass metabolomics. MultiClassMetabo can be accessed at http://idrblab.cn/multiclassmetabo/.
Collapse
Affiliation(s)
- Qingxia Yang
- Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Shuman Chen
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Wenyu Jiang
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Lan Mi
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Jiarui Liu
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Yu Hu
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Xinglai Ji
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Jun Wang
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| |
Collapse
|
4
|
Fu M, He R, Zhang Z, Ma F, Shen L, Zhang Y, Duan M, Zhang Y, Wang Y, Zhu L, He J. Multinomial machine learning identifies independent biomarkers by integrated metabolic analysis of acute coronary syndrome. Sci Rep 2023; 13:20535. [PMID: 37996510 PMCID: PMC10667512 DOI: 10.1038/s41598-023-47783-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Accepted: 11/18/2023] [Indexed: 11/25/2023] Open
Abstract
A multi-class classification model for acute coronary syndrome (ACS) remains to be constructed based on multi-fluid metabolomics. Major confounders may exert spurious effects on the relationship between metabolism and ACS. The study aims to identify an independent biomarker panel for the multiclassification of HC, UA, and AMI by integrating serum and urinary metabolomics. We performed a liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based metabolomics study on 300 serum and urine samples from 44 patients with unstable angina (UA), 77 with acute myocardial infarction (AMI), and 29 healthy controls (HC). Multinomial machine learning approaches, including multinomial adaptive least absolute shrinkage and selection operator (LASSO) regression and random forest (RF), and assessment of the confounders were applied to integrate a multi-class classification biomarker panel for HC, UA and AMI. Different metabolic landscapes were portrayed during the transition from HC to UA and then to AMI. Glycerophospholipid metabolism and arginine biosynthesis were predominant during the progression from HC to UA and then to AMI. The multiclass metabolic diagnostic model (MDM) dependent on ACS, including 2-ketobutyric acid, LysoPC(18:2(9Z,12Z)), argininosuccinic acid, and cyclic GMP, demarcated HC, UA, and AMI, providing a C-index of 0.84 (HC vs. UA), 0.98 (HC vs. AMI), and 0.89 (UA vs. AMI). The diagnostic value of MDM largely derives from the contribution of 2-ketobutyric acid, and LysoPC(18:2(9Z,12Z)) in serum. Higher 2-ketobutyric acid and cyclic GMP levels were positively correlated with ACS risk and atherosclerosis plaque burden, while LysoPC(18:2(9Z,12Z)) and argininosuccinic acid showed the reverse relationship. An independent multiclass biomarker panel for HC, UA, and AMI was constructed using the multinomial machine learning methods based on serum and urinary metabolite signatures.
Collapse
Affiliation(s)
- Meijiao Fu
- Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Ruhua He
- Department of Cardiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Zhihan Zhang
- Department of Cardiology, Hanzhong Central Hospital, Hanzhong, 723200, Shanxi, China
| | - Fuqing Ma
- Department of Cardiology, The Fifth People's Hospital of Ningxia, Shizuishan, 753000, Ningxia, China
| | - Libo Shen
- Center for Cardiovascular Diseases, People's Hospital of Ningxia Hui Autonomous Region, Yinchuan, 750002, Ningxia, China
| | - Yu Zhang
- Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Mingyu Duan
- Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Yameng Zhang
- Department of Cardiology, The Second Affiliated Hospital of Henan University of Science and Technology, Luoyang, 471000, Henan, China
| | - Yifan Wang
- Department of Radiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China
| | - Li Zhu
- Department of Radiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China.
| | - Jun He
- Department of Cardiology, General Hospital of Ningxia Medical University, Yinchuan, 750004, Ningxia, China.
| |
Collapse
|
5
|
You P, Liu S, Li Q, Xie D, Yao L, Guo C, Guo Z, Wang T, Qiu H, Guo Y, Li J, Zhou H. Radiation-sensitive genetic prognostic model identifies individuals at risk for radiation resistance in head and neck squamous cell carcinoma. J Cancer Res Clin Oncol 2023; 149:15623-15640. [PMID: 37656244 DOI: 10.1007/s00432-023-05304-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Accepted: 08/15/2023] [Indexed: 09/02/2023]
Abstract
BACKGROUND The advantages of radiotherapy for head and neck squamous cell carcinoma (HNSCC) depend on the radiation sensitivity of the patient. Here, we established and verified radiological factor-related gene signature and built a prognostic risk model to predict whether radiotherapy would be beneficial. METHODS Data from The Cancer Genome Atlas, Gene Expression Omnibus, and RadAtlas databases were subjected to LASSO regression, univariate COX regression, and multivariate COX regression analyses to integrate genomic and clinical information from patients with HNSCC. HNSCC radiation-related prognostic genes were identified, and patients classified into high- and low-risk groups, based on risk scores. Variations in radiation sensitivity according to immunological microenvironment, functional pathways, and immunotherapy response were investigated. Finally, the expression of HNSCC radiation-related genes was verified by qRT-PCR. RESULTS We built a clinical risk prediction model comprising a 15-gene signature and used it to divide patients into two groups based on their susceptibility to radiation: radiation-sensitive and radiation-resistant. Overall survival was significantly greater in the radiation-sensitive than the radiation-resistant group. Further, our model was an independent predictor of radiotherapy response, outperforming other clinical parameters, and could be combined with tumor mutational burden, to identify the target population with good predictive value for prognosis at 1, 2, and 3 years. Additionally, the radiation-resistant group was more vulnerable to low levels of immune infiltration, which are significantly associated with DNA damage repair, hypoxia, and cell cycle regulation. Tumor Immune Dysfunction and Exclusion scores also suggested that the resistant group would respond less favorably to immunotherapy. CONCLUSIONS Our prognostic model based on a radiation-related gene signature has potential for application as a tool for risk stratification of radiation therapy for patients with HNSCC, helping to identify candidates for radiation therapy and overcome radiation resistance.
Collapse
Affiliation(s)
- Peimeng You
- Nanchang University, Nanchang, China
- Jiangxi Key Laboratory of Translational Cancer Research, Jiangxi Cancer Hospital, Nanchang, China
| | - Shengbo Liu
- Second Clinical College of Medicine, Southern Medical University, Guangzhou, China
- Department of Thoracic Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
| | - Qiaxuan Li
- Department of Thoracic Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
- Shantou University Medical College, Shantou, China
| | - Daipeng Xie
- Department of Thoracic Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
- Guangdong Cardiovascular Institute, Guangzhou, China
| | - Lintong Yao
- Department of Thoracic Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
- Shantou University Medical College, Shantou, China
| | - Chenguang Guo
- Department of Radiation Oncology, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
| | - Zefeng Guo
- Department of Radiation Oncology, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
| | - Ting Wang
- Department of Radiation Oncology, The First Affiliated Hospital of Guangdong Pharmaceutical University, Guangzhou, China
| | - Hongrui Qiu
- Department of Thoracic Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China
- Shantou University Medical College, Shantou, China
| | - Yangzhong Guo
- Jiangxi Key Laboratory of Translational Cancer Research, Jiangxi Cancer Hospital, Nanchang, China
| | - Junyu Li
- Jiangxi Key Laboratory of Translational Cancer Research, Jiangxi Cancer Hospital, Nanchang, China.
| | - Haiyu Zhou
- Nanchang University, Nanchang, China.
- Department of Thoracic Surgery, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, China.
- Jiangxi Lung Cancer Institute, Nanchang, China.
| |
Collapse
|
6
|
Malik MA, Faraone SV, Michoel T, Haavik J. Use of big data and machine learning algorithms to extract possible treatment targets in neurodevelopmental disorders. Pharmacol Ther 2023; 250:108530. [PMID: 37708996 DOI: 10.1016/j.pharmthera.2023.108530] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 08/30/2023] [Accepted: 09/11/2023] [Indexed: 09/16/2023]
Abstract
Neurodevelopmental disorders (NDDs) impact multiple aspects of an individual's functioning, including social interactions, communication, and behaviors. The underlying biological mechanisms of NDDs are not yet fully understood, and pharmacological treatments have been limited in their effectiveness, in part due to the complex nature of these disorders and the heterogeneity of symptoms across individuals. Identifying genetic loci associated with NDDs can help in understanding biological mechanisms and potentially lead to the development of new treatments. However, the polygenic nature of these complex disorders has made identifying new treatment targets from genome-wide association studies (GWAS) challenging. Recent advances in the fields of big data and high-throughput tools have provided radically new insights into the underlying biological mechanism of NDDs. This paper reviews various big data approaches, including classical and more recent techniques like deep learning, which can identify potential treatment targets from GWAS and other omics data, with a particular emphasis on NDDs. We also emphasize the increasing importance of explainable and causal machine learning (ML) methods that can aid in identifying genes, molecular pathways, and more complex biological processes that may be future targets of intervention in these disorders. We conclude that these new developments in genetics and ML hold promise for advancing our understanding of NDDs and identifying novel treatment targets.
Collapse
Affiliation(s)
- Muhammad Ammar Malik
- Computational Biology Unit, Department of Informatics, University of Bergen, PO BOX 7803, 5020 Bergen, Norway
| | - Stephen V Faraone
- Department of Psychiatry, Norton College of Medicine at SUNY Upstate Medical University, 13210, NY, USA
| | - Tom Michoel
- Computational Biology Unit, Department of Informatics, University of Bergen, PO BOX 7803, 5020 Bergen, Norway
| | - Jan Haavik
- Department of Biomedicine, University of Bergen, PO BOX 7804, 5020 Bergen, Norway; Bergen Center for Brain Plasticity, Division of Psychiatry, Haukeland University Hospital, PO BOX 1400, 5021 Bergen, Norway.
| |
Collapse
|
7
|
Yang Q, Gong Y, Zhu F. Critical Assessment of the Biomarker Discovery and Classification Methods for Multiclass Metabolomics. Anal Chem 2023; 95:5542-5552. [PMID: 36944135 DOI: 10.1021/acs.analchem.2c04402] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
Multiclass metabolomics has been widely applied in clinical practice to understand pathophysiological processes involved in disease progression and diagnostic biomarkers of various disorders. In contrast to the binary problem, the multiclass classification problem is more difficult in terms of obtaining reliable and stable results due to the increase in the complexity of determining exact class decision boundaries. In particular, methods of biomarker discovery and classification have a significant effect on the multiclass model because different methods with significantly varied theories produce conflicting results even for the same dataset. However, a systematic assessment for selecting the most appropriate methods of biomarker discovery and classification for multiclass metabolomics is still lacking. Therefore, a comprehensive assessment is essential to measure the suitability of methods in multiclass classification models from multiple perspectives. In this study, five biomarker discovery methods and nine classification methods were assessed based on four benchmark datasets of multiclass metabolomics. The performance assessment of the biomarker discovery and classification methods was performed using three evaluation criteria: assessment a (cluster analysis of sample grouping), assessment b (biomarker consistency in multiple subgroups), and assessment c (accuracy in the classification model). As a result, 13 combining strategies with superior performance were selected under multiple criteria based on these benchmark datasets. In conclusion, superior strategies that performed consistently well are suggested for the discovery of biomarkers and the construction of a classification model for multiclass metabolomics.
Collapse
Affiliation(s)
- Qingxia Yang
- Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yaguo Gong
- School of Pharmacy, Macau University of Science and Technology, Macau 999078, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
8
|
Zheng W, Wang T, Liu C, Yan Q, Zhan S, Li G, Liu X, Jiang Y. Liver transcriptomics reveals microRNA features of the host response in a mouse model of dengue virus infection. Comput Biol Med 2022; 150:106057. [PMID: 36215851 DOI: 10.1016/j.compbiomed.2022.106057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 07/25/2022] [Accepted: 08/27/2022] [Indexed: 11/16/2022]
Abstract
BACKGROUND Organ dysfunction, especially liver injury, caused by dengue virus (DENV) infection has been associated with fatal cases in dengue patients around the world. However, the pathophysiological mechanisms of liver involvement in dengue remain unclear. There is accumulating evidence that miRNAs are playing an important role in regulating viral pathogenesis, and it can help in diagnostic and anti-viral therapies development. METHODS We collected liver tissues of DENV-infected for small RNA sequencing to identify significantly different express miRNAs during dengue virus infection, and the identified target genes of these miRNAs were annotated by biological function and pathway enrichment. RESULTS 31 significantly altered miRNAs were identified, including 16 up-regulated and 15 down-regulated miRNAs. By performing a series of miRNA prediction and signaling pathway enrichment analyses, the down-regulated miRNAs of mmu-miR-484, mmu-miR-1247-5p and mmu-miR-6538 were identified to be the crucial miRNAs. Further analysis revealed that the inflammation and immune responses involving Hippo, PI3K-Akt, MAPK, Wnt, mTOR, TGF-beta, Tight junction, and Platelet activation were modulated collectively by these three key miRNAs during DENV infection. These pathways are considered to be closely associated with the pathogenic mechanism and treatment strategy of dengue patients. CONCLUSION The miRNAs identified by sequencing, especially miR-484 may be the potential therapeutic targets for liver involvement in dengue patients which involves the regulation of vascular permeability and expression of inflammatory cytokines.
Collapse
Affiliation(s)
- Wenjiang Zheng
- The First Clinical Medical School, Guangzhou University of Chinese Medicine, China; The First Affiliated Hospital, Guangzhou University of Chinese Medicine, China; Animal Experiment Center, Guangzhou University of Chinese Medicine, China.
| | - Ting Wang
- The First Clinical Medical School, Guangzhou University of Chinese Medicine, China; The First Affiliated Hospital, Guangzhou University of Chinese Medicine, China.
| | - Chengxin Liu
- The First Clinical Medical School, Guangzhou University of Chinese Medicine, China; The First Affiliated Hospital, Guangzhou University of Chinese Medicine, China.
| | - Qian Yan
- The First Clinical Medical School, Guangzhou University of Chinese Medicine, China; The First Affiliated Hospital, Guangzhou University of Chinese Medicine, China.
| | - Shaofeng Zhan
- The First Affiliated Hospital, Guangzhou University of Chinese Medicine, China.
| | - Geng Li
- Animal Experiment Center, Guangzhou University of Chinese Medicine, China.
| | - Xiaohong Liu
- The First Affiliated Hospital, Guangzhou University of Chinese Medicine, China.
| | - Yong Jiang
- Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, China.
| |
Collapse
|
9
|
Zhou Z, Huang C, Fu P, Huang H, Zhang Q, Wu X, Yu Q, Sun Y. Prediction of in-hospital hypokalemia using machine learning and first hospitalization day records in patients with traumatic brain injury. CNS Neurosci Ther 2022; 29:181-191. [PMID: 36258296 PMCID: PMC9804086 DOI: 10.1111/cns.13993] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 09/18/2022] [Accepted: 09/23/2022] [Indexed: 02/06/2023] Open
Abstract
AIMS Hypokalemia is a common complication following traumatic brain injury, which may complicate treatment and lead to unfavorable outcomes. Identifying patients at risk of hypokalemia on the first day of admission helps to implement prophylactic treatment, reduce complications, and improve prognosis. METHODS This multicenter retrospective study was performed between January 2017 and December 2020 using the electronic medical records of patients admitted due to traumatic brain injury. A propensity score matching approach was adopted with a ratio of 1:1 to overcome overfitting and data imbalance during subgroup analyses. Five machine learning algorithms were applied to generate a best-performed prediction model for in-hospital hypokalemia. The internal fivefold cross-validation and external validation were performed to demonstrate the interpretability and generalizability. RESULTS A total of 4445 TBI patients were recruited for analysis and model generation. Hypokalemia occurred in 46.55% of recruited patients and the incidences of mild, moderate, and severe hypokalemia were 32.06%, 12.69%, and 1.80%, respectively. Hypokalemia was associated with increased mortality, while severe hypokalemia cast greater impacts. The logistic regression algorithm had the best performance in predicting decreased serum potassium and moderate-to-severe hypokalemia, with an AUC of 0.73 ± 0.011 and 0.74 ± 0.019, respectively. The prediction model was further verified using two external datasets, including our previous published data and the open-assessed Medical Information Mart for Intensive Care database. Linearized calibration curves showed no statistical difference (p > 0.05) with perfect predictions. CONCLUSIONS The occurrence of hypokalemia following traumatic brain injury can be predicted by first hospitalization day records and machine learning algorithms. The logistic regression algorithm showed an optimal predicting performance verified by both internal and external validation.
Collapse
Affiliation(s)
- Zhengyu Zhou
- Department of Anesthesia, Huashan HospitalFudan UniversityShanghaiChina
| | - Chiungwei Huang
- Health Consultation and Physical Examination Center, Zhongshan HospitalFudan UniversityShanghaiChina,Department of Neurosurgery, Huashan Hospital, Shanghai Medical CollegeFudan UniversityShanghaiChina
| | - Pengfei Fu
- Department of Neurosurgery, Huashan Hospital, Shanghai Medical CollegeFudan UniversityShanghaiChina
| | - Hong Huang
- Information Center, Huashan HospitalFudan UniversityShanghaiChina
| | - Qi Zhang
- Information Center, Huashan HospitalFudan UniversityShanghaiChina
| | - Xuehai Wu
- Department of Neurosurgery, Huashan Hospital, Shanghai Medical CollegeFudan UniversityShanghaiChina,National Center for Neurological DisordersShanghaiChina,Shanghai Key Laboratory of Brain Function Restoration and Neural RegenerationShanghaiChina,Neurosurgical Institute of Fudan UniversityShanghaiChina,Shanghai Clinical Medical Center of NeurosurgeryShanghaiChina
| | - Qiong Yu
- Department of Anesthesia, Huashan HospitalFudan UniversityShanghaiChina
| | - Yirui Sun
- Department of Neurosurgery, Huashan Hospital, Shanghai Medical CollegeFudan UniversityShanghaiChina,National Center for Neurological DisordersShanghaiChina,Shanghai Key Laboratory of Brain Function Restoration and Neural RegenerationShanghaiChina,Neurosurgical Institute of Fudan UniversityShanghaiChina,Shanghai Clinical Medical Center of NeurosurgeryShanghaiChina
| |
Collapse
|
10
|
Yang Q, Li Y, Li B, Gong Y. A novel multi-class classification model for schizophrenia, bipolar disorder and healthy controls using comprehensive transcriptomic data. Comput Biol Med 2022; 148:105956. [PMID: 35981456 DOI: 10.1016/j.compbiomed.2022.105956] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 07/30/2022] [Accepted: 08/06/2022] [Indexed: 01/01/2023]
Abstract
Two common psychiatric disorders, schizophrenia (SCZ) and bipolar disorder (BP), confer lifelong disability and collectively affect 2% of the world population. Because the diagnosis of psychiatry is based only on symptoms, developing more effective methods for the diagnosis of psychiatric disorders is a major international public health priority. Furthermore, SCZ and BP overlap considerably in terms of symptoms and risk genes. Therefore, the clarity of the underlying etiology and pathology remains lacking for these two disorders. Although many studies have been conducted, a classification model with higher accuracy and consistency was found to still be necessary for accurate diagnoses of SCZ and BP. In this study, a comprehensive dataset was combined from five independent transcriptomic studies. This dataset comprised 120 patients with SCZ, 101 patients with BP, and 149 healthy subjects. The partial least squares discriminant analysis (PLS-DA) method was applied to identify the gene signature among multiple groups, and 341 differentially expressed genes (DEGs) were identified. Then, the disease relevance of these DEGs was systematically performed, including (α) the great disease relevance of the identified signature, (β) the hub genes of the protein-protein interaction network playing a key role in psychiatric disorders, and (γ) gene ontology terms and enriched pathways playing a key role in psychiatric disorders. Finally, a popular multi-class classifier, support vector machine (SVM), was applied to construct a novel multi-class classification model using the identified signature for SCZ and BP. Using the independent test sets, the classification capacity of this multi-class model was assessed, which showed this model had a strong classification ability.
Collapse
Affiliation(s)
- Qingxia Yang
- Department of Bioinformatics, Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.
| | - Yi Li
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou, China
| | - Bo Li
- College of Life Sciences, Chongqing Normal University, Chongqing, Chongqing, 401331, China
| | - Yaguo Gong
- School of Pharmacy, Macau University of Science and Technology, Macau, China.
| |
Collapse
|
11
|
Babu G, Nobel FA. Identification of differentially expressed genes and their major pathways among the patient with COVID-19, cystic fibrosis, and chronic kidney disease. INFORMATICS IN MEDICINE UNLOCKED 2022; 32:101038. [PMID: 35966126 PMCID: PMC9357445 DOI: 10.1016/j.imu.2022.101038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Revised: 07/31/2022] [Accepted: 08/01/2022] [Indexed: 11/19/2022] Open
Abstract
The SARS-CoV-2 virus causes Coronavirus disease, an infectious disease. The majority of people who are infected with this virus will have mild to moderate respiratory symptoms. Multiple studies have proved that there is a substantial pathophysiological link between COVID-19 disease and patients having comorbidities such as cystic fibrosis and chronic kidney disease. In this study, we attempted to identify differentially expressed genes as well as genes that intersected among them in order to comprehend their compatibility. Gene expression profiling indicated that 849 genes were mutually exclusive and functional analysis was done within the context of gene ontology and key pathways involvement. Three genes (PRPF31, FOXN2, and RIOK3) were commonly upregulated in the analysed datasets of three disease categories. These genes could be potential biomarkers for patients with COVID-19 and cystic fibrosis, and COVID-19 and chronic kidney disease. Further extensive analyses have been performed to describe how these genes are regulated by various transcription factors and microRNAs. Then, our analyses revealed six hub genes (PRPF31, FOXN2, RIOK3, UBC, HNF4A, and ELAVL). As they were involved in the interaction between COVID-19 and the patient with CF and CKD, they could help researchers identify potential therapeutic molecules. Some drugs have been predicted based on the upregulated genes, which may have a significant impact on reducing the burden of these diseases in the future.
Collapse
Affiliation(s)
- Golap Babu
- Department of Biochemistry and Molecular Biology, Jahangirnagar University, Savar, Dhaka, 1342, Bangladesh
| | - Fahim Alam Nobel
- Department of Biochemistry and Molecular Biology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh
| |
Collapse
|
12
|
Zhang S, Sun X, Mou M, Amahong K, Sun H, Zhang W, Shi S, Li Z, Gao J, Zhu F. REGLIV: Molecular regulation data of diverse living systems facilitating current multiomics research. Comput Biol Med 2022; 148:105825. [PMID: 35872412 DOI: 10.1016/j.compbiomed.2022.105825] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 06/29/2022] [Accepted: 07/03/2022] [Indexed: 12/24/2022]
Abstract
Multiomics is a powerful technique in molecular biology that facilitates the identification of new associations among different molecules (genes, proteins & metabolites). It has attracted tremendous research interest from the scientists worldwide and has led to an explosive number of published studies. Most of these studies are based on the regulation data provided in available databases. Therefore, it is essential to have molecular regulation data that are strictly validated in the living systems of various cell lines and in vivo models. However, no database has been developed yet to provide comprehensive molecular regulation information validated by living systems. Herein, a new database, Molecular Regulation Data of Living System Facilitating Multiomics Study (REGLIV) is introduced to describe various types of molecular regulation tested by the living systems. (1) A total of 2996 regulations describe the changes in 1109 metabolites triggered by alterations in 284 genes or proteins, and (2) 1179 regulations describe the variations in 926 proteins induced by 125 endogenous metabolites. Overall, REGLIV is unique in (a) providing the molecular regulation of a clearly defined regulatory direction other than simple correlation, (b) focusing on molecular regulations that are validated in a living system not simply in an in vitro test, and (c) describing the disease/tissue/species specific property underlying each regulation. Therefore, REGLIV has important implications for the future practice of not only multiomics, but also other fields relevant to molecular regulation. REGLIV is freely accessible at: https://idrblab.org/regliv/.
Collapse
Affiliation(s)
- Song Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Xiuna Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Kuerbannisha Amahong
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Wei Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Shuiyang Shi
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China
| | - Jianqing Gao
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China; Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 330110, China.
| |
Collapse
|
13
|
Artificial intelligence and machine-learning approaches in structure and ligand-based discovery of drugs affecting central nervous system. Mol Divers 2022; 27:959-985. [PMID: 35819579 DOI: 10.1007/s11030-022-10489-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 06/21/2022] [Indexed: 12/11/2022]
Abstract
CNS disorders are indications with a very high unmet medical needs, relatively smaller number of available drugs, and a subpar satisfaction level among patients and caregiver. Discovery of CNS drugs is extremely expensive affair with its own unique challenges leading to extremely high attrition rates and low efficiency. With explosion of data in information age, there is hardly any aspect of life that has not been touched by data driven technologies such as artificial intelligence (AI) and machine learning (ML). Drug discovery is no exception, emergence of big data via genomic, proteomic, biological, and chemical technologies has driven pharmaceutical giants to collaborate with AI oriented companies to revolutionise drug discovery, with the goal of increasing the efficiency of the process. In recent years many examples of innovative applications of AI and ML techniques in CNS drug discovery has been reported. Research on therapeutics for diseases such as schizophrenia, Alzheimer's and Parkinsonism has been provided with a new direction and thrust from these developments. AI and ML has been applied to both ligand-based and structure-based drug discovery and design of CNS therapeutics. In this review, we have summarised the general aspects of AI and ML from the perspective of drug discovery followed by a comprehensive coverage of the recent developments in the applications of AI/ML techniques in CNS drug discovery.
Collapse
|
14
|
A radiation resistance related index for biochemical recurrence and tumor immune environment in prostate cancer patients. Comput Biol Med 2022; 146:105711. [PMID: 35701253 DOI: 10.1016/j.compbiomed.2022.105711] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 05/08/2022] [Accepted: 06/04/2022] [Indexed: 11/20/2022]
Abstract
PURPOSE To establish and verify a novel radiation resistance related index for predicting biochemical recurrence and tumor immune environment in prostate cancer (PCa) patients. MATERIALS AND METHODS The transcriptome information of PCa were obtained from GEO and TCGA portal. We identified radiation resistance related genes (RRGs) between radioresistant and radiosensitive PCa cells. We conducted multivariate Cox analysis to construct a novel radiation resistance related index for predicting biochemical recurrence (BCR)-free survival (BCRFS). Internal and external validations were conducted. Preliminary experimental verifications were performed. RESULTS We identified 194 differentially expressed RRGs and three radiation resistance related molecular clusters for PCa. Moreover, we established a novel radiation resistance related index and succeeded in conducting internal and external validations. High-risk populations meant significantly worse BCRFS in training, testing and validating cohort. The area under receiver operating characteristic curve were 0.809, 0.698, and 0.712 in training, testing, and validating cohort. The immune microenvironment was significantly different between high and low-risk score patients. Preliminary experiment identified and validated three potential biomarkers related to radiation resistance (ZNF695, TM4SF19, CCDC3) of PCa. CONCLUSIONS This study successfully established and verified a novel radiation resistance related index, which had an excellent performance in predicting BCR and tumor immune microenvironment in patients with PCa.
Collapse
|
15
|
Bagherzadeh S, Shahabi MS, Shalbaf A. Detection of schizophrenia using hybrid of deep learning and brain effective connectivity image from electroencephalogram signal. Comput Biol Med 2022; 146:105570. [DOI: 10.1016/j.compbiomed.2022.105570] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/14/2022] [Accepted: 04/25/2022] [Indexed: 02/06/2023]
|
16
|
Li F, Yin J, Lu M, Yang Q, Zeng Z, Zhang B, Li Z, Qiu Y, Dai H, Chen Y, Zhu F. ConSIG: consistent discovery of molecular signature from OMIC data. Brief Bioinform 2022; 23:6618243. [PMID: 35758241 DOI: 10.1093/bib/bbac253] [Citation(s) in RCA: 55] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Revised: 05/09/2022] [Accepted: 05/31/2022] [Indexed: 12/12/2022] Open
Abstract
The discovery of proper molecular signature from OMIC data is indispensable for determining biological state, physiological condition, disease etiology, and therapeutic response. However, the identified signature is reported to be highly inconsistent, and there is little overlap among the signatures identified from different biological datasets. Such inconsistency raises doubts about the reliability of reported signatures and significantly hampers its biological and clinical applications. Herein, an online tool, ConSIG, was constructed to realize consistent discovery of gene/protein signature from any uploaded transcriptomic/proteomic data. This tool is unique in a) integrating a novel strategy capable of significantly enhancing the consistency of signature discovery, b) determining the optimal signature by collective assessment, and c) confirming the biological relevance by enriching the disease/gene ontology. With the increasingly accumulated concerns about signature consistency and biological relevance, this online tool is expected to be used as an essential complement to other existing tools for OMIC-based signature discovery. ConSIG is freely accessible to all users without login requirement at https://idrblab.org/consig/.
Collapse
Affiliation(s)
- Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Jiayi Yin
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Qingxia Yang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Zhenyu Zeng
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Bing Zhang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Zhaorong Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, 79 QingChun Road, Hangzhou, Zhejiang 310000, China
| | - Haibin Dai
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China
| | - Yuzong Chen
- State Key Laboratory of Chemical Oncogenomics, Key Laboratory of Chemical Biology, The Graduate School at Shenzhen, Tsinghua University, Shenzhen 518055, China.,Qian Xuesen Collaborative Research Center of Astrochemistry and Space Life Sciences, Institute of Drug Discovery Technology, Ningbo University, Ningbo 315211, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital, Zhejiang University School of Medicine, Zhejiang University, Hangzhou 310058, China.,Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| |
Collapse
|
17
|
Application of explainable artificial intelligence in the identification of Squamous Cell Carcinoma biomarkers. Comput Biol Med 2022; 146:105505. [DOI: 10.1016/j.compbiomed.2022.105505] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 04/03/2022] [Accepted: 04/05/2022] [Indexed: 11/23/2022]
|
18
|
Yang Q, Gong Y. Construction of the Classification Model Using Key Genes Identified Between Benign and Malignant Thyroid Nodules From Comprehensive Transcriptomic Data. Front Genet 2022; 12:791349. [PMID: 35096008 PMCID: PMC8795894 DOI: 10.3389/fgene.2021.791349] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 12/06/2021] [Indexed: 01/15/2023] Open
Abstract
Thyroid nodules are present in upto 50% of the population worldwide, and thyroid malignancy occurs in only 5–15% of nodules. Until now, fine-needle biopsy with cytologic evaluation remains the diagnostic choice to determine the risk of malignancy, yet it fails to discriminate as benign or malignant in one-third of cases. In order to improve the diagnostic accuracy and reliability, molecular testing based on transcriptomic data has developed rapidly. However, gene signatures of thyroid nodules identified in a plenty of transcriptomic studies are highly inconsistent and extremely difficult to be applied in clinical application. Therefore, it is highly necessary to identify consistent signatures to discriminate benign or malignant thyroid nodules. In this study, five independent transcriptomic studies were combined to discover the gene signature between benign and malignant thyroid nodules. This combined dataset comprises 150 malignant and 93 benign thyroid samples. Then, there were 279 differentially expressed genes (DEGs) discovered by the feature selection method (Student’s t test and fold change). And the weighted gene co-expression network analysis (WGCNA) was performed to identify the modules of highly co-expressed genes, and 454 genes in the gray module were discovered as the hub genes. The intersection between DEGs by the feature selection method and hub genes in the WGCNA model was identified as the key genes for thyroid nodules. Finally, four key genes (ST3GAL5, NRCAM, MT1F, and PROS1) participated in the pathogenesis of malignant thyroid nodules were validated using an independent dataset. Moreover, a high-performance classification model for discriminating thyroid nodules was constructed using these key genes. All in all, this study might provide a new insight into the key differentiation of benign and malignant thyroid nodules.
Collapse
Affiliation(s)
- Qingxia Yang
- Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, Department of Bioinformatics, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing, China
| | - Yaguo Gong
- School of Pharmacy, Macau University of Science and Technology, Macau, China
- *Correspondence: Yaguo Gong,
| |
Collapse
|
19
|
Fu T, Li F, Zhang Y, Yin J, Qiu W, Li X, Liu X, Xin W, Wang C, Yu L, Gao J, Zheng Q, Zeng S, Zhu F. VARIDT 2.0: structural variability of drug transporter. Nucleic Acids Res 2021; 50:D1417-D1431. [PMID: 34747471 PMCID: PMC8728241 DOI: 10.1093/nar/gkab1013] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Revised: 10/08/2021] [Accepted: 11/04/2021] [Indexed: 12/20/2022] Open
Abstract
The structural variability data of drug transporter (DT) are key for research on precision medicine and rational drug use. However, these valuable data are not sufficiently covered by the available databases. In this study, a major update of VARIDT (a database previously constructed to provide DTs' variability data) was thus described. First, the experimentally resolved structures of all DTs reported in the original VARIDT were discovered from PubMed and Protein Data Bank. Second, the structural variability data of each DT were collected by literature review, which included: (a) mutation-induced spatial variations in folded state, (b) difference among DT structures of human and model organisms, (c) outward/inward-facing DT conformations and (d) xenobiotics-driven alterations in the 3D complexes. Third, for those DTs without experimentally resolved structural variabilities, homology modeling was further applied as well-established protocol to enrich such valuable data. As a result, 145 mutation-induced spatial variations of 42 DTs, 1622 inter-species structures originating from 292 DTs, 118 outward/inward-facing conformations belonging to 59 DTs, and 822 xenobiotics-regulated structures in complex with 57 DTs were updated to VARIDT (https://idrblab.org/varidt/ and http://varidt.idrblab.net/). All in all, the newly collected structural variabilities will be indispensable for explaining drug sensitivity/selectivity, bridging preclinical research with clinical trial, revealing the mechanism underlying drug-drug interaction, and so on.
Collapse
Affiliation(s)
- Tingting Fu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.,Institute of Theoretical Chemistry, College of Chemistry, Jilin University, Changchun 130023, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yang Zhang
- Department of Pharmacology, Hebei Medical University, Shijiazhuang 050017, China
| | - Jiayi Yin
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Wenqi Qiu
- Department of Surgery, HKU-SZH & Faculty of Medicine, The University of Hong Kong, Hong Kong, China
| | - Xuedong Li
- Department of Pharmacology, Hebei Medical University, Shijiazhuang 050017, China
| | - Xingang Liu
- Department of Pharmacology, Hebei Medical University, Shijiazhuang 050017, China
| | - Wenwen Xin
- Department of Pharmacology, Hebei Medical University, Shijiazhuang 050017, China
| | - Chengzhao Wang
- Department of Pharmacology, Hebei Medical University, Shijiazhuang 050017, China
| | - Lushan Yu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Jianqing Gao
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.,Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Qingchuan Zheng
- Institute of Theoretical Chemistry, College of Chemistry, Jilin University, Changchun 130023, China
| | - Su Zeng
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.,Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| |
Collapse
|
20
|
Rema J, Novais F, Telles-Correia D. Precision Psychiatry: Machine learning as a tool to find new pharmacological targets. Curr Top Med Chem 2021; 22:1261-1269. [PMID: 34607546 DOI: 10.2174/1568026621666211004095917] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Revised: 07/20/2021] [Accepted: 08/19/2021] [Indexed: 12/18/2022]
Abstract
There is an increasing amount of data arising from neurobehavioral sciences and medical records that cannot be adequately analyzed by traditional research methods. New drugs develop at a slow rate and seem unsatisfactory for the majority of neurobehavioral disorders. Machine learning (ML) techniques, instead, can incorporate psychopathological, computational, cognitive, and neurobiological underpinning knowledge leading to a refinement of detection, diagnosis, prognosis, treatment, research, and support. Machine and deep learning methods are currently used to accelerate the process of discovering new pharmacological targets and drugs. OBJECTIVE The present work reviews current evidence regarding the contribution of machine learning to the discovery of new drug targets. METHODS Scientific articles from PubMed, SCOPUS, EMBASE, and Web of Science Core Collection published until May 2021 were included in this review. RESULTS The most significant areas of research are schizophrenia, depression and anxiety, Alzheimer´s disease, and substance use disorders. ML techniques have pinpointed target gene candidates and pathways, new molecular substances, and several biomarkers regarding psychiatric disorders. Drug repositioning studies using ML have identified multiple drug candidates as promising therapeutic agents. CONCLUSION Next-generation ML techniques and subsequent deep learning may power new findings regarding the discovery of new pharmacological agents by bridging the gap between biological data and chemical drug information.
Collapse
Affiliation(s)
- João Rema
- Faculdade de Medicina da Universidade de Lisboa. Portugal
| | - Filipa Novais
- Faculdade de Medicina da Universidade de Lisboa. Portugal
| | | |
Collapse
|
21
|
Subtype-selective mechanisms of negative allosteric modulators binding to group I metabotropic glutamate receptors. Acta Pharmacol Sin 2021; 42:1354-1367. [PMID: 33122823 PMCID: PMC8285414 DOI: 10.1038/s41401-020-00541-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 09/15/2020] [Indexed: 02/06/2023]
Abstract
Group I metabotropic glutamate receptors (mGlu1 and mGlu5) are promising targets for multiple psychiatric and neurodegenerative disorders. Understanding the subtype selectivity of mGlu1 and mGlu5 allosteric sites is essential for the rational design of novel modulators with single- or dual-target mechanism of action. In this study, starting from the deposited mGlu1 and mGlu5 crystal structures, we utilized computational modeling approaches integrating docking, molecular dynamics simulation, and efficient post-trajectory analysis to reveal the subtype-selective mechanism of mGlu1 and mGlu5 to 10 diverse drug scaffolds representing known negative allosteric modulators (NAMs) in the literature. The results of modeling identified six pairs of non-conserved residues and four pairs of conserved ones as critical features to distinguish the selective NAMs binding to the corresponding receptors. In addition, nine pairs of residues are beneficial to the development of novel dual-target NAMs of group I metabotropic glutamate receptors. Furthermore, the binding modes of a reported dual-target NAM (VU0467558) in mGlu1 and mGlu5 were predicted to verify the identified residues that play key roles in the receptor selectivity and the dual-target binding. The results of this study can guide rational structure-based design of novel NAMs, and the approach can be generally applicable to characterize the features of selectivity for other G-protein-coupled receptors.
Collapse
|
22
|
Vatansever S, Schlessinger A, Wacker D, Kaniskan HÜ, Jin J, Zhou M, Zhang B. Artificial intelligence and machine learning-aided drug discovery in central nervous system diseases: State-of-the-arts and future directions. Med Res Rev 2021; 41:1427-1473. [PMID: 33295676 PMCID: PMC8043990 DOI: 10.1002/med.21764] [Citation(s) in RCA: 158] [Impact Index Per Article: 39.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 10/30/2020] [Accepted: 11/20/2020] [Indexed: 01/11/2023]
Abstract
Neurological disorders significantly outnumber diseases in other therapeutic areas. However, developing drugs for central nervous system (CNS) disorders remains the most challenging area in drug discovery, accompanied with the long timelines and high attrition rates. With the rapid growth of biomedical data enabled by advanced experimental technologies, artificial intelligence (AI) and machine learning (ML) have emerged as an indispensable tool to draw meaningful insights and improve decision making in drug discovery. Thanks to the advancements in AI and ML algorithms, now the AI/ML-driven solutions have an unprecedented potential to accelerate the process of CNS drug discovery with better success rate. In this review, we comprehensively summarize AI/ML-powered pharmaceutical discovery efforts and their implementations in the CNS area. After introducing the AI/ML models as well as the conceptualization and data preparation, we outline the applications of AI/ML technologies to several key procedures in drug discovery, including target identification, compound screening, hit/lead generation and optimization, drug response and synergy prediction, de novo drug design, and drug repurposing. We review the current state-of-the-art of AI/ML-guided CNS drug discovery, focusing on blood-brain barrier permeability prediction and implementation into therapeutic discovery for neurological diseases. Finally, we discuss the major challenges and limitations of current approaches and possible future directions that may provide resolutions to these difficulties.
Collapse
Affiliation(s)
- Sezen Vatansever
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Avner Schlessinger
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Daniel Wacker
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of NeuroscienceIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - H. Ümit Kaniskan
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Jian Jin
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Therapeutics DiscoveryIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Ming‐Ming Zhou
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Oncological Sciences, Tisch Cancer InstituteIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| | - Bin Zhang
- Department of Genetics and Genomic SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Mount Sinai Center for Transformative Disease ModelingIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Icahn Institute for Data Science and Genomic TechnologyIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
- Department of Pharmacological SciencesIcahn School of Medicine at Mount SinaiNew YorkNew YorkUSA
| |
Collapse
|
23
|
Fu J, Zhang Y, Liu J, Lian X, Tang J, Zhu F. Pharmacometabonomics: data processing and statistical analysis. Brief Bioinform 2021; 22:6236068. [PMID: 33866355 DOI: 10.1093/bib/bbab138] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Revised: 02/09/2021] [Accepted: 03/23/2021] [Indexed: 12/14/2022] Open
Abstract
Individual variations in drug efficacy, side effects and adverse drug reactions are still challenging that cannot be ignored in drug research and development. The aim of pharmacometabonomics is to better understand the pharmacokinetic properties of drugs and monitor the drug effects on specific metabolic pathways. Here, we systematically reviewed the recent technological advances in pharmacometabonomics for better understanding the pathophysiological mechanisms of diseases as well as the metabolic effects of drugs on bodies. First, the advantages and disadvantages of all mainstream analytical techniques were compared. Second, many data processing strategies including filtering, missing value imputation, quality control-based correction, transformation, normalization together with the methods implemented in each step were discussed. Third, various feature selection and feature extraction algorithms commonly applied in pharmacometabonomics were described. Finally, the databases that facilitate current pharmacometabonomics were collected and discussed. All in all, this review provided guidance for researchers engaged in pharmacometabonomics and metabolomics, and it would promote the wide application of metabolomics in drug research and personalized medicine.
Collapse
Affiliation(s)
- Jianbo Fu
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Ying Zhang
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Jin Liu
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Xichen Lian
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Jing Tang
- Department of Bioinformatics in Chongqing Medical University, China
| | - Feng Zhu
- College of Pharmaceutical Sciences in Zhejiang University, China
| |
Collapse
|
24
|
Zhang S, Amahong K, Sun X, Lian X, Liu J, Sun H, Lou Y, Zhu F, Qiu Y. The miRNA: a small but powerful RNA for COVID-19. Brief Bioinform 2021; 22:1137-1149. [PMID: 33675361 PMCID: PMC7989616 DOI: 10.1093/bib/bbab062] [Citation(s) in RCA: 108] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 02/05/2021] [Accepted: 02/08/2021] [Indexed: 12/12/2022] Open
Abstract
Coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a severe and rapidly evolving epidemic. Now, although a few drugs and vaccines have been proved for its treatment and prevention, little systematic comments are made to explain its susceptibility to humans. A few scattered studies used bioinformatics methods to explore the role of microRNA (miRNA) in COVID-19 infection. Combining these timely reports and previous studies about virus and miRNA, we comb through the available clues and seemingly make the perspective reasonable that the COVID-19 cleverly exploits the interplay between the small miRNA and other biomolecules to avoid being effectively recognized and attacked from host immune protection as well to deactivate functional genes that are crucial for immune system. In detail, SARS-CoV-2 can be regarded as a sponge to adsorb host immune-related miRNA, which forces host fall into dysfunction status of immune system. Besides, SARS-CoV-2 encodes its own miRNAs, which can enter host cell and are not perceived by the host's immune system, subsequently targeting host function genes to cause illnesses. Therefore, this article presents a reasonable viewpoint that the miRNA-based interplays between the host and SARS-CoV-2 may be the primary cause that SARS-CoV-2 accesses and attacks the host cells.
Collapse
Affiliation(s)
- Song Zhang
- College of Pharmaceutical Sciences in Zhejiang University and the First Affiliated Hospital of Zhejiang University School of Medicine, China
| | | | - Xiuna Sun
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Xichen Lian
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Jin Liu
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Yan Lou
- Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, the First Affiliated Hospital, Zhejiang University School of Medicine, China
| | - Feng Zhu
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, the First Affiliated Hospital, Zhejiang University School of Medicine, China
| |
Collapse
|
25
|
Yang Q, Li B, Chen S, Tang J, Li Y, Li Y, Zhang S, Shi C, Zhang Y, Mou M, Xue W, Zhu F. MMEASE: Online meta-analysis of metabolomic data by enhanced metabolite annotation, marker selection and enrichment analysis. J Proteomics 2021; 232:104023. [PMID: 33130111 DOI: 10.1016/j.jprot.2020.104023] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 10/12/2020] [Accepted: 10/22/2020] [Indexed: 12/17/2022]
Abstract
Large-scale and long-term metabolomic studies have attracted widespread attention in the biomedical studies yet remain challenging despite recent technique progresses. In particular, the ineffective way of experiment integration and limited capacity in metabolite annotation are known issues. Herein, we constructed an online tool MMEASE enabling the integration of multiple analytical experiments with an enhanced metabolite annotation and enrichment analysis (https://idrblab.org/mmease/). MMEASE was unique in capable of (1) integrating multiple analytical blocks; (2) providing enriched annotation for >330 thousands of metabolites; (3) conducting enrichment analysis using various categories/sub-categories. All in all, MMEASE aimed at supplying a comprehensive service for large-scale and long-term metabolomics, which might provide valuable guidance to current biomedical studies. SIGNIFICANCE: To facilitate the studies of large-scale and long-term metabolomic analysis, MMEASE was developed to (1) achieve the online integration of multiple datasets from different analytical experiments, (2) provide the most diverse strategies for marker discovery, enabling performance assessment and (3) significantly amplify metabolite annotation and subsequent enrichment analysis. MMEASE aimed at supplying a comprehensive service for long-term and large-scale metabolomics, which might provide valuable guidance to current biomedical studies.
Collapse
Affiliation(s)
- Qingxia Yang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China; Department of Bioinformatics, Smart Health Big Data Analysis and Location Services Engineering Lab of Jiangsu Province, School of Geographic and Biologic Information, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Bo Li
- College of Life Sciences, Chongqing Normal University, Chongqing, Chongqing 401331, China
| | - Sijie Chen
- School of Pharmaceutical Sciences, School of Big Data and Software Engineering, Chongqing University, Chongqing, Chongqing 401331, China
| | - Jing Tang
- Department of Bioinformatics, Chongqing Medical University, Chongqing, Chongqing 400016, China
| | - Yinghong Li
- School of Pharmaceutical Sciences, School of Big Data and Software Engineering, Chongqing University, Chongqing, Chongqing 401331, China
| | - Yi Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Song Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Cheng Shi
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Ying Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Minjie Mou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Weiwei Xue
- School of Pharmaceutical Sciences, School of Big Data and Software Engineering, Chongqing University, Chongqing, Chongqing 401331, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China; School of Pharmaceutical Sciences, School of Big Data and Software Engineering, Chongqing University, Chongqing, Chongqing 401331, China.
| |
Collapse
|
26
|
A Method for Identifying Vesicle Transport Proteins Based on LibSVM and MRMD. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2020; 2020:8926750. [PMID: 33133228 PMCID: PMC7591939 DOI: 10.1155/2020/8926750] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Revised: 08/14/2020] [Accepted: 09/16/2020] [Indexed: 12/14/2022]
Abstract
With the development of computer technology, many machine learning algorithms have been applied to the field of biology, forming the discipline of bioinformatics. Protein function prediction is a classic research topic in this subject area. Though many scholars have made achievements in identifying protein by different algorithms, they often extract a large number of feature types and use very complex classification methods to obtain little improvement in the classification effect, and this process is very time-consuming. In this research, we attempt to utilize as few features as possible to classify vesicular transportation proteins and to simultaneously obtain a comparative satisfactory classification result. We adopt CTDC which is a submethod of the method of composition, transition, and distribution (CTD) to extract only 39 features from each sequence, and LibSVM is used as the classification method. We use the SMOTE method to deal with the problem of dataset imbalance. There are 11619 protein sequences in our dataset. We selected 4428 sequences to train our classification model and selected other 1832 sequences from our dataset to test the classification effect and finally achieved an accuracy of 71.77%. After dimension reduction by MRMD, the accuracy is 72.16%.
Collapse
|
27
|
Zhang P, Wu J, Zhai H, Li S. ABCModeller: an automatic data mining tool based on a consistent voting method with a user-friendly graphical interface. Brief Bioinform 2020; 22:5924101. [PMID: 33057581 DOI: 10.1093/bib/bbaa247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 08/27/2020] [Accepted: 09/03/2020] [Indexed: 02/06/2023] Open
Abstract
In order to extract useful information from a huge amount of biological data nowadays, simple and convenient tools are urgently needed for data analysis and modeling. In this paper, an automatic data mining tool, termed as ABCModeller (Automatic Binary Classification Modeller), with a user-friendly graphical interface was developed here, which includes automated functions as data preprocessing, significant feature extraction, classification modeling, model evaluation and prediction. In order to enhance the generalization ability of the final model, a consistent voting method was built here in this tool with the utilization of three popular machine-learning algorithms, as artificial neural network, support vector machine and random forest. Besides, Fibonacci search and orthogonal experimental design methods were also employed here to automatically select significant features in the data space and optimal hyperparameters of the three algorithms to achieve the best model. The reliability of this tool has been verified through multiple benchmark data sets. In addition, with the advantage of a user-friendly graphical interface of this tool, users without any programming skills can easily obtain reliable models directly from original data, which can reduce the complexity of modeling and data mining, and contribute to the development of related research including but not limited to biology. The excitable file of this tool can be downloaded from http://lishuyan.lzu.edu.cn/ABCModeller.rar.
Collapse
|
28
|
Tang J, Wang Y, Luo Y, Fu J, Zhang Y, Li Y, Xiao Z, Lou Y, Qiu Y, Zhu F. Computational advances of tumor marker selection and sample classification in cancer proteomics. Comput Struct Biotechnol J 2020; 18:2012-2025. [PMID: 32802273 PMCID: PMC7403885 DOI: 10.1016/j.csbj.2020.07.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 07/06/2020] [Accepted: 07/08/2020] [Indexed: 12/11/2022] Open
Abstract
Cancer proteomics has become a powerful technique for characterizing the protein markers driving transformation of malignancy, tracing proteome variation triggered by therapeutics, and discovering the novel targets and drugs for the treatment of oncologic diseases. To facilitate cancer diagnosis/prognosis and accelerate drug target discovery, a variety of methods for tumor marker identification and sample classification have been developed and successfully applied to cancer proteomic studies. This review article describes the most recent advances in those various approaches together with their current applications in cancer-related studies. Firstly, a number of popular feature selection methods are overviewed with objective evaluation on their advantages and disadvantages. Secondly, these methods are grouped into three major classes based on their underlying algorithms. Finally, a variety of sample separation algorithms are discussed. This review provides a comprehensive overview of the advances on tumor maker identification and patients/samples/tissues separations, which could be guidance to the researches in cancer proteomics.
Collapse
Key Words
- ANN, Artificial Neural Network
- ANOVA, Analysis of Variance
- CFS, Correlation-based Feature Selection
- Cancer proteomics
- Computational methods
- DAPC, Discriminant Analysis of Principal Component
- DT, Decision Trees
- EDA, Estimation of Distribution Algorithm
- FC, Fold Change
- GA, Genetic Algorithms
- GR, Gain Ratio
- HC, Hill Climbing
- HCA, Hierarchical Cluster Analysis
- IG, Information Gain
- LDA, Linear Discriminant Analysis
- LIMMA, Linear Models for Microarray Data
- MBF, Markov Blanket Filter
- MWW, Mann–Whitney–Wilcoxon test
- OPLS-DA, Orthogonal Partial Least Squares Discriminant Analysis
- PCA, Principal Component Analysis
- PLS-DA, Partial Least Square Discriminant Analysis
- RF, Random Forest
- RF-RFE, Random Forest with Recursive Feature Elimination
- SA, Simulated Annealing
- SAM, Significance Analysis of Microarrays
- SBE, Sequential Backward Elimination
- SFS, and Sequential Forward Selection
- SOM, Self-organizing Map
- SU, Symmetrical Uncertainty
- SVM, Support Vector Machine
- SVM-RFE, Support Vector Machine with Recursive Feature Elimination
- Sample classification
- Tumor marker selection
- sPLSDA, Sparse Partial Least Squares Discriminant Analysis
- t-SNE, Student t Distribution
- χ2, Chi-square
Collapse
Affiliation(s)
- Jing Tang
- Department of Bioinformatics, Chongqing Medical University, Chongqing 400016, China.,College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yunxia Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yongchao Luo
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Jianbo Fu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yang Zhang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.,School of Pharmaceutical Sciences and Innovative Drug Research Centre, Chongqing University, Chongqing 401331, China
| | - Yi Li
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ziyu Xiao
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yan Lou
- Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou 310000, China
| | - Yunqing Qiu
- Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou 310000, China
| | - Feng Zhu
- Department of Bioinformatics, Chongqing Medical University, Chongqing 400016, China.,College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
29
|
Zhang S, Zhou Y, Wang Y, Wang Z, Xiao Q, Zhang Y, Lou Y, Qiu Y, Zhu F. The mechanistic, diagnostic and therapeutic novel nucleic acids for hepatocellular carcinoma emerging in past score years. Brief Bioinform 2020; 22:1860-1883. [PMID: 32249290 DOI: 10.1093/bib/bbaa023] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 02/09/2020] [Accepted: 02/12/2020] [Indexed: 02/07/2023] Open
Abstract
Despite The Central Dogma states the destiny of gene as 'DNA makes RNA and RNA makes protein', the nucleic acids not only store and transmit genetic information but also, surprisingly, join in intracellular vital movement as a regulator of gene expression. Bioinformatics has contributed to knowledge for a series of emerging novel nucleic acids molecules. For typical cases, microRNA (miRNA), long noncoding RNA (lncRNA) and circular RNA (circRNA) exert crucial role in regulating vital biological processes, especially in malignant diseases. Due to extraordinarily heterogeneity among all malignancies, hepatocellular carcinoma (HCC) has emerged enormous limitation in diagnosis and therapy. Mechanistic, diagnostic and therapeutic nucleic acids for HCC emerging in past score years have been systematically reviewed. Particularly, we have organized recent advances on nucleic acids of HCC into three facets: (i) summarizing diverse nucleic acids and their modification (miRNA, lncRNA, circRNA, circulating tumor DNA and DNA methylation) acting as potential biomarkers in HCC diagnosis; (ii) concluding different patterns of three key noncoding RNAs (miRNA, lncRNA and circRNA) in gene regulation and (iii) outlining the progress of these novel nucleic acids for HCC diagnosis and therapy in clinical trials, and discuss their possibility for clinical applications. All in all, this review takes a detailed look at the advances of novel nucleic acids from potential of biomarkers and elaboration of mechanism to early clinical application in past 20 years.
Collapse
Affiliation(s)
- Song Zhang
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital in Zhejiang University, China.,College of Pharmaceutical Sciences in Zhejiang University, China
| | - Ying Zhou
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital in Zhejiang University, China
| | - Yanan Wang
- School of Life Sciences in Nanchang University, China
| | - Zhengwen Wang
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Qitao Xiao
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Ying Zhang
- College of Pharmaceutical Sciences in Zhejiang University, China
| | - Yan Lou
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital in Zhejiang University, China
| | - Yunqing Qiu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital in Zhejiang University, China
| | - Feng Zhu
- State Key Laboratory for Diagnosis and Treatment of Infectious Disease, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital in Zhejiang University, China.,College of Pharmaceutical Sciences in Zhejiang University, China
| |
Collapse
|
30
|
Yang Q, Wang Y, Li F, Zhang S, Luo Y, Li Y, Tang J, Li B, Chen Y, Xue W, Zhu F. Identification of the gene signature reflecting schizophrenia's etiology by constructing artificial intelligence-based method of enhanced reproducibility. CNS Neurosci Ther 2019; 25:1054-1063. [PMID: 31350824 PMCID: PMC6698965 DOI: 10.1111/cns.13196] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 06/27/2019] [Accepted: 07/03/2019] [Indexed: 12/15/2022] Open
Abstract
AIMS As one of the most fundamental questions in modern science, "what causes schizophrenia (SZ)" remains a profound mystery due to the absence of objective gene markers. The reproducibility of the gene signatures identified by independent studies is found to be extremely low due to the incapability of available feature selection methods and the lack of measurement on validating signatures' robustness. These irreproducible results have significantly limited our understanding of the etiology of SZ. METHODS In this study, a new feature selection strategy was developed, and a comprehensive analysis was then conducted to ensure a reliable signature discovery. Particularly, the new strategy (a) combined multiple randomized sampling with consensus scoring and (b) assessed gene ranking consistency among different datasets, and a comprehensive analysis among nine independent studies was conducted. RESULTS Based on a first-ever evaluation of methods' reproducibility that was cross-validated by nine independent studies, the newly developed strategy was found to be superior to the traditional ones. As a result, 33 genes were consistently identified from multiple datasets by the new strategy as differentially expressed, which might facilitate our understanding of the mechanism underlying the etiology of SZ. CONCLUSION A new strategy capable of enhancing the reproducibility of feature selection in current SZ research was successfully constructed and validated. A group of candidate genes identified in this study should be considered as great potential for revealing the etiology of SZ.
Collapse
Affiliation(s)
- Qing‐Xia Yang
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
- School of Pharmaceutical SciencesChongqing UniversityChongqingChina
| | - Yun‐Xia Wang
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
| | - Feng‐Cheng Li
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
| | - Song Zhang
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
| | - Yong‐Chao Luo
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
| | - Yi Li
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
| | - Jing Tang
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
- School of Pharmaceutical SciencesChongqing UniversityChongqingChina
| | - Bo Li
- School of Pharmaceutical SciencesChongqing UniversityChongqingChina
| | - Yu‐Zong Chen
- Bioinformatics and Drug Design Group, Department of PharmacyNational University of SingaporeSingaporeSingapore
| | - Wei‐Wei Xue
- School of Pharmaceutical SciencesChongqing UniversityChongqingChina
| | - Feng Zhu
- College of Pharmaceutical SciencesZhejiang UniversityHangzhouChina
- School of Pharmaceutical SciencesChongqing UniversityChongqingChina
| |
Collapse
|