1
|
Liu C, Gershon ES. Endophenotype 2.0: updated definitions and criteria for endophenotypes of psychiatric disorders, incorporating new technologies and findings. Transl Psychiatry 2024; 14:502. [PMID: 39719446 PMCID: PMC11668880 DOI: 10.1038/s41398-024-03195-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/25/2024] [Revised: 11/28/2024] [Accepted: 12/05/2024] [Indexed: 12/26/2024] Open
Abstract
Recent genetic studies have linked numerous loci to psychiatric disorders. However, the biological pathways that connect these genetic associations to psychiatric disorders' specific pathophysiological processes are largely unclear. Endophenotypes, first defined over five decades ago, are heritable traits, independent of disease state that are associated with a disease, encompassing a broad range of neurophysiological, biochemical, endocrinological, neuroanatomical, cognitive, and neuropsychological characteristics. Considering the advancements in genetics and genomics over recent decades, we propose a revised definition of endophenotypes as 'genetically influenced phenotypes linked to disease or treatment characteristics and their related events.' We also updated endophenotype criteria to include (1) reliable measurement, (2) association with the disease or its related events, and (3) genetic mediation. 'Genetic mediation' is introduced to differentiate between causality and pleiotropic effects and allows non-linear relationships. Furthermore, this updated Endophenotype 2.0 framework expands to encompass genetically regulated responses to disease-related factors, including environmental risks, illness progression, treatment responses, and resilience phenotypes, which may be state-dependent. This broadened definition paves the way for developing new endophenotypes crucial for genetic analyses in psychiatric disorders. Integrating genetics, genomics, and diverse endophenotypes into multi-dimensional mechanistic models is vital for advancing our understanding of psychiatric disorders. Crucially, elucidating the biological underpinnings of endophenotypes will enhance our grasp of psychiatric genetics, thereby improving disease risk prediction and treatment approaches.
Collapse
Affiliation(s)
- Chunyu Liu
- Department of Psychiatry, SUNY Upstate Medical University, Syracuse, NY, USA.
- School of Life Sciences, Central South University, Changsha, China.
| | - Elliot S Gershon
- Departments of Psychiatry and Human Genetics, The University of Chicago, Chicago, IL, USA.
| |
Collapse
|
2
|
Chandrashekar PB, Alatkar S, Wang J, Hoffman GE, He C, Jin T, Khullar S, Bendl J, Fullard JF, Roussos P, Wang D. DeepGAMI: deep biologically guided auxiliary learning for multimodal integration and imputation to improve genotype-phenotype prediction. Genome Med 2023; 15:88. [PMID: 37904203 PMCID: PMC10617196 DOI: 10.1186/s13073-023-01248-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 10/16/2023] [Indexed: 11/01/2023] Open
Abstract
BACKGROUND Genotypes are strongly associated with disease phenotypes, particularly in brain disorders. However, the molecular and cellular mechanisms behind this association remain elusive. With emerging multimodal data for these mechanisms, machine learning methods can be applied for phenotype prediction at different scales, but due to the black-box nature of machine learning, integrating these modalities and interpreting biological mechanisms can be challenging. Additionally, the partial availability of these multimodal data presents a challenge in developing these predictive models. METHOD To address these challenges, we developed DeepGAMI, an interpretable neural network model to improve genotype-phenotype prediction from multimodal data. DeepGAMI leverages functional genomic information, such as eQTLs and gene regulation, to guide neural network connections. Additionally, it includes an auxiliary learning layer for cross-modal imputation allowing the imputation of latent features of missing modalities and thus predicting phenotypes from a single modality. Finally, DeepGAMI uses integrated gradient to prioritize multimodal features for various phenotypes. RESULTS We applied DeepGAMI to several multimodal datasets including genotype and bulk and cell-type gene expression data in brain diseases, and gene expression and electrophysiology data of mouse neuronal cells. Using cross-validation and independent validation, DeepGAMI outperformed existing methods for classifying disease types, and cellular and clinical phenotypes, even using single modalities (e.g., AUC score of 0.79 for Schizophrenia and 0.73 for cognitive impairment in Alzheimer's disease). CONCLUSION We demonstrated that DeepGAMI improves phenotype prediction and prioritizes phenotypic features and networks in multiple multimodal datasets in complex brains and brain diseases. Also, it prioritized disease-associated variants, genes, and regulatory networks linked to different phenotypes, providing novel insights into the interpretation of gene regulatory mechanisms. DeepGAMI is open-source and available for general use.
Collapse
Affiliation(s)
- Pramod Bharadwaj Chandrashekar
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53076, USA
| | - Sayali Alatkar
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53076, USA
| | - Jiebiao Wang
- Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, 15261, USA
| | - Gabriel E Hoffman
- Center for Disease Neurogenomics, Department of Psychiatry and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Chenfeng He
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53076, USA
| | - Ting Jin
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53076, USA
| | - Saniya Khullar
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53076, USA
| | - Jaroslav Bendl
- Center for Disease Neurogenomics, Department of Psychiatry and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - John F Fullard
- Center for Disease Neurogenomics, Department of Psychiatry and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Department of Psychiatry and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY, 10468, USA
- Center for Dementia Research, Nathan Kline Institute for Psychiatric Research, Orangeburg, NY, 10962, USA
| | - Daifeng Wang
- Waisman Center, University of Wisconsin-Madison, Madison, WI, 53705, USA.
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, 53076, USA.
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53076, USA.
| |
Collapse
|
3
|
Xue X, Wu X, Liu L, Liu L, Zhu F. ERVW-1 Activates ATF6-Mediated Unfolded Protein Response by Decreasing GANAB in Recent-Onset Schizophrenia. Viruses 2023; 15:1298. [PMID: 37376599 PMCID: PMC10304270 DOI: 10.3390/v15061298] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 05/27/2023] [Accepted: 05/29/2023] [Indexed: 06/29/2023] Open
Abstract
Schizophrenia, a mental disorder, afflicts 1% of the worldwide population. The dysregulation of homeostasis in the endoplasmic reticulum (ER) has been implicated in schizophrenia. Moreover, recent studies indicate that ER stress and the unfolded protein response (UPR) are linked to this mental disorder. Our previous research has verified that endogenous retrovirus group W member 1 envelope (ERVW-1), a risk factor for schizophrenia, is elevated in individuals with schizophrenia. Nevertheless, no literature is available regarding the underlying relationship between ER stress and ERVW-1 in schizophrenia. The aim of our research was to investigate the molecular mechanism connecting ER stress and ERVW-1 in schizophrenia. Here, we employed Gene Differential Expression Analysis to predict differentially expressed genes (DEGs) in the human prefrontal cortex of schizophrenic patients and identified aberrant expression of UPR-related genes. Subsequent research indicated that the UPR gene called XBP1 had a positive correlation with ATF6, BCL-2, and ERVW-1 in individuals with schizophrenia using Spearman correlation analysis. Furthermore, results from the enzyme-linked immunosorbent assay (ELISA) suggested increased serum protein levels of ATF6 and XBP1 in schizophrenic patients compared with healthy controls, exhibiting a strong correlation with ERVW-1 using median analysis and Mann-Whitney U analysis. However, serum GANAB levels were decreased in schizophrenic patients compared with controls and showed a significant negative correlation with ERVW-1, ATF6, and XBP1 in schizophrenic patients. Interestingly, in vitro experiments verified that ERVW-1 indeed increased ATF6 and XBP1 expression while decreasing GANAB expression. Additionally, the confocal microscope experiment suggested that ERVW-1 could impact the shape of the ER, leading to ER stress. GANAB was found to participate in ER stress regulated by ERVW-1. In conclusion, ERVW-1 induced ER stress by suppressing GANAB expression, thereby upregulating the expression of ATF6 and XBP1 and ultimately contributing to the development of schizophrenia.
Collapse
Affiliation(s)
- Xing Xue
- State Key Laboratory of Virology, Department of Medical Microbiology, School of Basic Medical Sciences, Wuhan University, Wuhan 430071, China; (X.X.); (X.W.); (L.L.)
| | - Xiulin Wu
- State Key Laboratory of Virology, Department of Medical Microbiology, School of Basic Medical Sciences, Wuhan University, Wuhan 430071, China; (X.X.); (X.W.); (L.L.)
| | - Lijuan Liu
- State Key Laboratory of Virology, Department of Medical Microbiology, School of Basic Medical Sciences, Wuhan University, Wuhan 430071, China; (X.X.); (X.W.); (L.L.)
- Hubei Province Key Laboratory of Allergy & Immunology, Wuhan University, Wuhan 430071, China
| | | | - Fan Zhu
- State Key Laboratory of Virology, Department of Medical Microbiology, School of Basic Medical Sciences, Wuhan University, Wuhan 430071, China; (X.X.); (X.W.); (L.L.)
- Hubei Province Key Laboratory of Allergy & Immunology, Wuhan University, Wuhan 430071, China
| |
Collapse
|
4
|
Ding Y, He W, Tang J, Zou Q, Guo F. Laplacian Regularized Sparse Representation Based Classifier for Identifying DNA N4-Methylcytosine Sites via L 2,1/2-Matrix Norm. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:500-511. [PMID: 34882559 DOI: 10.1109/tcbb.2021.3133309] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
N4-methylcytosine (4mC) is one of important epigenetic modifications in DNA sequences. Detecting 4mC sites is time-consuming. The computational method based on machine learning has provided effective help for identifying 4mC. To further improve the performance of prediction, we propose a Laplacian Regularized Sparse Representation based Classifier with L2,1/2-matrix norm (LapRSRC). We also utilize kernel trick to derive the kernel LapRSRC for nonlinear modeling. Matrix factorization technology is employed to solve the sparse representation coefficients of all test samples in the training set. And an efficient iterative algorithm is proposed to solve the objective function. We implement our model on six benchmark datasets of 4mC and eight UCI datasets to evaluate performance. The results show that the performance of our method is better or comparable.
Collapse
|
5
|
Evidence that the pituitary gland connects type 2 diabetes mellitus and schizophrenia based on large-scale trans-ethnic genetic analyses. J Transl Med 2022; 20:501. [PMID: 36329495 PMCID: PMC9632150 DOI: 10.1186/s12967-022-03704-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Revised: 10/05/2022] [Accepted: 10/11/2022] [Indexed: 11/06/2022] Open
Abstract
BACKGROUND Previous studies on European (EUR) samples have obtained inconsistent results regarding the genetic correlation between type 2 diabetes mellitus (T2DM) and Schizophrenia (SCZ). A large-scale trans-ethnic genetic analysis may provide additional evidence with enhanced power. OBJECTIVE We aimed to explore the genetic basis for both T2DM and SCZ based on large-scale genetic analyses of genome-wide association study (GWAS) data from both East Asian (EAS) and EUR subjects. METHODS A range of complementary approaches were employed to cross-validate the genetic correlation between T2DM and SCZ at the whole genome, autosomes (linkage disequilibrium score regression, LDSC), loci (Heritability Estimation from Summary Statistics, HESS), and causal variants (MiXeR and Mendelian randomization, MR) levels. Then, genome-wide and transcriptome-wide cross-trait/ethnic meta-analyses were performed separately to explore the effective shared organs, cells and molecular pathways. RESULTS A weak genome-wide negative genetic correlation between SCZ and T2DM was found for the EUR (rg = - 0.098, P = 0.009) and EAS (rg =- 0.053 and P = 0.032) populations, which showed no significant difference between the EUR and EAS populations (P = 0.22). After Bonferroni correction, the rg remained significant only in the EUR population. Similar results were obtained from analyses at the levels of autosomes, loci and causal variants. 25 independent variants were firstly identified as being responsible for both SCZ and T2DM. The variants associated with the two disorders were significantly correlated to the gene expression profiles in the brain (P = 1.1E-9) and pituitary gland (P = 1.9E-6). Then, 61 protein-coding and non-coding genes were identified as effective genes in the pituitary gland (P < 9.23E-6) and were enriched in metabolic pathways related to glutathione mediated arsenate detoxification and to D-myo-inositol-trisphosphate. CONCLUSION Here, we show that a negative genetic correlation exists between SCZ and T2DM at the whole genome, autosome, locus and causal variant levels. We identify pituitary gland as a common effective organ for both diseases, in which non-protein-coding effective genes, such as lncRNAs, may be responsible for the negative genetic correlation. This highlights the importance of molecular metabolism and neuroendocrine modulation in the pituitary gland, which may be responsible for the initiation of T2DM in SCZ patients.
Collapse
|
6
|
Cai L, He C, Liu Y, Sun Y, He L, Baranova A. Inflammation and immunity connect hypertension with adverse COVID-19 outcomes. Front Genet 2022; 13:933148. [PMID: 36160003 PMCID: PMC9493274 DOI: 10.3389/fgene.2022.933148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 07/08/2022] [Indexed: 11/23/2022] Open
Abstract
Objectives: To explore the connection of hypertension and severe COVID-19 outcomes. Methods: A total of 68 observational studies recording mortality and/or general severity of COVID-19 were pooled for meta-analyses of the relationship of severe COVID-19 outcomes with hypertension as well as systolic and diastolic blood pressure. Genome-wide cross-trait meta-analysis (GWCTM) was performed to explore the genes linking between hypertension and COVID-19 severity. Results: The results of meta-analysis with the random effect model indicated that pooled risk ratios of hypertension on mortality and severity of COVID-19 were 1.80 [95% confidence interval (CI) 1.54-2.1] and 1.78 (95% confidence interval 1.56-2.04), respectively, although the apparent heterogeneity of the included studies was detected. In subgroup analysis, cohorts of severe and mild patients of COVID-19 assessed in Europe had a significant pooled weighted mean difference of 6.61 mmHg (95% CI 3.66-9.55) with no heterogeneity found (p = 0.26). The genes in the shared signature of hypertension and the COVID-19 severity were mostly expressed in lungs. Analysis of molecular networks commonly affected both by hypertension and by severe COVID-19 highlighted CCR1/CCR5 and IL10RB signaling, as well as Th1 and Th2 activation pathways, and also a potential for a shared regulation with multiple sclerosis. Conclusion: Hypertension is significantly associated with the severe course of COVID-19. Genetic variants within inflammation- and immunity-related genes may affect their expression in lungs and confer liability to both elevated blood pressure and to severe COVID-19.
Collapse
Affiliation(s)
- Lei Cai
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Genetics and Development, Shanghai Mental Health Center, Shanghai Jiaotong University, Shanghai, China
| | - Chuan He
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Genetics and Development, Shanghai Mental Health Center, Shanghai Jiaotong University, Shanghai, China
| | - Yonglin Liu
- Sanya Women and Children’s Hospital managed by Shanghai Children’s Medical Center, Sanya, China
| | - Yanlan Sun
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Genetics and Development, Shanghai Mental Health Center, Shanghai Jiaotong University, Shanghai, China
| | - Lin He
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center for Genetics and Development, Shanghai Mental Health Center, Shanghai Jiaotong University, Shanghai, China
- Shanghai Center for Women and Children’s Health, Shanghai, China
| | - Ancha Baranova
- School of Systems Biology, George Mason University, Fairfax, VA, United States
| |
Collapse
|
7
|
Placebo effect in the treatment of non-small cell lung cancer: a meta-analysis. JOURNAL OF BIO-X RESEARCH 2022. [DOI: 10.1097/jbr.0000000000000123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
8
|
Zhou M, Bian K, Hu F, Lai W. A New Strategy for Identification of Coal Miners With Abnormal Physical Signs Based on EN-mRMR. Front Bioeng Biotechnol 2022; 10:935481. [PMID: 35898648 PMCID: PMC9310099 DOI: 10.3389/fbioe.2022.935481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 06/06/2022] [Indexed: 11/21/2022] Open
Abstract
Coal miners’ occupational health is a key part of production safety in the coal mine. Accurate identification of abnormal physical signs is the key to preventing occupational diseases and improving miners’ working environment. There are many problems when evaluating the physical health status of miners manually, such as too many sign parameters, low diagnostic efficiency, missed diagnosis, and misdiagnosis. To solve these problems, the machine learning algorithm is used to identify miners with abnormal signs. We proposed a feature screening strategy of integrating elastic net (EN) and Max-Relevance and Min-Redundancy (mRMR) to establish the model to identify abnormal signs and obtain the key physical signs. First, the raw 21 physical signs were expanded to 25 by feature construction technology. Then, the EN was used to delete redundant physical signs. Finally, the mRMR combined with the support vector classification of intelligent optimization algorithm by Gravitational Search Algorithm (GSA-SVC) is applied to further simplify the rest of 12 relatively important physical signs and obtain the optimal model with data of six physical signs. At this time, the accuracy, precision, recall, specificity, G-mean, and MCC of the test set were 97.50%, 97.78%, 97.78%, 97.14%, 0.98, and 0.95. The experimental results show that the proposed strategy improves the model performance with the smallest features and realizes the accurate identification of abnormal coal miners. The conclusion could provide reference evidence for intelligent classification and assessment of occupational health in the early stage.
Collapse
Affiliation(s)
- Mengran Zhou
- School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan, China
| | - Kai Bian
- School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan, China
| | - Feng Hu
- School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan, China
| | - Wenhao Lai
- School of Electrical and Information Engineering, Anhui University of Science and Technology, Huainan, China
| |
Collapse
|
9
|
Integrative Analyses of Transcriptomes to Explore Common Molecular Effects of Antipsychotic Drugs. Int J Mol Sci 2022; 23:ijms23147508. [PMID: 35886854 PMCID: PMC9325239 DOI: 10.3390/ijms23147508] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/04/2022] [Accepted: 07/04/2022] [Indexed: 01/27/2023] Open
Abstract
There is little understanding of the underlying molecular mechanism(s) involved in the clinical efficacy of antipsychotics for schizophrenia. This study integrated schizophrenia-associated transcriptional perturbations with antipsychotic-induced gene expression profiles to detect potentially relevant therapeutic targets shared by multiple antipsychotics. Human neuronal-like cells (NT2-N) were treated for 24 h with one of the following antipsychotic drugs: amisulpride, aripiprazole, clozapine, risperidone, or vehicle controls. Drug-induced gene expression patterns were compared to schizophrenia-associated transcriptional data in post-mortem brain tissues. Genes regulated by each of four antipsychotic drugs in the reverse direction to schizophrenia were identified as potential therapeutic-relevant genes. A total of 886 genes were reversely expressed between at least one drug treatment (versus vehicle) and schizophrenia (versus healthy control), in which 218 genes were commonly regulated by all four antipsychotic drugs. The most enriched biological pathways include Wnt signaling and action potential regulation. The protein-protein interaction (PPI) networks found two main clusters having schizophrenia expression quantitative trait loci (eQTL) genes such as PDCD10, ANK2, and AKT3, suggesting further investigation on these genes as potential novel treatment targets.
Collapse
|
10
|
Sun KF, Sun LM, Zhou D, Chen YY, Hao XW, Liu HR, Liu X, Chen JJ. XGBG: A Novel Method for Identifying Ovarian Carcinoma Susceptible Genes Based on Deep Learning. Front Oncol 2022; 12:897503. [PMID: 35646648 PMCID: PMC9133413 DOI: 10.3389/fonc.2022.897503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 04/08/2022] [Indexed: 11/30/2022] Open
Abstract
Ovarian carcinomas (OCs) represent a heterogeneous group of neoplasms consisting of several entities with pathogenesis, molecular profiles, multiple risk factors, and outcomes. OC has been regarded as the most lethal cancer among women all around the world. There are at least five main types of OCs classified by the fifth edition of the World Health Organization of tumors: high-/low-grade serous carcinoma, mucinous carcinoma, clear cell carcinoma, and endometrioid carcinoma. With the improved knowledge of genome-wide association study (GWAS) and expression quantitative trait locus (eQTL) analyses, the knowledge of genomic landscape of complex diseases has been uncovered in large measure. Moreover, pathway analyses also play an important role in exploring the underlying mechanism of complex diseases by providing curated pathway models and information about molecular dynamics and cellular processes. To investigate OCs deeper, we introduced a novel disease susceptible gene prediction method, XGBG, which could be used in identifying OC-related genes based on different omics data and deep learning methods. We first employed the graph convolutional network (GCN) to reconstruct the gene features based on both gene feature and network topological structure. Then, a boosting method is utilized to predict OC susceptible genes. As a result, our model achieved a high AUC of 0.7541 and an AUPR of 0.8051, which indicates the effectiveness of the XGPG. Based on the newly predicted OC susceptible genes, we gathered and researched related literatures to provide strong support to the results, which may help in understanding the pathogenesis and mechanisms of the disease.
Collapse
Affiliation(s)
- Ke Feng Sun
- Department of Obstetrics and Gynecology, First Affiliated Hospital, Heilongjiang University of Chinese Medicine, Harbin, China
| | - Li Min Sun
- Department of Oncology, The Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Dong Zhou
- Department of Oncology, Affiliated Zhongshan Hospital of Dalian University, Dalian, China
| | - Ying Ying Chen
- Department of Nephrology, The First Affiliated Hospital of Heilongjiang University of Chinese Medical, Harbin, China
| | - Xi Wen Hao
- Heilongjiang University of Chinese Medicine, Harbin, China
| | - Hong Ruo Liu
- Department of Oncology, The Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Xin Liu
- Department of Oncology, The Second Affiliated Hospital of Dalian Medical University, Dalian, China
| | - Jing Jing Chen
- Department of Rheumatology and Immunology, The First Hospital Affiliated to Army Medical University, Chongqing, China
| |
Collapse
|
11
|
Naseer S, Hussain W, Khan YD, Rasool N. iPhosS(Deep)-PseAAC: Identification of Phosphoserine Sites in Proteins Using Deep Learning on General Pseudo Amino Acid Compositions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1703-1714. [PMID: 33242308 DOI: 10.1109/tcbb.2020.3040747] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Among all the PTMs, the protein phosphorylation is pivotal for various pathological and physiological processes. About 30 percent of eukaryotic proteins undergo the phosphorylation modification, leading to various changes in conformation, function, stability, localization, and so forth. In eukaryotic proteins, phosphorylation occurs on serine (S), Threonine (T) and Tyrosine (Y) residues. Among these all, serine phosphorylation has its own importance as it is associated with various importance biological processes, including energy metabolism, signal transduction pathways, cell cycling, and apoptosis. Thus, its identification is important, however, the in vitro, ex vivo and in vivo identification can be laborious, time-taking and costly. There is a dire need of an efficient and accurate computational model to help researchers and biologists identifying these sites, in an easy manner. Herein, we propose a novel predictor for identification of Phosphoserine sites (PhosS) in proteins, by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) with deep features. We used well-known DNNs for both the tasks of learning a feature representation of peptide sequences and performing classifications. Among different DNNs, the best score is shown by Covolutional Neural Network based model which renders CNN based prediction model the best for Phosphoserine prediction. Based on these results, it is concluded that the proposed model can help to identify PhosS sites in a very efficient and accurate manner which can help scientists understand the mechanism of this modification in proteins.
Collapse
|
12
|
Antunes ASLM, Saia-Cereda VM, Crunfli F, Martins-de-Souza D. 14-3-3 proteins at the crossroads of neurodevelopment and schizophrenia. World J Biol Psychiatry 2022; 23:14-32. [PMID: 33952049 DOI: 10.1080/15622975.2021.1925585] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
The 14-3-3 family comprises multifunctional proteins that play a role in neurogenesis, neuronal migration, neuronal differentiation, synaptogenesis and dopamine synthesis. 14-3-3 members function as adaptor proteins and impact a wide variety of cellular and physiological processes involved in the pathophysiology of neurological disorders. Schizophrenia is a psychiatric disorder and knowledge about its pathophysiology is still limited. 14-3-3 have been proven to be linked with the dopaminergic, glutamatergic and neurodevelopmental hypotheses of schizophrenia. Further, research using genetic models has demonstrated the role played by 14-3-3 proteins in neurodevelopment and neuronal circuits, however a more integrative and comprehensive approach is needed for a better understanding of their role in schizophrenia. For instance, we still lack an integrated assessment of the processes affected by 14-3-3 proteins in the dopaminergic and glutamatergic systems. In this context, it is also paramount to understand their involvement in the biology of brain cells other than neurons. Here, we present previous and recent research that has led to our current understanding of the roles 14-3-3 proteins play in brain development and schizophrenia, perform an assessment of their functional protein association network and discuss the use of protein-protein interaction modulators to target 14-3-3 as a potential therapeutic strategy.
Collapse
Affiliation(s)
- André S L M Antunes
- Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, State University of Campinas, Campinas, Brazil
| | - Verônica M Saia-Cereda
- Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, State University of Campinas, Campinas, Brazil
| | - Fernanda Crunfli
- Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, State University of Campinas, Campinas, Brazil
| | - Daniel Martins-de-Souza
- Laboratory of Neuroproteomics, Department of Biochemistry and Tissue Biology, Institute of Biology, State University of Campinas, Campinas, Brazil.,Experimental Medicine Research Cluster (EMRC), University of Campinas, Campinas, SP, Brazil.,D'Or Institute for Research and Education (IDOR), São Paulo, Brazil.,Instituto Nacional de Biomarcadores em Neuropsiquiatria (INBION), Conselho Nacional de Desenvolvimento Científico e Tecnológico, São Paulo, Brazil
| |
Collapse
|
13
|
Griswold AJ, Correa D, Kaplan LD, Best TM. Using Genomic Techniques in Sports and Exercise Science: Current Status and Future Opportunities. Curr Sports Med Rep 2021; 20:617-623. [PMID: 34752437 DOI: 10.1249/jsr.0000000000000908] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
ABSTRACT The past two decades have built on the successes of the Human Genome Project identifying the impact of genetics and genomics on human traits. Given the importance of exercise in the physical and psychological health of individuals across the lifespan, using genomics to understand the impact of genes in the sports medicine field is an emerging field. Given the complexity of the systems involved, high-throughput genomics is required to understand genetic variants, their functions, and ultimately their effect on the body. Consequently, genomic studies have been performed across several domains of sports medicine with varying degrees of success. While the breadth of these is great, they focus largely on the following three areas: 1) performance; 2) injury susceptibility; and 3) sports associated chronic conditions, such as osteoarthritis. Herein, we review literature on genetics and genomics in sports medicine, offer suggestions to bolster existing studies, and suggest ways to ideally impact clinical care.
Collapse
Affiliation(s)
| | | | - Lee D Kaplan
- Department of Orthopedic Surgery, UHealth Sports Medicine Institute, University of Miami, Miller School of Medicine, Miami, FL
| | - Thomas M Best
- Department of Orthopedic Surgery, UHealth Sports Medicine Institute, University of Miami, Miller School of Medicine, Miami, FL
| |
Collapse
|
14
|
Akmal MA, Hussain W, Rasool N, Khan YD, Khan SA, Chou KC. Using CHOU'S 5-Steps Rule to Predict O-Linked Serine Glycosylation Sites by Blending Position Relative Features and Statistical Moment. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2045-2056. [PMID: 31985438 DOI: 10.1109/tcbb.2020.2968441] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Glycosylation of proteins in eukaryote cells is an important and complicated post-translation modification due to its pivotal role and association with crucial physiological functions within most of the proteins. Identification of glycosylation sites in a polypeptide chain is not an easy task due to multiple impediments. Analytical identification of these sites is expensive and laborious. There is a dire need to develop a reliable computational method for precise determination of such sites which can help researchers to save time and effort. Herein, we propose a novel predictor namely iGlycoS-PseAAC by integrating the Chou's Pseudo Amino Acid Composition (PseAAC) and relative/absolute position-based features. The self-consistency results show that the accuracy revealed by the model using the benchmark dataset for prediction of O-linked glycosylation having serine sites is 98.8 percent. The overall accuracy of predictor achieved through 10-fold cross validation by combining the positive and negative results is 97.2 percent. The overall accuracy achieved through Jackknife test is 96.195 percent by aggregating of all the prediction results. Thus the proposed predictor can help in predicting the O-linked glycosylated serine sites in an efficient and accurate way. The overall results show that the accuracy of the iGlycoS-PseAAC is higher than the existing tools.
Collapse
|
15
|
Khan YD, Khan NS, Naseer S, Butt AH. iSUMOK-PseAAC: prediction of lysine sumoylation sites using statistical moments and Chou's PseAAC. PeerJ 2021; 9:e11581. [PMID: 34430072 PMCID: PMC8349168 DOI: 10.7717/peerj.11581] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 05/19/2021] [Indexed: 01/25/2023] Open
Abstract
Sumoylation is the post-translational modification that is involved in the adaption of the cells and the functional properties of a large number of proteins. Sumoylation has key importance in subcellular concentration, transcriptional synchronization, chromatin remodeling, response to stress, and regulation of mitosis. Sumoylation is associated with developmental defects in many human diseases such as cancer, Huntington's, Alzheimer's, Parkinson's, Spin cerebellar ataxia 1, and amyotrophic lateral sclerosis. The covalent bonding of Sumoylation is essential to inheriting part of the operative characteristics of some other proteins. For that reason, the prediction of the Sumoylation site has significance in the scientific community. A novel and efficient technique is proposed to predict the Sumoylation sites in proteins by incorporating Chou's Pseudo Amino Acid Composition (PseAAC) with statistical moments-based features. The outcomes from the proposed system using 10 fold cross-validation testing are 94.51%, 94.24%, 94.79% and 0.8903% accuracy, sensitivity, specificity and MCC, respectively. The performance of the proposed system is so far the best in comparison to the other state-of-the-art methods. The codes for the current study are available on the GitHub repository using the link: https://github.com/csbioinfopk/iSumoK-PseAAC.
Collapse
Affiliation(s)
- Yaser Daanial Khan
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Punjab, Pakistan
| | - Nabeel Sabir Khan
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Punjab, Pakistan
| | - Sheraz Naseer
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Punjab, Pakistan
| | - Ahmad Hassan Butt
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, Punjab, Pakistan
| |
Collapse
|
16
|
Naseer S, Hussain W, Khan YD, Rasool N. NPalmitoylDeep-PseAAC: A Predictor of N-Palmitoylation Sites in Proteins Using Deep Representations of Proteins and PseAAC via Modified 5-Steps Rule. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200605142828] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Among all the major Post-translational modification, lipid modifications
possess special significance due to their widespread functional importance in eukaryotic cells. There
exist multiple types of lipid modifications and Palmitoylation, among them, is one of the broader
types of modification, having three different types. The N-Palmitoylation is carried out by
attachment of palmitic acid to an N-terminal cysteine. Due to the association of N-Palmitoylation
with various biological functions and diseases such as Alzheimer’s and other neurodegenerative
diseases, its identification is very important.
Objective:
The in vitro, ex vivo and in vivo identification of Palmitoylation is laborious, time-taking
and costly. There is a dire need for an efficient and accurate computational model to help researchers
and biologists identify these sites, in an easy manner. Herein, we propose a novel prediction model
for the identification of N-Palmitoylation sites in proteins.
Method:
The proposed prediction model is developed by combining the Chou’s Pseudo Amino
Acid Composition (PseAAC) with deep neural networks. We used well-known deep neural
networks (DNNs) for both the tasks of learning a feature representation of peptide sequences and
developing a prediction model to perform classification.
Results:
Among different DNNs, Gated Recurrent Unit (GRU) based RNN model showed the
highest scores in terms of accuracy, and all other computed measures, and outperforms all the
previously reported predictors.
Conclusion:
The proposed GRU based RNN model can help to identify N-Palmitoylation in a very
efficient and accurate manner which can help scientists understand the mechanism of this
modification in proteins.
Collapse
Affiliation(s)
- Sheraz Naseer
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, P.O. Box 10033, C-II, Johar Town, Lahore 54770, Pakistan
| | - Waqar Hussain
- National Center of Artificial Intelligence, Punjab University College of Information Technology, University of the Punjab, Lahore, Pakistan
| | - Yaser Daanial Khan
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, P.O. Box 10033, C-II, Johar Town, Lahore 54770, Pakistan
| | - Nouman Rasool
- Dr Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, 75270, Pakistan
| |
Collapse
|
17
|
Awais M, Hussain W, Khan YD, Rasool N, Khan SA, Chou KC. iPhosH-PseAAC: Identify Phosphohistidine Sites in Proteins by Blending Statistical Moments and Position Relative Features According to the Chou's 5-Step Rule and General Pseudo Amino Acid Composition. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:596-610. [PMID: 31144645 DOI: 10.1109/tcbb.2019.2919025] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Protein phosphorylation is one of the key mechanism in prokaryotes and eukaryotes and is responsible for various biological functions such as protein degradation, intracellular localization, the multitude of cellular processes, molecular association, cytoskeletal dynamics, and enzymatic inhibition/activation. Phosphohistidine (PhosH) has a key role in a number of biological processes, including central metabolism to signalling in eukaryotes and bacteria. Thus, identification of phosphohistidine sites in a protein sequence is crucial, and experimental identification can be expensive, time-taking, and laborious. To address this problem, here, we propose a novel computational model namely iPhosH-PseAAC for prediction of phosphohistidine sites in a given protein sequence using pseudo amino acid composition (PseAAC), statistical moments, and position relative features. The results of the proposed predictor are validated through self-consistency testing, 10-fold cross-validation, and jackknife testing. The self-consistency validation gave the 100 percent accuracy, whereas, for cross-validation, the accuracy achieved is 94.26 percent. Moreover, jackknife testing gave 97.07 percent accuracy for the proposed model. Thus, the proposed model iPhosH-PseAAC for prediction of iPhosH site has the great ability to predict the PhosH sites in given proteins.
Collapse
|
18
|
Stevens SR, Rasband MN. Ankyrins and neurological disease. Curr Opin Neurobiol 2021; 69:51-57. [PMID: 33485190 DOI: 10.1016/j.conb.2021.01.002] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 01/06/2021] [Accepted: 01/08/2021] [Indexed: 12/11/2022]
Abstract
Ankyrins are scaffolding proteins widely expressed throughout the nervous system. Ankyrins recruit diverse membrane proteins, including ion channels and cell adhesion molecules, into specialized subcellular membrane domains. These domains are stabilized by ankyrins interacting with the spectrin cytoskeleton. Ankyrin genes are highly associated with a number of neurological disorders, including Alzheimer's disease, schizophrenia, autism spectrum disorders, and bipolar disorder. Here, we discuss ankyrin function and their role in neurological disease. We propose mutations in ankyrins contribute to disease through two primary mechanisms: 1) altered neuronal excitability by disrupting ion channel clustering at key excitable domains, and 2) altered neuronal connectivity via impaired stabilization of membrane proteins.
Collapse
Affiliation(s)
- Sharon R Stevens
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
| | - Matthew N Rasband
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
19
|
Wu Z, Shou L, Wang J, Huang T, Xu X. The Methylation Pattern for Knee and Hip Osteoarthritis. Front Cell Dev Biol 2020; 8:602024. [PMID: 33240895 PMCID: PMC7677303 DOI: 10.3389/fcell.2020.602024] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Accepted: 10/22/2020] [Indexed: 01/08/2023] Open
Abstract
Osteoarthritis is one of the most prevalent chronic joint diseases for middle-aged and elderly people. But in recent years, the number of young people suffering from the disease increases quickly. It is known that osteoarthritis is a common degenerative disease caused by the combination and interaction of many factors such as natural and environmental factors. DNA methylations reflect the effects of environmental factors. Several researches on DNA methylation at specific genes in OA cartilage indicated the great potential roles of DNA methylation in OA. To systematically investigate the methylation pattern in knee and hip osteoarthritis, we analyzed the methylation profiles in cartilage of 16 OA hip samples, 19 control hip samples and 62 OA knee samples. 12 discriminative methylation sites were identified using advanced minimal Redundancy Maximal Relevance (mRMR) and Incremental Feature Selection (IFS) methods. The SVM classifier of these 12 methylation sites from genes like MEIS1, GABRG3, RXRA, and EN1, can perfectly classify the OA hip samples, control hip samples and OA knee samples evaluated with LOOCV (Leave-One Out-Cross Validation). These 12 methylation sites can not only serve as biomarker, but also provide underlying mechanism of OA.
Collapse
Affiliation(s)
- Zhen Wu
- Departmemt of Orthopaedics, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Lu Shou
- Departmemt of Pneumology, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Jian Wang
- Departmemt of Orthopaedics, Tongde Hospital of Zhejiang Province, Hangzhou, China
| | - Tao Huang
- Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
| | - Xinwei Xu
- Departmemt of Orthopaedics, Tongde Hospital of Zhejiang Province, Hangzhou, China
| |
Collapse
|
20
|
Nguyen TTD, Le NQK, Ho QT, Phan DV, Ou YY. TNFPred: identifying tumor necrosis factors using hybrid features based on word embeddings. BMC Med Genomics 2020; 13:155. [PMID: 33087125 PMCID: PMC7579990 DOI: 10.1186/s12920-020-00779-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Background Cytokines are a class of small proteins that act as chemical messengers and play a significant role in essential cellular processes including immunity regulation, hematopoiesis, and inflammation. As one important family of cytokines, tumor necrosis factors have association with the regulation of a various biological processes such as proliferation and differentiation of cells, apoptosis, lipid metabolism, and coagulation. The implication of these cytokines can also be seen in various diseases such as insulin resistance, autoimmune diseases, and cancer. Considering the interdependence between this kind of cytokine and others, classifying tumor necrosis factors from other cytokines is a challenge for biological scientists. Methods In this research, we employed a word embedding technique to create hybrid features which was proved to efficiently identify tumor necrosis factors given cytokine sequences. We segmented each protein sequence into protein words and created corresponding word embedding for each word. Then, word embedding-based vector for each sequence was created and input into machine learning classification models. When extracting feature sets, we not only diversified segmentation sizes of protein sequence but also conducted different combinations among split grams to find the best features which generated the optimal prediction. Furthermore, our methodology follows a well-defined procedure to build a reliable classification tool. Results With our proposed hybrid features, prediction models obtain more promising performance compared to seven prominent sequenced-based feature kinds. Results from 10 independent runs on the surveyed dataset show that on an average, our optimal models obtain an area under the curve of 0.984 and 0.998 on 5-fold cross-validation and independent test, respectively. Conclusions These results show that biologists can use our model to identify tumor necrosis factors from other cytokines efficiently. Moreover, this study proves that natural language processing techniques can be applied reasonably to help biologists solve bioinformatics problems efficiently.
Collapse
Affiliation(s)
| | - Nguyen-Quoc-Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei City, 106, Taiwan.,Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei City, 106, Taiwan
| | - Quang-Thai Ho
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 32003, Taiwan
| | - Dinh-Van Phan
- University of Economics, The University of Danang, Danang, 550000, Vietnam
| | - Yu-Yen Ou
- Department of Computer Science and Engineering, Yuan Ze University, Taoyuan, 32003, Taiwan.
| |
Collapse
|
21
|
Schizophrenia in a genomic era: a review from the pathogenesis, genetic and environmental etiology to diagnosis and treatment insights. Psychiatr Genet 2020; 30:1-9. [PMID: 31764709 DOI: 10.1097/ypg.0000000000000245] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Schizophrenia is a common multigenic and debilitating neurological disorder characterized by chronic psychotic symptoms and psychosocial impairment. Complex interactions of genetics and environmental factors have been implicated in etiology of schizophrenia. There is no central pathophysiology mechanism, diagnostic neuropathology, or biological markers have been defined for schizophrenia. However, a number of different hypotheses including neurodevelopmental and neurochemical hypotheses have been proposed to explain the neuropathology of schizophrenia. This review provides an overview of pathogenesis, genetic and environmental etiologies to diagnosis and treatment insights in clinical management of schizophrenia in light of the recent discoveries of genetic loci associated with susceptibility to schizophrenia.
Collapse
|
22
|
Dysregulation of peripheral expression of the YWHA genes during conversion to psychosis. Sci Rep 2020; 10:9863. [PMID: 32555255 PMCID: PMC7299951 DOI: 10.1038/s41598-020-66901-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Accepted: 05/04/2020] [Indexed: 12/01/2022] Open
Abstract
The seven human 14-3-3 proteins are encoded by the YWHA-gene family. They are expressed in the brain where they play multiple roles including the modulation of synaptic plasticity and neuronal development. Previous studies have provided arguments for their involvement in schizophrenia, but their role during disease onset is unknown. We explored the peripheral-blood expression level of the seven YWHA genes in 92 young individuals at ultra-high risk for psychosis (UHR). During the study, 36 participants converted to psychosis (converters) while 56 did not (non-converters). YWHA genes expression was evaluated at baseline and after a mean follow-up of 10.3 months using multiplex quantitative PCR. Compared with non-converters, the converters had a significantly higher baseline expression levels for 5 YWHA family genes, and significantly different longitudinal changes in the expression of YWHAE, YWHAG, YWHAH, YWHAS and YWAHZ. A principal-component analysis also indicated that the YWHA expression was significantly different between converters and non-converters suggesting a dysregulation of the YWHA co-expression network. Although these results were obtained from peripheral blood which indirectly reflects brain chemistry, they indicate that this gene family may play a role in psychosis onset, opening the way to the identification of prognostic biomarkers or new drug targets.
Collapse
|
23
|
Polymorphism in the 3'-UTR of LIF but Not in the ATF6B Gene Associates with Schizophrenia Susceptibility: a Case-Control Study and In Silico Analyses. J Mol Neurosci 2020; 70:2093-2101. [PMID: 32504404 DOI: 10.1007/s12031-020-01616-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 05/26/2020] [Indexed: 12/17/2022]
Abstract
Schizophrenia (SCZ) is a multifactorial disorder caused by environmental and genetic factors. Studies have shown that various single-nucleotide polymorphisms (SNPs) in the binding sites of microRNAs contribute to the risk of developing SCZ. We aimed to investigate whether the variants located in the 3'-UTR region of LIF (rs929271T>G) and ATF6B (rs8283G>A) were associated with increased susceptibility to SCZ in a population from the south-east of Iran. In this case-control study, a total of 396 subjects were recruited. SNPs were genotyped via polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method. Genotyping results showed that the G allele of rs929271 significantly increased the risk of SCZ (OR = 1.58 95%CI = 1.19-2.10, p = 0.001). As for rs929271, the GG genotype of co-dominant (OR = 2.54 95%CI = 1.39-4.64, p = 0.002) and recessive (OR = 2.91 95%CI = 1.77-4.80, p < 0.001) models were strongly linked to SCZ. No significant differences were observed between rs8283 polymorphism and predisposition to SCZ. In silico analyses predicted that rs929271 might alter the binding sites of microRNAs, which was believed to have an unclear role in the development of SCZ. Moreover, rs929271 polymorphism changed the LIF-mRNA folding structure. These findings provide fine pieces of evidence regarding the possible effects of LIF polymorphism in the development of SCZ and regulation of the LIF gene targeted by microRNAs.
Collapse
|
24
|
Identification of prokaryotic promoters and their strength by integrating heterogeneous features. Genomics 2020; 112:1396-1403. [DOI: 10.1016/j.ygeno.2019.08.009] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/31/2019] [Accepted: 08/14/2019] [Indexed: 12/21/2022]
|
25
|
Systems biology and bioinformatics approach to identify gene signatures, pathways and therapeutic targets of Alzheimer's disease. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100439] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
|
26
|
Liu S, Rao S, Xu Y, Li J, Huang H, Zhang X, Fu H, Wang Q, Cao H, Baranova A, Jin C, Zhang F. Identifying common genome-wide risk genes for major psychiatric traits. Hum Genet 2019; 139:185-198. [DOI: 10.1007/s00439-019-02096-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Accepted: 11/24/2019] [Indexed: 10/25/2022]
|
27
|
iRSpot-DTS: Predict recombination spots by incorporating the dinucleotide-based spare-cross covariance information into Chou's pseudo components. Genomics 2019; 111:1760-1770. [DOI: 10.1016/j.ygeno.2018.11.031] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Revised: 11/29/2018] [Accepted: 11/30/2018] [Indexed: 12/16/2022]
|
28
|
Chou KC. Advances in Predicting Subcellular Localization of Multi-label Proteins and its Implication for Developing Multi-target Drugs. Curr Med Chem 2019; 26:4918-4943. [PMID: 31060481 DOI: 10.2174/0929867326666190507082559] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 01/29/2019] [Accepted: 01/31/2019] [Indexed: 12/16/2022]
Abstract
The smallest unit of life is a cell, which contains numerous protein molecules. Most
of the functions critical to the cell’s survival are performed by these proteins located in its different
organelles, usually called ‘‘subcellular locations”. Information of subcellular localization
for a protein can provide useful clues about its function. To reveal the intricate pathways at the
cellular level, knowledge of the subcellular localization of proteins in a cell is prerequisite.
Therefore, one of the fundamental goals in molecular cell biology and proteomics is to determine
the subcellular locations of proteins in an entire cell. It is also indispensable for prioritizing
and selecting the right targets for drug development. Unfortunately, it is both timeconsuming
and costly to determine the subcellular locations of proteins purely based on experiments.
With the avalanche of protein sequences generated in the post-genomic age, it is highly
desired to develop computational methods for rapidly and effectively identifying the subcellular
locations of uncharacterized proteins based on their sequences information alone. Actually,
considerable progresses have been achieved in this regard. This review is focused on those
methods, which have the capacity to deal with multi-label proteins that may simultaneously
exist in two or more subcellular location sites. Protein molecules with this kind of characteristic
are vitally important for finding multi-target drugs, a current hot trend in drug development.
Focused in this review are also those methods that have use-friendly web-servers established so
that the majority of experimental scientists can use them to get the desired results without the
need to go through the detailed mathematics involved.
Collapse
Affiliation(s)
- Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, United States
| |
Collapse
|
29
|
Abstract
The smallest unit of life is a cell, which contains numerous protein molecules. Most
of the functions critical to the cell’s survival are performed by these proteins located in its different
organelles, usually called ‘‘subcellular locations”. Information of subcellular localization
for a protein can provide useful clues about its function. To reveal the intricate pathways at the
cellular level, knowledge of the subcellular localization of proteins in a cell is prerequisite.
Therefore, one of the fundamental goals in molecular cell biology and proteomics is to determine
the subcellular locations of proteins in an entire cell. It is also indispensable for prioritizing
and selecting the right targets for drug development. Unfortunately, it is both timeconsuming
and costly to determine the subcellular locations of proteins purely based on experiments.
With the avalanche of protein sequences generated in the post-genomic age, it is highly
desired to develop computational methods for rapidly and effectively identifying the subcellular
locations of uncharacterized proteins based on their sequences information alone. Actually,
considerable progresses have been achieved in this regard. This review is focused on those
methods, which have the capacity to deal with multi-label proteins that may simultaneously
exist in two or more subcellular location sites. Protein molecules with this kind of characteristic
are vitally important for finding multi-target drugs, a current hot trend in drug development.
Focused in this review are also those methods that have use-friendly web-servers established so
that the majority of experimental scientists can use them to get the desired results without the
need to go through the detailed mathematics involved.
Collapse
Affiliation(s)
- Kuo-Chen Chou
- Gordon Life Science Institute, Boston, MA 02478, United States
| |
Collapse
|
30
|
Cai L, He L. Placebo effects and the molecular biological components involved. Gen Psychiatr 2019; 32:e100089. [PMID: 31552390 PMCID: PMC6738668 DOI: 10.1136/gpsych-2019-100089] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 07/18/2019] [Accepted: 07/30/2019] [Indexed: 12/20/2022] Open
Abstract
Pharmacologically inactive substances have been used in medicine for more than 700 years and can trigger beneficial responses in the human body, which is referred to as the placebo effects or placebo responses. This effect is robust enough to influence psychosocial and physiological responses to the placebo and to active treatments in many settings, which has led to increased interest from researchers. In this article, we summarise the history of placebo, the characteristics of placebo effects and recent advancements reported from the studies on placebo effects and highlight placebome studies to identify various molecular biological components associated with placebo effects. Although placebos have a long history, the placebome concept is still in its infancy. Although behavioural, neurobiological and genetic studies have identified that molecules in the dopamine, opioid, serotonin and endocannabinoid systems might be targets of the placebo effect, placebome studies with a no-treatment control (NTC) are necessary to identify whole-genome genetic targets. Although bioinformatics analysis has identified the molecular placebome module, placebome studies with NTCs are also required to validate the related findings.
Collapse
Affiliation(s)
- Lei Cai
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center of Genetics and Development, Shanghai Jiaotong University, Shanghai 200240, China
| | - Lin He
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Collaborative Innovation Center of Genetics and Development, Shanghai Jiaotong University, Shanghai 200240, China
| |
Collapse
|
31
|
Identifying Methylation Pattern and Genes Associated with Breast Cancer Subtypes. Int J Mol Sci 2019; 20:ijms20174269. [PMID: 31480430 PMCID: PMC6747348 DOI: 10.3390/ijms20174269] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 08/19/2019] [Accepted: 08/29/2019] [Indexed: 12/18/2022] Open
Abstract
Breast cancer is regarded worldwide as a severe human disease. Various genetic variations, including hereditary and somatic mutations, contribute to the initiation and progression of this disease. The diagnostic parameters of breast cancer are not limited to the conventional protein content and can include newly discovered genetic variants and even genetic modification patterns such as methylation and microRNA. In addition, breast cancer detection extends to detailed breast cancer stratifications to provide subtype-specific indications for further personalized treatment. One genome-wide expression–methylation quantitative trait loci analysis confirmed that different breast cancer subtypes have various methylation patterns. However, recognizing clinically applied (methylation) biomarkers is difficult due to the large number of differentially methylated genes. In this study, we attempted to re-screen a small group of functional biomarkers for the identification and distinction of different breast cancer subtypes with advanced machine learning methods. The findings may contribute to biomarker identification for different breast cancer subtypes and provide a new perspective for differential pathogenesis in breast cancer subtypes.
Collapse
|
32
|
Niu B, Liang C, Lu Y, Zhao M, Chen Q, Zhang Y, Zheng L, Chou KC. Glioma stages prediction based on machine learning algorithm combined with protein-protein interaction networks. Genomics 2019; 112:837-847. [PMID: 31150762 DOI: 10.1016/j.ygeno.2019.05.024] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Accepted: 05/25/2019] [Indexed: 12/18/2022]
Abstract
BACKGROUND Glioma is the most lethal nervous system cancer. Recent studies have made great efforts to study the occurrence and development of glioma, but the molecular mechanisms are still unclear. This study was designed to reveal the molecular mechanisms of glioma based on protein-protein interaction network combined with machine learning methods. Key differentially expressed genes (DEGs) were screened and selected by using the protein-protein interaction (PPI) networks. RESULTS As a result, 19 genes between grade I and grade II, 21 genes between grade II and grade III, and 20 genes between grade III and grade IV. Then, five machine learning methods were employed to predict the gliomas stages based on the selected key genes. After comparison, Complement Naive Bayes classifier was employed to build the prediction model for grade II-III with accuracy 72.8%. And Random forest was employed to build the prediction model for grade I-II and grade III-VI with accuracy 97.1% and 83.2%, respectively. Finally, the selected genes were analyzed by PPI networks, Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and the results improve our understanding of the biological functions of select DEGs involved in glioma growth. We expect that the key genes expressed have a guiding significance for the occurrence of gliomas or, at the very least, that they are useful for tumor researchers. CONCLUSION Machine learning combined with PPI networks, GO and KEGG analyses of selected DEGs improve our understanding of the biological functions involved in glioma growth.
Collapse
Affiliation(s)
- Bing Niu
- School of Life Sciences, Shanghai University, Shanghai 200444, China; Gordon Life Science Institute, Boston, MA 02478, USA.
| | - Chaofeng Liang
- Department of Neurosurgery, The Third Affiliated Hospital of Sun Yat-sen University, Guangzhou, China
| | - Yi Lu
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Manman Zhao
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Qin Chen
- School of Life Sciences, Shanghai University, Shanghai 200444, China.
| | - Yuhui Zhang
- Renji Hospital, Medical School, Shanghai Jiaotong University, 160 Pujian Rd, New Pudong District, Shanghai 200127, China; Changhai Hospital, Second Military Medical University, Shanghai 200433, China.
| | - Linfeng Zheng
- Department of Radiology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200080, China; Department of Radiology, Shanghai First People's Hospital, Baoshan Branch, Shanghai 200940, China.
| | - Kuo-Chen Chou
- Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China; Gordon Life Science Institute, Boston, MA 02478, USA.
| |
Collapse
|
33
|
iN6-methylat (5-step): identifying DNA N6-methyladenine sites in rice genome using continuous bag of nucleobases via Chou’s 5-step rule. Mol Genet Genomics 2019; 294:1173-1182. [DOI: 10.1007/s00438-019-01570-y] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2019] [Accepted: 04/25/2019] [Indexed: 12/21/2022]
|
34
|
SPrenylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-prenylation sites in proteins. J Theor Biol 2019; 468:1-11. [DOI: 10.1016/j.jtbi.2019.02.007] [Citation(s) in RCA: 98] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 02/07/2019] [Accepted: 02/11/2019] [Indexed: 11/22/2022]
|
35
|
Kumar VS, Vellaichamy A. Sequence and structure‐based characterization of ubiquitination sites in human and yeast proteins using Chou's sample formulation. Proteins 2019; 87:646-657. [DOI: 10.1002/prot.25689] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Revised: 02/20/2019] [Accepted: 04/04/2019] [Indexed: 12/29/2022]
|
36
|
Zhao X, Zhang Y, Ning Q, Zhang H, Ji J, Yin M. Identifying N6-methyladenosine sites using extreme gradient boosting system optimized by particle swarm optimizer. J Theor Biol 2019; 467:39-47. [DOI: 10.1016/j.jtbi.2019.01.035] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 01/04/2019] [Accepted: 01/30/2019] [Indexed: 01/15/2023]
|
37
|
Le NQK, Yapp EKY, Ou YY, Yeh HY. iMotor-CNN: Identifying molecular functions of cytoskeleton motor proteins using 2D convolutional neural network via Chou's 5-step rule. Anal Biochem 2019; 575:17-26. [PMID: 30930199 DOI: 10.1016/j.ab.2019.03.017] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 03/24/2019] [Accepted: 03/25/2019] [Indexed: 12/12/2022]
Abstract
Motor proteins are the driving force behind muscle contraction and are responsible for the active transportation of most proteins and vesicles in the cytoplasm. There are three superfamilies of cytoskeletal motor proteins with various molecular functions and structures: dynein, kinesin, and myosin. The functional loss of a specific motor protein molecular function has linked to a variety of human diseases, e.g., Charcot-Marie-Tooth disease, kidney disease. Therefore, creating a precise model to classify motor proteins is essential for helping biologists understand their molecular functions and design drug targets according to their impact on human diseases. Here we attempt to classify cytoskeleton motor proteins using deep learning, which has been increasingly and widely used to address numerous problems in a variety of fields resulting in state-of-the-art results. Our effective deep convolutional neural network is able to achieve an independent test accuracy of 97.5%, 96.4%, and 96.1% for each superfamily, respectively. Compared to other state-of-the-art methods, our approach showed a significant improvement in performance across a range of evaluation metrics. Through the proposed study, we provide an effective model for classifying motor proteins and a basis for further research that can enhance the performance of protein function classification using deep learning.
Collapse
Affiliation(s)
- Nguyen Quoc Khanh Le
- Medical Humanities Research Cluster, School of Humanities, Nanyang Technological University, 48 Nanyang Ave, 639798, Singapore.
| | - Edward Kien Yee Yapp
- Singapore Institute of Manufacturing Technology, 2 Fusionopolis Way, #08-04, Innovis, 138634, Singapore
| | - Yu-Yen Ou
- Department of Computer Science and Engineering, Yuan Ze University, 32003, Taiwan
| | - Hui-Yuan Yeh
- Medical Humanities Research Cluster, School of Humanities, Nanyang Technological University, 48 Nanyang Ave, 639798, Singapore.
| |
Collapse
|
38
|
Han Q, Yang C, Lu J, Zhang Y, Li J. Metabolism of Oxalate in Humans: A Potential Role Kynurenine Aminotransferase/Glutamine Transaminase/Cysteine Conjugate Beta-lyase Plays in Hyperoxaluria. Curr Med Chem 2019; 26:4944-4963. [PMID: 30907303 DOI: 10.2174/0929867326666190325095223] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 02/17/2019] [Accepted: 02/22/2019] [Indexed: 11/22/2022]
Abstract
Hyperoxaluria, excessive urinary oxalate excretion, is a significant health problem worldwide. Disrupted oxalate metabolism has been implicated in hyperoxaluria and accordingly, an enzymatic disturbance in oxalate biosynthesis can result in the primary hyperoxaluria. Alanine glyoxylate aminotransferase-1 and glyoxylate reductase, the enzymes involving glyoxylate (precursor for oxalate) metabolism, have been related to primary hyperoxalurias. Some studies suggest that other enzymes such as glycolate oxidase and alanine glyoxylate aminotransferase-2 might be associated with primary hyperoxaluria as well, but evidence of a definitive link is not strong between the clinical cases and gene mutations. There are still some idiopathic hyperoxalurias, which require a further study for the etiologies. Some aminotransferases, particularly kynurenine aminotransferases, can convert glyoxylate to glycine. Based on biochemical and structural characteristics, expression level, subcellular localization of some aminotransferases, a number of them appear able to catalyze the transamination of glyoxylate to glycine more efficiently than alanine glyoxylate aminotransferase-1. The aim of this minireview is to explore other undermining causes of primary hyperoxaluria and stimulate research toward achieving a comprehensive understanding of underlying mechanisms leading to the disease. Herein, we reviewed all aminotransferases in the liver for their functions in glyoxylate metabolism. Particularly, kynurenine aminotransferase-I and III were carefully discussed regarding their biochemical and structural characteristics, cellular localization, and enzyme inhibition. Kynurenine aminotransferase-III is, so far, the most efficient putative mitochondrial enzyme to transaminate glyoxylate to glycine in mammalian livers, might be an interesting enzyme to look over in hyperoxaluria etiology of primary hyperoxaluria and should be carefully investigated for its involvement in oxalate metabolism.
Collapse
Affiliation(s)
- Qian Han
- Key Laboratory of Tropical Biological Resources of Ministry of Education, Hainan University, Haikou, Hainan 570228. China
| | - Cihan Yang
- Key Laboratory of Tropical Biological Resources of Ministry of Education, Hainan University, Haikou, Hainan 570228. China
| | - Jun Lu
- Central South University Xiangya School of Medicine Affiliated Haikou People's Hospital, Haikou, Hainan 570208. China
| | - Yinai Zhang
- Central South University Xiangya School of Medicine Affiliated Haikou People's Hospital, Haikou, Hainan 570208. China
| | - Jianyong Li
- Department of Biochemistry, Virginia Tech, Blacksburg, VA 24061. United States
| |
Collapse
|
39
|
Taju SW, Ou Y. DeepIon: Deep learning approach for classifying ion transporters and ion channels from membrane proteins. J Comput Chem 2019; 40:1521-1529. [DOI: 10.1002/jcc.25805] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 01/19/2019] [Accepted: 01/30/2019] [Indexed: 01/20/2023]
Affiliation(s)
- Semmy Wellem Taju
- Department of Computer Science and EngineeringYuan Ze University Chung‐Li 32003 Taiwan
| | - Yu‐Yen Ou
- Department of Computer Science and EngineeringYuan Ze University Chung‐Li 32003 Taiwan
| |
Collapse
|
40
|
SPalmitoylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-palmitoylation sites in proteins. Anal Biochem 2019; 568:14-23. [DOI: 10.1016/j.ab.2018.12.019] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Revised: 12/19/2018] [Accepted: 12/22/2018] [Indexed: 02/06/2023]
|
41
|
Ahmad A, Shatabda S. EPAI-NC: Enhanced prediction of adenosine to inosine RNA editing sites using nucleotide compositions. Anal Biochem 2019; 569:16-21. [DOI: 10.1016/j.ab.2019.01.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2018] [Revised: 01/03/2019] [Accepted: 01/11/2019] [Indexed: 01/24/2023]
|
42
|
Le NQK, Yapp EKY, Ho QT, Nagasundaram N, Ou YY, Yeh HY. iEnhancer-5Step: Identifying enhancers using hidden information of DNA sequences via Chou's 5-step rule and word embedding. Anal Biochem 2019; 571:53-61. [PMID: 30822398 DOI: 10.1016/j.ab.2019.02.017] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Revised: 02/17/2019] [Accepted: 02/19/2019] [Indexed: 12/22/2022]
Abstract
An enhancer is a short (50-1500bp) region of DNA that plays an important role in gene expression and the production of RNA and proteins. Genetic variation in enhancers has been linked to many human diseases, such as cancer, disorder or inflammatory bowel disease. Due to the importance of enhancers in genomics, the classification of enhancers has become a popular area of research in computational biology. Despite the few computational tools employed to address this problem, their resulting performance still requires improvements. In this study, we treat enhancers by the word embeddings, including sub-word information of its biological words, which then serve as features to be fed into a support vector machine algorithm to classify them. We present iEnhancer-5Step, a web server containing two-layer classifiers to identify enhancers and their strength. We are able to attain an independent test accuracy of 79% and 63.5% in the two layers, respectively. Compared to current predictors on the same dataset, our proposed method is able to yield superior performance as compared to the other methods. Moreover, this study provides a basis for further research that can enrich the field of applying natural language processing techniques in biological sequences. iEnhancer-5Step is freely accessible via http://biologydeep.com/fastenc/.
Collapse
Affiliation(s)
- Nguyen Quoc Khanh Le
- Medical Humanities Research Cluster, School of Humanities, Nanyang Technological University, 48 Nanyang Ave, 639798, Singapore.
| | - Edward Kien Yee Yapp
- Singapore Institute of Manufacturing Technology, 2 Fusionopolis Way, #08-04, Innovis, 138634, Singapore
| | - Quang-Thai Ho
- Department of Computer Science and Engineering, Yuan Ze University, 32003, Taiwan
| | - N Nagasundaram
- Medical Humanities Research Cluster, School of Humanities, Nanyang Technological University, 48 Nanyang Ave, 639798, Singapore
| | - Yu-Yen Ou
- Department of Computer Science and Engineering, Yuan Ze University, 32003, Taiwan
| | - Hui-Yuan Yeh
- Medical Humanities Research Cluster, School of Humanities, Nanyang Technological University, 48 Nanyang Ave, 639798, Singapore.
| |
Collapse
|
43
|
Khan YD, Jamil M, Hussain W, Rasool N, Khan SA, Chou KC. pSSbond-PseAAC: Prediction of disulfide bonding sites by integration of PseAAC and statistical moments. J Theor Biol 2019; 463:47-55. [DOI: 10.1016/j.jtbi.2018.12.015] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 12/05/2018] [Accepted: 12/11/2018] [Indexed: 02/08/2023]
|
44
|
MFSC: Multi-voting based feature selection for classification of Golgi proteins by adopting the general form of Chou's PseAAC components. J Theor Biol 2019; 463:99-109. [DOI: 10.1016/j.jtbi.2018.12.017] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Revised: 12/02/2018] [Accepted: 12/14/2018] [Indexed: 12/29/2022]
|
45
|
Deng M, Lv XD, Fang ZX, Xie XS, Chen WY. The blood transcriptional signature for active and latent tuberculosis. Infect Drug Resist 2019; 12:321-328. [PMID: 30787624 PMCID: PMC6363485 DOI: 10.2147/idr.s184640] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Although the incidence of tuberculosis (TB) has dropped substantially, it still is a serious threat to human health. And in recent years, the emergence of resistant bacilli and inadequate disease control and prevention has led to a significant rise in the global TB epidemic. It is known that the cause of TB is Mycobacterium tuberculosis infection. But it is not clear why some infected patients are active while others are latent. METHODS We analyzed the blood gene expression profiles of 69 latent TB patients and 54 active pulmonary TB patients from GEO (Transcript Expression Omnibus) database. RESULTS By applying minimal redundancy maximal relevance and incremental feature selection, we identified 24 signature genes which can predict the TB activation. The support vector machine predictor based on these 24 genes had a sensitivity of 0.907, specificity of 0.913, and accuracy of 0.911, respectively. Although they need to be validated in a large independent dataset, the biological analysis of these 24 genes showed great promise. CONCLUSION We found that cytokine production was a key process during TB activation and genes like CYBB, TSPO, CD36, and STAT1 worth further investigation.
Collapse
Affiliation(s)
- Min Deng
- Department of Infectious Diseases, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China,
| | - Xiao-Dong Lv
- Department of Respiration, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| | - Zhi-Xian Fang
- Department of Respiration, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| | - Xin-Sheng Xie
- Department of Infectious Diseases, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China,
| | - Wen-Yu Chen
- Department of Respiration, The First Hospital of Jiaxing, The First Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| |
Collapse
|
46
|
Jia J, Li X, Qiu W, Xiao X, Chou KC. iPPI-PseAAC(CGR): Identify protein-protein interactions by incorporating chaos game representation into PseAAC. J Theor Biol 2019; 460:195-203. [DOI: 10.1016/j.jtbi.2018.10.021] [Citation(s) in RCA: 78] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2018] [Revised: 09/16/2018] [Accepted: 10/08/2018] [Indexed: 01/11/2023]
|
47
|
iRNA-PseKNC(2methyl): Identify RNA 2'-O-methylation sites by convolution neural network and Chou's pseudo components. J Theor Biol 2018; 465:1-6. [PMID: 30590059 DOI: 10.1016/j.jtbi.2018.12.034] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Revised: 12/15/2018] [Accepted: 12/23/2018] [Indexed: 11/21/2022]
Abstract
The 2'-O-methylation transferase is involved in the process of 2'-O-methylation. In catalytic processes, the 2-hydroxy group of the ribose moiety of a nucleotide accept a methyl group. This methylation process is a post-transcriptional modification, which occurs in various cellular RNAs and plays a vital role in regulation of gene expressions at the post-transcriptional level. Through biochemical experiments 2'-O-methylation sites produce good results but these biochemical process and exploratory techniques are very expensive. Thus, it is required to develop a computational method to identify 2'-O-methylation sites. In this work, we proposed a simple and precise convolution neural network method namely: iRNA-PseKNC(2methyl) to identify 2'-O-methylation sites. The existing techniques use handcrafted features, while the proposed method automatically extracts the features of 2'-O-methylation using the proposed convolution neural network model. The proposed prediction iRNA-PseKNC(2methyl) method obtained 98.27% of accuracy, 96.29% of sensitivity, 100% of specificity, and 0.965 of MCC on Home sapiens dataset. The reported outcomes present that our proposed method obtained better outcomes than existing method in terms of all evaluation parameters. These outcomes show that iRNA-PseKNC(2methyl) method might be beneficial for the academic research and drug design.
Collapse
|
48
|
Chen W, Liang X, Nong Z, Li Y, Pan X, Chen C, Huang L. The Multiple Applications and Possible Mechanisms of the Hyperbaric Oxygenation Therapy. Med Chem 2018; 15:459-471. [PMID: 30569869 DOI: 10.2174/1573406415666181219101328] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2018] [Revised: 10/23/2018] [Accepted: 12/12/2018] [Indexed: 12/18/2022]
Abstract
Hyperbaric Oxygenation Therapy (HBOT) is used as an adjunctive method for multiple diseases. The method meets the routine treating and is non-invasive, as well as provides 100% pure oxygen (O2), which is at above-normal atmospheric pressure in a specialized chamber. It is well known that in the condition of O2 deficiency, it will induce a series of adverse events. In order to prevent the injury induced by anoxia, the capability of offering pressurized O2 by HBOT seems involuntary and significant. In recent years, HBOT displays particular therapeutic efficacy in some degree, and it is thought to be beneficial to the conditions of angiogenesis, tissue ischemia and hypoxia, nerve system disease, diabetic complications, malignancies, Carbon monoxide (CO) poisoning and chronic radiation-induced injury. Single and combination HBOT are both applied in previous studies, and the manuscript is to review the current applications and possible mechanisms of HBOT. The applicability and validity of HBOT for clinical treatment remain controversial, even though it is regarded as an adjunct to conventional medical treatment with many other clinical benefits. There also exists a negative side effect of accepting pressurized O2, such as oxidative stress injury, DNA damage, cellular metabolic, activating of coagulation, endothelial dysfunction, acute neurotoxicity and pulmonary toxicity. Then it is imperative to comprehensively consider the advantages and disadvantages of HBOT in order to obtain a satisfying therapeutic outcome.
Collapse
Affiliation(s)
- Wan Chen
- Department of Emergency, the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi 530021, China
| | - Xingmei Liang
- Department of Pharmacy, Guangxi Medical College, Nanning, Guangxi 530021, China
| | - Zhihuan Nong
- Department of Pharmacology, Guangxi Institute of Chinese Medicine and Pharmaceutical Science, Nanning 530022, China
| | - Yaoxuan Li
- Department of Neurology, the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning 530022, China
| | - Xiaorong Pan
- Department of Hyperbaric oxygen, the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi 530021, China
| | - Chunxia Chen
- Department of Hyperbaric oxygen, the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi 530021, China
| | - Luying Huang
- Department of Respiratory Medicine, the People's Hospital of Guangxi Zhuang Autonomous Region, Nanning, Guangxi 530021, China
| |
Collapse
|
49
|
Cheng X, Xiao X, Chou KC. pLoc_bal-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by quasi-balancing training dataset and general PseAAC. J Theor Biol 2018; 458:92-102. [DOI: 10.1016/j.jtbi.2018.09.005] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Revised: 09/05/2018] [Accepted: 09/07/2018] [Indexed: 01/03/2023]
|
50
|
Pan Y, Wang S, Zhang Q, Lu Q, Su D, Zuo Y, Yang L. Analysis and prediction of animal toxins by various Chou's pseudo components and reduced amino acid compositions. J Theor Biol 2018; 462:221-229. [PMID: 30452961 DOI: 10.1016/j.jtbi.2018.11.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/06/2018] [Accepted: 11/15/2018] [Indexed: 01/19/2023]
Abstract
The animal toxin proteins are one of the disulfide rich small peptides that detected in venomous species. They are used as pharmacological tools and therapeutic agents in medicine for the high specificity of their targets. The successful analysis and prediction of toxin proteins may have important signification for the pharmacological and therapeutic researches of toxins. In this study, significant differences were found between the toxins and the non-toxins in amino acid compositions and several important biological properties. The random forest was firstly proposed to predict the animal toxin proteins by selecting 400 pseudo amino acid compositions and the dipeptide compositions of reduced amino acid alphabet as the input parameters. Based on dipeptide composition of reduced amino acid alphabet with 13 reduced amino acids, the best overall accuracy of 85.71% was obtained. These results indicated that our algorithm was an efficient tool for the animal toxin prediction.
Collapse
Affiliation(s)
- Yi Pan
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Shiyuan Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Qi Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Qianzi Lu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Dongqing Su
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yongchun Zuo
- The State key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot 010070, China.
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
| |
Collapse
|