1
|
Alamri SH, Haque S, Alghamdi BS, Tayeb HO, Azhari S, Farsi RM, Elmokadem A, Alamri TA, Harakeh S, Prakash A, Kumar V. Comprehensive mapping of mutations in TDP-43 and α-Synuclein that affect stability and binding. J Biomol Struct Dyn 2025; 43:1818-1830. [PMID: 38126188 DOI: 10.1080/07391102.2023.2293258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 11/11/2023] [Indexed: 12/23/2023]
Abstract
Abnormal aggregation and amyloid inclusions of TAR DNA-binding protein 43 (TDP-43) and α-Synuclein (α-Syn) are frequently co-observed in amyotrophic lateral sclerosis, Parkinson's disease, and Alzheimer's disease. Several reports showed TDP-43 C-terminal domain (CTD) and α-Syn interact with each other and the aggregates of these two proteins colocalized together in different cellular and animal models. Molecular dynamics simulation was conducted to elucidate the stability of the TDP-43 and Syn complex structure. The interfacial mutations in protein complexes changes the stability and binding affinity of the protein that may cause diseases. Here, we have utilized the computational saturation mutagenesis approach including structure-based stability and binding energy calculations to compute the systemic effects of missense mutations of TDP-43 CTD and α-Syn on protein stability and binding affinity. Most of the interfacial mutations of CTD and α-Syn were found to destabilize the protein and reduced the protein binding affinity. The results thus shed light on the functional consequences of missense mutations observed in TDP-43 associated proteinopathies and may provide the mechanisms of co-morbidities involving these two proteins.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Sultan H Alamri
- Department of Family Medicine, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan, Saudi Arabia
- Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Beirut, Lebanon
- Centre of Medical and Bio-Allied Health Sciences Research, Ajman University, Ajman, United Arab Emirates
| | - Badra S Alghamdi
- Department of Physiology, Neuroscience Unit, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
- Pre-Clinical Research Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Haythum O Tayeb
- The Mind and Brain Studies Initiative, Neuroscience Research Unit, Department of Neurology, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Shereen Azhari
- Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Reem M Farsi
- Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Abear Elmokadem
- Department of Hematology/Pediatric Oncology, King Abdulaziz University Hospital, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Turki A Alamri
- Family and Community Medicine Department, Faculty of Medicine in Rabigh, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Steve Harakeh
- King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Jeddah, Saudi Arabia
- Yousef Abdul Latif Jameel Scientific Chair of Prophetic Medicine Application, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Amresh Prakash
- Amity Institute of Integrative Sciences and Health (AIISH), Amity University Haryana, Gurgaon, India
| | - Vijay Kumar
- Amity Institute of Neuropsychology & Neurosciences, Amity University, Noida, India
| |
Collapse
|
2
|
Shorthouse D, Lister H, Freeman GS, Hall BA. Understanding large scale sequencing datasets through changes to protein folding. Brief Funct Genomics 2024; 23:517-524. [PMID: 38521964 PMCID: PMC11428155 DOI: 10.1093/bfgp/elae007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Revised: 02/26/2024] [Accepted: 03/01/2024] [Indexed: 03/25/2024] Open
Abstract
The expansion of high-quality, low-cost sequencing has created an enormous opportunity to understand how genetic variants alter cellular behaviour in disease. The high diversity of mutations observed has however drawn a spotlight onto the need for predictive modelling of mutational effects on phenotype from variants of uncertain significance. This is particularly important in the clinic due to the potential value in guiding clinical diagnosis and patient treatment. Recent computational modelling has highlighted the importance of mutation induced protein misfolding as a common mechanism for loss of protein or domain function, aided by developments in methods that make large computational screens tractable. Here we review recent applications of this approach to different genes, and how they have enabled and supported subsequent studies. We further discuss developments in the approach and the role for the approach in light of increasingly high throughput experimental approaches.
Collapse
Affiliation(s)
- David Shorthouse
- School of Pharmacy, University College London, 29-39 Brunswick Square, London WC1N 1AX, UK
| | - Harris Lister
- Department of Medical Physics and Biomedical Engineering, Malet Place Engineering Building, University College London, Gower Street, London WC1E 6BT, UK
| | - Gemma S Freeman
- Department of Medical Physics and Biomedical Engineering, Malet Place Engineering Building, University College London, Gower Street, London WC1E 6BT, UK
| | - Benjamin A Hall
- Department of Medical Physics and Biomedical Engineering, Malet Place Engineering Building, University College London, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
3
|
Haque S, Mathkor DM, Alkhanani MF, Bantun F, Momenah AM, Faidah H, Jalal NA, Kumar V. Comprehensive deep mutational scanning reveals the pH induced stability and binding differences between SARS-CoV-2 spike RBD and human ACE2. J Biomol Struct Dyn 2023; 41:15207-15218. [PMID: 36995177 DOI: 10.1080/07391102.2023.2194007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 02/25/2023] [Indexed: 03/31/2023]
Abstract
The SARS-CoV-2 spike (S) glycoprotein with its mobile receptor-binding domain (RBD), binds to the human ACE2 receptor and thus facilitates virus entry through low-pH-endosomal pathways. The high degree of SARS-CoV-2 mutability has raised concern among scientists and medical professionals because it created doubt about the effectiveness of drugs and vaccinations designed specifically for COVID-19. In this study, we used computational saturation mutagenesis approach, including structure-based free energy calculations to analyse the effects of the missense mutations on the SARS-CoV-2 S-RBD stability and the S-RBD binding affinity with ACE2 at three different pH (pH 4.5, pH 6.5, and pH 7.4). A total of 3705 mutations in the S-RBD protein were analyzed, and we discovered that most of these mutations destabilize the RBD protein. Specifically, residues G404, G431, G447, A475, and G526 were important for RBD protein stability. In addition, RBD residues Y449, Y489, Y495, Q498, and N487 were critical for the RBD-ACE2 interaction. Next, we found that the distribution of the mean stability changes and mean binding energy changes of RBD due to mutations at both serological and endosomal pH correlated well, indicating the similar effects of mutations. Overall, this computational analysis is useful for understanding the effects of missense mutations in SARS-CoV-2 pathogenesis at different pH.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan-45142, Saudi Arabia
- Gilbert and Rose-Marie Chagoury School of Medicine, Lebanese American University, Beirut, Lebanon
- Centre of Medical and Bio-Allied Health Sciences Research, Ajman University, Ajman, United Arab Emirates
| | - Darin Mansor Mathkor
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan-45142, Saudi Arabia
| | - Mustfa Faisal Alkhanani
- Biology Department, College of Sciences, University of Hafr Al Batin, Hafr Al Batin, Saudi Arabia
| | - Farkad Bantun
- Department of Microbiology, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Aiman M Momenah
- Department of Microbiology, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Hani Faidah
- Department of Microbiology, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Naif A Jalal
- Department of Microbiology, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
| | - Vijay Kumar
- Amity Institute of Neuropsychology & Neurosciences, Amity University, Noida, Uttar Pradesh, India
| |
Collapse
|
4
|
Teng S, Sobitan A, Rhoades R, Liu D, Tang Q. Systemic effects of missense mutations on SARS-CoV-2 spike glycoprotein stability and receptor-binding affinity. Brief Bioinform 2020; 22:1239-1253. [PMID: 33006605 PMCID: PMC7665319 DOI: 10.1093/bib/bbaa233] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 08/03/2020] [Accepted: 08/26/2020] [Indexed: 12/21/2022] Open
Abstract
The spike (S) glycoprotein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the binding to the permissive cells. The receptor-binding domain (RBD) of SARS-CoV-2 S protein directly interacts with the human angiotensin-converting enzyme 2 (ACE2) on the host cell membrane. In this study, we used computational saturation mutagenesis approaches, including structure-based energy calculations and sequence-based pathogenicity predictions, to quantify the systemic effects of missense mutations on SARS-CoV-2 S protein structure and function. A total of 18 354 mutations in S protein were analyzed, and we discovered that most of these mutations could destabilize the entire S protein and its RBD. Specifically, residues G431 and S514 in SARS-CoV-2 RBD are important for S protein stability. We analyzed 384 experimentally verified S missense variations and revealed that the dominant pandemic form, D614G, can stabilize the entire S protein. Moreover, many mutations in N-linked glycosylation sites can increase the stability of the S protein. In addition, we investigated 3705 mutations in SARS-CoV-2 RBD and 11 324 mutations in human ACE2 and found that SARS-CoV-2 neighbor residues G496 and F497 and ACE2 residues D355 and Y41 are critical for the RBD-ACE2 interaction. The findings comprehensively provide potential target sites in the development of drugs and vaccines against COVID-19.
Collapse
Affiliation(s)
- Shaolei Teng
- Corresponding authors: Shaolei Teng, Department of Biology, Howard University, 415 College St. NW, Washington, DC 20059. Tel.: +1 202-806-6933; E-mail: ; Qiyi Tang, Howard University College of Medicine, 520 W Street NW, Washington, DC 20059. Tel.: +1 202-806-3915; E-mail:
| | - Adebiyi Sobitan
- Department of Biology at the Howard University, 415 College St. NW, Washington, DC 20059
| | - Raina Rhoades
- Department of Biology at the Howard University, 415 College St. NW, Washington, DC 20059
| | - Dongxiao Liu
- Howard University College of Medicine, 520 W Street NW, Washington, DC 20059
| | - Qiyi Tang
- Corresponding authors: Shaolei Teng, Department of Biology, Howard University, 415 College St. NW, Washington, DC 20059. Tel.: +1 202-806-6933; E-mail: ; Qiyi Tang, Howard University College of Medicine, 520 W Street NW, Washington, DC 20059. Tel.: +1 202-806-3915; E-mail:
| |
Collapse
|
5
|
Bag AK, Mandloi S, Jarmalavicius S, Mondal S, Kumar K, Mandal C, Walden P, Chakrabarti S, Mandal C. Connecting signaling and metabolic pathways in EGF receptor-mediated oncogenesis of glioblastoma. PLoS Comput Biol 2019; 15:e1007090. [PMID: 31386654 PMCID: PMC6684045 DOI: 10.1371/journal.pcbi.1007090] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 05/13/2019] [Indexed: 12/21/2022] Open
Abstract
As malignant transformation requires synchronization of growth-driving signaling (S) and metabolic (M) pathways, defining cancer-specific S-M interconnected networks (SMINs) could lead to better understanding of oncogenic processes. In a systems-biology approach, we developed a mathematical model for SMINs in mutated EGF receptor (EGFRvIII) compared to wild-type EGF receptor (EGFRwt) expressing glioblastoma multiforme (GBM). Starting with experimentally validated human protein-protein interactome data for S-M pathways, and incorporating proteomic data for EGFRvIII and EGFRwt GBM cells and patient transcriptomic data, we designed a dynamic model for EGFR-driven GBM-specific information flow. Key nodes and paths identified by in silico perturbation were validated experimentally when inhibition of signaling pathway proteins altered expression of metabolic proteins as predicted by the model. This demonstrated capacity of the model to identify unknown connections between signaling and metabolic pathways, explain the robustness of oncogenic SMINs, predict drug escape, and assist identification of drug targets and the development of combination therapies. Complex and highly dynamic interconnected networks allow cancer to take different routes and circumvent chemotherapy. Therefore, understanding these context-specific networks and their dynamics of molecular interactions driven by different oncogenic signaling and metabolic pathways is very much needed to predict drug targets and the effect of therapeutics. We incorporated high-throughput transcriptome and proteome data into mathematical models to deduce properties of cancer cells through systems biology approach. Here we report the development, testing and validation of an integrated systems biology model of information flow between signaling and metabolic pathways to understand the regulation of the interconnection between them in cancer. Our model efficiently identified unique connections and key nodes important in signaling-metabolic information flow. We predicted some potential novel targets before performing actual drug tests. We have successfully applied this model to identify the interconnections altered in the constitutive signaling of the mutated EGFR by comparing EGF-dependent and wild-type EGFR signaling in glioblastoma multiforme.
Collapse
Affiliation(s)
- Arup K. Bag
- Cancer Biology and Inflammatory Disorder Division, Indian Institute of Chemical Biology, Kolkata, India
| | - Sapan Mandloi
- Structural Biology and Bioinformatics Division, Indian Institute of Chemical Biology, Kolkata, India
| | - Saulius Jarmalavicius
- Department of Dermatology, Venerology and Allergology, Charité– Universitätsmedizin Berlin corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Berlin, Germany
| | - Susmita Mondal
- Cancer Biology and Inflammatory Disorder Division, Indian Institute of Chemical Biology, Kolkata, India
| | - Krishna Kumar
- Structural Biology and Bioinformatics Division, Indian Institute of Chemical Biology, Kolkata, India
| | - Chhabinath Mandal
- National Institute of Pharmaceutical Education and Research, Kolkata, India
| | - Peter Walden
- Department of Dermatology, Venerology and Allergology, Charité– Universitätsmedizin Berlin corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health, Berlin, Germany
- * E-mail: (PW); , (SC); , (CM)
| | - Saikat Chakrabarti
- Structural Biology and Bioinformatics Division, Indian Institute of Chemical Biology, Kolkata, India
- * E-mail: (PW); , (SC); , (CM)
| | - Chitra Mandal
- Cancer Biology and Inflammatory Disorder Division, Indian Institute of Chemical Biology, Kolkata, India
- * E-mail: (PW); , (SC); , (CM)
| |
Collapse
|
6
|
Barradas-Bautista D, Rosell M, Pallara C, Fernández-Recio J. Structural Prediction of Protein–Protein Interactions by Docking: Application to Biomedical Problems. PROTEIN-PROTEIN INTERACTIONS IN HUMAN DISEASE, PART A 2018; 110:203-249. [DOI: 10.1016/bs.apcsb.2017.06.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
7
|
Barradas-Bautista D, Fernández-Recio J. Docking-based modeling of protein-protein interfaces for extensive structural and functional characterization of missense mutations. PLoS One 2017; 12:e0183643. [PMID: 28841721 PMCID: PMC5571915 DOI: 10.1371/journal.pone.0183643] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 08/08/2017] [Indexed: 01/23/2023] Open
Abstract
Next-generation sequencing (NGS) technologies are providing genomic information for an increasing number of healthy individuals and patient populations. In the context of the large amount of generated genomic data that is being generated, understanding the effect of disease-related mutations at molecular level can contribute to close the gap between genotype and phenotype and thus improve prevention, diagnosis or treatment of a pathological condition. In order to fully characterize the effect of a pathological mutation and have useful information for prediction purposes, it is important first to identify whether the mutation is located at a protein-binding interface, and second to understand the effect on the binding affinity of the affected interaction/s. Computational methods, such as protein docking are currently used to complement experimental efforts and could help to build the human structural interactome. Here we have extended the original pyDockNIP method to predict the location of disease-associated nsSNPs at protein-protein interfaces, when there is no available structure for the protein-protein complex. We have applied this approach to the pathological interaction networks of six diseases with low structural data on PPIs. This approach can almost double the number of nsSNPs that can be characterized and identify edgetic effects in many nsSNPs that were previously unknown. This can help to annotate and interpret genomic data from large-scale population studies, and to achieve a better understanding of disease at molecular level.
Collapse
Affiliation(s)
| | - Juan Fernández-Recio
- Life Sciences Department, Barcelona Supercomputing Center (BSC), Barcelona, Spain
- * E-mail:
| |
Collapse
|
8
|
Brunk E, Mih N, Monk J, Zhang Z, O’Brien EJ, Bliven SE, Chen K, Chang RL, Bourne PE, Palsson BO. Systems biology of the structural proteome. BMC SYSTEMS BIOLOGY 2016; 10:26. [PMID: 26969117 PMCID: PMC4787049 DOI: 10.1186/s12918-016-0271-6] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2015] [Accepted: 02/16/2016] [Indexed: 12/19/2022]
Abstract
BACKGROUND The success of genome-scale models (GEMs) can be attributed to the high-quality, bottom-up reconstructions of metabolic, protein synthesis, and transcriptional regulatory networks on an organism-specific basis. Such reconstructions are biochemically, genetically, and genomically structured knowledge bases that can be converted into a mathematical format to enable a myriad of computational biological studies. In recent years, genome-scale reconstructions have been extended to include protein structural information, which has opened up new vistas in systems biology research and empowered applications in structural systems biology and systems pharmacology. RESULTS Here, we present the generation, application, and dissemination of genome-scale models with protein structures (GEM-PRO) for Escherichia coli and Thermotoga maritima. We show the utility of integrating molecular scale analyses with systems biology approaches by discussing several comparative analyses on the temperature dependence of growth, the distribution of protein fold families, substrate specificity, and characteristic features of whole cell proteomes. Finally, to aid in the grand challenge of big data to knowledge, we provide several explicit tutorials of how protein-related information can be linked to genome-scale models in a public GitHub repository ( https://github.com/SBRG/GEMPro/tree/master/GEMPro_recon/). CONCLUSIONS Translating genome-scale, protein-related information to structured data in the format of a GEM provides a direct mapping of gene to gene-product to protein structure to biochemical reaction to network states to phenotypic function. Integration of molecular-level details of individual proteins, such as their physical, chemical, and structural properties, further expands the description of biochemical network-level properties, and can ultimately influence how to model and predict whole cell phenotypes as well as perform comparative systems biology approaches to study differences between organisms. GEM-PRO offers insight into the physical embodiment of an organism's genotype, and its use in this comparative framework enables exploration of adaptive strategies for these organisms, opening the door to many new lines of research. With these provided tools, tutorials, and background, the reader will be in a position to run GEM-PRO for their own purposes.
Collapse
Affiliation(s)
- Elizabeth Brunk
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
- />Joint BioEnergy Institute, Emeryville, CA 94608 USA
| | - Nathan Mih
- />Bioinformatics and Systems Biology Program, University of California, La Jolla, San Diego, CA 92093 USA
| | - Jonathan Monk
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Zhen Zhang
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Edward J. O’Brien
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Spencer E. Bliven
- />Bioinformatics and Systems Biology Program, University of California, La Jolla, San Diego, CA 92093 USA
- />National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 USA
| | - Ke Chen
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| | - Roger L. Chang
- />Department of Systems Biology, Harvard Medical School, Boston, MA 02115 USA
| | - Philip E. Bourne
- />Office of the Director, National Institutes of Health, Bethesda, MD 20894 USA
| | - Bernhard O. Palsson
- />Department of Bioengineering, University of California, La Jolla, San Diego, CA 92093 USA
| |
Collapse
|
9
|
Brunk E, Rothlisberger U. Mixed Quantum Mechanical/Molecular Mechanical Molecular Dynamics Simulations of Biological Systems in Ground and Electronically Excited States. Chem Rev 2015; 115:6217-63. [PMID: 25880693 DOI: 10.1021/cr500628b] [Citation(s) in RCA: 319] [Impact Index Per Article: 31.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Elizabeth Brunk
- †Laboratory of Computational Chemistry and Biochemistry, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,‡Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, California 94618, United States
| | - Ursula Rothlisberger
- †Laboratory of Computational Chemistry and Biochemistry, Ecole Polytechnique Fédérale de Lausanne, 1015 Lausanne, Switzerland.,§National Competence Center of Research (NCCR) MARVEL-Materials' Revolution: Computational Design and Discovery of Novel Materials, 1015 Lausanne, Switzerland
| |
Collapse
|
10
|
Li B, Seligman C, Thusberg J, Miller JL, Auer J, Whirl-Carrillo M, Capriotti E, Klein TE, Mooney SD. In silico comparative characterization of pharmacogenomic missense variants. BMC Genomics 2014; 15 Suppl 4:S4. [PMID: 25057096 PMCID: PMC4092878 DOI: 10.1186/1471-2164-15-s4-s4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Missense pharmacogenomic (PGx) variants refer to amino acid substitutions that potentially affect the pharmacokinetic (PK) or pharmacodynamic (PD) response to drug therapies. The PGx variants, as compared to disease-associated variants, have not been investigated as deeply. The ability to computationally predict future PGx variants is desirable; however, it is not clear what data sets should be used or what features are beneficial to this end. Hence we carried out a comparative characterization of PGx variants with annotated neutral and disease variants from UniProt, to test the predictive power of sequence conservation and structural information in discriminating these three groups. RESULTS 126 PGx variants of high quality from PharmGKB were selected and two data sets were created: one set contained 416 variants with structural and sequence information, and, the other set contained 1,265 variants with sequence information only. In terms of sequence conservation, PGx variants are more conserved than neutral variants and much less conserved than disease variants. A weighted random forest was used to strike a more balanced classification for PGx variants. Generally structural features are helpful in discriminating PGx variant from the other two groups, but still classification of PGx from neutral polymorphisms is much less effective than between disease and neutral variants. CONCLUSIONS We found that PGx variants are much more similar to neutral variants than to disease variants in the feature space consisting of residue conservation, neighboring residue conservation, number of neighbors, and protein solvent accessibility. Such similarity poses great difficulty in the classification of PGx variants and polymorphisms.
Collapse
|
11
|
Xie L, Ge X, Tan H, Xie L, Zhang Y, Hart T, Yang X, Bourne PE. Towards structural systems pharmacology to study complex diseases and personalized medicine. PLoS Comput Biol 2014; 10:e1003554. [PMID: 24830652 PMCID: PMC4022462 DOI: 10.1371/journal.pcbi.1003554] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Genome-Wide Association Studies (GWAS), whole genome sequencing, and high-throughput omics techniques have generated vast amounts of genotypic and molecular phenotypic data. However, these data have not yet been fully explored to improve the effectiveness and efficiency of drug discovery, which continues along a one-drug-one-target-one-disease paradigm. As a partial consequence, both the cost to launch a new drug and the attrition rate are increasing. Systems pharmacology and pharmacogenomics are emerging to exploit the available data and potentially reverse this trend, but, as we argue here, more is needed. To understand the impact of genetic, epigenetic, and environmental factors on drug action, we must study the structural energetics and dynamics of molecular interactions in the context of the whole human genome and interactome. Such an approach requires an integrative modeling framework for drug action that leverages advances in data-driven statistical modeling and mechanism-based multiscale modeling and transforms heterogeneous data from GWAS, high-throughput sequencing, structural genomics, functional genomics, and chemical genomics into unified knowledge. This is not a small task, but, as reviewed here, progress is being made towards the final goal of personalized medicines for the treatment of complex diseases.
Collapse
Affiliation(s)
- Lei Xie
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
- Ph.D. Program in Computer Science, Biology, and Biochemistry, The Graduate Center, The City University of New York, New York, New York, United States of America
- * E-mail:
| | - Xiaoxia Ge
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
| | - Hepan Tan
- Department of Computer Science, Hunter College, The City University of New York, New York, New York, United States of America
| | - Li Xie
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| | - Yinliang Zhang
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| | - Thomas Hart
- Department of Biological Sciences, Hunter College, The City University of New York, New York, New York, United States of America
| | - Xiaowei Yang
- School of Public Health, Hunter College, The City University of New York, New York, New York, United States of America
| | - Philip E. Bourne
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California, United States of America
| |
Collapse
|
12
|
Zhou HX, Bates PA. Modeling protein association mechanisms and kinetics. Curr Opin Struct Biol 2013; 23:887-93. [PMID: 23850142 PMCID: PMC3844007 DOI: 10.1016/j.sbi.2013.06.014] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Revised: 06/19/2013] [Accepted: 06/19/2013] [Indexed: 11/21/2022]
Abstract
Substantial advances have been made in modeling protein association mechanisms and in calculating association rate constants (ka). We now have a clear understanding of the physical factors underlying the wide range of experimental ka values. Half of the association problem, where ka is limited by diffusion, is perhaps solved, and for the other half, where conformational changes become rate-limiting, a number of promising methods are being developed for ka calculations. Notably, the binding kinetics of disordered proteins are receiving growing attention, with 'dock-and-coalesce' emerging as a general mechanism. Progress too has been made in the modeling of protein association kinetics under conditions mimicking the heterogeneous, crowded environments of cells, an endeavor that should ultimately lead to a better understanding of cellular functions.
Collapse
Affiliation(s)
- Huan-Xiang Zhou
- Department of Physics and Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32306, USA.
| | | |
Collapse
|
13
|
Antibacterial mechanisms identified through structural systems pharmacology. BMC SYSTEMS BIOLOGY 2013; 7:102. [PMID: 24112686 PMCID: PMC3853765 DOI: 10.1186/1752-0509-7-102] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2013] [Accepted: 10/07/2013] [Indexed: 01/19/2023]
Abstract
BACKGROUND The growing discipline of structural systems pharmacology is applied prospectively in this study to predict pharmacological outcomes of antibacterial compounds in Escherichia coli K12. This work builds upon previously established methods for structural prediction of ligand binding pockets on protein molecules and utilizes and expands upon the previously developed genome scale model of metabolism integrated with protein structures (GEM-PRO) for E. coli, structurally accounting for protein complexes. Carefully selected case studies are demonstrated to display the potential for this structural systems pharmacology framework in discovery and development of antibacterial compounds. RESULTS The prediction framework for antibacterial activity of compounds was validated for a control set of well-studied compounds, recapitulating experimentally-determined protein binding interactions and deleterious growth phenotypes resulting from these interactions. The antibacterial activity of fosfomycin, sulfathiazole, and trimethoprim were accurately predicted, and as a negative control glucose was found to have no predicted antibacterial activity. Previously uncharacterized mechanisms of action were predicted for compounds with known antibacterial properties, including (1-hydroxyheptane-1,1-diyl)bis(phosphonic acid) and cholesteryl oleate. Five candidate inhibitors were predicted for a desirable target protein without any known inhibitors, tryptophan synthase β subunit (TrpB). In addition to the predictions presented, this effort also included significant expansion of the previously developed GEM-PRO to account for physiological assemblies of protein complex structures with activities included in the E. coli K12 metabolic network. CONCLUSIONS The structural systems pharmacology framework presented in this study was shown to be effective in the prediction of molecular mechanisms of antibacterial compounds. The study provides a promising proof of principle for such an approach to antibacterial development and raises specific molecular and systemic hypotheses about antibacterials that are amenable to experimental testing. This framework, and perhaps also the specific predictions of antibacterials, is extensible to developing antibacterial treatments for pathogenic E. coli and other bacterial pathogens.
Collapse
|
14
|
Agius R, Torchala M, Moal IH, Fernández-Recio J, Bates PA. Characterizing changes in the rate of protein-protein dissociation upon interface mutation using hotspot energy and organization. PLoS Comput Biol 2013; 9:e1003216. [PMID: 24039569 PMCID: PMC3764008 DOI: 10.1371/journal.pcbi.1003216] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Accepted: 07/25/2013] [Indexed: 12/21/2022] Open
Abstract
Predicting the effects of mutations on the kinetic rate constants of protein-protein interactions is central to both the modeling of complex diseases and the design of effective peptide drug inhibitors. However, while most studies have concentrated on the determination of association rate constants, dissociation rates have received less attention. In this work we take a novel approach by relating the changes in dissociation rates upon mutation to the energetics and architecture of hotspots and hotregions, by performing alanine scans pre- and post-mutation. From these scans, we design a set of descriptors that capture the change in hotspot energy and distribution. The method is benchmarked on 713 kinetically characterized mutations from the SKEMPI database. Our investigations show that, with the use of hotspot descriptors, energies from single-point alanine mutations may be used for the estimation of off-rate mutations to any residue type and also multi-point mutations. A number of machine learning models are built from a combination of molecular and hotspot descriptors, with the best models achieving a Pearson's Correlation Coefficient of 0.79 with experimental off-rates and a Matthew's Correlation Coefficient of 0.6 in the detection of rare stabilizing mutations. Using specialized feature selection models we identify descriptors that are highly specific and, conversely, broadly important to predicting the effects of different classes of mutations, interface regions and complexes. Our results also indicate that the distribution of the critical stability regions across protein-protein interfaces is a function of complex size more strongly than interface area. In addition, mutations at the rim are critical for the stability of small complexes, but consistently harder to characterize. The relationship between hotregion size and the dissociation rate is also investigated and, using hotspot descriptors which model cooperative effects within hotregions, we show how the contribution of hotregions of different sizes, changes under different cooperative effects.
Collapse
Affiliation(s)
- Rudi Agius
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Mieczyslaw Torchala
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | - Iain H. Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Juan Fernández-Recio
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Paul A. Bates
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| |
Collapse
|
15
|
Gulati S, Cheng TMK, Bates PA. Cancer networks and beyond: interpreting mutations using the human interactome and protein structure. Semin Cancer Biol 2013; 23:219-26. [PMID: 23680723 DOI: 10.1016/j.semcancer.2013.05.002] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2013] [Revised: 04/30/2013] [Accepted: 05/03/2013] [Indexed: 01/08/2023]
Abstract
Over recent years, with the advances in next-generation sequencing, a large number of cancer mutations have been identified and accumulated in public repositories. Coupled to this is our increased ability to generate detailed interactome maps that help to enrich our knowledge of the biological implications of cancer mutations. As a result, network analysis approaches have become an invaluable tool to predict and interpret mutations that are associated with tumour survival and progression. Our understanding of cancer mechanisms is further enhanced by mapping protein structure information to such networks. Here we review the current methodologies for annotating the functional impacts of cancer mutations, which range from analysis of protein structures to protein-protein interaction network studies.
Collapse
Affiliation(s)
- Sakshi Gulati
- Biomolecular Modelling Laboratory, Cancer Research UK London Research Institute, London, United Kingdom
| | | | | |
Collapse
|
16
|
Chang RL, Andrews K, Kim D, Li Z, Godzik A, Palsson BO. Structural systems biology evaluation of metabolic thermotolerance in Escherichia coli. Science 2013; 340:1220-3. [PMID: 23744946 DOI: 10.1126/science.1234012] [Citation(s) in RCA: 89] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Genome-scale network reconstruction has enabled predictive modeling of metabolism for many systems. Traditionally, protein structural information has not been represented in such reconstructions. Expansion of a genome-scale model of Escherichia coli metabolism by including experimental and predicted protein structures enabled the analysis of protein thermostability in a network context. This analysis allowed the prediction of protein activities that limit network function at superoptimal temperatures and mechanistic interpretations of mutations found in strains adapted to heat. Predicted growth-limiting factors for thermotolerance were validated through nutrient supplementation experiments and defined metabolic sensitivities to heat stress, providing evidence that metabolic enzyme thermostability is rate-limiting at superoptimal temperatures. Inclusion of structural information expanded the content and predictive capability of genome-scale metabolic networks that enable structural systems biology of metabolism.
Collapse
Affiliation(s)
- Roger L Chang
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093-0412, USA
| | | | | | | | | | | |
Collapse
|
17
|
Duran-Frigola M, Mosca R, Aloy P. Structural Systems Pharmacology: The Role of 3D Structures in Next-Generation Drug Development. ACTA ACUST UNITED AC 2013; 20:674-84. [DOI: 10.1016/j.chembiol.2013.03.004] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2013] [Revised: 02/28/2013] [Accepted: 03/05/2013] [Indexed: 01/12/2023]
|