1
|
Liang H, Zhan X, Wang Y, Maegawa GHB, Zhang H. Development and validation of a new genotype-phenotype correlation for Niemann-Pick disease type C1. J Inherit Metab Dis 2024; 47:317-326. [PMID: 38131230 DOI: 10.1002/jimd.12705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/27/2023] [Revised: 11/29/2023] [Accepted: 12/12/2023] [Indexed: 12/23/2023]
Abstract
Hundreds of NPC1 variants cause highly heterogeneous phenotypes. This study aims to explore the genotype-phenotype correlation of NPC1, especially for missense variants. In a well-characterized cohort, phenotypes are graded into three clinical forms: mild, intermediate, and severe. Missense residue structural location was stratified into three categories: surface, partially, and fully buried. The association of phenotypes with the topography of the amino acid substitution in the protein structure was investigated in our cohort and validated in two reported cohorts. One hundred six unrelated NPC1 patients were enrolled. A significant correlation of genotype-phenotype was found in 81 classified individuals with two or one (the second was null variant) missense variant (p < 0.001): of 25 patients with at least one missense variant of surface (group A), 19 (76%) mild, six (24%) intermediate, and none severe; of 31 cases with at least one missense variant of partially buried without surface variants (group B), 11 (35%) mild, 16 (52%) intermediate, and four (13%) severe; of the remaining 25 patients with two or one buried missense variants (group C), eight (32%) mild, nine (36%) intermediate, and eight (32%) severe. Additionally, 7-ketocholesterol, the biomarker, was lower in group A than in group B (p = 0.024) and group C (p = 0.029). A model was proposed that accurately predicted phenotypes of 72 of 90 (80%), 73 of85 (86%), and 64 of 69 (93%) patients in our cohort, Italian, and UK cohort, respectively. This study proposed a novel genotype-phenotype correlation in NPC1, linking the underlying molecular pathophysiology with clinical phenotype and aiding genetic counseling and evaluation in clinical practice.
Collapse
Affiliation(s)
- Huan Liang
- Pediatric Endocrinology and Genetics, Xinhua Hospital, Shanghai Institute for Pediatric Research, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Xia Zhan
- Pediatric Endocrinology and Genetics, Xinhua Hospital, Shanghai Institute for Pediatric Research, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yu Wang
- Pediatric Endocrinology and Genetics, Xinhua Hospital, Shanghai Institute for Pediatric Research, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Gustavo H B Maegawa
- Department of Pediatrics, Metabolism and Genetics, Vagelos College of Physicians and Surgeons, Columbia University Medical Center, New York, USA
| | - Huiwen Zhang
- Pediatric Endocrinology and Genetics, Xinhua Hospital, Shanghai Institute for Pediatric Research, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| |
Collapse
|
2
|
Mishra B, Kumar N, Shahid Mukhtar M. A Rice Protein Interaction Network Reveals High Centrality Nodes and Candidate Pathogen Effector Targets. Comput Struct Biotechnol J 2022; 20:2001-2012. [PMID: 35521542 PMCID: PMC9062363 DOI: 10.1016/j.csbj.2022.04.027] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 04/10/2022] [Accepted: 04/17/2022] [Indexed: 12/11/2022] Open
Abstract
Network science identifies key players in diverse biological systems including host-pathogen interactions. We demonstrated a scale-free network property for a comprehensive rice protein–protein interactome (RicePPInets) that exhibits nodes with increased centrality indices. While weighted k-shell decomposition was shown efficacious to predict pathogen effector targets in Arabidopsis, we improved its computational code for a broader implementation on large-scale networks including RicePPInets. We determined that nodes residing within the internal layers of RicePPInets are poised to be the most influential, central, and effective information spreaders. To identify central players and modules through network topology analyses, we integrated RicePPInets and co-expression networks representing susceptible and resistant responses to strains of the bacterial pathogens Xanthomonas oryzae pv. oryzae and X. oryzae pv. oryzicola (Xoc) and generated a RIce-Xanthomonas INteractome (RIXIN). This revealed that previously identified candidate targets of pathogen transcription activator-like (TAL) effectors are enriched in nodes with enhanced connectivity, bottlenecks, and information spreaders that are located in the inner layers of the network, and these nodes are involved in several important biological processes. Overall, our integrative multi-omics network-based platform provides a potentially useful approach to prioritizing candidate pathogen effector targets for functional validation, suggesting that this computational framework can be broadly translatable to other complex pathosystems.
Collapse
|
3
|
Ginsberg SD, Neubert TA, Sharma S, Digwal CS, Yan P, Timbus C, Wang T, Chiosis G. Disease-specific interactome alterations via epichaperomics: the case for Alzheimer's disease. FEBS J 2021; 289:2047-2066. [PMID: 34028172 DOI: 10.1111/febs.16031] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 04/23/2021] [Accepted: 05/20/2021] [Indexed: 12/22/2022]
Abstract
The increasingly appreciated prevalence of complicated stressor-to-phenotype associations in human disease requires a greater understanding of how specific stressors affect systems or interactome properties. Many currently untreatable diseases arise due to variations in, and through a combination of, multiple stressors of genetic, epigenetic, and environmental nature. Unfortunately, how such stressors lead to a specific disease phenotype or inflict a vulnerability to some cells and tissues but not others remains largely unknown and unsatisfactorily addressed. Analysis of cell- and tissue-specific interactome networks may shed light on organization of biological systems and subsequently to disease vulnerabilities. However, deriving human interactomes across different cell and disease contexts remains a challenge. To this end, this opinion article links stressor-induced protein interactome network perturbations to the formation of pathologic scaffolds termed epichaperomes, revealing a viable and reproducible experimental solution to obtaining rigorous context-dependent interactomes. This article presents our views on how a specialized 'omics platform called epichaperomics may complement and enhance the currently available conventional approaches and aid the scientific community in defining, understanding, and ultimately controlling interactome networks of complex diseases such as Alzheimer's disease. Ultimately, this approach may aid the transition from a limited single-alteration perspective in disease to a comprehensive network-based mindset, which we posit will result in precision medicine paradigms for disease diagnosis and treatment.
Collapse
Affiliation(s)
- Stephen D Ginsberg
- Center for Dementia Research, Nathan Kline Institute, Orangeburg, NY, USA.,Departments of Psychiatry, Neuroscience & Physiology, The NYU Neuroscience Institute, New York University Grossman School of Medicine, NY, USA
| | - Thomas A Neubert
- Kimmel Center for Biology and Medicine at the Skirball Institute, NYU School of Medicine, New York, NY, USA
| | - Sahil Sharma
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Chander S Digwal
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Pengrong Yan
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Calin Timbus
- Department of Mathematics, Technical University of Cluj-Napoca, CJ, Romania
| | - Tai Wang
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA
| | - Gabriela Chiosis
- Program in Chemical Biology, Sloan Kettering Institute, New York, NY, USA.,Breast Cancer Medicine Service, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| |
Collapse
|
4
|
Porras P, Barrera E, Bridge A, Del-Toro N, Cesareni G, Duesbury M, Hermjakob H, Iannuccelli M, Jurisica I, Kotlyar M, Licata L, Lovering RC, Lynn DJ, Meldal B, Nanduri B, Paneerselvam K, Panni S, Pastrello C, Pellegrini M, Perfetto L, Rahimzadeh N, Ratan P, Ricard-Blum S, Salwinski L, Shirodkar G, Shrivastava A, Orchard S. Towards a unified open access dataset of molecular interactions. Nat Commun 2020; 11:6144. [PMID: 33262342 PMCID: PMC7708836 DOI: 10.1038/s41467-020-19942-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 11/09/2020] [Indexed: 12/16/2022] Open
Abstract
The International Molecular Exchange (IMEx) Consortium provides scientists with a single body of experimentally verified protein interactions curated in rich contextual detail to an internationally agreed standard. In this update to the work of the IMEx Consortium, we discuss how this initiative has been working in practice, how it has ensured database sustainability, and how it is meeting emerging annotation challenges through the introduction of new interactor types and data formats. Additionally, we provide examples of how IMEx data are being used by biomedical researchers and integrated in other bioinformatic tools and resources.
Collapse
Affiliation(s)
- Pablo Porras
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Elisabet Barrera
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Alan Bridge
- SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, 1 rue Michel Servet, CH-1211, Geneva, Switzerland
| | - Noemi Del-Toro
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Gianni Cesareni
- University of Rome Tor Vergata, Rome, Italy.,IRCCS Fondazione Santa Lucia, 00143, Rome, Italy
| | - Margaret Duesbury
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK.,UCLA-DOE Institute, University of California, Los Angeles, CA, 90095, USA
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | | | - Igor Jurisica
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, and Krembil Research Institute, University Health Network, 60 Leonard Avenue, 5KD-407, Toronto, ON, M5T 0S8, Canada.,Departments of Medical Biophysics, and Computer Science, University of Toronto, Toronto, ON, Canada.,Institute of Neuroimmunology, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Max Kotlyar
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, and Krembil Research Institute, University Health Network, 60 Leonard Avenue, 5KD-407, Toronto, ON, M5T 0S8, Canada
| | | | - Ruth C Lovering
- Functional Gene Annotation, Preclinical and Fundamental Science, UCL Institute of Cardiovascular Science, University College London, London, WC1E 6JF, UK
| | - David J Lynn
- Computational and Systems Biology Program, Precision Medicine Theme, South Australian Health and Medical Research Institute, Adelaide, SA, 5000, Australia.,College of Medicine and Public Health, Flinders University, Bedford Park, SA, 5042, Australia
| | - Birgit Meldal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Bindu Nanduri
- Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University, Starkville, MS, USA
| | - Kalpana Paneerselvam
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Simona Panni
- Università della Calabria, Dipartimento di Biologia, Ecologia e Scienze della Terra, Via Pietro Bucci Cubo 6/C, Rende, CS, Italy
| | - Chiara Pastrello
- Osteoarthritis Research Program, Division of Orthopedic Surgery, Schroeder Arthritis Institute, and Krembil Research Institute, University Health Network, 60 Leonard Avenue, 5KD-407, Toronto, ON, M5T 0S8, Canada
| | - Matteo Pellegrini
- Department of Molecular, Cell and Developmental Biology, UCLA, Box 951606, Los Angeles, CA, 90095-1606, USA
| | - Livia Perfetto
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Negin Rahimzadeh
- UCLA-DOE Institute, University of California, Los Angeles, CA, 90095, USA
| | - Prashansa Ratan
- UCLA-DOE Institute, University of California, Los Angeles, CA, 90095, USA
| | - Sylvie Ricard-Blum
- ICBMS, UMR 5246 University Lyon 1 - CNRS, Univ. Lyon, 69622, Villeurbanne, France
| | - Lukasz Salwinski
- UCLA-DOE Institute, University of California, Los Angeles, CA, 90095, USA
| | - Gautam Shirodkar
- UCLA-DOE Institute, University of California, Los Angeles, CA, 90095, USA
| | - Anjalia Shrivastava
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK.,Open Targets, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Campus, Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
5
|
Fragoza R, Das J, Wierbowski SD, Liang J, Tran TN, Liang S, Beltran JF, Rivera-Erick CA, Ye K, Wang TY, Yao L, Mort M, Stenson PD, Cooper DN, Wei X, Keinan A, Schimenti JC, Clark AG, Yu H. Extensive disruption of protein interactions by genetic variants across the allele frequency spectrum in human populations. Nat Commun 2019; 10:4141. [PMID: 31515488 PMCID: PMC6742646 DOI: 10.1038/s41467-019-11959-3] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Accepted: 08/06/2019] [Indexed: 12/19/2022] Open
Abstract
Each human genome carries tens of thousands of coding variants. The extent to which this variation is functional and the mechanisms by which they exert their influence remains largely unexplored. To address this gap, we leverage the ExAC database of 60,706 human exomes to investigate experimentally the impact of 2009 missense single nucleotide variants (SNVs) across 2185 protein-protein interactions, generating interaction profiles for 4797 SNV-interaction pairs, of which 421 SNVs segregate at > 1% allele frequency in human populations. We find that interaction-disruptive SNVs are prevalent at both rare and common allele frequencies. Furthermore, these results suggest that 10.5% of missense variants carried per individual are disruptive, a higher proportion than previously reported; this indicates that each individual's genetic makeup may be significantly more complex than expected. Finally, we demonstrate that candidate disease-associated mutations can be identified through shared interaction perturbations between variants of interest and known disease mutations.
Collapse
Affiliation(s)
- Robert Fragoza
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Jishnu Das
- Ragon Institute of MGH, MIT and Harvard, Cambridge, MA, 02139, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Shayne D Wierbowski
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Jin Liang
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Tina N Tran
- Department of Biomedical Science, Cornell University, Ithaca, NY, 14853, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14853, USA
| | - Siqi Liang
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Juan F Beltran
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Christen A Rivera-Erick
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Kaixiong Ye
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Ting-Yi Wang
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Li Yao
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Matthew Mort
- Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| | - Peter D Stenson
- Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| | - David N Cooper
- Institute of Medical Genetics, Cardiff University, Heath Park, Cardiff, CF14 4XN, UK
| | - Xiaomu Wei
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
| | - Alon Keinan
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
| | - John C Schimenti
- Department of Biomedical Science, Cornell University, Ithaca, NY, 14853, USA
| | - Andrew G Clark
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14853, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY, 14853, USA.
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA.
| |
Collapse
|
6
|
Abstract
Human cancers often harbor large numbers of somatic mutations. However, only a small proportion of these mutations are expected to contribute to tumor growth and progression. Therefore, determining causal driver mutations and the genes they target is becoming an important challenge in cancer genomics. Here we describe an approach for mapping somatic mutations onto 3D structures of human proteins in complex to identify "driver interfaces." Our strategy relies on identifying protein-interaction interfaces that are unexpectedly biased toward nonsynonymous mutations, which suggests that these interfaces are subject to positive selection during tumorigenesis, implicating the interacting proteins as candidate drivers.
Collapse
Affiliation(s)
- Kivilcim Ozturk
- Division of Medical Genetics, Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics Program, University of California San Diego, La Jolla, CA, USA
| | - Hannah Carter
- Division of Medical Genetics, Department of Medicine, University of California San Diego, La Jolla, CA, USA.
- Bioinformatics Program, University of California San Diego, La Jolla, CA, USA.
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA.
- Institute for Genomic Medicine, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
7
|
Capriotti E, Ozturk K, Carter H. Integrating molecular networks with genetic variant interpretation for precision medicine. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2018; 11:e1443. [PMID: 30548534 PMCID: PMC6450710 DOI: 10.1002/wsbm.1443] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Revised: 10/23/2018] [Accepted: 10/30/2018] [Indexed: 02/01/2023]
Abstract
More reliable and cheaper sequencing technologies have revealed the vast mutational landscapes characteristic of many phenotypes. The analysis of such genetic variants has led to successful identification of altered proteins underlying many Mendelian disorders. Nevertheless the simple one‐variant one‐phenotype model valid for many monogenic diseases does not capture the complexity of polygenic traits and disorders. Although experimental and computational approaches have improved detection of functionally deleterious variants and important interactions between gene products, the development of comprehensive models relating genotype and phenotypes remains a challenge in the field of genomic medicine. In this context, a new view of the pathologic state as significant perturbation of the network of interactions between biomolecules is crucial for the identification of biochemical pathways associated with complex phenotypes. Seminal studies in systems biology combined the analysis of genetic variation with protein–protein interaction networks to demonstrate that even as biological systems evolve to be robust to genetic variation, their topologies create disease vulnerabilities. More recent analyses model the impact of genetic variants as changes to the “wiring” of the interactome to better capture heterogeneity in genotype–phenotype relationships. These studies lay the foundation for using networks to predict variant effects at scale using machine‐learning or algorithmic approaches. A wealth of databases and resources for the annotation of genotype–phenotype relationships have been developed to support developments in this area. This overview describes how study of the molecular interactome has generated insights linking the organization of biological systems to disease mechanism, and how this information can enable precision medicine. This article is categorized under:
Translational, Genomic, and Systems Medicine > Translational Medicine Biological Mechanisms > Cell Signaling Models of Systems Properties and Processes > Mechanistic Models Analytical and Computational Methods > Computational Methods
Collapse
Affiliation(s)
- Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Kivilcim Ozturk
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, California
| | - Hannah Carter
- Department of Medicine and Institute for Genomic Medicine, University of California, San Diego, La Jolla, California
| |
Collapse
|
8
|
Ahmed H, Howton TC, Sun Y, Weinberger N, Belkhadir Y, Mukhtar MS. Network biology discovers pathogen contact points in host protein-protein interactomes. Nat Commun 2018; 9:2312. [PMID: 29899369 PMCID: PMC5998135 DOI: 10.1038/s41467-018-04632-8] [Citation(s) in RCA: 71] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2018] [Accepted: 05/11/2018] [Indexed: 12/21/2022] Open
Abstract
In all organisms, major biological processes are controlled by complex protein-protein interactions networks (interactomes), yet their structural complexity presents major analytical challenges. Here, we integrate a compendium of over 4300 phenotypes with Arabidopsis interactome (AI-1MAIN). We show that nodes with high connectivity and betweenness are enriched and depleted in conditional and essential phenotypes, respectively. Such nodes are located in the innermost layers of AI-1MAIN and are preferential targets of pathogen effectors. We extend these network-centric analyses to Cell Surface Interactome (CSILRR) and predict its 35 most influential nodes. To determine their biological relevance, we show that these proteins physically interact with pathogen effectors and modulate plant immunity. Overall, our findings contrast with centrality-lethality rule, discover fast information spreading nodes, and highlight the structural properties of pathogen targets in two different interactomes. Finally, this theoretical framework could possibly be applicable to other inter-species interactomes to reveal pathogen contact points.
Collapse
Affiliation(s)
- Hadia Ahmed
- Department of Computer Science, University of Alabama at Birmingham, 115A Campbell Hall, 1300 University Boulevard, Birmingham, AL, 35294, USA
| | - T C Howton
- Department of Biology, University of Alabama at Birmingham, 464 Campbell Hall, 1300 University Boulevard, Birmingham, AL, 35294, USA
| | - Yali Sun
- Department of Biology, University of Alabama at Birmingham, 464 Campbell Hall, 1300 University Boulevard, Birmingham, AL, 35294, USA
| | - Natascha Weinberger
- Gregor Mendel Institute (GMI), Austrian Academy of Sciences, Vienna Biocenter (VBC), Dr Bohr Gasse 3, 1030, Vienna, Austria
| | - Youssef Belkhadir
- Gregor Mendel Institute (GMI), Austrian Academy of Sciences, Vienna Biocenter (VBC), Dr Bohr Gasse 3, 1030, Vienna, Austria
| | - M Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, 464 Campbell Hall, 1300 University Boulevard, Birmingham, AL, 35294, USA.
- Nutrition Obesity Research Center, University of Alabama at Birmingham, 1675 University Blvd, WEBB 568, Birmingham, AL, 35294, USA.
| |
Collapse
|
9
|
Mishra B, Sun Y, Ahmed H, Liu X, Mukhtar MS. Global temporal dynamic landscape of pathogen-mediated subversion of Arabidopsis innate immunity. Sci Rep 2017; 7:7849. [PMID: 28798368 PMCID: PMC5552879 DOI: 10.1038/s41598-017-08073-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 06/29/2017] [Indexed: 12/22/2022] Open
Abstract
The universal nature of networks’ structural and physical properties across diverse systems offers a better prospect to elucidate the interplay between a system and its environment. In the last decade, several large-scale transcriptome and interactome studies were conducted to understand the complex and dynamic nature of interactions between Arabidopsis and its bacterial pathogen, Pseudomonas syringae pv. tomato DC3000. We took advantage of these publicly available datasets and performed “-omics”-based integrative, and network topology analyses to decipher the transcriptional and protein-protein interaction activities of effector targets. We demonstrated that effector targets exhibit shorter distance to differentially expressed genes (DEGs) and possess increased information centrality. Intriguingly, effector targets are differentially expressed in a sequential manner and make for 1% of the total DEGs at any time point of infection with virulent or defense-inducing DC3000 strains. We revealed that DC3000 significantly alters the expression levels of 71% effector targets and their downstream physical interacting proteins in Arabidopsis interactome. Our integrative “-omics”-–based analyses identified dynamic complexes associated with MTI and disease susceptibility. Finally, we discovered five novel plant defense players using a systems biology-fueled top-to-bottom approach and demonstrated immune-related functions for them, further validating the power and resolution of our network analyses.
Collapse
Affiliation(s)
- Bharat Mishra
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA
| | - Yali Sun
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA
| | - Hadia Ahmed
- Department of Computer & Information Sciences, University of Alabama at Birmingham, Birmingham, USA
| | - Xiaoyu Liu
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA
| | - M Shahid Mukhtar
- Department of Biology, University of Alabama at Birmingham, Birmingham, USA. .,Nutrition Obesity Research Center, University of Alabama at Birmingham, Birmingham, USA.
| |
Collapse
|
10
|
Yi S, Lin S, Li Y, Zhao W, Mills GB, Sahni N. Functional variomics and network perturbation: connecting genotype to phenotype in cancer. Nat Rev Genet 2017; 18:395-410. [PMID: 28344341 DOI: 10.1038/nrg.2017.8] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Proteins interact with other macromolecules in complex cellular networks for signal transduction and biological function. In cancer, genetic aberrations have been traditionally thought to disrupt the entire gene function. It has been increasingly appreciated that each mutation of a gene could have a subtle but unique effect on protein function or network rewiring, contributing to diverse phenotypic consequences across cancer patient populations. In this Review, we discuss the current understanding of cancer genetic variants, including the broad spectrum of mutation classes and the wide range of mechanistic effects on gene function in the context of signalling networks. We highlight recent advances in computational and experimental strategies to study the diverse functional and phenotypic consequences of mutations at the base-pair resolution. Such information is crucial to understanding the complex pleiotropic effect of cancer genes and provides a possible link between genotype and phenotype in cancer.
Collapse
Affiliation(s)
- Song Yi
- Department of Systems Biology, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Shengda Lin
- Department of Medicine, Stanford University School of Medicine, Stanford, California 94305, USA
| | - Yongsheng Li
- Department of Systems Biology, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Wei Zhao
- Department of Systems Biology, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Gordon B Mills
- Department of Systems Biology, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | - Nidhi Sahni
- Department of Systems Biology, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA.,Graduate Program in Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
11
|
Wu MY, Zhang XF, Dai DQ, Ou-Yang L, Zhu Y, Yan H. Regularized logistic regression with network-based pairwise interaction for biomarker identification in breast cancer. BMC Bioinformatics 2016; 17:108. [PMID: 26921029 PMCID: PMC4769543 DOI: 10.1186/s12859-016-0951-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 01/28/2016] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND To facilitate advances in personalized medicine, it is important to detect predictive, stable and interpretable biomarkers related with different clinical characteristics. These clinical characteristics may be heterogeneous with respect to underlying interactions between genes. Usually, traditional methods just focus on detection of differentially expressed genes without taking the interactions between genes into account. Moreover, due to the typical low reproducibility of the selected biomarkers, it is difficult to give a clear biological interpretation for a specific disease. Therefore, it is necessary to design a robust biomarker identification method that can predict disease-associated interactions with high reproducibility. RESULTS In this article, we propose a regularized logistic regression model. Different from previous methods which focus on individual genes or modules, our model takes gene pairs, which are connected in a protein-protein interaction network, into account. A line graph is constructed to represent the adjacencies between pairwise interactions. Based on this line graph, we incorporate the degree information in the model via an adaptive elastic net, which makes our model less dependent on the expression data. Experimental results on six publicly available breast cancer datasets show that our method can not only achieve competitive performance in classification, but also retain great stability in variable selection. Therefore, our model is able to identify the diagnostic and prognostic biomarkers in a more robust way. Moreover, most of the biomarkers discovered by our model have been verified in biochemical or biomedical researches. CONCLUSIONS The proposed method shows promise in the diagnosis of disease pathogenesis with different clinical characteristics. These advances lead to more accurate and stable biomarker discovery, which can monitor the functional changes that are perturbed by diseases. Based on these predictions, researchers may be able to provide suggestions for new therapeutic approaches.
Collapse
Affiliation(s)
- Meng-Yun Wu
- School of Statistics and Management, Shanghai University of Finance and Economics, Guoding Road, Shanghai, 200433, China. .,Key Laboratory of Mathematical Economics SUFE, Ministry of Education, Guoding Road, Shanghai, 200433, China.
| | - Xiao-Fei Zhang
- School of Mathematics and Statistics & Hubei Key Laboratory of Mathematical Sciences, Central China Normal University, Luoyu Road, Wuhan, 430079, China.
| | - Dao-Qing Dai
- Intelligent Data Center and Department of Mathematics, Sun Yat-Sen University, Xingang West Road, Guangzhou, 510275, China.
| | - Le Ou-Yang
- College of Information Engineering, Shenzhen University, Nanhai Avenue, Shenzhen, 518060, China.
| | - Yuan Zhu
- School of Automation, China University of Geosciences, Lumo Road, Wuhan, 430074, China.
| | - Hong Yan
- Department of Electronic and Engineering, City University of Hong Kong, Tat Chee Avenue, Hong Kong, 999077, China.
| |
Collapse
|
12
|
Meyer MJ, Lapcevic R, Romero AE, Yoon M, Das J, Beltrán JF, Mort M, Stenson PD, Cooper DN, Paccanaro A, Yu H. mutation3D: Cancer Gene Prediction Through Atomic Clustering of Coding Variants in the Structural Proteome. Hum Mutat 2016; 37:447-56. [PMID: 26841357 DOI: 10.1002/humu.22963] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/14/2016] [Indexed: 12/20/2022]
Abstract
A new algorithm and Web server, mutation3D (http://mutation3d.org), proposes driver genes in cancer by identifying clusters of amino acid substitutions within tertiary protein structures. We demonstrate the feasibility of using a 3D clustering approach to implicate proteins in cancer based on explorations of single proteins using the mutation3D Web interface. On a large scale, we show that clustering with mutation3D is able to separate functional from nonfunctional mutations by analyzing a combination of 8,869 known inherited disease mutations and 2,004 SNPs overlaid together upon the same sets of crystal structures and homology models. Further, we present a systematic analysis of whole-genome and whole-exome cancer datasets to demonstrate that mutation3D identifies many known cancer genes as well as previously underexplored target genes. The mutation3D Web interface allows users to analyze their own mutation data in a variety of popular formats and provides seamless access to explore mutation clusters derived from over 975,000 somatic mutations reported by 6,811 cancer sequencing studies. The mutation3D Web interface is freely available with all major browsers supported.
Collapse
Affiliation(s)
- Michael J Meyer
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, 14853.,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, 14853.,Tri-Institutional Training Program in Computational Biology and Medicine, New York, New York, 10065
| | - Ryan Lapcevic
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, 14853.,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, 14853
| | - Alfonso E Romero
- Department of Computer Science and Centre for Systems and Synthetic Biology, Royal Holloway, University of London, Egham TW20 0EX, UK
| | - Mark Yoon
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, 14853.,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, 14853
| | - Jishnu Das
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, 14853.,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, 14853
| | - Juan Felipe Beltrán
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, 14853.,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, 14853
| | - Matthew Mort
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, UK
| | - Peter D Stenson
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, UK
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, UK
| | - Alberto Paccanaro
- Department of Computer Science and Centre for Systems and Synthetic Biology, Royal Holloway, University of London, Egham TW20 0EX, UK
| | - Haiyuan Yu
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, 14853.,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, 14853
| |
Collapse
|
13
|
Theofilatos KA, Likothanassis S, Mavroudi S. Quo vadis computational analysis of PPI data or why the future isn't here yet. Front Genet 2015; 6:289. [PMID: 26442107 PMCID: PMC4584938 DOI: 10.3389/fgene.2015.00289] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2015] [Accepted: 08/31/2015] [Indexed: 11/13/2022] Open
Affiliation(s)
| | - Spiros Likothanassis
- InSyBio Ltd. London, UK ; Pattern Recognition Laboratory, Department of Computer Engineering and Informatics, University of Patras Patras, Greece
| | - Seferina Mavroudi
- InSyBio Ltd. London, UK ; Pattern Recognition Laboratory, Department of Computer Engineering and Informatics, University of Patras Patras, Greece ; Department of Social Work, School of Sciences of Health and Care, Technological Educational Institute of Western Greece Patras, Greece
| |
Collapse
|
14
|
Abstract
The acquisition of mutations that activate oncogenes or inactivate tumor suppressors is a primary feature of most cancers. Mutations that directly alter protein sequence and structure drive the development of tumors through aberrant expression and modification of proteins, in many cases directly impacting components of signal transduction pathways and cellular architecture. Cancer-associated mutations may have direct or indirect effects on proteins and their interactions and while the effects of mutations on signaling pathways have been widely studied, how mutations alter underlying protein-protein interaction networks is much less well understood. Systematic mapping of oncoprotein protein interactions using proteomics techniques as well as computational network analyses is revealing how oncoprotein mutations perturb protein-protein interaction networks and drive the cancer phenotype.
Collapse
Affiliation(s)
- Emily Bowler
- Centre for Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| | - Zhenghe Wang
- Department of Genetics and Genome Science, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Rob M. Ewing
- Centre for Biological Sciences, University of Southampton, Southampton SO17 1BJ, UK
| |
Collapse
|
15
|
Cantor DI, Nice EC, Baker MS. Recent findings from the human proteome project: opening the mass spectrometry toolbox to advance cancer diagnosis, surveillance and treatment. Expert Rev Proteomics 2015; 12:279-93. [DOI: 10.1586/14789450.2015.1040770] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
16
|
Das J, Gayvert KM, Bunea F, Wegkamp MH, Yu H. ENCAPP: elastic-net-based prognosis prediction and biomarker discovery for human cancers. BMC Genomics 2015; 16:263. [PMID: 25887568 PMCID: PMC4392808 DOI: 10.1186/s12864-015-1465-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2014] [Accepted: 03/13/2015] [Indexed: 02/08/2023] Open
Abstract
Background With the explosion of genomic data over the last decade, there has been a tremendous amount of effort to understand the molecular basis of cancer using informatics approaches. However, this has proven to be extremely difficult primarily because of the varied etiology and vast genetic heterogeneity of different cancers and even within the same cancer. One particularly challenging problem is to predict prognostic outcome of the disease for different patients. Results Here, we present ENCAPP, an elastic-net-based approach that combines the reference human protein interactome network with gene expression data to accurately predict prognosis for different human cancers. Our method identifies functional modules that are differentially expressed between patients with good and bad prognosis and uses these to fit a regression model that can be used to predict prognosis for breast, colon, rectal, and ovarian cancers. Using this model, ENCAPP can also identify prognostic biomarkers with a high degree of confidence, which can be used to generate downstream mechanistic and therapeutic insights. Conclusion ENCAPP is a robust method that can accurately predict prognostic outcome and identify biomarkers for different human cancers. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1465-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jishnu Das
- Department of Biological Statistics and Computational Biology, Cornell University, 335 Weill Hall, Ithaca, NY, 14853, USA. .,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA.
| | - Kaitlyn M Gayvert
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA.
| | - Florentina Bunea
- Department of Statistical Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Marten H Wegkamp
- Department of Statistical Science, Cornell University, Ithaca, NY, 14853, USA.
| | - Haiyuan Yu
- Department of Biological Statistics and Computational Biology, Cornell University, 335 Weill Hall, Ithaca, NY, 14853, USA. .,Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, 14853, USA.
| |
Collapse
|
17
|
Vázquez M, Valencia A, Pons T. Structure-PPi: a module for the annotation of cancer-related single-nucleotide variants at protein-protein interfaces. Bioinformatics 2015; 31:2397-9. [PMID: 25765346 PMCID: PMC4495296 DOI: 10.1093/bioinformatics/btv142] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Accepted: 03/08/2015] [Indexed: 02/06/2023] Open
Abstract
Motivation: The interpretation of cancer-related single-nucleotide variants (SNVs) considering the protein features they affect, such as known functional sites, protein–protein interfaces, or relation with already annotated mutations, might complement the annotation of genetic variants in the analysis of NGS data. Current tools that annotate mutations fall short on several aspects, including the ability to use protein structure information or the interpretation of mutations in protein complexes. Results: We present the Structure–PPi system for the comprehensive analysis of coding SNVs based on 3D protein structures of protein complexes. The 3D repository used, Interactome3D, includes experimental and modeled structures for proteins and protein–protein complexes. Structure–PPi annotates SNVs with features extracted from UniProt, InterPro, APPRIS, dbNSFP and COSMIC databases. We illustrate the usefulness of Structure–PPi with the interpretation of 1 027 122 non-synonymous SNVs from COSMIC and the 1000G Project that provides a collection of ∼172 700 SNVs mapped onto the protein 3D structure of 8726 human proteins (43.2% of the 20 214 SwissProt-curated proteins in UniProtKB release 2014_06) and protein–protein interfaces with potential functional implications. Availability and implementation: Structure–PPi, along with a user manual and examples, isavailable at http://structureppi.bioinfo.cnio.es/Structure, the code for local installations at https://github.com/Rbbt-Workflows Contact:tpons@cnio.es Supplementary Information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Miguel Vázquez
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| | - Alfonso Valencia
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| | - Tirso Pons
- Structural Biology and BioComputing Programme, Spanish National Cancer Research Centre (CNIO), 28029 Madrid, Spain
| |
Collapse
|
18
|
Petrey D, Chen TS, Deng L, Garzon JI, Hwang H, Lasso G, Lee H, Silkov A, Honig B. Template-based prediction of protein function. Curr Opin Struct Biol 2015; 32:33-8. [PMID: 25678152 DOI: 10.1016/j.sbi.2015.01.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Revised: 01/13/2015] [Accepted: 01/19/2015] [Indexed: 12/11/2022]
Abstract
We discuss recent approaches for structure-based protein function annotation. We focus on template-based methods where the function of a query protein is deduced from that of a template for which both the structure and function are known. We describe the different ways of identifying a template. These are typically based on sequence analysis but new methods based on purely structural similarity are also being developed that allow function annotation based on structural relationships that cannot be recognized by sequence. The growing number of available structures of known function, improved homology modeling techniques and new developments in the use of structure allow template-based methods to be applied on a proteome-wide scale and in many different biological contexts. This progress significantly expands the range of applicability of structural information in function annotation to a level that previously was only achievable by sequence comparison.
Collapse
Affiliation(s)
- Donald Petrey
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States.
| | - T Scott Chen
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Lei Deng
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Jose Ignacio Garzon
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Howook Hwang
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Gorka Lasso
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Hunjoong Lee
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Antonina Silkov
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| | - Barry Honig
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Department of Systems Biology, Center for Computational Biology and Bioinformatics, 1130 St. Nicholas Avenue, Room 815, New York, NY 10032, United States
| |
Collapse
|
19
|
A massively parallel pipeline to clone DNA variants and examine molecular phenotypes of human disease mutations. PLoS Genet 2014; 10:e1004819. [PMID: 25502805 PMCID: PMC4263371 DOI: 10.1371/journal.pgen.1004819] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 10/14/2014] [Indexed: 12/13/2022] Open
Abstract
Understanding the functional relevance of DNA variants is essential for all exome and genome sequencing projects. However, current mutagenesis cloning protocols require Sanger sequencing, and thus are prohibitively costly and labor-intensive. We describe a massively-parallel site-directed mutagenesis approach, "Clone-seq", leveraging next-generation sequencing to rapidly and cost-effectively generate a large number of mutant alleles. Using Clone-seq, we further develop a comparative interactome-scanning pipeline integrating high-throughput GFP, yeast two-hybrid (Y2H), and mass spectrometry assays to systematically evaluate the functional impact of mutations on protein stability and interactions. We use this pipeline to show that disease mutations on protein-protein interaction interfaces are significantly more likely than those away from interfaces to disrupt corresponding interactions. We also find that mutation pairs with similar molecular phenotypes in terms of both protein stability and interactions are significantly more likely to cause the same disease than those with different molecular phenotypes, validating the in vivo biological relevance of our high-throughput GFP and Y2H assays, and indicating that both assays can be used to determine candidate disease mutations in the future. The general scheme of our experimental pipeline can be readily expanded to other types of interactome-mapping methods to comprehensively evaluate the functional relevance of all DNA variants, including those in non-coding regions.
Collapse
|
20
|
Hao D, Li C, Zhang S, Lu J, Jiang Y, Wang S, Zhou M. Network-based analysis of genotype-phenotype correlations between different inheritance modes. ACTA ACUST UNITED AC 2014; 30:3223-31. [PMID: 25078399 DOI: 10.1093/bioinformatics/btu482] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
MOTIVATION Recent studies on human disease have revealed that aberrant interaction between proteins probably underlies a substantial number of human genetic diseases. This suggests a need to investigate disease inheritance mode using interaction, and based on which to refresh our conceptual understanding of a series of properties regarding inheritance mode of human disease. RESULTS We observed a strong correlation between the number of protein interactions and the likelihood of a gene causing any dominant diseases or multiple dominant diseases, whereas no correlation was observed between protein interaction and the likelihood of a gene causing recessive diseases. We found that dominant diseases are more likely to be associated with disruption of important interactions. These suggest inheritance mode should be understood using protein interaction. We therefore reviewed the previous studies and refined an interaction model of inheritance mode, and then confirmed that this model is largely reasonable using new evidences. With these findings, we found that the inheritance mode of human genetic diseases can be predicted using protein interaction. By integrating the systems biology perspectives with the classical disease genetics paradigm, our study provides some new insights into genotype-phenotype correlations. CONTACT haodapeng@ems.hrbmu.edu.cn or biofomeng@hotmail.com SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dapeng Hao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| | - Chuanxing Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| | - Shaojun Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| | - Jianping Lu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| | - Yongshuai Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| | - Shiyuan Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| | - Meng Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, P.R. China and Institute for Systems Biology, Seattle 98109, USA
| |
Collapse
|
21
|
Azaiez H, Booth KT, Bu F, Huygen P, Shibata SB, Shearer AE, Kolbe D, Meyer N, Black-Ziegelbein EA, Smith RJH. TBC1D24 mutation causes autosomal-dominant nonsyndromic hearing loss. Hum Mutat 2014; 35:819-23. [PMID: 24729539 DOI: 10.1002/humu.22557] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2014] [Accepted: 03/26/2014] [Indexed: 11/08/2022]
Abstract
Hereditary hearing loss is extremely heterogeneous. Over 70 genes have been identified to date, and with the advent of massively parallel sequencing, the pace of novel gene discovery has accelerated. In a family segregating progressive autosomal-dominant nonsyndromic hearing loss (NSHL), we used OtoSCOPE® to exclude mutations in known deafness genes and then performed segregation mapping and whole-exome sequencing to identify a unique variant, p.Ser178Leu, in TBC1D24 that segregates with the hearing loss phenotype. TBC1D24 encodes a GTPase-activating protein expressed in the cochlea. Ser178 is highly conserved across vertebrates and its change is predicted to be damaging. Other variants in TBC1D24 have been associated with a panoply of clinical symptoms including autosomal recessive NSHL, syndromic hearing impairment associated with onychodystrophy, osteodystrophy, mental retardation, and seizures (DOORS syndrome), and a wide range of epileptic disorders.
Collapse
Affiliation(s)
- Hela Azaiez
- Molecular Otolaryngology & Renal Research Laboratories, Department of Otolaryngology-Head and Neck Surgery, University of Iowa Hospitals and Clinics, Iowa City, Iowa
| | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Garbutt CC, Bangalore PV, Kannar P, Mukhtar MS. Getting to the edge: protein dynamical networks as a new frontier in plant-microbe interactions. FRONTIERS IN PLANT SCIENCE 2014; 5:312. [PMID: 25071795 PMCID: PMC4074768 DOI: 10.3389/fpls.2014.00312] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Accepted: 06/11/2014] [Indexed: 05/18/2023]
Abstract
A systems perspective on diverse phenotypes, mechanisms of infection, and responses to environmental stresses can lead to considerable advances in agriculture and medicine. A significant promise of systems biology within plants is the development of disease-resistant crop varieties, which would maximize yield output for food, clothing, building materials, and biofuel production. A systems or "-omics" perspective frames the next frontier in the search for enhanced knowledge of plant network biology. The functional understanding of network structure and dynamics is vital to expanding our knowledge of how the intercellular communication processes are executed. This review article will systematically discuss various levels of organization of systems biology beginning with the building blocks termed "-omes" and ending with complex transcriptional and protein-protein interaction networks. We will also highlight the prevailing computational modeling approaches of biological regulatory network dynamics. The latest developments in the "-omics" approach will be reviewed and discussed to underline and highlight novel technologies and research directions in plant network biology.
Collapse
Affiliation(s)
- Cassandra C. Garbutt
- Department of Biology, The University of Alabama at BirminghamBirmingham, AL, USA
| | - Purushotham V. Bangalore
- Department of Computer and Information Sciences, The University of Alabama at BirminghamBirmingham, AL, USA
| | - Pegah Kannar
- Department of Biology, The University of Alabama at BirminghamBirmingham, AL, USA
| | - M. S. Mukhtar
- Department of Biology, The University of Alabama at BirminghamBirmingham, AL, USA
- Nutrition Obesity Research Center, The University of Alabama at BirminghamBirmingham, AL, USA
- *Correspondence: M. S. Mukhtar, Department of Biology, The University of Alabama at Birmingham, Campbell Hall 369, 1300 University Boulevard, Birmingham, AL 35294-1170, USA e-mail:
| |
Collapse
|