1
|
Khurana V, Peng J, Chung CY, Auluck PK, Fanning S, Tardiff DF, Bartels T, Koeva M, Eichhorn SW, Benyamini H, Lou Y, Nutter-Upham A, Baru V, Freyzon Y, Tuncbag N, Costanzo M, San Luis BJ, Schöndorf DC, Barrasa MI, Ehsani S, Sanjana N, Zhong Q, Gasser T, Bartel DP, Vidal M, Deleidi M, Boone C, Fraenkel E, Berger B, Lindquist S. Genome-Scale Networks Link Neurodegenerative Disease Genes to α-Synuclein through Specific Molecular Pathways. Cell Syst 2017; 4:157-170.e14. [PMID: 28131822 DOI: 10.1016/j.cels.2016.12.011] [Citation(s) in RCA: 101] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Revised: 08/05/2016] [Accepted: 12/14/2016] [Indexed: 02/02/2023]
Abstract
Numerous genes and molecular pathways are implicated in neurodegenerative proteinopathies, but their inter-relationships are poorly understood. We systematically mapped molecular pathways underlying the toxicity of alpha-synuclein (α-syn), a protein central to Parkinson's disease. Genome-wide screens in yeast identified 332 genes that impact α-syn toxicity. To "humanize" this molecular network, we developed a computational method, TransposeNet. This integrates a Steiner prize-collecting approach with homology assignment through sequence, structure, and interaction topology. TransposeNet linked α-syn to multiple parkinsonism genes and druggable targets through perturbed protein trafficking and ER quality control as well as mRNA metabolism and translation. A calcium signaling hub linked these processes to perturbed mitochondrial quality control and function, metal ion transport, transcriptional regulation, and signal transduction. Parkinsonism gene interaction profiles spatially opposed in the network (ATP13A2/PARK9 and VPS35/PARK17) were highly distinct, and network relationships for specific genes (LRRK2/PARK8, ATXN2, and EIF4G1/PARK18) were confirmed in patient induced pluripotent stem cell (iPSC)-derived neurons. This cross-species platform connected diverse neurodegenerative genes to proteinopathy through specific mechanisms and may facilitate patient stratification for targeted therapy.
Collapse
Affiliation(s)
- Vikram Khurana
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Ann Romney Center for Neurologic Disease, Department of Neurology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Harvard Stem Cell Institute, Cambridge, MA 02138, USA.
| | - Jian Peng
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Computer Science and Artificial Intelligence Laboratory and Department of Mathematics, MIT, Cambridge, MA 02139, USA
| | - Chee Yeun Chung
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Pavan K Auluck
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Saranna Fanning
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Daniel F Tardiff
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Theresa Bartels
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Martina Koeva
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; Department of Biological Engineering, MIT, Cambridge, MA 02139, USA
| | | | - Hadar Benyamini
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Yali Lou
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Andy Nutter-Upham
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Valeriya Baru
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Yelena Freyzon
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Nurcan Tuncbag
- Department of Biological Engineering, MIT, Cambridge, MA 02139, USA
| | - Michael Costanzo
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M5G 1L6, Canada
| | - Bryan-Joseph San Luis
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M5G 1L6, Canada
| | - David C Schöndorf
- Department of Neurodegenerative Diseases, German Center for Neurodegenerative Diseases (DZNE), and Hertie-Institute for Clinical Brain Research, University of Tübingen, Tübingen, 72076, Germany
| | | | - Sepehr Ehsani
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Neville Sanjana
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; New York Genome Center and Department of Biology, New York University, New York, NY 10013, USA
| | - Quan Zhong
- Department of Biological Sciences, Wright State University, Dayton, OH 45435, USA
| | - Thomas Gasser
- Department of Neurodegenerative Diseases, German Center for Neurodegenerative Diseases (DZNE), and Hertie-Institute for Clinical Brain Research, University of Tübingen, Tübingen, 72076, Germany
| | - David P Bartel
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA
| | - Marc Vidal
- Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA 02215, USA; Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Michela Deleidi
- Department of Neurodegenerative Diseases, German Center for Neurodegenerative Diseases (DZNE), and Hertie-Institute for Clinical Brain Research, University of Tübingen, Tübingen, 72076, Germany
| | - Charles Boone
- Banting and Best Department of Medical Research, University of Toronto, Toronto, ON M5G 1L6, Canada
| | - Ernest Fraenkel
- Department of Biological Engineering, MIT, Cambridge, MA 02139, USA.
| | - Bonnie Berger
- Harvard Stem Cell Institute, Cambridge, MA 02138, USA.
| | - Susan Lindquist
- Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA; HHMI, Department of Biology, MIT, Cambridge, MA 02139, USA
| |
Collapse
|
2
|
Price GA, Crooks GE, Green RE, Brenner SE. Statistical evaluation of pairwise protein sequence comparison with the Bayesian bootstrap. Bioinformatics 2005; 21:3824-31. [PMID: 16105900 DOI: 10.1093/bioinformatics/bti627] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Protein sequence comparison methods are routinely used to infer the intricate network of evolutionary relationships found within the rapidly growing library of protein sequences, and thereby to predict the structure and function of uncharacterized proteins. In the present study, we detail an improved statistical benchmark of pairwise protein sequence comparison algorithms. We use bootstrap resampling techniques to determine standard statistical errors and to estimate the confidence of our conclusions. We show that the underlying structure within benchmark databases causes Efron's standard, non-parametric bootstrap to be biased. Consequently, the standard bootstrap underpredicts average performance when used in the context of evaluating sequence comparison methods. We have developed, as an alternative, an unbiased statistical evaluation based on the Bayesian bootstrap, a resampling method operationally similar to the standard bootstrap. RESULTS We apply our analysis to the comparative study of amino acid substitution matrix families and find that using modern matrices results in a small, but statistically significant improvement in remote homology detection compared with the classic PAM and BLOSUM matrices. AVAILABILITY The sequence sets and code for performing these analyses are available from http://compbio.berkeley.edu/. CONTACT brenner@compbio.berkeley.edu.
Collapse
Affiliation(s)
- Gavin A Price
- Department of Bioengineering, University of California, Berkeley, 94720, USA
| | | | | | | |
Collapse
|
7
|
Reddy BV, Li WW, Shindyalov IN, Bourne PE. Conserved key amino acid positions (CKAAPs) derived from the analysis of common substructures in proteins. Proteins 2001; 42:148-63. [PMID: 11119639 DOI: 10.1002/1097-0134(20010201)42:2<148::aid-prot20>3.0.co;2-r] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
An all-against-all protein structure comparison using the Combinatorial Extension (CE) algorithm applied to a representative set of PDB structures revealed a gallery of common substructures in proteins (http://cl.sdsc.edu/ce.html). These substructures represent commonly identified folds, domains, or components thereof. Most of the subsequences forming these similar substructures have no significant sequence similarity. We present a method to identify conserved amino acid positions and residue-dependent property clusters within these subsequences starting with structure alignments. Each of the subsequences is aligned to its homologues in SWALL, a nonredundant protein sequence database. The most similar sequences are purged into a common frequency matrix, and weighted homologues of each one of the subsequences are used in scoring for conserved key amino acid positions (CKAAPs). We have set the top 20% of the high-scoring positions in each substructure to be CKAAPs. It is hypothesized that CKAAPs may be responsible for the common folding patterns in either a local or global view of the protein-folding pathway. Where a significant number of structures exist, CKAAPs have also been identified in structure alignments of complete polypeptide chains from the same protein family or superfamily. Evidence to support the presence of CKAAPs comes from other computational approaches and experimental studies of mutation and protein-folding experiments, notably the Paracelsus challenge. Finally, the structural environment of CKAAPs versus non-CKAAPs is examined for solvent accessibility, hydrogen bonding, and secondary structure. The identification of CKAAPs has important implications for protein engineering, fold recognition, modeling, and structure prediction studies and is dependent on the availability of structures and an accurate structure alignment methodology. Proteins 2001;42:148-163.
Collapse
Affiliation(s)
- B V Reddy
- San Diego Supercomputer Center, University of California, San Diego, La Jolla, California 92093-0505, USA
| | | | | | | |
Collapse
|