51
|
An extended catalogue of tandem alternative splice sites in human tissue transcriptomes. PLoS Comput Biol 2021; 17:e1008329. [PMID: 33826604 PMCID: PMC8055015 DOI: 10.1371/journal.pcbi.1008329] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 04/19/2021] [Accepted: 03/22/2021] [Indexed: 12/18/2022] Open
Abstract
Tandem alternative splice sites (TASS) is a special class of alternative splicing events that are characterized by a close tandem arrangement of splice sites. Most TASS lack functional characterization and are believed to arise from splicing noise. Based on the RNA-seq data from the Genotype Tissue Expression project, we present an extended catalogue of TASS in healthy human tissues and analyze their tissue-specific expression. The expression of TASS is usually dominated by one major splice site (maSS), while the expression of minor splice sites (miSS) is at least an order of magnitude lower. Among 46k miSS with sufficient read support, 9k (20%) are significantly expressed above the expected noise level, and among them 2.5k are expressed tissue-specifically. We found significant correlations between tissue-specific expression of RNA-binding proteins (RBP), tissue-specific expression of miSS, and miSS response to RBP inactivation by shRNA. In combination with RBP profiling by eCLIP, this allowed prediction of novel cases of tissue-specific splicing regulation including a miSS in QKI mRNA that is likely regulated by PTBP1. The analysis of human primary cell transcriptomes suggested that both tissue-specific and cell-type-specific factors contribute to the regulation of miSS expression. More than 20% of tissue-specific miSS affect structured protein regions and may adjust protein-protein interactions or modify the stability of the protein core. The significantly expressed miSS evolve under the same selection pressure as maSS, while other miSS lack signatures of evolutionary selection and conservation. Using mixture models, we estimated that not more than 15% of maSS and not more than 54% of tissue-specific miSS are noisy, while the proportion of noisy splice sites among non-significantly expressed miSS is above 63%. Pre-mRNA splicing is an important step in the processing of the genomic information during gene expression. During splicing, introns are excised from a gene transcript, and the remaining exons are ligated. Our work concerns one its particular subtype, which involves the so-called tandem alternative splice sites, a group of closely located exon borders that are used alternatively. We analyzed RNA-seq measurements of gene expression provided by the Genotype-Tissue Expression (GTEx) project, the largest to-date collection of such measurements in healthy human tissues, and constructed a detailed catalogue of tandem alternative splice sites. Within this catalogue, we characterized patterns of tissue-specific expression, regulation, impact on protein structure, and evolutionary selection acting on tandem alternative splice sites. In a number of genes, we predicted regulatory mechanisms that could be responsible for choosing one of many tandem alternative splice sites. The results of this study provide an invaluable resource for molecular biologists studying alternative splicing.
Collapse
|
52
|
Goretzki B, Guhl C, Tebbe F, Harder JM, Hellmich UA. Unstructural Biology of TRP Ion Channels: The Role of Intrinsically Disordered Regions in Channel Function and Regulation. J Mol Biol 2021; 433:166931. [PMID: 33741410 DOI: 10.1016/j.jmb.2021.166931] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2021] [Revised: 03/02/2021] [Accepted: 03/06/2021] [Indexed: 12/13/2022]
Abstract
The first genuine high-resolution single particle cryo-electron microscopy structure of a membrane protein determined was a transient receptor potential (TRP) ion channel, TRPV1, in 2013. This methodical breakthrough opened up a whole new world for structural biology and ion channel aficionados alike. TRP channels capture the imagination due to the sheer endless number of tasks they carry out in all aspects of animal physiology. To date, structures of at least one representative member of each of the six mammalian TRP channel subfamilies as well as of a few non-mammalian families have been determined. These structures were instrumental for a better understanding of TRP channel function and regulation. However, all of the TRP channel structures solved so far are incomplete since they miss important information about highly flexible regions found mostly in the channel N- and C-termini. These intrinsically disordered regions (IDRs) can represent between a quarter to almost half of the entire protein sequence and act as important recruitment hubs for lipids and regulatory proteins. Here, we analyze the currently available TRP channel structures with regard to the extent of these "missing" regions and compare these findings to disorder predictions. We discuss select examples of intra- and intermolecular crosstalk of TRP channel IDRs with proteins and lipids as well as the effect of splicing and post-translational modifications, to illuminate their importance for channel function and to complement the prevalently discussed structural biology of these versatile and fascinating proteins with their equally relevant 'unstructural' biology.
Collapse
Affiliation(s)
- Benedikt Goretzki
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany
| | - Charlotte Guhl
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany; TransMED - Mainz Research School of Translational Medicine, Johannes Gutenberg-University, University Medical Center, Langenbeckstr. 1, 55131 Mainz, Germany
| | - Frederike Tebbe
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany
| | - Jean-Martin Harder
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany
| | - Ute A Hellmich
- Faculty of Chemistry and Earth Sciences, Institute of Organic Chemistry and Macromolecular Chemistry, Friedrich-Schiller-University, Humboldtstrasse 10, 07743 Jena, Germany; Centre for Biomolecular Magnetic Resonance (BMRZ), Goethe-University, Max-von-Laue-Strasse 9, 60438 Frankfurt, Germany; TransMED - Mainz Research School of Translational Medicine, Johannes Gutenberg-University, University Medical Center, Langenbeckstr. 1, 55131 Mainz, Germany; Cluster of Excellence Balance of the Microverse, Friedrich-Schiller-University, 07743 Jena, Germany.
| |
Collapse
|
53
|
Langella E, Buonanno M, De Simone G, Monti SM. Intrinsically disordered features of carbonic anhydrase IX proteoglycan-like domain. Cell Mol Life Sci 2021; 78:2059-2067. [PMID: 33201250 PMCID: PMC11072538 DOI: 10.1007/s00018-020-03697-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 09/26/2020] [Accepted: 10/31/2020] [Indexed: 12/25/2022]
Abstract
hCA IX is a multi-domain protein belonging to the family of hCAs which are ubiquitous zinc enzymes that catalyze the reversible hydration of CO2 to HCO3- and H+. hCA IX is a tumor-associated enzyme with a limited distribution in normal tissues, but over-expressed in many tumors, and is a promising drug target. Although many studies concerning the CA IX catalytic domain were performed, little is known about the proteoglycan-like (PG-like) domain of hCA IX which has been poorly investigated so far. Here we attempt to fill this gap by providing an overview on the functional, structural and therapeutic studies of the PG-like domain of hCA IX which represents a unique feature within the CA family. The main studies and recent advances concerning PG role in modulating hCA IX catalytic activity as well as in tumor spreading and migration are here reported. Special attention has been paid to the newly discovered disordered features of the PG domain which open new perspectives about its molecular mechanisms of action under physiological and pathological conditions, since disorder is likely involved in mediating interactions with partner proteins. The emerged disordered features of PG domain will be explored for putative diagnostic and therapeutic applications involving CA IX targeting in tumors.
Collapse
Affiliation(s)
- Emma Langella
- Institute of Biostructures and Bioimaging, CNR, via Mezzocannone, 16, 80134, Naples, Italy.
| | - Martina Buonanno
- Institute of Biostructures and Bioimaging, CNR, via Mezzocannone, 16, 80134, Naples, Italy
| | - Giuseppina De Simone
- Institute of Biostructures and Bioimaging, CNR, via Mezzocannone, 16, 80134, Naples, Italy
| | - Simona Maria Monti
- Institute of Biostructures and Bioimaging, CNR, via Mezzocannone, 16, 80134, Naples, Italy.
| |
Collapse
|
54
|
Jagannathan NS, Hogue CWV, Tucker-Kellogg L. Computational modeling suggests binding-induced expansion of Epsin disordered regions upon association with AP2. PLoS Comput Biol 2021; 17:e1008474. [PMID: 33406091 PMCID: PMC7787433 DOI: 10.1371/journal.pcbi.1008474] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 10/27/2020] [Indexed: 11/22/2022] Open
Abstract
Intrinsically disordered regions (IDRs) are prevalent in the eukaryotic proteome. Common functional roles of IDRs include forming flexible linkers or undergoing allosteric folding-upon-binding. Recent studies have suggested an additional functional role for IDRs: generating steric pressure on the plasma membrane during endocytosis, via molecular crowding. However, in order to accomplish useful functions, such crowding needs to be regulated in space (e.g., endocytic hotspots) and time (e.g., during vesicle formation). In this work, we explore binding-induced regulation of IDR steric volume. We simulate the IDRs of two proteins from Clathrin-mediated endocytosis (CME) to see if their conformational spaces are regulated via binding-induced expansion. Using Monte-Carlo computational modeling of excluded volumes, we generate large conformational ensembles (3 million) for the IDRs of Epsin and Eps15 and dock the conformers to the alpha subunit of Adaptor Protein 2 (AP2α), their CME binding partner. Our results show that as more molecules of AP2α are bound, the Epsin-derived ensemble shows a significant increase in global dimensions, measured as the radius of Gyration (RG) and the end-to-end distance (EED). Unlike Epsin, Eps15-derived conformers that permit AP2α binding at one motif were found to be more likely to accommodate binding of AP2α at other motifs, suggesting a tendency toward co-accessibility of binding motifs. Co-accessibility was not observed for any pair of binding motifs in Epsin. Thus, we speculate that the disordered regions of Epsin and Eps15 perform different roles during CME, with accessibility in Eps15 allowing it to act as a recruiter of AP2α molecules, while binding-induced expansion of the Epsin disordered region could impose steric pressure and remodel the plasma membrane during vesicle formation. Protein functions were originally believed to arise from ordered protein structures. This dogma was later challenged by the identification of intrinsically disordered proteins that lack specific structure. The functional roles of such proteins usually fell in two categories–exploiting the disorder for flexibility (like floppy connector), or imposing order upon binding to an external partner. In this study we explore the possibility of an alternative mechanism that harnesses disorder for function through regulated molecular crowding. Specifically, we use modeling to study two proteins involved in reshaping the cell membrane, Epsin and Eps15. We ask if they undergo binding-induced expansion, where binding of an external partner AP2 causes not a transition toward order, but rather an energetically favorable increase in propensity to occupy larger volumes. Our results show that Epsin tends to occupy a larger volume when bound to AP2, consistent with increased molecular crowding, which could help reshape the cell membrane. Such regulation of disorder via binding (without folding) opens hitherto unexplored avenues that cells might employ to harness disorder.
Collapse
Affiliation(s)
- N. Suhas Jagannathan
- Cancer & Stem Cell Biology, and Centre for Computational Biology, Duke-NUS Medical School, 8 College Road, Singapore
- Singapore-MIT Alliance, Computation and Systems Biology Program, National University of Singapore, Singapore
| | - Christopher W. V. Hogue
- Singapore-MIT Alliance, Computation and Systems Biology Program, National University of Singapore, Singapore
- Mechanobiology Institute, National University of Singapore, Singapore
| | - Lisa Tucker-Kellogg
- Cancer & Stem Cell Biology, and Centre for Computational Biology, Duke-NUS Medical School, 8 College Road, Singapore
- Singapore-MIT Alliance, Computation and Systems Biology Program, National University of Singapore, Singapore
- * E-mail:
| |
Collapse
|
55
|
Salladini E, Jørgensen MLM, Theisen FF, Skriver K. Intrinsic Disorder in Plant Transcription Factor Systems: Functional Implications. Int J Mol Sci 2020; 21:E9755. [PMID: 33371315 PMCID: PMC7767404 DOI: 10.3390/ijms21249755] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 12/17/2020] [Accepted: 12/18/2020] [Indexed: 01/07/2023] Open
Abstract
Eukaryotic cells are complex biological systems that depend on highly connected molecular interaction networks with intrinsically disordered proteins as essential components. Through specific examples, we relate the conformational ensemble nature of intrinsic disorder (ID) in transcription factors to functions in plants. Transcription factors contain large regulatory ID-regions with numerous orphan sequence motifs, representing potential important interaction sites. ID-regions may affect DNA-binding through electrostatic interactions or allosterically as for the bZIP transcription factors, in which the DNA-binding domains also populate ensembles of dynamic transient structures. The flexibility of ID is well-suited for interaction networks requiring efficient molecular adjustments. For example, Radical Induced Cell Death1 depends on ID in transcription factors for its numerous, structurally heterogeneous interactions, and the JAZ:MYC:MED15 regulatory unit depends on protein dynamics, including binding-associated unfolding, for regulation of jasmonate-signaling. Flexibility makes ID-regions excellent targets of posttranslational modifications. For example, the extent of phosphorylation of the NAC transcription factor SOG1 regulates target gene expression and the DNA-damage response, and phosphorylation of the AP2/ERF transcription factor DREB2A acts as a switch enabling heat-regulated degradation. ID-related phase separation is emerging as being important to transcriptional regulation with condensates functioning in storage and inactivation of transcription factors. The applicative potential of ID-regions is apparent, as removal of an ID-region of the AP2/ERF transcription factor WRI1 affects its stability and consequently oil biosynthesis. The highlighted examples show that ID plays essential functional roles in plant biology and has a promising potential in engineering.
Collapse
Affiliation(s)
| | | | | | - Karen Skriver
- REPIN and the Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200 Copenhagen, Denmark; (E.S.); (M.L.M.J.); (F.F.T.)
| |
Collapse
|
56
|
Vinterhalter G, Kovačević JJ, Uversky VN, Pavlović-Lažetić GM. Bioinformatics analysis of correlation between protein function and intrinsic disorder. Int J Biol Macromol 2020; 167:446-456. [PMID: 33278435 DOI: 10.1016/j.ijbiomac.2020.11.211] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 11/20/2020] [Accepted: 11/30/2020] [Indexed: 10/22/2022]
Abstract
The correlation of molecular function and protein intrinsic disorder is an important aspect of understanding the relationship between function, sequence and structure. This research was inspired by statistical correlation evaluation method described by Xie et al. (J Proteome Res 6 (2007) 1882-1898, reference study), where the authors analyzed the relationship between structure and function of proteins from Swiss-Prot database and where these functions were described with Swiss-Prot function keywords. In this research, we investigated whether the conclusions from the reference study stand for another dataset with richer functional annotation. We used CAFA3 challenge training dataset where the function was described with terms from Gene Ontology (GO terms). In order to compare the results with the previous work, we associated the GO terms with the corresponding Swiss-Prot function keywords. The results were compared with the reference study by first repeating the analysis with Swiss-Prot function keywords and then by GO terms. We used PONDR VSL2b disorder predictor to label over 66,000 CAFA3 proteins as putatively disordered or ordered. Out of 186 Swiss-Prot keywords (belonging to molecular function type) with more than 20 annotated proteins, we found 47 to be highly order related and 44 highly disorder related. Using the same dataset and annotation constraints, out of 1781 GO term (belonging to molecular function type), we found 746 to be highly order related and 564 highly disorder related. GO term results are presented as interactive graphs displaying complex hierarchical structure of Gene Ontology. Comparison of two functional annotations, GO and Swiss-Prot keywords, showed consistent results in cases when it was possible to map a Swiss-Prot keyword to a corresponding GO term. Because of the small number of such cases, we propose a new method for deriving the missing mappings between Swiss-Prot keywords and GO terms with the highest likelihood by measuring similarity (Jaccard index) between sets of protein annotated by different functions. Comparison with results from the reference study revealed prevalence of binding related functions (disorder related) in the current dataset even though the same functions were not present in previous results.
Collapse
Affiliation(s)
- Goran Vinterhalter
- Agilent Technologies Belgium NV, De Kleetlaan 5/bus 9, 1831 Diegem, Belgium
| | - Jovana J Kovačević
- Faculty of Mathematics, University of Belgrade, Studentski trg 16, 11000 Belgrade, Serbia.
| | - Vladimir N Uversky
- Department of Molecular Medicine, USFHealth Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, United States of America.
| | | |
Collapse
|
57
|
Tsang B, Pritišanac I, Scherer SW, Moses AM, Forman-Kay JD. Phase Separation as a Missing Mechanism for Interpretation of Disease Mutations. Cell 2020; 183:1742-1756. [DOI: 10.1016/j.cell.2020.11.050] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 11/04/2020] [Accepted: 11/25/2020] [Indexed: 02/08/2023]
|
58
|
Chong S, Mir M. Towards Decoding the Sequence-Based Grammar Governing the Functions of Intrinsically Disordered Protein Regions. J Mol Biol 2020; 433:166724. [PMID: 33248138 DOI: 10.1016/j.jmb.2020.11.023] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 11/14/2020] [Accepted: 11/19/2020] [Indexed: 01/03/2023]
Abstract
A substantial portion of the proteome consists of intrinsically disordered regions (IDRs) that do not fold into well-defined 3D structures yet perform numerous biological functions and are associated with a broad range of diseases. It has been a long-standing enigma how different IDRs successfully execute their specific functions. Further putting a spotlight on IDRs are recent discoveries of functionally relevant biomolecular assemblies, which in some cases form through liquid-liquid phase separation. At the molecular level, the formation of biomolecular assemblies is largely driven by weak, multivalent, but selective IDR-IDR interactions. Emerging experimental and computational studies suggest that the primary amino acid sequences of IDRs encode a variety of their interaction behaviors. In this review, we focus on findings and insights that connect sequence-derived features of IDRs to their conformations, propensities to form biomolecular assemblies, selectivity of interaction partners, functions in the context of physiology and disease, and regulation of function. We also discuss directions of future research to facilitate establishing a comprehensive sequence-function paradigm that will eventually allow prediction of selective interactions and specificity of function mediated by IDRs.
Collapse
Affiliation(s)
- Shasha Chong
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA 94720, United States; The Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA 94720, United States.
| | - Mustafa Mir
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA 94720, United States
| |
Collapse
|
59
|
Uversky VN. Functions of short lifetime biological structures at large: the case of intrinsically disordered proteins. Brief Funct Genomics 2020; 19:60-68. [PMID: 29982297 DOI: 10.1093/bfgp/ely023] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Although for more than a century a protein function was intimately associated with the presence of unique structure in a protein molecule, recent years witnessed a skyrocket rise of the appreciation of protein intrinsic disorder concept that emphasizes the importance of the biologically active proteins without ordered structures. In different proteins, the depth and breadth of disorder penetrance are different, generating an amusing spatiotemporal heterogeneity of intrinsically disordered proteins (IDPs) and intrinsically disordered protein region regions (IDPRs), which are typically described as highly dynamic ensembles of rapidly interconverting conformations (or a multitude of short lifetime structures). IDPs/IDPRs constitute a substantial part of protein kingdom and have unique functions complementary to functional repertoires of ordered proteins. They are recognized as interaction specialists and global controllers that play crucial roles in regulation of functions of their binding partners and in controlling large biological networks. IDPs/IDPRs are characterized by immense binding promiscuity and are able to use a broad spectrum of binding modes, often resulting in the formation of short lifetime complexes. In their turn, functions of IDPs and IDPRs are controlled by various means, such as numerous posttranslational modifications and alternative splicing. Some of the functions of IDPs/IDPRs are briefly considered in this review to shed some light on the biological roles of short-lived structures at large.
Collapse
Affiliation(s)
- Vladimir N Uversky
- Department of Molecular Medicine, USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA and Laboratory of New Methods in Biology, Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
| |
Collapse
|
60
|
Pelham JF, Dunlap JC, Hurley JM. Intrinsic disorder is an essential characteristic of components in the conserved circadian circuit. Cell Commun Signal 2020; 18:181. [PMID: 33176800 PMCID: PMC7656774 DOI: 10.1186/s12964-020-00658-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 09/06/2020] [Indexed: 12/12/2022] Open
Abstract
INTRODUCTION The circadian circuit, a roughly 24 h molecular feedback loop, or clock, is conserved from bacteria to animals and allows for enhanced organismal survival by facilitating the anticipation of the day/night cycle. With circadian regulation reportedly impacting as high as 80% of protein coding genes in higher eukaryotes, the protein-based circadian clock broadly regulates physiology and behavior. Due to the extensive interconnection between the clock and other cellular systems, chronic disruption of these molecular rhythms leads to a decrease in organismal fitness as well as an increase of disease rates in humans. Importantly, recent research has demonstrated that proteins comprising the circadian clock network display a significant amount of intrinsic disorder. MAIN BODY In this work, we focus on the extent of intrinsic disorder in the circadian clock and its potential mechanistic role in circadian timing. We highlight the conservation of disorder by quantifying the extent of computationally-predicted protein disorder in the core clock of the key eukaryotic circadian model organisms Drosophila melanogaster, Neurospora crassa, and Mus musculus. We further examine previously published work, as well as feature novel experimental evidence, demonstrating that the core negative arm circadian period drivers FREQUENCY (Neurospora crassa) and PERIOD-2 (PER2) (Mus musculus), possess biochemical characteristics of intrinsically disordered proteins. Finally, we discuss the potential contributions of the inherent biophysical principals of intrinsically disordered proteins that may explain the vital mechanistic roles they play in the clock to drive their broad evolutionary conservation in circadian timekeeping. CONCLUSION The pervasive conservation of disorder amongst the clock in the crown eukaryotes suggests that disorder is essential for optimal circadian timing from fungi to animals, providing vital homeostatic cellular maintenance and coordinating organismal physiology across phylogenetic kingdoms. Video abstract.
Collapse
Affiliation(s)
- Jacqueline F. Pelham
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Jay C. Dunlap
- Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH 03755 USA
| | - Jennifer M. Hurley
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
- Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12018 USA
| |
Collapse
|
61
|
Koterniak B, Pilaka PP, Gracida X, Schneider LM, Pritišanac I, Zhang Y, Calarco JA. Global regulatory features of alternative splicing across tissues and within the nervous system of C. elegans. Genome Res 2020; 30:1766-1780. [PMID: 33127752 PMCID: PMC7706725 DOI: 10.1101/gr.267328.120] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2020] [Accepted: 10/28/2020] [Indexed: 12/27/2022]
Abstract
Alternative splicing plays a major role in shaping tissue-specific transcriptomes. Among the broad tissue types present in metazoans, the central nervous system contains some of the highest levels of alternative splicing. Although many documented examples of splicing differences between broad tissue types exist, there remains much to be understood about the splicing factors and the cis sequence elements controlling tissue and neuron subtype-specific splicing patterns. By using translating ribosome affinity purification coupled with deep-sequencing (TRAP-seq) in Caenorhabditis elegans, we have obtained high coverage profiles of ribosome-associated mRNA for three broad tissue classes (nervous system, muscle, and intestine) and two neuronal subtypes (dopaminergic and serotonergic neurons). We have identified hundreds of splice junctions that exhibit distinct splicing patterns between tissue types or within the nervous system. Alternative splicing events differentially regulated between tissues are more often frame-preserving, are more highly conserved across Caenorhabditis species, and are enriched in specific cis regulatory motifs, when compared with other types of exons. By using this information, we have identified a likely mechanism of splicing repression by the RNA-binding protein UNC-75/CELF via interactions with cis elements that overlap a 5′ splice site. Alternatively spliced exons also overlap more frequently with intrinsically disordered peptide regions than constitutive exons. Moreover, regulated exons are often shorter than constitutive exons but are flanked by longer intron sequences. Among these tissue-regulated exons are several highly conserved microexons <27 nt in length. Collectively, our results indicate a rich layer of tissue-specific gene regulation at the level of alternative splicing in C. elegans that parallels the evolutionary forces and constraints observed across metazoa.
Collapse
Affiliation(s)
- Bina Koterniak
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario M5S 3G5, Canada
| | - Pallavi P Pilaka
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario M5S 3G5, Canada
| | - Xicotencatl Gracida
- Department of Organismal and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Lisa-Marie Schneider
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario M5S 3G5, Canada.,Department of Chemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Iva Pritišanac
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario M5S 3G5, Canada.,Program in Molecular Medicine, The Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Yun Zhang
- Department of Organismal and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | - John A Calarco
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario M5S 3G5, Canada
| |
Collapse
|
62
|
Rodriguez JM, Pozo F, di Domenico T, Vazquez J, Tress ML. An analysis of tissue-specific alternative splicing at the protein level. PLoS Comput Biol 2020; 16:e1008287. [PMID: 33017396 PMCID: PMC7561204 DOI: 10.1371/journal.pcbi.1008287] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Revised: 10/15/2020] [Accepted: 08/25/2020] [Indexed: 01/09/2023] Open
Abstract
The role of alternative splicing is one of the great unanswered questions in cellular biology. There is strong evidence for alternative splicing at the transcript level, and transcriptomics experiments show that many splice events are tissue specific. It has been suggested that alternative splicing evolved in order to remodel tissue-specific protein-protein networks. Here we investigated the evidence for tissue-specific splicing among splice isoforms detected in a large-scale proteomics analysis. Although the data supporting alternative splicing is limited at the protein level, clear patterns emerged among the small numbers of alternative splice events that we could detect in the proteomics data. More than a third of these splice events were tissue-specific and most were ancient: over 95% of splice events that were tissue-specific in both proteomics and RNAseq analyses evolved prior to the ancestors of lobe-finned fish, at least 400 million years ago. By way of contrast, three in four alternative exons in the human gene set arose in the primate lineage, so our results cannot be extrapolated to the whole genome. Tissue-specific alternative protein forms in the proteomics analysis were particularly abundant in nervous and muscle tissues and their genes had roles related to the cytoskeleton and either the structure of muscle fibres or cell-cell connections. Our results suggest that this conserved tissue-specific alternative splicing may have played a role in the development of the vertebrate brain and heart. We manually curated a set of 255 splice events detected in a large-scale tissue-based proteomics experiment and found that more than a third had evidence of significant tissue-specific differences. Events that were significantly tissue-specific at the protein level were highly conserved; almost 75% evolved over 400 million years ago. The tissues in which we found most evidence for tissue-specific splicing were nervous tissues and cardiac tissues. Genes with tissue-specific events in these two tissues had functions related to important cellular structures in brain and heart tissues. These splice events may have been essential for the development of vertebrate heart and muscle. However, our data set may not be representative of alternative exons as a whole. We found that most tissue specific splicing was strongly conserved, but just 5% of annotated alternative exons in the human gene set are ancient. More than three quarters of alternative exons are primate-derived. Although the analysis does not provide a definitive answer to the question of the functional role of alternative splicing, our results do indicate that alternative splice variants may have played a significant part in the evolution of brain and heart tissues in vertebrates.
Collapse
Affiliation(s)
- Jose Manuel Rodriguez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Calle Melchor Fernandez, Madrid, Spain
| | - Fernando Pozo
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Calle Melchor Fernandez, Madrid, Spain
| | - Tomas di Domenico
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Calle Melchor Fernandez, Madrid, Spain
| | - Jesus Vazquez
- Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares (CNIC), Calle Melchor Fernandez, Madrid, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Cardiovasculares (CIBERCV), Madrid, Spain
| | - Michael L. Tress
- Bioinformatics Unit, Spanish National Cancer Research Centre (CNIO), Calle Melchor Fernandez, Madrid, Spain
- * E-mail:
| |
Collapse
|
63
|
Livingstone I, Uversky VN, Furniss D, Wiberg A. The Pathophysiological Significance of Fibulin-3. Biomolecules 2020; 10:E1294. [PMID: 32911658 PMCID: PMC7563619 DOI: 10.3390/biom10091294] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 09/03/2020] [Accepted: 09/04/2020] [Indexed: 02/07/2023] Open
Abstract
Fibulin-3 (also known as EGF-containing fibulin extracellular matrix protein 1 (EFEMP1)) is a secreted extracellular matrix glycoprotein, encoded by the EFEMP1 gene that belongs to the eight-membered fibulin protein family. It has emerged as a functionally unique member of this family, with a diverse array of pathophysiological associations predominantly centered on its role as a modulator of extracellular matrix (ECM) biology. Fibulin-3 is widely expressed in the human body, especially in elastic-fibre-rich tissues and ocular structures, and interacts with enzymatic ECM regulators, including tissue inhibitor of metalloproteinase-3 (TIMP-3). A point mutation in EFEMP1 causes an inherited early-onset form of macular degeneration called Malattia Leventinese/Doyne honeycomb retinal dystrophy (ML/DHRD). EFEMP1 genetic variants have also been associated in genome-wide association studies with numerous complex inherited phenotypes, both physiological (namely, developmental anthropometric traits) and pathological (many of which involve abnormalities of connective tissue function). Furthermore, EFEMP1 expression changes are implicated in the progression of numerous types of cancer, an area in which fibulin-3 has putative significance as a therapeutic target. Here we discuss the potential mechanistic roles of fibulin-3 in these pathologies and highlight how it may contribute to the development, structural integrity, and emergent functionality of the ECM and connective tissues across a range of anatomical locations. Its myriad of aetiological roles positions fibulin-3 as a molecule of interest across numerous research fields and may inform our future understanding and therapeutic approach to many human diseases in clinical settings.
Collapse
Affiliation(s)
- Imogen Livingstone
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Nuffield Orthopaedic Centre, Oxford OX3 7LD, UK; (I.L.); (D.F.)
| | - Vladimir N. Uversky
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation, Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino 142290, Moscow Region, Russia;
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Dominic Furniss
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Nuffield Orthopaedic Centre, Oxford OX3 7LD, UK; (I.L.); (D.F.)
- Department of Plastic and Reconstructive Surgery, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| | - Akira Wiberg
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Botnar Research Centre, Nuffield Orthopaedic Centre, Oxford OX3 7LD, UK; (I.L.); (D.F.)
- Department of Plastic and Reconstructive Surgery, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU, UK
| |
Collapse
|
64
|
Seiffert P, Bugge K, Nygaard M, Haxholm GW, Martinsen JH, Pedersen MN, Arleth L, Boomsma W, Kragelund BB. Orchestration of signaling by structural disorder in class 1 cytokine receptors. Cell Commun Signal 2020; 18:132. [PMID: 32831102 PMCID: PMC7444064 DOI: 10.1186/s12964-020-00626-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Accepted: 07/08/2020] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Class 1 cytokine receptors (C1CRs) are single-pass transmembrane proteins responsible for transmitting signals between the outside and the inside of cells. Remarkably, they orchestrate key biological processes such as proliferation, differentiation, immunity and growth through long disordered intracellular domains (ICDs), but without having intrinsic kinase activity. Despite these key roles, their characteristics remain rudimentarily understood. METHODS The current paper asks the question of why disorder has evolved to govern signaling of C1CRs by reviewing the literature in combination with new sequence and biophysical analyses of chain properties across the family. RESULTS We uncover that the C1CR-ICDs are fully disordered and brimming with SLiMs. Many of these short linear motifs (SLiMs) are overlapping, jointly signifying a complex regulation of interactions, including network rewiring by isoforms. The C1CR-ICDs have unique properties that distinguish them from most IDPs and we forward the perception that the C1CR-ICDs are far from simple strings with constitutively bound kinases. Rather, they carry both organizational and operational features left uncovered within their disorder, including mechanisms and complexities of regulatory functions. CONCLUSIONS Critically, the understanding of the fascinating ability of these long, completely disordered chains to orchestrate complex cellular signaling pathways is still in its infancy, and we urge a perceptional shift away from the current simplistic view towards uncovering their full functionalities and potential. Video abstract.
Collapse
Affiliation(s)
- Pernille Seiffert
- REPIN, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Katrine Bugge
- REPIN, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Mads Nygaard
- REPIN, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Gitte W. Haxholm
- REPIN, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Jacob H. Martinsen
- REPIN, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Martin N. Pedersen
- Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100 Copenhagen Ø, Denmark
| | - Lise Arleth
- Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100 Copenhagen Ø, Denmark
| | - Wouter Boomsma
- Department of Computer Science, University of Copenhagen, Universitetsparken 1, 2100 Copenhagen Ø, Denmark
| | - Birthe B. Kragelund
- REPIN, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
65
|
Weisz J, Uversky VN. Zooming into the Dark Side of Human Annexin-S100 Complexes: Dynamic Alliance of Flexible Partners. Int J Mol Sci 2020; 21:ijms21165879. [PMID: 32824294 PMCID: PMC7461550 DOI: 10.3390/ijms21165879] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 08/10/2020] [Accepted: 08/13/2020] [Indexed: 02/06/2023] Open
Abstract
Annexins and S100 proteins form two large families of Ca2+-binding proteins. They are quite different both structurally and functionally, with S100 proteins being small (10–12 kDa) acidic regulatory proteins from the EF-hand superfamily of Ca2+-binding proteins, and with annexins being at least three-fold larger (329 ± 12 versus 98 ± 7 residues) and using non-EF-hand-based mechanism for calcium binding. Members of both families have multiple biological roles, being able to bind to a large cohort of partners and possessing a multitude of functions. Furthermore, annexins and S100 proteins can interact with each other in either a Ca2+-dependent or Ca2+-independent manner, forming functional annexin-S100 complexes. Such functional polymorphism and binding indiscrimination are rather unexpected, since structural information is available for many annexins and S100 proteins, which therefore are considered as ordered proteins that should follow the classical “one protein–one structure–one function” model. On the other hand, the ability to be engaged in a wide range of interactions with multiple, often unrelated, binding partners and possess multiple functions represent characteristic features of intrinsically disordered proteins (IDPs) and intrinsically disordered protein regions (IDPRs); i.e., functional proteins or protein regions lacking unique tertiary structures. The aim of this paper is to provide an overview of the functional roles of human annexins and S100 proteins, and to use the protein intrinsic disorder perspective to explain their exceptional multifunctionality and binding promiscuity.
Collapse
Affiliation(s)
- Judith Weisz
- Departments of Gynecology and Pathology, Pennsylvania State University College of Medicine, Hershey, PA 17033, USA;
| | - Vladimir N. Uversky
- Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, 142290 Moscow, Russia
- Department of Molecular Medicine and USF Health Byrd Alzheimer’s Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
- Correspondence: ; Tel.: +1-813-974-5816 (ext. 123); Fax: +1-813-974-7357
| |
Collapse
|
66
|
Mínguez-Toral M, Cuevas-Zuviría B, Garrido-Arandia M, Pacios LF. A computational structural study on the DNA-protecting role of the tardigrade-unique Dsup protein. Sci Rep 2020; 10:13424. [PMID: 32770133 PMCID: PMC7414916 DOI: 10.1038/s41598-020-70431-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 07/29/2020] [Indexed: 01/10/2023] Open
Abstract
The remarkable ability of tardigrades to withstand a wide range of physical and chemical extremes has attracted a considerable interest in these small invertebrates, with a particular focus on the protective roles of proteins expressed during such conditions. The discovery that a tardigrade-unique protein named Dsup (damage suppressor) protects DNA from damage produced by radiation and radicals, has raised expectations concerning its potential applications in biotechnology and medicine. We present in this paper what might be dubbed a “computational experiment” on the Dsup-DNA system. By means of molecular modelling, calculations of electrostatic potentials and electric fields, and all-atom molecular dynamics simulations, we obtained a dynamic picture of the Dsup-DNA interaction. Our results suggest that the protein is intrinsically disordered, which enables Dsup to adjust its structure to fit DNA shape. Strong electrostatic attractions and high protein flexibility drive the formation of a molecular aggregate in which Dsup shields DNA. While the precise mechanism of DNA protection conferred by Dsup remains to be elucidated, our study provides some molecular clues of their association that could be of interest for further investigation in this line.
Collapse
Affiliation(s)
- Marina Mínguez-Toral
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM)-Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, Pozuelo de Alarcón, 28223, Madrid, Spain
| | - Bruno Cuevas-Zuviría
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM)-Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, Pozuelo de Alarcón, 28223, Madrid, Spain
| | - María Garrido-Arandia
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM)-Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, Pozuelo de Alarcón, 28223, Madrid, Spain
| | - Luis F Pacios
- Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), Universidad Politécnica de Madrid (UPM)-Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA), Campus de Montegancedo-UPM, Pozuelo de Alarcón, 28223, Madrid, Spain. .,Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas (ETSIAAB), Universidad Politécnica de Madrid (UPM), 28040, Madrid, Spain.
| |
Collapse
|
67
|
Avelar GST, Gonçalves LO, Guimarães FG, Guimarães PAS, do Nascimento Rocha LG, Carvalho MGR, de Melo Resende D, Ruiz JC. Diversity and genome mapping assessment of disordered and functional domains in trypanosomatids. J Proteomics 2020; 227:103919. [PMID: 32721629 DOI: 10.1016/j.jprot.2020.103919] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 06/27/2020] [Accepted: 07/20/2020] [Indexed: 12/20/2022]
Abstract
The proteins that have structural disorder exemplify a class of proteins which is part of a new frontier in structural biology that demands a new understanding of the paradigm of structure/function correlations. In order to address the location, relative distances and the functional/structural correlation between disordered and conserved domains, consensus disordered predictions were mapped together with CDD domains in Leishmania braziliensis M2904, Leishmania infantum JPCM5, Trypanosoma cruzi CL-Brener Esmeraldo-like, Trypanosoma cruzi Dm28c, Trypanosoma cruzi Sylvio X10, Blechomonas ayalai B08-376 and Paratrypanosoma confusum CUL13 predicted proteomes. Our results depicts the role of protein disorder in key aspects of parasites biology highlighting: a) statistical significant association between genome structural location of protein disordered consensus stretches and functional domains; b) that disordered protein stretches appear in greater percentage at upstream or downstream position of the predicted domain; c) a possible role of structural disorder in several gene expression, control points that includes but are not limited to: i) protein folding; ii) protein transport and degradation; and iii) protein modification. In addition, for values of protein with disorder content greater than 40%, a small percentage of protein binding sites in IDPs/IDRs, a higher hypothetical protein annotation frequency was observed than expected by chance and trypanosomatid multigene families linked with virulence are rich in protein with disorder content. SIGNIFICANCE: T. cruzi and Leishmania spp are the etiological agents of Chagas disease and leishmaniasis, respectively. Currently, no vaccine or effective drug treatment is available against these neglected diseases and the knowledge about the post-transcriptional and post-translational mechanisms of these organisms, which are key for this scenario, remain scarce. This study depicts the potential impact of the proximity between protein structural disorder and functional domains in the post-transcriptional regulation of pathogenic versus human non-pathogenic trypanosomatids. Our results revealed a significant statistical relationship between the genome structural locations of these two variables and disordered regions appearing more frequently at upstream or downstream positions of the CDD locus domain. This flexibility feature would maintain structural accessibility of functional sites for post-translational modifications, shedding light into this important aspect of parasite biology. This hypothesis is corroborated by the functional enrichment analysis of disordered proteins subset that highlight the involvement of this class of proteins in protein folding, protein transport and degradation and protein modification. Furthermore, our results pointed out: a) the impact of protein disorder in the process of genome annotation (proteins tend to be annotated as hypothetical when the disorder content reaches ~40%); b) that trypanosomatid multigenic families linked with virulence have a key protein disorder content.
Collapse
Affiliation(s)
- Grace Santos Tavares Avelar
- Programa de Pós-graduação em Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, RJ, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil
| | - Leilane Oliveira Gonçalves
- Programa de Pós-graduação em Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, RJ, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil
| | - Frederico Gonçalves Guimarães
- Programa de Pós-graduação em Ciências da Saúde, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil
| | - Paul Anderson Souza Guimarães
- Programa de Pós-graduação em Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, RJ, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil
| | - Luiz Gustavo do Nascimento Rocha
- Programa de Pós-graduação em Ciências da Saúde, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil
| | | | - Daniela de Melo Resende
- Programa de Pós-graduação em Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, RJ, Brazil; Programa de Pós-graduação em Ciências da Saúde, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil
| | - Jeronimo Conceição Ruiz
- Programa de Pós-graduação em Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, RJ, Brazil; Programa de Pós-graduação em Ciências da Saúde, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil; Grupo Informática de Biossistemas, Instituto René Rachou, Fiocruz Minas, Belo Horizonte, MG, Brazil.
| |
Collapse
|
68
|
Sulakhe D, D'Souza M, Wang S, Balasubramanian S, Athri P, Xie B, Canzar S, Agam G, Gilliam TC, Maltsev N. Exploring the functional impact of alternative splicing on human protein isoforms using available annotation sources. Brief Bioinform 2020; 20:1754-1768. [PMID: 29931155 DOI: 10.1093/bib/bby047] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2018] [Revised: 05/02/2018] [Indexed: 12/30/2022] Open
Abstract
In recent years, the emphasis of scientific inquiry has shifted from whole-genome analyses to an understanding of cellular responses specific to tissue, developmental stage or environmental conditions. One of the central mechanisms underlying the diversity and adaptability of the contextual responses is alternative splicing (AS). It enables a single gene to encode multiple isoforms with distinct biological functions. However, to date, the functions of the vast majority of differentially spliced protein isoforms are not known. Integration of genomic, proteomic, functional, phenotypic and contextual information is essential for supporting isoform-based modeling and analysis. Such integrative proteogenomics approaches promise to provide insights into the functions of the alternatively spliced protein isoforms and provide high-confidence hypotheses to be validated experimentally. This manuscript provides a survey of the public databases supporting isoform-based biology. It also presents an overview of the potential global impact of AS on the human canonical gene functions, molecular interactions and cellular pathways.
Collapse
Affiliation(s)
- Dinanath Sulakhe
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Computation Institute, University of Chicago, 5735 S. Ellis Avenue, Chicago, IL, USA
| | - Mark D'Souza
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA
| | - Sheng Wang
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Toyota Technological Institute at Chicago, 6045 S. Kenwood Avenue, Chicago, IL, USA
| | - Sandhya Balasubramanian
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Genentech, Inc. 1 DNA Way, Mail Stop: 35-6J, South San Francisco, CA, USA
| | - Prashanth Athri
- Department of Computer Science and Engineering, Amrita School of Engineering, Bengaluru, Amrita Vishwa Vidyapeetham, Kasavanahalli, Carmelaram P.O., Bengaluru, Karnataka, India
| | - Bingqing Xie
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Department of Computer Science, Illinois Institute of Technology, Chicago, IL, USA
| | - Stefan Canzar
- Toyota Technological Institute at Chicago, 6045 S. Kenwood Avenue, Chicago, IL, USA.,Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Gady Agam
- Department of Computer Science, Illinois Institute of Technology, Chicago, IL, USA
| | - T Conrad Gilliam
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Computation Institute, University of Chicago, 5735 S. Ellis Avenue, Chicago, IL, USA
| | - Natalia Maltsev
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL, USA.,Computation Institute, University of Chicago, 5735 S. Ellis Avenue, Chicago, IL, USA
| |
Collapse
|
69
|
Dayhoff GW, Regenmortel MHV, Uversky VN. Intrinsic disorder in protein sense‐antisense recognition. J Mol Recognit 2020; 33:e2868. [DOI: 10.1002/jmr.2868] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 05/04/2020] [Accepted: 05/18/2020] [Indexed: 01/03/2023]
Affiliation(s)
- Guy W. Dayhoff
- Department of Chemistry, College of Art and SciencesUniversity of South Florida Tampa Florida USA
| | | | - Vladimir N. Uversky
- Laboratory of New Methods in BiologyInstitute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences” Pushchino Russia
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research InstituteMorsani College of Medicine, University of South Florida Tampa Florida USA
| |
Collapse
|
70
|
Genomic Analysis of Intrinsically Disordered Proteins in the Genus Camelus. Int J Mol Sci 2020; 21:ijms21114010. [PMID: 32503351 PMCID: PMC7312968 DOI: 10.3390/ijms21114010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 05/14/2020] [Accepted: 05/18/2020] [Indexed: 12/11/2022] Open
Abstract
Intrinsically disordered proteins/regions (IDPs/IDRs) fail to fold completely into 3D structures, but have major roles in determining protein function. While natively disordered proteins/regions have been found to fulfill a wide variety of primary cellular roles, the functions of many disordered proteins in numerous species remain to be uncovered. Here, we perform the first large-scale study of IDPs/IDRs in the genus Camelus, one of the most important mammalians in Asia and North Africa, in order to explore the biological roles of these proteins. The study includes the prediction of disordered proteins/regions in Camelus species and in humans using multiple state-of-the-art prediction tools. Additionally, we provide a comparative analysis of Camelus and Homo sapiens IDPs/IDRs for the sake of highlighting the distinctive use of disorder in each genus. Our findings indicate that the human proteome is more disordered than the Camelus proteome. Gene Ontology analysis also revealed that Camelus IDPs are enriched in glutathione catabolism and lactose biosynthesis.
Collapse
|
71
|
Yan J, Cheng J, Kurgan L, Uversky VN. Structural and functional analysis of "non-smelly" proteins. Cell Mol Life Sci 2020; 77:2423-2440. [PMID: 31486849 PMCID: PMC11105052 DOI: 10.1007/s00018-019-03292-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 08/21/2019] [Accepted: 08/28/2019] [Indexed: 01/09/2023]
Abstract
Cysteine and aromatic residues are major structure-promoting residues. We assessed the abundance, structural coverage, and functional characteristics of the "non-smelly" proteins, i.e., proteins that do not contain cysteine residues (C-depleted) or cysteine and aromatic residues (CFYWH-depleted), across 817 proteomes from all domains of life. The analysis revealed that although these proteomes contained significant levels of the C-depleted proteins, with prokaryotes being significantly more enriched in such proteins than eukaryotes, the CFYWH-depleted proteins were relatively rare, accounting for about 0.05% of proteomes. Furthermore, CFYWH-depleted proteins were virtually never found in PDB. Depletion in cysteine and in aromatic residues was associated with the substantially increased intrinsic disorder levels across all domains of life. Archaeal and eukaryotic organisms with higher levels of the C-depleted proteins were shown to have higher levels of the intrinsic disorder and lower levels of structural coverage. We also showed that the "non-smelly" proteins typically did not independently fold into monomeric structures, and instead, they fold by interacting with nucleic acids as constituents of the ribosome and nucleosome complexes. They were shown to be involved in translation, transcription, nucleosome assembly, transmembrane transport, and protein folding functions, all of which are known to be associated with the intrinsic disorder. Our data suggested that, in general, structure of monomeric proteins is crucially dependent on the presence of cysteine and aromatic residues.
Collapse
Affiliation(s)
- Jing Yan
- Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada
| | - Jianlin Cheng
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, 401 West Main Street, Room E4225, Richmond, VA, 23284, USA.
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd., MDC07, Tampa, FL, 33612, USA.
- Protein Research Group, Institute for Biological Instrumentation of the Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| |
Collapse
|
72
|
Intrinsic Disorder in Tetratricopeptide Repeat Proteins. Int J Mol Sci 2020; 21:ijms21103709. [PMID: 32466138 PMCID: PMC7279152 DOI: 10.3390/ijms21103709] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/12/2020] [Accepted: 05/22/2020] [Indexed: 12/27/2022] Open
Abstract
Among the realm of repeat containing proteins that commonly serve as “scaffolds” promoting protein-protein interactions, there is a family of proteins containing between 2 and 20 tetratricopeptide repeats (TPRs), which are functional motifs consisting of 34 amino acids. The most distinguishing feature of TPR domains is their ability to stack continuously one upon the other, with these stacked repeats being able to affect interaction with binding partners either sequentially or in combination. It is known that many repeat-containing proteins are characterized by high levels of intrinsic disorder, and that many protein tandem repeats can be intrinsically disordered. Furthermore, it seems that TPR-containing proteins share many characteristics with hybrid proteins containing ordered domains and intrinsically disordered protein regions. However, there has not been a systematic analysis of the intrinsic disorder status of TPR proteins. To fill this gap, we analyzed 166 human TPR proteins to determine the degree to which proteins containing TPR motifs are affected by intrinsic disorder. Our analysis revealed that these proteins are characterized by different levels of intrinsic disorder and contain functional disordered regions that are utilized for protein-protein interactions and often serve as targets of various posttranslational modifications.
Collapse
|
73
|
de la Fuente L, Arzalluz-Luque Á, Tardáguila M, Del Risco H, Martí C, Tarazona S, Salguero P, Scott R, Lerma A, Alastrue-Agudo A, Bonilla P, Newman JRB, Kosugi S, McIntyre LM, Moreno-Manzano V, Conesa A. tappAS: a comprehensive computational framework for the analysis of the functional impact of differential splicing. Genome Biol 2020; 21:119. [PMID: 32423416 PMCID: PMC7236505 DOI: 10.1186/s13059-020-02028-w] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 04/23/2020] [Indexed: 12/26/2022] Open
Abstract
Recent advances in long-read sequencing solve inaccuracies in alternative transcript identification of full-length transcripts in short-read RNA-Seq data, which encourages the development of methods for isoform-centered functional analysis. Here, we present tappAS, the first framework to enable a comprehensive Functional Iso-Transcriptomics (FIT) analysis, which is effective at revealing the functional impact of context-specific post-transcriptional regulation. tappAS uses isoform-resolved annotation of coding and non-coding functional domains, motifs, and sites, in combination with novel analysis methods to interrogate different aspects of the functional readout of transcript variants and isoform regulation. tappAS software and documentation are available at https://app.tappas.org.
Collapse
Affiliation(s)
- Lorena de la Fuente
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
- Present Address: Bioinformatics Unit, IIS Fundación Jiménez Díaz, Madrid, Spain
| | - Ángeles Arzalluz-Luque
- Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
| | - Manuel Tardáguila
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Héctor Del Risco
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Cristina Martí
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Sonia Tarazona
- Department of Statistics and Operational Research, Polytechnical University of Valencia, Valencia, Spain
| | - Pedro Salguero
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Raymond Scott
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA
| | - Alberto Lerma
- Genomics of Gene Expression Laboratory, Prince Felipe Research Center, Valencia, Spain
| | - Ana Alastrue-Agudo
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Pablo Bonilla
- Present Address: Human Genetics Department, Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
| | - Jeremy R B Newman
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Pathology, University of Florida, Gainesville, FL, USA
| | - Shunichi Kosugi
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Laboratory for Statistical and Translational Genetics, Center for Integrative Medical Sciences, RIKEN, Wako, Japan
| | - Lauren M McIntyre
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA
| | | | - Ana Conesa
- Department of Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, USA.
- Genetics Institute, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
74
|
Functional and structural features of proteins associated with alternative splicing. Int J Biol Macromol 2020; 147:513-520. [PMID: 31931065 DOI: 10.1016/j.ijbiomac.2019.09.241] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Revised: 09/16/2019] [Accepted: 09/21/2019] [Indexed: 12/16/2022]
Abstract
The alternative splicing is a mechanism increasing the number of expressed proteins and a variety of these functions. We uncovered the protein domains most frequently lacked or occurred in the splice variants. Proteins presented by several isoforms participate in such processes as transcription regulation, immune response, etc. Our results displayed the association of alternative splicing with branched regulatory pathways. By considering the published data on the protein proteins encoded by the 18th human chromosome, we noted that alternative products display the differences in several functional features, such as phosphorylation, subcellular location, ligand specificity, protein-protein interactions, etc. The investigation of alternative variants referred to the protein kinase domain was performed by comparing the alternative sequences with 3D structures. It was shown that large enough insertions/deletions could be compatible with the kinase fold if they match between the conserved secondary structures. Using the 3D data on human proteins, we showed that conformational flexibility could accommodate fold alterations in splice variants. The investigations of structural and functional differences in splice isoforms are required to understand how to distinguish the isoforms expressed as functioning proteins from the non-realized transcripts. These studies allow filling the gap between genomic and proteomic data.
Collapse
|
75
|
Bhat MY, Singh LR, Dar TA. Taurine Induces an Ordered but Functionally Inactive Conformation in Intrinsically Disordered Casein Proteins. Sci Rep 2020; 10:3503. [PMID: 32103094 PMCID: PMC7044306 DOI: 10.1038/s41598-020-60430-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 01/31/2020] [Indexed: 11/30/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are involved in various important biological processes, such as cell signalling, transcription, translation, cell division regulation etc. Many IDPs need to maintain their disordered conformation for proper function. Osmolytes, natural organic compounds responsible for maintaining osmoregulation, have been believed to regulate the functional activity of macromolecules including globular proteins and IDPs due to their ability of modulating the macromolecular structure, conformational stability, and functional integrity. In the present study, we have investigated the effect of all classes of osmolytes on two model IDPs, α- and β-casein. It was observed that osmolytes can serve either as folding inducers or folding evaders. Folding evaders, in general, do not induce IDP folding and therefore had no significant effect on structural and functional integrity of IDPs. On the other hand, osmolytes taurine and TMAO serve as folding inducers by promoting structural collapse of IDPs that eventually leads to altered structural and functional integrity of IDPs. This study sheds light on the osmolyte-induced regulation of IDPs and their possible role in various disease pathologies.
Collapse
Affiliation(s)
- Mohd Younus Bhat
- Department of Clinical Biochemistry, University of Kashmir, Srinagar, J&K, 190006, India
| | | | - Tanveer Ali Dar
- Department of Clinical Biochemistry, University of Kashmir, Srinagar, J&K, 190006, India.
| |
Collapse
|
76
|
Phillips AH, Kriwacki RW. Intrinsic protein disorder and protein modifications in the processing of biological signals. Curr Opin Struct Biol 2020; 60:1-6. [DOI: 10.1016/j.sbi.2019.09.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 09/04/2019] [Indexed: 12/15/2022]
|
77
|
Minde DP, Ramakrishna M, Lilley KS. Biotin proximity tagging favours unfolded proteins and enables the study of intrinsically disordered regions. Commun Biol 2020; 3:38. [PMID: 31969649 PMCID: PMC6976632 DOI: 10.1038/s42003-020-0758-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 12/16/2019] [Indexed: 02/07/2023] Open
Abstract
Intrinsically Disordered Regions (IDRs) are enriched in disease-linked proteins known to have multiple post-translational modifications, but there is limited in vivo information about how locally unfolded protein regions contribute to biological functions. We reasoned that IDRs should be more accessible to targeted in vivo biotinylation than ordered protein regions, if they retain their flexibility in human cells. Indeed, we observed increased biotinylation density in predicted IDRs in several cellular compartments >20,000 biotin sites from four proximity proteomics studies. We show that in a biotin ‘painting’ time course experiment, biotinylation events in Escherichia coli ribosomes progress from unfolded and exposed regions at 10 s, to structured and less accessible regions after five minutes. We conclude that biotin proximity tagging favours sites of local disorder in proteins and suggest the possibility of using biotin painting as a method to gain unique insights into in vivo condition-dependent subcellular plasticity of proteins. David-Paul Minde, Manasa Ramakrishna et al. look at intrinsically disordered regions of disease-linked proteins in vivo by biotinylation. They show that biotin “painting” could be used as a method to examine the condition-dependent plasticity of proteins in vivo.
Collapse
Affiliation(s)
- David-Paul Minde
- Department of Biochemistry, Cambridge Centre for Proteomics, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK.
| | - Manasa Ramakrishna
- Medical Research Council Toxicology Unit, University of Cambridge, Lancaster Road, Leicester, LE1 9HN, UK
| | - Kathryn S Lilley
- Department of Biochemistry, Cambridge Centre for Proteomics, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK.
| |
Collapse
|
78
|
Oldfield CJ, Fan X, Wang C, Dunker AK, Kurgan L. Computational Prediction of Intrinsic Disorder in Protein Sequences with the disCoP Meta-predictor. Methods Mol Biol 2020; 2141:21-35. [PMID: 32696351 DOI: 10.1007/978-1-0716-0524-0_2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Intrinsically disordered proteins are either entirely disordered or contain disordered regions in their native state. These proteins and regions function without the prerequisite of a stable structure and were found to be abundant across all kingdoms of life. Experimental annotation of disorder lags behind the rapidly growing number of sequenced proteins, motivating the development of computational methods that predict disorder in protein sequences. DisCoP is a user-friendly webserver that provides accurate sequence-based prediction of protein disorder. It relies on meta-architecture in which the outputs generated by multiple disorder predictors are combined together to improve predictive performance. The architecture of disCoP is presented, and its accuracy relative to several other disorder predictors is briefly discussed. We describe usage of the web interface and explain how to access and read results generated by this computational tool. We also provide an example of prediction results and interpretation. The disCoP's webserver is publicly available at http://biomine.cs.vcu.edu/servers/disCoP/ .
Collapse
Affiliation(s)
| | - Xiao Fan
- Department of Pediatrics, Columbia University, New York, NY, USA
| | - Chen Wang
- Department of Medicine, Columbia University, New York, NY, USA
| | - A Keith Dunker
- Department of Biochemistry and Molecular Biology, Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| |
Collapse
|
79
|
Oldfield CJ, Peng Z, Uversky VN, Kurgan L. Codon selection reduces GC content bias in nucleic acids encoding for intrinsically disordered proteins. Cell Mol Life Sci 2020; 77:149-160. [PMID: 31175370 PMCID: PMC11104855 DOI: 10.1007/s00018-019-03166-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Revised: 05/14/2019] [Accepted: 05/28/2019] [Indexed: 02/06/2023]
Abstract
Protein-coding nucleic acids exhibit composition and codon biases between sequences coding for intrinsically disordered regions (IDRs) and those coding for structured regions. IDRs are regions of proteins that are folding self-insufficient and which function without the prerequisite of folded structure. Several authors have investigated composition bias or codon selection in regions encoding for IDRs, primarily in Eukaryota, and concluded that elevated GC content is the result of the biased amino acid composition of IDRs. We substantively extend previous work by examining GC content in regions encoding IDRs, from 44 species in Eukaryota, Archaea, and Bacteria, spanning a wide range of GC content. We confirm that regions coding for IDRs show a significantly elevated GC content, even across all domains of life. Although this is largely attributable to the amino acid composition bias of IDRs, we show that this bias is independent of the overall GC content and, most importantly, we are the first to observe that GC content bias in IDRs is significantly different than expected from IDR amino acid composition alone. We empirically find compensatory codon selection that reduces the observed GC content bias in IDRs. This selection is dependent on the overall GC content of the organism. The codon selection bias manifests as use of infrequent, AT-rich codons in encoding IDRs. Further, we find these relationships to be independent of the intrinsic disorder prediction method used, and independent of estimated translation efficiency. These observations are consistent with the previous work, and we speculate on whether the observed biases are causal or symptomatic of other driving forces.
Collapse
Affiliation(s)
- Christopher J Oldfield
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| | - Zhenling Peng
- Center for Applied Mathematics, Tianjin University, Tianjin, 300072, China
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, 23284, USA.
| |
Collapse
|
80
|
Abstract
Intrinsically disordered regions (IDRs) are estimated to be highly abundant in nature. While only several thousand proteins are annotated with experimentally derived IDRs, computational methods can be used to predict IDRs for the millions of currently uncharacterized protein chains. Several dozen disorder predictors were developed over the last few decades. While some of these methods provide accurate predictions, unavoidably they also make some mistakes. Consequently, one of the challenges facing users of these methods is how to decide which predictions can be trusted and which are likely incorrect. This practical problem can be solved using quality assessment (QA) scores that predict correctness of the underlying (disorder) predictions at a residue level. We motivate and describe a first-of-its-kind toolbox of QA methods, QUARTER (QUality Assessment for pRotein inTrinsic disordEr pRedictions), which provides the scores for a diverse set of ten disorder predictors. QUARTER is available to the end users as a free and convenient webserver at http://biomine.cs.vcu.edu/servers/QUARTER/ . We briefly describe the predictive architecture of QUARTER and provide detailed instructions on how to use the webserver. We also explain how to interpret results produced by QUARTER with the help of a case study.
Collapse
|
81
|
Smithers B, Oates M, Gough J. 'Why genes in pieces?'-revisited. Nucleic Acids Res 2019; 47:4970-4973. [PMID: 30997511 PMCID: PMC6547436 DOI: 10.1093/nar/gkz284] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 04/05/2019] [Accepted: 04/15/2019] [Indexed: 02/04/2023] Open
Abstract
The alignment between the boundaries of protein domains and the boundaries of exons could provide evidence for the evolution of proteins via domain shuffling, but literature in the field has so far struggled to conclusively show this. Here, on larger data sets than previously possible, we do finally show that this phenomenon is indisputably found widely across the eukaryotic tree. In contrast, the alignment between exons and the boundaries of intrinsically disordered regions of proteins is not a general property of eukaryotes. Most interesting of all is the discovery that domain-exon alignment is much more common in recently evolved protein sequences than older ones.
Collapse
Affiliation(s)
- Ben Smithers
- Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK
| | - Matt Oates
- Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK
| | - Julian Gough
- Department of Computer Science, University of Bristol, Bristol BS8 1UB, UK.,MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| |
Collapse
|
82
|
Zhang H, Mao R, Wang Y, Zhang L, Wang C, Lv S, Liu X, Wang Y, Ji W. Transcriptome-wide alternative splicing modulation during plant-pathogen interactions in wheat. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2019; 288:110160. [PMID: 31521219 DOI: 10.1016/j.plantsci.2019.05.023] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2018] [Revised: 03/11/2019] [Accepted: 05/29/2019] [Indexed: 05/07/2023]
Abstract
Alternative splicing (AS) enhances the diversities of both transcripts and proteins in eukaryotes, which contribute to stress adaptation. To catalog wheat (Triticum aestivum L.) AS genes, we characterized 45 RNA-seq libraries from wheat seedlings infected by powdery mildew, Blumeria graminis f. sp. tritici (Bgt) or stripe rust fungus, Puccinia striiformis f. sp. tritici (Pst). We discovered that 11.2% and 10.4% of the multiexon genes had AS transcripts during Bgt and Pst infections, respectively. In response to fungal infection, wheat modulated AS not only in disease resistance proteins, but also in splicing related factors. Apart from the stress induced or activated splicing variants by pathogen, the differential expression profiles were fold increased through changing the ratio of full spliced transcripts versus intron retention (IR) transcripts. Comparing AS transcripts produced by the same gene in Bgt with Pst stress, the spliced terminal exons and the stranded introns are independent and different. This demonstrated that differential induction of specific splice variants were activated against two fungal pathogens. The specific induced AS genes in the Pst-resistant plants were enriched in improving the membrane permeability and protein modification ability, whereas gene expression involved in protein translation and transport were strengthened in Pst-susceptible plants.
Collapse
Affiliation(s)
- Hong Zhang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Rui Mao
- College of Information Engineering, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Yanzhen Wang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Lu Zhang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Changyou Wang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Shikai Lv
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Xinlun Liu
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Yajuan Wang
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China
| | - Wanquan Ji
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712100, Shaanxi, PR China.
| |
Collapse
|
83
|
Vindin H, Mithieux SM, Weiss AS. Elastin architecture. Matrix Biol 2019; 84:4-16. [DOI: 10.1016/j.matbio.2019.07.005] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2019] [Revised: 07/08/2019] [Accepted: 07/08/2019] [Indexed: 11/15/2022]
|
84
|
Yang J, Gao M, Xiong J, Su Z, Huang Y. Features of molecular recognition of intrinsically disordered proteins via coupled folding and binding. Protein Sci 2019; 28:1952-1965. [PMID: 31441158 PMCID: PMC6798136 DOI: 10.1002/pro.3718] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 08/16/2019] [Accepted: 08/20/2019] [Indexed: 12/12/2022]
Abstract
The sequence-structure-function paradigm of proteins has been revolutionized by the discovery of intrinsically disordered proteins (IDPs) or intrinsically disordered regions (IDRs). In contrast to traditional ordered proteins, IDPs/IDRs are unstructured under physiological conditions. The absence of well-defined three-dimensional structures in the free state of IDPs/IDRs is fundamental to their function. Folding upon binding is an important mode of molecular recognition for IDPs/IDRs. While great efforts have been devoted to investigating the complex structures and binding kinetics and affinities, our knowledge on the binding mechanisms of IDPs/IDRs remains very limited. Here, we review recent advances on the binding mechanisms of IDPs/IDRs. The structures and kinetic parameters of IDPs/IDRs can vary greatly, and the binding mechanisms can be highly dependent on the structural properties of IDPs/IDRs. IDPs/IDRs can employ various combinations of conformational selection and induced fit in a binding process, which can be templated by the target and/or encoded by the IDP/IDR. Further studies should provide deeper insights into the molecular recognition of IDPs/IDRs and enable the rational design of IDP/IDR binding mechanisms in the future.
Collapse
Affiliation(s)
- Jing Yang
- Department of Biological Engineering and Key Laboratory of Industrial Fermentation (Ministry of Education)Hubei University of TechnologyWuhanHubeiChina
- Institute of Biomedical and Pharmaceutical SciencesHubei University of TechnologyWuhanHubeiChina
| | - Meng Gao
- Department of Biological Engineering and Key Laboratory of Industrial Fermentation (Ministry of Education)Hubei University of TechnologyWuhanHubeiChina
- Institute of Biomedical and Pharmaceutical SciencesHubei University of TechnologyWuhanHubeiChina
| | - Junwen Xiong
- Department of Biological Engineering and Key Laboratory of Industrial Fermentation (Ministry of Education)Hubei University of TechnologyWuhanHubeiChina
- Institute of Biomedical and Pharmaceutical SciencesHubei University of TechnologyWuhanHubeiChina
| | - Zhengding Su
- Department of Biological Engineering and Key Laboratory of Industrial Fermentation (Ministry of Education)Hubei University of TechnologyWuhanHubeiChina
- Institute of Biomedical and Pharmaceutical SciencesHubei University of TechnologyWuhanHubeiChina
| | - Yongqi Huang
- Department of Biological Engineering and Key Laboratory of Industrial Fermentation (Ministry of Education)Hubei University of TechnologyWuhanHubeiChina
- Institute of Biomedical and Pharmaceutical SciencesHubei University of TechnologyWuhanHubeiChina
| |
Collapse
|
85
|
Fonin AV, Darling AL, Kuznetsova IM, Turoverov KK, Uversky VN. Multi-functionality of proteins involved in GPCR and G protein signaling: making sense of structure-function continuum with intrinsic disorder-based proteoforms. Cell Mol Life Sci 2019; 76:4461-4492. [PMID: 31428838 PMCID: PMC11105632 DOI: 10.1007/s00018-019-03276-1] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 08/05/2019] [Accepted: 08/12/2019] [Indexed: 12/21/2022]
Abstract
GPCR-G protein signaling system recognizes a multitude of extracellular ligands and triggers a variety of intracellular signaling cascades in response. In humans, this system includes more than 800 various GPCRs and a large set of heterotrimeric G proteins. Complexity of this system goes far beyond a multitude of pair-wise ligand-GPCR and GPCR-G protein interactions. In fact, one GPCR can recognize more than one extracellular signal and interact with more than one G protein. Furthermore, one ligand can activate more than one GPCR, and multiple GPCRs can couple to the same G protein. This defines an intricate multifunctionality of this important signaling system. Here, we show that the multifunctionality of GPCR-G protein system represents an illustrative example of the protein structure-function continuum, where structures of the involved proteins represent a complex mosaic of differently folded regions (foldons, non-foldons, unfoldons, semi-foldons, and inducible foldons). The functionality of resulting highly dynamic conformational ensembles is fine-tuned by various post-translational modifications and alternative splicing, and such ensembles can undergo dramatic changes at interaction with their specific partners. In other words, GPCRs and G proteins exist as sets of conformational/basic, inducible/modified, and functioning proteoforms characterized by a broad spectrum of structural features and possessing various functional potentials.
Collapse
Affiliation(s)
- Alexander V Fonin
- Laboratory of structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, 194064, Russian Federation
| | - April L Darling
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Irina M Kuznetsova
- Laboratory of structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, 194064, Russian Federation
| | - Konstantin K Turoverov
- Laboratory of structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, 194064, Russian Federation
- Department of Biophysics, Peter the Great St. Petersburg Polytechnic University, Polytechnicheskaya av. 29, St. Petersburg, 195251, Russian Federation
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
- Institute for Biological Instrumentation, Russian Academy of Sciences, Pushchino, Moscow, Russian Federation.
| |
Collapse
|
86
|
El Hadidy N, Uversky VN. Intrinsic Disorder of the BAF Complex: Roles in Chromatin Remodeling and Disease Development. Int J Mol Sci 2019; 20:ijms20215260. [PMID: 31652801 PMCID: PMC6862534 DOI: 10.3390/ijms20215260] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 10/12/2019] [Accepted: 10/21/2019] [Indexed: 12/13/2022] Open
Abstract
The two-meter-long DNA is compressed into chromatin in the nucleus of every cell, which serves as a significant barrier to transcription. Therefore, for processes such as replication and transcription to occur, the highly compacted chromatin must be relaxed, and the processes required for chromatin reorganization for the aim of replication or transcription are controlled by ATP-dependent nucleosome remodelers. One of the most highly studied remodelers of this kind is the BRG1- or BRM-associated factor complex (BAF complex, also known as SWItch/sucrose non-fermentable (SWI/SNF) complex), which is crucial for the regulation of gene expression and differentiation in eukaryotes. Chromatin remodeling complex BAF is characterized by a highly polymorphic structure, containing from four to 17 subunits encoded by 29 genes. The aim of this paper is to provide an overview of the role of BAF complex in chromatin remodeling and also to use literature mining and a set of computational and bioinformatics tools to analyze structural properties, intrinsic disorder predisposition, and functionalities of its subunits, along with the description of the relations of different BAF complex subunits to the pathogenesis of various human diseases.
Collapse
Affiliation(s)
- Nashwa El Hadidy
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL 33612, USA.
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, 12901 Bruce B. Downs Blvd. MDC07, Tampa, FL 33612, USA.
- Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, 142290 Moscow Region, Russia.
| |
Collapse
|
87
|
Millard PS, Kragelund BB, Burow M. R2R3 MYB Transcription Factors - Functions outside the DNA-Binding Domain. TRENDS IN PLANT SCIENCE 2019; 24:934-946. [PMID: 31358471 DOI: 10.1016/j.tplants.2019.07.003] [Citation(s) in RCA: 97] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2019] [Revised: 07/02/2019] [Accepted: 07/05/2019] [Indexed: 05/20/2023]
Abstract
Several transcription factor (TF) families, including the MYB family, regulate a wide array of biological processes. TFs contain DNA-binding domains (DBDs) and regulatory regions; although information on protein structure is scarce for plant MYB TFs, various in silico methods suggest that the non-MYB regions contain extensive intrinsically disordered regions (IDRs). Although IDRs do not fold into stable globular structures, they comprise functional regions including interaction motifs, and recent research has shown that IDRs perform crucial biological roles. We map here domain organization, disorder predictions, and functional regions across the entire Arabidopsis thaliana R2R3 MYB TF family, and highlight where an increased research focus will be necessary to shape a new understanding of structure-function relationships in plant TFs.
Collapse
Affiliation(s)
- Peter S Millard
- DynaMo Center, Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark; Copenhagen Plant Science Centre, Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Birthe B Kragelund
- Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Meike Burow
- DynaMo Center, Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark; Copenhagen Plant Science Centre, Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
88
|
Appadurai R, Uversky VN, Srivastava A. The Structural and Functional Diversity of Intrinsically Disordered Regions in Transmembrane Proteins. J Membr Biol 2019; 252:273-292. [PMID: 31139867 PMCID: PMC7617717 DOI: 10.1007/s00232-019-00069-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 05/17/2019] [Indexed: 10/26/2022]
Abstract
The intrinsically disordered proteins and protein regions (IDPs/IDPRs) do not have unique structures, but are known to be functionally important and their conformational flexibility and structural plasticity have engendered a paradigmatic shift in the classical sequence-structure-function maxim. Fundamental understanding in this field has significantly evolved since the discovery of this class of proteins about 25 years ago. Though the IDPRs of transmembrane proteins (TMP-IDPRs) comply with the broad definition of typical IDPs and IDPRs found in water-soluble globular proteins, much less is explored and known about them. In this review, we assimilate the key emerging biophysical principles from the limited studies on TMP-IDPRs and provide several context-specific biological examples to highlight the ubiquitous nature of TMP-IDPRs and their functional importance in cellular functions. Besides providing a spectrum of insights from sequence to structural disorder and functions, we also review the challenges and methodological advances in studying the structure-function relationship of TMP-IDPRs. We also lay stress upon the importance of an integrative framework, where ensemble-averaged (and mostly low-resolution) data from multiple experiments can be faithfully integrated with modelling techniques such as advanced sampling, coarse-graining, and free energy minimization methods for a high-fidelity characterization of TMP-IDPRs. We close the review by providing futuristic perspective with suggestions on how we could use the ideas and methods from the exciting field of protein engineering in conjunction with integrative modelling framework to advance the IDPR field and harness the sequence-disorder-function paradigm towards functional design of proteins.
Collapse
Affiliation(s)
- Rajeswari Appadurai
- Molecular Biophysics Unit, Biological Sciences Division, Indian Institute of Science, Bangalore, Karnataka, 560012, India
| | - Vladimir N Uversky
- Department of Molecular Medicine and Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
- Protein Research Group, Institute for Biological Instrumentation of the Russian Academy of Sciences, Institutskaya Str., 7, Pushchino, Moscow Region, Russia, 142290
| | - Anand Srivastava
- Molecular Biophysics Unit, Biological Sciences Division, Indian Institute of Science, Bangalore, Karnataka, 560012, India.
| |
Collapse
|
89
|
Capitanchik C, Dixon CR, Swanson SK, Florens L, Kerr ARW, Schirmer EC. Analysis of RNA-Seq datasets reveals enrichment of tissue-specific splice variants for nuclear envelope proteins. Nucleus 2019; 9:410-430. [PMID: 29912636 PMCID: PMC7000147 DOI: 10.1080/19491034.2018.1469351] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Laminopathies yield tissue-specific pathologies, yet arise from mutation of ubiquitously-expressed genes. A little investigated hypothesis to explain this is that the mutated proteins or their partners have tissue-specific splice variants. To test this, we analyzed RNA-Seq datasets, finding novel isoforms or isoform tissue-specificity for: Lap2, linked to cardiomyopathy; Nesprin 2, linked to Emery-Dreifuss muscular dystrophy and Lmo7, that regulates the Emery-Dreifuss muscular dystrophy linked emerin gene. Interestingly, the muscle-specific Lmo7 exon is rich in serine phosphorylation motifs, suggesting regulatory function. Muscle-specific splice variants in non-nuclear envelope proteins linked to other muscular dystrophies were also found. Nucleoporins tissue-specific variants were found for Nup54, Nup133, Nup153 and Nup358/RanBP2. RT-PCR confirmed novel Lmo7 and RanBP2 variants and specific knockdown of the Lmo7 variantreduced myogenic index. Nuclear envelope proteins were enriched for tissue-specific splice variants compared to the rest of the genome, suggesting that splice variants contribute to its tissue-specific functions.
Collapse
Affiliation(s)
- Charlotte Capitanchik
- a The Wellcome Centre for Cell Biology and Institute of Cell Biology , University of Edinburgh , Edinburgh , UK
| | - Charles R Dixon
- a The Wellcome Centre for Cell Biology and Institute of Cell Biology , University of Edinburgh , Edinburgh , UK
| | - Selene K Swanson
- b Stowers Institute for Medical Research , Kansas City , MO , USA
| | - Laurence Florens
- b Stowers Institute for Medical Research , Kansas City , MO , USA
| | - Alastair R W Kerr
- a The Wellcome Centre for Cell Biology and Institute of Cell Biology , University of Edinburgh , Edinburgh , UK
| | - Eric C Schirmer
- a The Wellcome Centre for Cell Biology and Institute of Cell Biology , University of Edinburgh , Edinburgh , UK
| |
Collapse
|
90
|
Shamilov R, Aneskievich BJ. Intrinsic Disorder in Nuclear Receptor Amino Termini: From Investigational Challenge to Therapeutic Opportunity. NUCLEAR RECEPTOR RESEARCH 2019. [DOI: 10.32527/2019/101417] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Affiliation(s)
- Rambon Shamilov
- Graduate Program in Pharmacology & Toxicology, University of Connecticut, Storrs, CT 06269-3092, USA
| | - Brian J. Aneskievich
- Department of Pharmaceutical Sciences, University of Connecticut, Storrs, CT 06269-3092, USA
| |
Collapse
|
91
|
Djulbegovic MB, Uversky VN. Ferroptosis - An iron- and disorder-dependent programmed cell death. Int J Biol Macromol 2019; 135:1052-1069. [PMID: 31175900 DOI: 10.1016/j.ijbiomac.2019.05.221] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2019] [Revised: 05/30/2019] [Accepted: 05/31/2019] [Indexed: 12/20/2022]
Abstract
Programmed cell death (PCD) is an integral component of both developmental and pathological features of an organism. Recently, ferroptosis, a new form of PCD that is dependent on reactive oxygen species and iron, has been described. As with apoptosis, necroptosis, and autophagy, ferroptosis is associated with a large set of proteins assembled in protein-protein interaction (PPI) networks, interactability of which is driven by the presence of intrinsically disordered proteins (IDPs) and IDP regions (IDPRs). Previous investigations have identified the prevalence and functionality of IDPs/IDPRs in non-ferroptosis PCD. The intrinsic disorder in protein structures is used to increase the regulatory powers of these processes. As uncontrolled PCD is associated with the onset of various pathological traits, uncovering the association between intrinsic disorder and ferroptosis-related proteins is crucial. To understand this association, 31 human ferroptosis-related proteins were analyzed via a multi-dimensional array of bioinformatics and computational techniques. In addition, a disorder meta-predictor (PONDR® FIT) was implored to look at the evolutionary conservation of intrinsic disorder in these proteins. This study presents evidence that IDPs and IDPRs are prevalent in ferroptosis. The intrinsic disorder found in ferroptosis has far-reaching functional implications related to ferroptosis-related PPIs and molecular interactions.
Collapse
Affiliation(s)
- Mak B Djulbegovic
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA; USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA; Protein Research Group, Institute for Biological Instrumentation of the Russian Academy of Sciences, 142290 Pushchino, Moscow region, Russia.
| |
Collapse
|
92
|
Thomas BT, Ogunkanmi LA, Iwalokun BA, Popoola OD. Transition-transversion mutations in the polyketide synthase gene of Aspergillus section Nigri. Heliyon 2019; 5:e01881. [PMID: 31338447 PMCID: PMC6579908 DOI: 10.1016/j.heliyon.2019.e01881] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 02/25/2019] [Accepted: 05/30/2019] [Indexed: 11/21/2022] Open
Abstract
This study determined the transition-transversion mutation in the pks gene of Aspergillus section Nigri in order to gain insight into the patterns of nucleotide base substitution and the process of molecular evolution using standard recommended techniques. Results obtained depict frequent occurrence of transition (23 ± 0.96) than transversion (11.37 ± 1.38) (p < 0.05) with C/T being the most frequently observed transitional base substitution and C/A the most frequently occurring transversional base change. The number of single base insertions (56 ± 1.00) were significantly higher than the observed single base deletions (38 ± 2.00) (p < 0.05) while varying degrees of two or more base deletions and insertions were also observed both inside and outside the open reading frame. The maximum likelihood value estimated for the pks gene was calculated to be -9458.80 in 423 positions of the final dataset while the transition-transversion ratio was estimated to be 0.50. The Tajima's neutrality test approaches seven (7) with the nucleotide diversity estimated to be approximately 65%. Evolutionary test depicts positive selection as ratio of non synonymous to synonymous divergence was found to be greater than ratio of the number of non synonymous to synonymous polymorphisms. The proportion of substitution driven by positive selection was calculated to be approximately 96.2%. This research therefore provides an insight into the understanding of pks gene mutation patterns as some of the observed indels resulted in frame shift mutations.
Collapse
Affiliation(s)
- Benjamin Thoha Thomas
- Department of Microbiology, Olabisi Onabanjo University, Ago Iwoye, Ogun State, Nigeria
| | | | - Bamidele Abiodun Iwalokun
- Division of Molecular Biology and Biotechnology, Nigeria Institute of Medical Research, Yaba, Lagos, Nigeria
| | | |
Collapse
|
93
|
Zuo Y, Feng F, Qi W, Song R. Dek42 encodes an RNA-binding protein that affects alternative pre-mRNA splicing and maize kernel development. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2019; 61:728-748. [PMID: 30839161 DOI: 10.1111/jipb.12798] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 02/28/2019] [Indexed: 05/22/2023]
Abstract
RNA-binding proteins (RBPs) play an important role in post-transcriptional gene regulation. However, the functions of RBPs in plants remain poorly understood. Maize kernel mutant dek42 has small defective kernels and lethal seedlings. Dek42 was cloned by Mutator tag isolation and further confirmed by an independent mutant allele and clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 materials. Dek42 encodes an RRM_RBM48 type RNA-binding protein that localizes to the nucleus. Dek42 is constitutively expressed in various maize tissues. The dek42 mutation caused a significant reduction in the accumulation of DEK42 protein in mutant kernels. RNA-seq analysis showed that the dek42 mutation significantly disturbed the expression of thousands of genes during maize kernel development. Sequence analysis also showed that the dek42 mutation significantly changed alternative splicing in expressed genes, which were especially enriched for the U12-type intron-retained type. Yeast two-hybrid screening identified SF3a1 as a DEK42-interacting protein. DEK42 also interacts with the spliceosome component U1-70K. These results suggested that DEK42 participates in the regulation of pre-messenger RNA splicing through its interaction with other spliceosome components. This study showed the function of a newly identified RBP and provided insights into alternative splicing regulation during maize kernel development.
Collapse
Affiliation(s)
- Yi Zuo
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Beijing Key Laboratory of Crop Genetic Improvement, Joint International Research Laboratory of Crop Molecular Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
| | - Fan Feng
- Shanghai Key Laboratory of Bio-Energy Crops, Plant Science Center, School of Life Sciences, Shanghai University, Shanghai, 200444, China
| | - Weiwei Qi
- Shanghai Key Laboratory of Bio-Energy Crops, Plant Science Center, School of Life Sciences, Shanghai University, Shanghai, 200444, China
| | - Rentao Song
- State Key Laboratory of Plant Physiology and Biochemistry, National Maize Improvement Center, Beijing Key Laboratory of Crop Genetic Improvement, Joint International Research Laboratory of Crop Molecular Breeding, College of Agronomy and Biotechnology, China Agricultural University, Beijing, 100193, China
| |
Collapse
|
94
|
Identification and Analysis of Micro-Exon Genes in the Rice Genome. Int J Mol Sci 2019; 20:ijms20112685. [PMID: 31159166 PMCID: PMC6600660 DOI: 10.3390/ijms20112685] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 05/25/2019] [Accepted: 05/29/2019] [Indexed: 11/24/2022] Open
Abstract
Micro-exons are a kind of exons with lengths no more than 51 nucleotides. They are generally ignored in genome annotation due to the short length, whereas recent studies indicate that they have special splicing properties and important functions. Considering that there has been no genome-wide study of micro-exons in plants up to now, we screened and analyzed genes containing micro-exons in two indica rice varieties in this study. According to the annotation of Zhenshan 97 (ZS97) and Minghui 63 (MH63), ~23% of genes possess micro-exons. We then identified micro-exons from RNA-seq data and found that >65% micro-exons had been annotated and most of novel micro-exons were located in gene regions. About 60% micro-exons were constitutively spliced, and the others were alternatively spliced in different tissues. Besides, we observed that approximately 54% of genes harboring micro-exons tended to be ancient genes, and 13% were Oryza genus-specific. Micro-exon genes were highly conserved in Oryza genus with consistent domains. In particular, the predicted protein structures showed that alternative splicing of in-frame micro-exons led to a local structural recombination, which might affect some core structure of domains, and alternative splicing of frame-shifting micro-exons usually resulted in premature termination of translation by introducing a stop codon or missing functional domains. Overall, our study provided the genome-wide distribution, evolutionary conservation, and potential functions of micro-exons in rice.
Collapse
|
95
|
Katuwawala A, Ghadermarzi S, Kurgan L. Computational prediction of functions of intrinsically disordered regions. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2019; 166:341-369. [PMID: 31521235 DOI: 10.1016/bs.pmbts.2019.04.006] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Intrinsically disorder regions (IDRs) are abundant in nature, particularly among Eukaryotes. While they facilitate a wide spectrum of cellular functions including signaling, molecular assembly and recognition, translation, transcription and regulation, only several hundred IDRs are annotated functionally. This annotation gap motivates the development of fast and accurate computational methods that predict IDR functions directly from protein sequences. We introduce and describe a comprehensive collection of 25 methods that provide accurate predictions of IDRs that interact with proteins and nucleic acids, that function as flexible linkers and that moonlight multiple functions. Virtually all of these predictors can be accessed online and many were developed in the last few years. They utilize a wide range of predictive architectures and take advantage of modern machine learning algorithms. Our empirical analysis shows that predictors that are available as webservers enjoy high rates of citations, attesting to their practical value and popularity. The most cited methods include DISOPRED3, ANCHOR, alpha-MoRFpred, MoRFpred, fMoRFpred and MoRFCHiBi. We present two case studies to demonstrate that predictions produced by these computational tools are relatively easy to interpret and that they deliver valuable functional clues. However, the current computational tools cover a relatively narrow range of disorder functions. Further development efforts that would cover a broader range of functions should be pursued. We demonstrate that a sufficient amount of functionally annotated IDRs that are associated with several other disorder functions is already available and can be used to design and validate novel predictors.
Collapse
Affiliation(s)
- Akila Katuwawala
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Sina Ghadermarzi
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, United States.
| |
Collapse
|
96
|
Redwan EM, Alkarim SA, El-Hanafy AA, Saad YM, Almehdar HA, Uversky VN. Disorder in milk proteins: adipophilin and TIP47, important constituents of the milk fat globule membrane. J Biomol Struct Dyn 2019; 38:1214-1229. [PMID: 30896308 DOI: 10.1080/07391102.2019.1592027] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Milk fat globules (MFGs), which are secreted by the epithelial cells of the lactating mammary glands, account for the most of the nutritional value of milk. They are enveloped by the milk fat globule membrane (MFGM), a complex structure consisting of three phospholipid membrane monolayers and containing various lipids. Depending on the origin of milk, specific proteins accounts for 5-70% of the MFGM mass. Proteome of MFGMs includes hundreds of proteins, with nine major components being adipophilin, butyrophilin, cluster of differentiation 36, fatty acid binding protein, lactadherin, mucin 1, mucin 15, tail-interacting protein 47 (TIP47), and xanthine oxidoreductase. Two of the MFGM components, adipophilin and TIP47, belong to the five-member perilipin family of lipid droplet proteins. Adipophilin is involved in the formation of cytoplasmic lipid droplets and secretion of MFGs. This protein is also related to the formation of other lipid droplets that exist in most cell types, playing an important role in the transport of lipids from ER to the surface of lipid droplets. TIP47 acts as a cytoplasmic sorting factor for mannose 6-phosphate receptors and is recruited to the MFGM. Therefore, both adipophilin and TIP47 are moonlighting proteins, each possessing several unrelated functions. This review focuses on the main functions and specific structural features of adipophilin and TIP47, analyzes similarities and differences of these proteins among different species, and describes these proteins in the context of other members of the perilipin family.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Elrashdy M Redwan
- Biological Sciences Department, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.,Protein Research Department, Therapeutic and Protective Proteins Laboratory, Genetic Engineering and Biotechnology Research Institute, City for Scientific Research and Technology Applications, Alexandria, Egypt
| | - Saleh A Alkarim
- Biological Sciences Department, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Amr A El-Hanafy
- Biological Sciences Department, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.,Department of Nucleic Acid Research, Genetic Engineering and Biotechnology Research Institute, City for Scientific Research & Technology Applications, Borg EL-Arab, Alexandria, Egypt
| | - Yasser M Saad
- Biological Sciences Department, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.,Genetics Laboratory, National Institute of Oceanography and Fisheries, Cairo, Egypt
| | - Hussein A Almehdar
- Biological Sciences Department, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Vladimir N Uversky
- Biological Sciences Department, Faculty of Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.,Institute for Biological Instrumentation of the Russian Academy of Sciences, Pushchino, Russia Moscow Region.,Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| |
Collapse
|
97
|
Oldfield CJ, Uversky VN, Dunker AK, Kurgan L. Introduction to intrinsically disordered proteins and regions. Proteins 2019. [DOI: 10.1016/b978-0-12-816348-1.00001-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
98
|
Uversky VN. Protein intrinsic disorder and structure-function continuum. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2019; 166:1-17. [DOI: 10.1016/bs.pmbts.2019.05.003] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
|
99
|
Gurevich VV, Gurevich EV, Uversky VN. Arrestins: structural disorder creates rich functionality. Protein Cell 2018; 9:986-1003. [PMID: 29453740 PMCID: PMC6251804 DOI: 10.1007/s13238-017-0501-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2017] [Accepted: 11/27/2017] [Indexed: 01/14/2023] Open
Abstract
Arrestins are soluble relatively small 44-46 kDa proteins that specifically bind hundreds of active phosphorylated GPCRs and dozens of non-receptor partners. There are binding partners that demonstrate preference for each of the known arrestin conformations: free, receptor-bound, and microtubule-bound. Recent evidence suggests that conformational flexibility in every functional state is the defining characteristic of arrestins. Flexibility, or plasticity, of proteins is often described as structural disorder, in contrast to the fixed conformational order observed in high-resolution crystal structures. However, protein-protein interactions often involve highly flexible elements that can assume many distinct conformations upon binding to different partners. Existing evidence suggests that arrestins are no exception to this rule: their flexibility is necessary for functional versatility. The data on arrestins and many other multi-functional proteins indicate that in many cases, "order" might be artificially imposed by highly non-physiological crystallization conditions and/or crystal packing forces. In contrast, conformational flexibility (and its extreme case, intrinsic disorder) is a more natural state of proteins, representing true biological order that underlies their physiologically relevant functions.
Collapse
Affiliation(s)
- Vsevolod V Gurevich
- Department of Pharmacology, Vanderbilt University, Nashville, TN, 37232, USA.
| | - Eugenia V Gurevich
- Department of Pharmacology, Vanderbilt University, Nashville, TN, 37232, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, 33612, USA
- Institute for Biological Instrumentation, Russian Academy of Sciences, Pushchino, Moscow Region, Russia, 142290
| |
Collapse
|
100
|
Fonin AV, Darling AL, Kuznetsova IM, Turoverov KK, Uversky VN. Intrinsically disordered proteins in crowded milieu: when chaos prevails within the cellular gumbo. Cell Mol Life Sci 2018; 75:3907-3929. [PMID: 30066087 PMCID: PMC11105604 DOI: 10.1007/s00018-018-2894-9] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Revised: 07/24/2018] [Accepted: 07/26/2018] [Indexed: 12/18/2022]
Abstract
Effects of macromolecular crowding on structural and functional properties of ordered proteins, their folding, interactability, and aggregation are well documented. Much less is known about how macromolecular crowding might affect structural and functional behaviour of intrinsically disordered proteins (IDPs) or intrinsically disordered protein regions (IDPRs). To fill this gap, this review represents a systematic analysis of the available literature data on the behaviour of IDPs/IDPRs in crowded environment. Although it was hypothesized that, due to the excluded-volume effects present in crowded environments, IDPs/IDPRs would invariantly fold in the presence of high concentrations of crowding agents or in the crowded cellular environment, accumulated data indicate that, based on their response to the presence of crowders, IDPs/IDPRs can be grouped into three major categories, foldable, non-foldable, and unfoldable. This is because natural cellular environment is not simply characterized by the presence of high concentration of "inert" macromolecules, but represents an active milieu, components of which are engaged in direct physical interactions and soft interactions with target proteins. Some of these interactions with cellular components can cause (local) unfolding of query proteins. In other words, since crowding can cause both folding and unfolding of an IDP or its regions, the outputs of the placing of a query protein to the crowded environment would depend on the balance between these two processes. As a result, and because of the spatio-temporal heterogeneity in structural organization of IDPs, macromolecular crowding can differently affect structures of different IDPs. Recent studies indicate that some IDPs are able to undergo liquid-liquid-phase transitions leading to the formation of various proteinaceous membrane-less organelles (PMLOs). Although interiors of such PMLOs are self-crowded, being characterized by locally increased concentrations of phase-separating IDPs, these IDPs are minimally foldable or even non-foldable at all (at least within the physiologically safe time-frame of normal PMLO existence).
Collapse
Affiliation(s)
- Alexander V Fonin
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
| | - April L Darling
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA
| | - Irina M Kuznetsova
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
| | - Konstantin K Turoverov
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russian Federation
- St. Petersburg State Polytechnical University, St. Petersburg, Russian Federation
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| |
Collapse
|