1
|
Disordered C-terminal domain drives spatiotemporal confinement of RNAPII to enhance search for chromatin targets. Nat Cell Biol 2024; 26:581-592. [PMID: 38548891 DOI: 10.1038/s41556-024-01382-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 02/21/2024] [Indexed: 04/09/2024]
Abstract
Efficient gene expression requires RNA polymerase II (RNAPII) to find chromatin targets precisely in space and time. How RNAPII manages this complex diffusive search in three-dimensional nuclear space remains largely unknown. The disordered carboxy-terminal domain (CTD) of RNAPII, which is essential for recruiting transcription-associated proteins, forms phase-separated droplets in vitro, hinting at a potential role in modulating RNAPII dynamics. In the present study, we use single-molecule tracking and spatiotemporal mapping in living yeast to show that the CTD is required for confining RNAPII diffusion within a subnuclear region enriched for active genes, but without apparent phase separation into condensates. Both Mediator and global chromatin organization are required for sustaining RNAPII confinement. Remarkably, truncating the CTD disrupts RNAPII spatial confinement, prolongs target search, diminishes chromatin binding, impairs pre-initiation complex formation and reduces transcription bursting. The present study illuminates the pivotal role of the CTD in driving spatiotemporal confinement of RNAPII for efficient gene expression.
Collapse
|
2
|
The molecular basis for cellular function of intrinsically disordered protein regions. Nat Rev Mol Cell Biol 2024; 25:187-211. [PMID: 37957331 DOI: 10.1038/s41580-023-00673-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Intrinsically disordered protein regions exist in a collection of dynamic interconverting conformations that lack a stable 3D structure. These regions are structurally heterogeneous, ubiquitous and found across all kingdoms of life. Despite the absence of a defined 3D structure, disordered regions are essential for cellular processes ranging from transcriptional control and cell signalling to subcellular organization. Through their conformational malleability and adaptability, disordered regions extend the repertoire of macromolecular interactions and are readily tunable by their structural and chemical context, making them ideal responders to regulatory cues. Recent work has led to major advances in understanding the link between protein sequence and conformational behaviour in disordered regions, yet the link between sequence and molecular function is less well defined. Here we consider the biochemical and biophysical foundations that underlie how and why disordered regions can engage in productive cellular functions, provide examples of emerging concepts and discuss how protein disorder contributes to intracellular information processing and regulation of cellular function.
Collapse
|
3
|
Macromolecular Crowding, Phase Separation, and Homeostasis in the Orchestration of Bacterial Cellular Functions. Chem Rev 2024; 124:1899-1949. [PMID: 38331392 PMCID: PMC10906006 DOI: 10.1021/acs.chemrev.3c00622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/01/2023] [Accepted: 01/10/2024] [Indexed: 02/10/2024]
Abstract
Macromolecular crowding affects the activity of proteins and functional macromolecular complexes in all cells, including bacteria. Crowding, together with physicochemical parameters such as pH, ionic strength, and the energy status, influences the structure of the cytoplasm and thereby indirectly macromolecular function. Notably, crowding also promotes the formation of biomolecular condensates by phase separation, initially identified in eukaryotic cells but more recently discovered to play key functions in bacteria. Bacterial cells require a variety of mechanisms to maintain physicochemical homeostasis, in particular in environments with fluctuating conditions, and the formation of biomolecular condensates is emerging as one such mechanism. In this work, we connect physicochemical homeostasis and macromolecular crowding with the formation and function of biomolecular condensates in the bacterial cell and compare the supramolecular structures found in bacteria with those of eukaryotic cells. We focus on the effects of crowding and phase separation on the control of bacterial chromosome replication, segregation, and cell division, and we discuss the contribution of biomolecular condensates to bacterial cell fitness and adaptation to environmental stress.
Collapse
|
4
|
Disordered C-terminal domain drives spatiotemporal confinement of RNAPII to enhance search for chromatin targets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.31.551302. [PMID: 37577667 PMCID: PMC10418089 DOI: 10.1101/2023.07.31.551302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Efficient gene expression requires RNA Polymerase II (RNAPII) to find chromatin targets precisely in space and time. How RNAPII manages this complex diffusive search in 3D nuclear space remains largely unknown. The disordered carboxy-terminal domain (CTD) of RNAPII, which is essential for recruiting transcription-associated proteins, forms phase-separated droplets in vitro, hinting at a potential role in modulating RNAPII dynamics. Here, we use single-molecule tracking and spatiotemporal mapping in living yeast to show that the CTD is required for confining RNAPII diffusion within a subnuclear region enriched for active genes, but without apparent phase separation into condensates. Both Mediator and global chromatin organization are required for sustaining RNAPII confinement. Remarkably, truncating the CTD disrupts RNAPII spatial confinement, prolongs target search, diminishes chromatin binding, impairs pre-initiation complex formation, and reduces transcription bursting. This study illuminates the pivotal role of the CTD in driving spatiotemporal confinement of RNAPII for efficient gene expression.
Collapse
|
5
|
Complex Conformational Space of the RNA Polymerase II C-Terminal Domain upon Phosphorylation. J Phys Chem B 2023; 127:9223-9235. [PMID: 37870995 PMCID: PMC10626582 DOI: 10.1021/acs.jpcb.3c02655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Revised: 10/03/2023] [Indexed: 10/25/2023]
Abstract
Intrinsically disordered proteins (IDPs) have been closely studied during the past decade due to their importance in many biological processes. The disordered nature of this group of proteins makes it difficult to observe its full span of the conformational space using either experimental or computational studies. In this article, we explored the conformational space of the C-terminal domain (CTD) of RNA polymerase II (Pol II), which is also an intrinsically disordered low complexity domain, using enhanced sampling methods. We provided a detailed conformational analysis of model systems of CTD with different lengths; first with the last 44 residues of the human CTD sequence and finally the CTD model with 2-heptapeptide repeating units. We then investigated the effects of phosphorylation on CTD conformations by performing simulations at different phosphorylated states. We obtained broad conformational spaces in nonphosphorylated CTD models, and phosphorylation has complex effects on the conformations of the CTD. These complex effects depend on the length of the CTD, spacing between the multiple phosphorylation sites, ion coordination, and interactions with the nearby residues.
Collapse
|
6
|
Driving forces behind phase separation of the carboxy-terminal domain of RNA polymerase II. Nat Commun 2023; 14:5979. [PMID: 37749095 PMCID: PMC10519987 DOI: 10.1038/s41467-023-41633-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 09/10/2023] [Indexed: 09/27/2023] Open
Abstract
Eukaryotic gene regulation and pre-mRNA transcription depend on the carboxy-terminal domain (CTD) of RNA polymerase (Pol) II. Due to its highly repetitive, intrinsically disordered sequence, the CTD enables clustering and phase separation of Pol II. The molecular interactions that drive CTD phase separation and Pol II clustering are unclear. Here, we show that multivalent interactions involving tyrosine impart temperature- and concentration-dependent self-coacervation of the CTD. NMR spectroscopy, molecular ensemble calculations and all-atom molecular dynamics simulations demonstrate the presence of diverse tyrosine-engaging interactions, including tyrosine-proline contacts, in condensed states of human CTD and other low-complexity proteins. We further show that the network of multivalent interactions involving tyrosine is responsible for the co-recruitment of the human Mediator complex and CTD during phase separation. Our work advances the understanding of the driving forces of CTD phase separation and thus provides the basis to better understand CTD-mediated Pol II clustering in eukaryotic gene transcription.
Collapse
|
7
|
Abstract
The understanding on how short peptide assemblies transit from disorder to order remains limited due to the lack of atomistic structures. Here we report cryo-EM structure of the nanofibers short intrinsically disordered peptides (IDPs). Upon lowering pH or adding calcium ions, the IDP transitions from individual nanoparticles to nanofibers containing an aromatic core and a disordered periphery comprised of 2 to 5 amino acids. Protonating the phosphate or adding more metal ions further assembles the nanofibers into filament bundles. The assemblies of the IDP analogs with controlled chemistry, such as phosphorylation site, hydrophobic interactions, and sequences indicate that metal ions interact with the flexible periphery of the nanoparticles of the IDPs to form fibrils and enhance the interfibrillar interactions to form filament bundles. Illustrating that an IDP self-assembles from disorder to order, this work offers atomistic molecular insights to understand assemblies of short peptides driven by noncovalent interactions.
Collapse
|
8
|
SOURSOP: A Python Package for the Analysis of Simulations of Intrinsically Disordered Proteins. J Chem Theory Comput 2023; 19:5609-5620. [PMID: 37463458 DOI: 10.1021/acs.jctc.3c00190] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2023]
Abstract
Conformational heterogeneity is a defining hallmark of intrinsically disordered proteins and protein regions (IDRs). The functions of IDRs and the emergent cellular phenotypes they control are associated with sequence-specific conformational ensembles. Simulations of conformational ensembles that are based on atomistic and coarse-grained models are routinely used to uncover the sequence-specific interactions that may contribute to IDR functions. These simulations are performed either independently or in conjunction with data from experiments. Functionally relevant features of IDRs can span a range of length scales. Extracting these features requires analysis routines that quantify a range of properties. Here, we describe a new analysis suite simulation analysis of unfolded regions of proteins (SOURSOP), an object-oriented and open-source toolkit designed for the analysis of simulated conformational ensembles of IDRs. SOURSOP implements several analysis routines motivated by principles in polymer physics, offering a unique collection of simple-to-use functions to characterize IDR ensembles. As an extendable framework, SOURSOP supports the development and implementation of new analysis routines that can be easily packaged and shared.
Collapse
|
9
|
Structure and phase separation of the C-terminal domain of RNA polymerase II. Biol Chem 2023; 404:839-844. [PMID: 37331973 DOI: 10.1515/hsz-2023-0136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 06/01/2023] [Indexed: 06/20/2023]
Abstract
The repetitive heptads in the C-terminal domain (CTD) of RPB1, the largest subunit of RNA Polymerase II (Pol II), play a critical role in the regulation of Pol II-based transcription. Recent findings on the structure of the CTD in the pre-initiation complex determined by cryo-EM and the novel phase separation properties of key transcription components offers an expanded mechanistic interpretation of the spatiotemporal distribution of Pol II during transcription. Current experimental evidence further suggests an exquisite balance between CTD's local structure and an array of multivalent interactions that drive phase separation of Pol II and thus shape its transcriptional activity.
Collapse
|
10
|
SOURSOP: A Python package for the analysis of simulations of intrinsically disordered proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.16.528879. [PMID: 36824878 PMCID: PMC9949127 DOI: 10.1101/2023.02.16.528879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
Conformational heterogeneity is a defining hallmark of intrinsically disordered proteins and protein regions (IDRs). The functions of IDRs and the emergent cellular phenotypes they control are associated with sequence-specific conformational ensembles. Simulations of conformational ensembles that are based on atomistic and coarse-grained models are routinely used to uncover the sequence-specific interactions that may contribute to IDR functions. These simulations are performed either independently or in conjunction with data from experiments. Functionally relevant features of IDRs can span a range of length scales. Extracting these features requires analysis routines that quantify a range of properties. Here, we describe a new analysis suite SOURSOP, an object-oriented and open-source toolkit designed for the analysis of simulated conformational ensembles of IDRs. SOURSOP implements several analysis routines motivated by principles in polymer physics, offering a unique collection of simple-to-use functions to characterize IDR ensembles. As an extendable framework, SOURSOP supports the development and implementation of new analysis routines that can be easily packaged and shared.
Collapse
|
11
|
From structure to molecular condensates: emerging mechanisms for Mediator function. FEBS J 2023; 290:286-309. [PMID: 34698446 DOI: 10.1111/febs.16250] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/15/2021] [Accepted: 10/25/2021] [Indexed: 02/05/2023]
Abstract
Mediator is a large modular protein assembly whose function as a coactivator of transcription is conserved in all eukaryotes. The Mediator complex can integrate and relay signals from gene-specific activators bound at enhancers to activate the general transcription machinery located at promoters. It has thus been described as a bridge between these elements during initiation of transcription. Here, we review recent studies on Mediator relating to its structure, gene specificity and general requirement, roles in chromatin architecture as well as novel concepts involving phase separation and transcriptional bursting. We revisit the mechanism of action of Mediator and ultimately put forward models for its mode of action in gene activation.
Collapse
|
12
|
Abstract
In stark contrast to foldable proteins with a unique folded state, intrinsically disordered proteins and regions (IDPs) persist in perpetually disordered ensembles. Yet an IDP ensemble has conformational features-even when averaged-that are specific to its sequence. In fact, subtle changes in an IDP sequence can modulate its conformational features and its function. Recent advances in theoretical physics reveal a set of elegant mathematical expressions that describe the intricate relationships among IDP sequences, their ensemble conformations, and the regulation of their biological functions. These equations also describe the molecular properties of IDP sequences that predict similarities and dissimilarities in their functions and facilitate classification of sequences by function, an unmet challenge to traditional bioinformatics. These physical sequence-patterning metrics offer a promising new avenue for advancing synthetic biology at a time when multiple novel functional modes mediated by IDPs are emerging.
Collapse
|
13
|
Abstract
To predict transcription, one needs a mechanistic understanding of how the numerous required transcription factors (TFs) explore the nuclear space to find their target genes, assemble, cooperate, and compete with one another. Advances in fluorescence microscopy have made it possible to visualize real-time TF dynamics in living cells, leading to two intriguing observations: first, most TFs contact chromatin only transiently; and second, TFs can assemble into clusters through their intrinsically disordered regions. These findings suggest that highly dynamic events and spatially structured nuclear microenvironments might play key roles in transcription regulation that are not yet fully understood. The emerging model is that while some promoters directly convert TF-binding events into on/off cycles of transcription, many others apply complex regulatory layers that ultimately lead to diverse phenotypic outputs. Cracking this kinetic code is an ongoing and challenging task that is made possible by combining innovative imaging approaches with biophysical models.
Collapse
|
14
|
Folded domain charge properties influence the conformational behavior of disordered tails. Curr Res Struct Biol 2021; 3:216-228. [PMID: 34557680 PMCID: PMC8446786 DOI: 10.1016/j.crstbi.2021.08.002] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 08/26/2021] [Indexed: 12/22/2022] Open
Abstract
Intrinsically disordered proteins and protein regions (IDRs) make up around 30% of the human proteome where they play essential roles in dictating and regulating many core biological processes. While IDRs are often studied as isolated domains, in naturally occurring proteins most IDRs are found adjacent to folded domains, where they exist as either N- or C-terminal tails or as linkers connecting two folded domains. Prior work has shown that charge properties of IDRs can influence their conformational behavior, both in isolation and in the context of folded domains. In contrast, the converse scenario is less well-explored: how do the charge properties of folded domains influence IDR conformational behavior? To answer this question, we combined a large-scale structural bioinformatics analysis with all-atom implicit solvent simulations of both rationally designed and naturally occurring proteins. Our results reveal three key takeaways. Firstly, the relative position and accessibility of charged residues across the surface of a folded domain can dictate IDR conformational behavior, overriding expectations based on net surface charge properties. Secondly, naturally occurring proteins possess multiple charge patches that are physically accessible to local IDRs. Finally, even modest changes in the local electrostatic environment of a folded domain can substantially modulate IDR-folded domain interactions. Taken together, our results suggest that folded domain surfaces can act as local determinants of IDR conformational behavior. Intrinsically disordered regions (IDRs) are mostly found adjacent to folded domains. Here we propose that the folded domain surface properties influence IDR behavior. We combine all-atom simulations and sequence design of IDRs and folded domains. IDR conformational behavior is determined by a complex combination of factors. Folded domains can substantially alter IDR conformational biases.
Collapse
|
15
|
Intrinsically disordered protein regions and phase separation: sequence determinants of assembly or lack thereof. Emerg Top Life Sci 2021; 4:307-329. [PMID: 33078839 DOI: 10.1042/etls20190164] [Citation(s) in RCA: 121] [Impact Index Per Article: 40.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 09/23/2020] [Accepted: 09/28/2020] [Indexed: 02/07/2023]
Abstract
Intrinsically disordered protein regions (IDRs) - regions that do not fold into a fixed three-dimensional structure but instead exist in a heterogeneous ensemble of conformations - have recently entered mainstream cell biology in the context of liquid-liquid phase separation (LLPS). IDRs are frequently found to be enriched in phase-separated compartments. Due to this observation, the presence of an IDR in a protein is frequently assumed to be diagnostic of its ability to phase separate. In this review, we clarify the role of IDRs in biological assembly and explore the physical principles through which amino acids can confer the attractive molecular interactions that underlie phase separation. While some disordered regions will robustly drive phase separation, many others will not. We emphasize that rather than 'disorder' driving phase separation, multivalency drives phase separation. As such, whether or not a disordered region is capable of driving phase separation will depend on the physical chemistry encoded within its amino acid sequence. Consequently, an in-depth understanding of that physical chemistry is a prerequisite to make informed inferences on how and why an IDR may be involved in phase separation or, more generally, in protein-mediated intermolecular interactions.
Collapse
|
16
|
What's all the phos about? Insights into the phosphorylation state of the RNA polymerase II C-terminal domain via mass spectrometry. RSC Chem Biol 2021; 2:1084-1095. [PMID: 34458825 PMCID: PMC8341212 DOI: 10.1039/d1cb00083g] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Accepted: 06/03/2021] [Indexed: 12/31/2022] Open
Abstract
RNA polymerase II (RNAP II) is one of the primary enzymes responsible for expressing protein-encoding genes and some small nuclear RNAs. The enigmatic carboxy-terminal domain (CTD) of RNAP II and its phosphorylation state are critically important in regulating transcription in vivo. Early methods of identifying phosphorylation on the CTD heptad were plagued by issues of low specificity and ambiguous signals. However, advancements in the field of mass spectrometry (MS) have presented the opportunity to gain new insights into well-studied processes as well as explore new frontiers in transcription. By using MS, residues which are modified within the CTD heptad and across repeats are now able to be pinpointed. Likewise, identification of kinase and phosphatase specificity towards residues of the CTD has reached a new level of accuracy. Now, MS is being used to investigate the crosstalk between modified residues of the CTD and may be a critical technique for understanding how phosphorylation plays a role in the new LLPS model of transcription. Herein, we discuss the development of various MS techniques and evaluate their capabilities. By highlighting the pros and cons of each technique, we aim to provide future investigators with a comprehensive overview of how MS can be used to investigate the complexities of RNAP-II mediated transcription.
Collapse
|
17
|
Simplicity is the Ultimate Sophistication-Crosstalk of Post-translational Modifications on the RNA Polymerase II. J Mol Biol 2021; 433:166912. [PMID: 33676925 PMCID: PMC8184622 DOI: 10.1016/j.jmb.2021.166912] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 02/23/2021] [Accepted: 02/26/2021] [Indexed: 12/19/2022]
Abstract
The highly conserved C-terminal domain (CTD) of the largest subunit of RNA polymerase II comprises a consensus heptad (Y1S2P3T4S5P6S7) repeated multiple times. Despite the simplicity of its sequence, the essential CTD domain orchestrates eukaryotic transcription and co-transcriptional processes, including transcription initiation, elongation, and termination, and mRNA processing. These distinct facets of the transcription cycle rely on specific post-translational modifications (PTM) of the CTD, in which five out of the seven residues in the heptad repeat are subject to phosphorylation. A hypothesis termed the "CTD code" has been proposed in which these PTMs and their combinations generate a sophisticated landscape for spatiotemporal recruitment of transcription regulators to Pol II. In this review, we summarize the recent experimental evidence understanding the biological role of the CTD, implicating a context-dependent theme that significantly enhances the ability of accurate transcription by RNA polymerase II. Furthermore, feedback communication between the CTD and histone modifications coordinates chromatin states with RNA polymerase II-mediated transcription, ensuring the effective and accurate conversion of information into cellular responses.
Collapse
|
18
|
Allosteric conformational ensembles have unlimited capacity for integrating information. eLife 2021; 10:65498. [PMID: 34106049 PMCID: PMC8189718 DOI: 10.7554/elife.65498] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Accepted: 04/30/2021] [Indexed: 12/24/2022] Open
Abstract
Integration of binding information by macromolecular entities is fundamental to cellular functionality. Recent work has shown that such integration cannot be explained by pairwise cooperativities, in which binding is modulated by binding at another site. Higher-order cooperativities (HOCs), in which binding is collectively modulated by multiple other binding events, appear to be necessary but an appropriate mechanism has been lacking. We show here that HOCs arise through allostery, in which effective cooperativity emerges indirectly from an ensemble of dynamically interchanging conformations. Conformational ensembles play important roles in many cellular processes but their integrative capabilities remain poorly understood. We show that sufficiently complex ensembles can implement any form of information integration achievable without energy expenditure, including all patterns of HOCs. Our results provide a rigorous biophysical foundation for analysing the integration of binding information through allostery. We discuss the implications for eukaryotic gene regulation, where complex conformational dynamics accompanies widespread information integration.
Collapse
|
19
|
Evaluating Spatiotemporal Dynamics of Phosphorylation of RNA Polymerase II Carboxy-Terminal Domain by Ultraviolet Photodissociation Mass Spectrometry. J Am Chem Soc 2021; 143:8488-8498. [PMID: 34053220 DOI: 10.1021/jacs.1c03321] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The critical role of site-specific phosphorylation in eukaryotic transcription has motivated efforts to decipher the complex phosphorylation patterns exhibited by the carboxyl-terminal domain (CTD) of RNA polymerase II. Phosphorylation remains a challenging post-translational modification to characterize by mass spectrometry owing to the labile phosphate ester linkage and low stoichiometric prevalence, two features that complicate analysis by high-throughput MS/MS methods. Identifying phosphorylation sites represents one significant hurdle in decrypting the CTD phosphorylation, a problem exaggerated by a large number of potential phosphorylation sites. An even greater obstacle is decoding the dynamic phosphorylation pattern along the length of the periodic CTD sequence. Ultraviolet photodissociation (UVPD) is a high-energy ion activation method that provides ample backbone cleavages of peptides while preserving labile post-translational modifications that facilitate their confident localization. Herein, we report a quantitative parallel reaction monitoring (PRM) method developed to monitor spatiotemporal changes in site-specific Ser5 phosphorylation of the CTD by cyclin-dependent kinase 7 (CDK7) using UVPD for sequence identification, phosphosite localization, and differentiation of phosphopeptide isomers. We capitalize on the series of phospho-retaining fragment ions produced by UVPD to create unique transition lists that are pivotal for distinguishing the array of phosphopeptides generated from the CTD.
Collapse
|
20
|
Intrachain interaction topology can identify functionally similar intrinsically disordered proteins. Biophys J 2021; 120:1860-1868. [PMID: 33865811 DOI: 10.1016/j.bpj.2020.11.2282] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Revised: 10/17/2020] [Accepted: 11/19/2020] [Indexed: 01/06/2023] Open
Abstract
Functionally similar IDPs (intrinsically disordered proteins) often have little sequence similarity. This is in stark contrast to folded proteins and poses a challenge for the inverse problem, functional classification of IDPs using sequence alignment. The problem is further compounded because of the lack of structure in IDPs, preventing structural alignment as an alternate tool for classification. Recent advances in heteropolymer theory unveiled a powerful set of sequence-patterning metrics bridging molecular interaction with chain conformation. Focusing only on charge patterning, these set of metrics yield a sequence charge decoration matrix (SCDM). SCDMs can potentially identify functionally similar IDPs not apparent from sequence alignment alone. Here, we illustrate how these information-rich "molecular blueprints" encoded in SCDMs can be used for functional classification of IDPs with specific application in three protein families-Ste50, PSC, and RAM-in which electrostatics is known to be important. For both the Ste50 and PSC protein family, the set of metrics appropriately classifies proteins in functional and nonfunctional groups in agreement with experiment. Furthermore, our algorithm groups synthetic variants of the disordered RAM region of the Notch receptor protein-important in gene expression-in reasonable accordance with classification based on experimentally measured binding constants of RAM and transcription factor. Taken together, the novel classification scheme reveals the critical role of a high-dimensional set of metrics-manifest in self-interaction maps and topology-in functional annotation of IDPs even when there is low sequence homology, providing the much-needed alternate to a traditional sequence alignment tool.
Collapse
|
21
|
Stress-induced nuclear condensation of NELF drives transcriptional downregulation. Mol Cell 2021; 81:1013-1026.e11. [PMID: 33548202 PMCID: PMC7939545 DOI: 10.1016/j.molcel.2021.01.016] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Revised: 10/20/2020] [Accepted: 01/11/2021] [Indexed: 12/21/2022]
Abstract
In response to stress, human cells coordinately downregulate transcription and translation of housekeeping genes. To downregulate transcription, the negative elongation factor (NELF) is recruited to gene promoters impairing RNA polymerase II elongation. Here we report that NELF rapidly forms nuclear condensates upon stress in human cells. Condensate formation requires NELF dephosphorylation and SUMOylation induced by stress. The intrinsically disordered region (IDR) in NELFA is necessary for nuclear NELF condensation and can be functionally replaced by the IDR of FUS or EWSR1 protein. We find that biomolecular condensation facilitates enhanced recruitment of NELF to promoters upon stress to drive transcriptional downregulation. Importantly, NELF condensation is required for cellular viability under stressful conditions. We propose that stress-induced NELF condensates reported here are nuclear counterparts of cytosolic stress granules. These two stress-inducible condensates may drive the coordinated downregulation of transcription and translation, likely forming a critical node of the stress survival strategy.
Collapse
|
22
|
Testing the length limit of loop grafting in a helical repeat protein. Curr Res Struct Biol 2020; 3:30-40. [PMID: 34235484 PMCID: PMC8244534 DOI: 10.1016/j.crstbi.2020.12.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Revised: 11/13/2020] [Accepted: 12/02/2020] [Indexed: 11/19/2022] Open
Abstract
Alpha-helical repeat proteins such as consensus-designed tetratricopeptide repeats (CTPRs) are exceptionally stable molecules that are able to tolerate destabilizing sequence alterations and are therefore becoming increasingly valued as a modular platform for biotechnology and biotherapeutic applications. A simple approach to functionalize the CTPR scaffold that we are pioneering is the insertion of short linear motifs (SLiMs) into the loops between adjacent repeats. Here, we test the limits of the scaffold by inserting 17 highly diverse amino acid sequences of up to 58 amino acids in length into a two-repeat protein and examine the impact on protein folding, stability and solubility. The sequences include three SLiMs that bind oncoproteins and eleven naturally occurring linker sequences all predicted to be intrinsically disordered but with conformational preferences ranging from compact globules to expanded coils. We show that the loop-grafted proteins retain the native CTPR structure and are thermally stable with melting temperatures above 60 °C, despite the longest loop sequence being almost the same size as the CTPR scaffold itself (68 amino acids). Although the main determinant of the effect of stability was found to be loop length and was relatively insensitive to amino acid composition, the relationship between protein solubility and the loop sequences was more complex, with the presence of negatively charged amino acids enhancing the solubility. Our findings will help us to fully realize the potential of the repeat-protein scaffold, allowing a rational design approach to create artificial modular proteins with customized functional capabilities.
Collapse
Key Words
- CD, circular dichroism
- CTPRs, consensus-designed tetratricopeptide repeats
- FCR, fraction of charged residues
- IDPs, intrinsically disordered proteins
- IDRs, intrinsically disordered regions
- Intrinsically disordered protein
- Intrinsically disordered region
- NCPR, net charge per residue
- PBIP1, polo-box interacting protein 1
- Peptide grafting
- SLiMs, short linear motifs
- TBP, tankyrase-binding peptides
- Tandem-repeat protein
- Tetratricopeptide repeat
- ves, effective solvation volume
Collapse
|
23
|
Intrinsic Disorder in the T Cell Receptor Creates Cooperativity and Controls ZAP70 Binding. Biophys J 2020; 120:379-392. [PMID: 33285117 PMCID: PMC7840419 DOI: 10.1016/j.bpj.2020.11.2266] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 10/24/2020] [Accepted: 11/19/2020] [Indexed: 12/31/2022] Open
Abstract
Many immunoreceptors have cytoplasmic domains that are intrinsically disordered (i.e., have high configurational entropy), have multiple sites of posttranslational modification (e.g., tyrosine phosphorylation), and participate in nonlinear signaling pathways (e.g., exhibiting switch-like behavior). Several hypotheses to explain the origin of these nonlinearities fall under the broad hypothesis that modification at one site changes the immunoreceptor’s entropy, which in turn changes further modification dynamics. Here, we use coarse-grain simulation to study three scenarios, all related to the chains that constitute the T cell receptor (TCR). We find that first, if phosphorylation induces local changes in the flexibility of the TCR ζ-chain, this naturally leads to rate enhancements and cooperativity. Second, we find that TCR CD3ɛ can provide a switch by modulating its residence in the plasma membrane. By constraining our model to be consistent with the previous observation that both basic residues and phosphorylation control membrane residence, we find that there is only a moderate rate enhancement of 10% between first and subsequent phosphorylation events. Third, we find that volume constraints do not limit the number of ZAP70s that can bind the TCR but that entropic penalties lead to a 200-fold decrease in binding rate by the seventh ZAP70, potentially explaining the observation that each TCR has around six ZAP70 molecules bound after receptor triggering. In all three scenarios, our results demonstrate that phenomena that change an immunoreceptor chain’s entropy (stiffening, confinement to a membrane, and multiple simultaneous binding) can lead to nonlinearities (rate enhancement, switching, and negative cooperativity) in how the receptor participates in signaling. These polymer-entropy-driven nonlinearities may augment the nonlinearities that arise from, e.g., kinetic proofreading and cluster formation. They also suggest different design strategies for engineered receptors, e.g., whether or not to put signaling modules on one chain or multiple clustered chains.
Collapse
|
24
|
RNA Pol II Length and Disorder Enable Cooperative Scaling of Transcriptional Bursting. Mol Cell 2020; 79:207-220.e8. [DOI: 10.1016/j.molcel.2020.05.030] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 04/09/2020] [Accepted: 05/19/2020] [Indexed: 12/15/2022]
|
25
|
Electrophoretic Mobility Shift Assay of in vitro Phosphorylated RNA Polymerase II Carboxyl-terminal Domain Substrates. Bio Protoc 2020; 10:e3648. [PMID: 33659319 DOI: 10.21769/bioprotoc.3648] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 04/29/2020] [Accepted: 04/29/2020] [Indexed: 11/02/2022] Open
Abstract
Eukaryotic RNA polymerase II transcribes all protein-coding mRNAs and is highly regulated. A key mechanism directing RNA polymerase II and facilitating the co-transcriptional processing of mRNAs is the phosphorylation of its highly repetitive carboxyl-terminal domain (CTD) of its largest subunit, RPB1, at specific residues. A variety of techniques exist to identify and quantify the degree of CTD phosphorylation, including phosphorylation-specific antibodies and mass spectrometry. Electrophoretic mobility shift assays (EMSAs) have been utilized since the discovery of CTD phosphorylation and continue to represent a simple, direct, and widely applicable approach for qualitatively monitoring CTD phosphorylation. We present a standardized method for EMSA analysis of recombinant GST-CTD substrates phosphorylated by a variety of CTD kinases. Strategies to analyze samples under both denatured/reduced and semi-native conditions are provided. This method represents a simple, direct, and reproducible means to monitor CTD phosphorylation in recombinant substrates utilizing equipment common to molecular biology labs and readily applicable to downstream analyses including immunoblotting and mass spectrometry.
Collapse
|
26
|
Abstract
The production of mRNA is a dynamic process that is highly regulated by reversible post-translational modifications of the C-terminal domain (CTD) of RNA polymerase II. The CTD is a highly repetitive domain consisting mostly of the consensus heptad sequence Tyr1-Ser2-Pro3-Thr4-Ser5-Pro6-Ser7. Phosphorylation of serine residues within this repeat sequence is well studied, but modifications of all residues have been described. Here, we focus on integrating newly identified and lesser-studied CTD post-translational modifications into the existing framework. We also review the growing body of work demonstrating crosstalk between different CTD modifications and the functional consequences of such crosstalk on the dynamics of transcriptional regulation.
Collapse
|
27
|
Physical basis of the disorder-order transition. Arch Biochem Biophys 2020; 685:108305. [DOI: 10.1016/j.abb.2020.108305] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Revised: 02/10/2020] [Accepted: 02/14/2020] [Indexed: 12/29/2022]
|
28
|
Abstract
Intrinsically disordered proteins (IDPs) can adopt a range of conformations from globules to swollen coils. This large range of conformational preferences for different IDPs raises the question of how conformational preferences are encoded by sequence. Global compositional features of a sequence such as the fraction of charged residues and the net charge per residue engender certain conformational biases. However, more specific sequence features such as the patterning of oppositely charged residues, expansion driving residues, or residues that can undergo posttranslational modifications can also influence the conformational ensembles of an IDP. Here, we outline how to calculate important global compositional features and patterning metrics that can be used to classify IDPs into different conformational classes and predict relative changes in conformation for sequences with the same amino acid composition. Although increased effort has been devoted to determining conformational properties of IDPs in recent years, quantitative predictions of conformation directly from sequence remain difficult and often inaccurate. Thus, if quantitative predictions of conformational properties are desired, then sequence-specific simulations must be performed.
Collapse
|
29
|
Abstract
Intrinsically disordered proteins and protein regions are ubiquitous across eukaryotic proteomes where they play a range of functional roles. Unlike folded proteins, IDRs lack a well-defined native state but exist in heterogeneous ensembles of conformations. In the absence of a defined native state, structure-guided mutations to test specific mechanistic hypotheses are generally not possible. Despite this, the use of mutations to alter sequence properties has become a relatively common approach for teasing out the relationship between sequence, ensemble, and function. A key step in designing informative mutants is the ability to identify specific sequence features that may reveal an interpretable response if perturbed. Here, we provide guidance on using the CIDER and localCIDER tools for amino acid sequence analysis, with a focus on building intuition with respect to the most commonly described features.
Collapse
|
30
|
The Significance of the Intrinsically Disordered Regions for the Functions of the bHLH Transcription Factors. Int J Mol Sci 2019; 20:E5306. [PMID: 31653121 PMCID: PMC6862971 DOI: 10.3390/ijms20215306] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 10/22/2019] [Accepted: 10/22/2019] [Indexed: 11/17/2022] Open
Abstract
The bHLH proteins are a family of eukaryotic transcription factors regulating expression of a wide range of genes involved in cell differentiation and development. They contain the Helix-Loop-Helix (HLH) domain, preceded by a stretch of basic residues, which are responsible for dimerization and binding to E-box sequences. In addition to the well-preserved DNA-binding bHLH domain, these proteins may contain various additional domains determining the specificity of performed transcriptional regulation. According to this, the family has been divided into distinct classes. Our aim was to emphasize the significance of existing disordered regions within the bHLH transcription factors for their functionality. Flexible, intrinsically disordered regions containing various motives and specific sequences allow for multiple interactions with transcription co-regulators. Also, based on in silico analysis and previous studies, we hypothesize that the bHLH proteins have a general ability to undergo spontaneous phase separation, forming or participating into liquid condensates which constitute functional centers involved in transcription regulation. We shortly introduce recent findings on the crucial role of the thermodynamically liquid-liquid driven phase separation in transcription regulation by disordered regions of regulatory proteins. We believe that further experimental studies should be performed in this field for better understanding of the mechanism of gene expression regulation (among others regarding oncogenes) by important and linked to many diseases the bHLH transcription factors.
Collapse
|
31
|
Tyr1 phosphorylation promotes phosphorylation of Ser2 on the C-terminal domain of eukaryotic RNA polymerase II by P-TEFb. eLife 2019; 8:48725. [PMID: 31385803 PMCID: PMC6715403 DOI: 10.7554/elife.48725] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 08/05/2019] [Indexed: 12/18/2022] Open
Abstract
The Positive Transcription Elongation Factor b (P-TEFb) phosphorylates Ser2 residues of the C-terminal domain (CTD) of the largest subunit (RPB1) of RNA polymerase II and is essential for the transition from transcription initiation to elongation in vivo. Surprisingly, P-TEFb exhibits Ser5 phosphorylation activity in vitro. The mechanism garnering Ser2 specificity to P-TEFb remains elusive and hinders understanding of the transition from transcription initiation to elongation. Through in vitro reconstruction of CTD phosphorylation, mass spectrometry analysis, and chromatin immunoprecipitation sequencing (ChIP-seq) analysis, we uncover a mechanism by which Tyr1 phosphorylation directs the kinase activity of P-TEFb and alters its specificity from Ser5 to Ser2. The loss of Tyr1 phosphorylation causes an accumulation of RNA polymerase II in the promoter region as detected by ChIP-seq. We demonstrate the ability of Tyr1 phosphorylation to generate a heterogeneous CTD modification landscape that expands the CTD’s coding potential. These findings provide direct experimental evidence for a combinatorial CTD phosphorylation code wherein previously installed modifications direct the identity and abundance of subsequent coding events by influencing the behavior of downstream enzymes. DNA contains the instructions for making proteins, which build and maintain our cells. So that the information encoded in DNA can be used, a molecular machine called RNA polymerase II makes copies of specific genes. These copies, in the form of a molecule called RNA, convey the instructions for making proteins to the rest of the cell. To ensure that RNA polymerase II copies the correct genes at the correct time, a group of regulatory proteins are needed to control its activity. Many of these proteins interact with RNA polymerase II at a region known as the C-terminal domain, or CTD for short. For example, before RNA polymerase can make a full copy of a gene, a small molecule called a phosphate group must first be added to CTD at specific units known as Ser2. The regulatory protein P-TEFb was thought to be responsible for phosphorylating Ser2. However, it was previously not known how P-TEFb added this phosphate group, and why it did not also add phosphate groups to other positions in the CTD domain that are structurally similar to Ser2. To investigate this, Mayfield, Irani et al. mixed the CTD domain with different regulatory proteins, and used various biochemical approaches to examine which specific positions of the domain had phosphate groups attached. These experiments revealed a previously unknown aspect of P-TEFb activity: its specificity for Ser2 increased dramatically if a different regulatory protein first added a phosphate group to a nearby location in CTD. This additional phosphate group directed P-TEFb to then add its phosphate specifically at Ser2. To confirm the activity of this mechanism in living human cells, Mayfield, Irani et al. used a drug that prevented the first phosphate from being added. In the drug treated cells, RNA polymerase II was found more frequently ‘stalled’ at positions on the DNA just before a gene starts. This suggests that living cells needs this two-phosphate code system in order for RNA polymerase II to progress and make copies of specific genes. These results are a step forward in understanding the complex control mechanisms cells use to make proteins from their DNA. Moreover, the model presented here – one phosphate addition priming a second specific phosphate addition – provides a template that may underlie similar regulatory processes.
Collapse
|
32
|
Abstract
In this issue of Molecular Cell,Sharma et al. (2019) show that normal cell growth requires conversion of an arginine residue in the RNA polymerase II C-terminal domain (CTD) to citrulline, uncovering a potential regulatory pathway involving opposing arginine modifications.
Collapse
|
33
|
Efficient and robust preparation of tyrosine phosphorylated intrinsically disordered proteins. Biotechniques 2019; 67:16-22. [PMID: 31092000 DOI: 10.2144/btn-2019-0033] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are subject to post-translational modifications. This allows the same polypeptide to be involved in different interaction networks with different consequences, ranging from regulatory signalling networks to the formation of membrane-less organelles. We report a robust method for co-expression of modification enzyme and SUMO-tagged IDPs with a subsequent purification procedure that allows for the production of modified IDP. The robustness of our protocol is demonstrated using a challenging system: RNA polymerase II C-terminal domain (CTD); that is, a low-complexity repetitive region with multiple phosphorylation sites. In vitro phosphorylation approaches fail to yield multiple-site phosphorylated CTD, whereas our in vivo protocol allows the rapid production of near homogeneous phosphorylated CTD at a low cost. These samples can be used in functional and structural studies.
Collapse
|
34
|
Genetic analysis of the RNA polymerase II CTD in Drosophila. Methods 2019; 159-160:129-137. [PMID: 30684537 PMCID: PMC6589110 DOI: 10.1016/j.ymeth.2019.01.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Revised: 01/18/2019] [Accepted: 01/21/2019] [Indexed: 02/03/2023] Open
Abstract
The Carboxy-terminal Domain (CTD) of RNA polymerase II (Pol II) plays essential roles in regulating gene expression in eukaryotes. Here, we describe multiple genetic approaches for studying the CTD in Drosophila that complement pre-existing molecular analyses of the Pol II CTD in other experimental models. These approaches will allow one to assess the effects of any CTD mutations in a developmentally complex organism. The approaches discussed in this work can in principle, be applied to analyze other transcription components in eukaryotes.
Collapse
|
35
|
The C-Terminal Domain of RNA Polymerase II Is a Multivalent Targeting Sequence that Supports Drosophila Development with Only Consensus Heptads. Mol Cell 2019; 73:1232-1242.e4. [PMID: 30765194 DOI: 10.1016/j.molcel.2019.01.008] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 12/03/2018] [Accepted: 01/04/2019] [Indexed: 12/29/2022]
Abstract
The C-terminal domain (CTD) of RNA polymerase II (Pol II) is composed of repeats of the consensus YSPTSPS and is an essential binding scaffold for transcription-associated factors. Metazoan CTDs have well-conserved lengths and sequence compositions arising from the evolution of divergent motifs, features thought to be essential for development. On the contrary, we show that a truncated CTD composed solely of YSPTSPS repeats supports Drosophila viability but that a CTD with enough YSPTSPS repeats to match the length of the wild-type Drosophila CTD is defective. Furthermore, a fluorescently tagged CTD lacking the rest of Pol II dynamically enters transcription compartments, indicating that the CTD functions as a signal sequence. However, CTDs with too many YSPTSPS repeats are more prone to localize to static nuclear foci separate from the chromosomes. We propose that the sequence complexity of the CTD offsets aberrant behavior caused by excessive repetitive sequences without compromising its targeting function.
Collapse
|
36
|
Balanced between order and disorder: a new phase in transcription elongation control and beyond. Transcription 2019; 10:157-163. [PMID: 30663929 DOI: 10.1080/21541264.2019.1570812] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
We recently reported that the cyclin T1 histidine-rich domain creates a phase-separated environment to promote hyperphosphorylation of RNA polymerase II C-terminal domain and robust transcriptional elongation by P-TEFb. Here, we discuss this and several other recent discoveries to demonstrate that phase separation is important for controlling various aspects of transcription.
Collapse
|
37
|
RNA polymerase II clustering through carboxy-terminal domain phase separation. Nat Struct Mol Biol 2018; 25:833-840. [PMID: 30127355 DOI: 10.1038/s41594-018-0112-y] [Citation(s) in RCA: 359] [Impact Index Per Article: 59.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2018] [Accepted: 07/17/2018] [Indexed: 12/25/2022]
Abstract
The carboxy-terminal domain (CTD) of RNA polymerase (Pol) II is an intrinsically disordered low-complexity region that is critical for pre-mRNA transcription and processing. The CTD consists of hepta-amino acid repeats varying in number from 52 in humans to 26 in yeast. Here we report that human and yeast CTDs undergo cooperative liquid phase separation, with the shorter yeast CTD forming less-stable droplets. In human cells, truncation of the CTD to the length of the yeast CTD decreases Pol II clustering and chromatin association, whereas CTD extension has the opposite effect. CTD droplets can incorporate intact Pol II and are dissolved by CTD phosphorylation with the transcription initiation factor IIH kinase CDK7. Together with published data, our results suggest that Pol II forms clusters or hubs at active genes through interactions between CTDs and with activators and that CTD phosphorylation liberates Pol II enzymes from hubs for promoter escape and transcription elongation.
Collapse
|
38
|
Sequence-to-Conformation Relationships of Disordered Regions Tethered to Folded Domains of Proteins. J Mol Biol 2018; 430:2403-2421. [DOI: 10.1016/j.jmb.2018.05.012] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2018] [Revised: 04/16/2018] [Accepted: 05/07/2018] [Indexed: 12/20/2022]
|
39
|
Abstract
Proteins can collapse into compact globules or form expanded, solvent-accessible, coil-like conformations. Additionally, they can fold into well-defined three-dimensional structures or remain partially or entirely disordered. Recent discoveries have shown that the tendency for proteins to collapse or remain expanded is not intrinsically coupled to their ability to fold. These observations suggest that proteins do not have to form compact globules in aqueous solutions. They can be intrinsically disordered, collapsed, or expanded, and even form well-folded, elongated structures. This ability to decouple collapse from folding is determined by the sequence details of proteins. In this review, we highlight insights gleaned from studies over the past decade. Using a polymer physics framework, we explain how the interplay among sidechains, backbone units, and solvent determines the driving forces for collapsed versus expanded states in aqueous solvents.
Collapse
|
40
|
Ovule identity mediated by pre-mRNA processing in Arabidopsis. PLoS Genet 2018; 14:e1007182. [PMID: 29329291 PMCID: PMC5785034 DOI: 10.1371/journal.pgen.1007182] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Revised: 01/25/2018] [Accepted: 01/02/2018] [Indexed: 11/18/2022] Open
Abstract
Ovules are fundamental for plant reproduction and crop yield as they are the precursors of seeds. Therefore, ovule specification is a critical developmental program. In Arabidopsis thaliana, ovule identity is redundantly conferred by the homeotic D-class genes SHATTERPROOF1 (SHP1), SHP2 and SEEDSTICK (STK), phylogenetically related to the MADS-domain regulatory gene AGAMOUS (AG), essential in floral organ specification. Previous studies have shown that the HUA-PEP activity, comprised of a suite of RNA-binding protein (RBP) encoding genes, regulates AG pre-mRNA processing and thus flower patterning and organ identity. Here, we report that the HUA-PEP activity additionally governs ovule morphogenesis. Accordingly, in severe hua-pep backgrounds ovules transform into flower organ-like structures. These homeotic transformations are most likely due to the dramatic reduction in SHP1, SHP2 and STK activity. Our molecular and genome-wide profiling strategies revealed the accumulation of prematurely terminated transcripts of D-class genes in hua-pep mutants and reduced amounts of their respective functional messengers, which points to pre-mRNA processing misregulation as the origin of the ovule developmental defects in such backgrounds. RNA processing and transcription are coordinated by the RNA polymerase II (RNAPII) carboxyl-terminal domain (CTD). Our results show that HUA-PEP activity members can interact with the CTD regulator C-TERMINAL DOMAIN PHOSPHATASE-LIKE1 (CPL1), supporting a co-transcriptional mode of action for the HUA-PEP activity. Our findings expand the portfolio of reproductive developmental programs in which HUA-PEP activity participates, and further substantiates the importance of RNA regulatory mechanisms (pre-mRNA co-transcriptional regulation) for correct gene expression during plant morphogenesis.
Collapse
|
41
|
Substrate Specificity of the Kinase P-TEFb towards the RNA Polymerase II C-Terminal Domain. Biophys J 2017; 113:1909-1911. [DOI: 10.1016/j.bpj.2017.09.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2017] [Revised: 08/24/2017] [Accepted: 09/11/2017] [Indexed: 10/18/2022] Open
|
42
|
Lysines in the RNA Polymerase II C-Terminal Domain Contribute to TAF15 Fibril Recruitment. Biochemistry 2017; 57:2549-2563. [PMID: 28945358 DOI: 10.1021/acs.biochem.7b00310] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Many cancer-causing chromosomal translocations result in transactivating protein products encoding FET family (FUS, EWSR1, TAF15) low-complexity (LC) domains fused to a DNA binding domain from one of several transcription factors. Recent work demonstrates that higher-order assemblies of FET LC domains bind the carboxy-terminal domain of the large subunit of RNA polymerase II (RNA pol II CTD), suggesting FET oncoproteins may mediate aberrant transcriptional activation by recruiting RNA polymerase II to promoters of target genes. Here we use nuclear magnetic resonance (NMR) spectroscopy and hydrogel fluorescence microscopy localization and fluorescence recovery after photobleaching to visualize atomic details of a model of this process, interactions of RNA pol II CTD with high-molecular weight TAF15 LC assemblies. We report NMR resonance assignments of the intact degenerate repeat half of human RNA pol II CTD alone and verify its predominant intrinsic disorder by molecular simulation. By measuring NMR spin relaxation and dark-state exchange saturation transfer, we characterize the interaction of RNA pol II CTD with amyloid-like hydrogel fibrils of TAF15 and hnRNP A2 LC domains and observe that heptads far from the acidic C-terminal tail of RNA pol II CTD bind TAF15 fibrils most avidly. Mutation of CTD lysines in heptad position 7 to consensus serines reduced the overall level of TAF15 fibril binding, suggesting that electrostatic interactions contribute to complex formation. Conversely, mutations of position 7 asparagine residues and truncation of the acidic tail had little effect. Thus, weak, multivalent interactions between TAF15 fibrils and heptads throughout RNA pol II CTD collectively mediate complex formation.
Collapse
|
43
|
Abstract
RNA polymerase II contains a long C-terminal domain (CTD) that regulates interactions at the site of transcription. The CTD architecture remains poorly understood due to its low sequence complexity, dynamic phosphorylation patterns, and structural variability. We used integrative structural biology to visualize the architecture of the CTD in complex with Rtt103, a 3'-end RNA-processing and transcription termination factor. Rtt103 forms homodimers via its long coiled-coil domain and associates densely on the repetitive sequence of the phosphorylated CTD via its N-terminal CTD-interacting domain. The CTD-Rtt103 association opens the compact random coil structure of the CTD, leading to a beads-on-a-string topology in which the long rod-shaped Rtt103 dimers define the topological and mobility restraints of the entire assembly. These findings underpin the importance of the structural plasticity of the CTD, which is templated by a particular set of CTD-binding proteins.
Collapse
|