1
|
Stroup EK, Ji Z. Delineating yeast cleavage and polyadenylation signals using deep learning. Genome Res 2024; 34:1066-1080. [PMID: 38914436 PMCID: PMC11368178 DOI: 10.1101/gr.278606.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 06/17/2024] [Indexed: 06/26/2024]
Abstract
3'-end cleavage and polyadenylation is an essential process for eukaryotic mRNA maturation. In yeast species, the polyadenylation signals that recruit the processing machinery are degenerate and remain poorly characterized compared with the well-defined regulatory elements in mammals. Here we address this issue by developing deep learning models to deconvolute degenerate cis-regulatory elements and quantify their positional importance in mediating yeast poly(A) site formation, cleavage heterogeneity, and strength. In S. cerevisiae, cleavage heterogeneity is promoted by the depletion of U-rich elements around poly(A) sites as well as multiple occurrences of upstream UA-rich elements. Sites with high cleavage heterogeneity show overall lower strength. The site strength and tandem site distances modulate alternative polyadenylation (APA) under the diauxic stress. Finally, we develop a deep learning model to reveal the distinct motif configuration of S. pombe poly(A) sites, which show more precise cleavage than S. cerevisiae Altogether, our deep learning models provide unprecedented insights into poly(A) site formation of yeast species, and our results highlight divergent poly(A) signals across distantly related species.
Collapse
Affiliation(s)
- Emily Kunce Stroup
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois 60611, USA
| | - Zhe Ji
- Department of Pharmacology, Feinberg School of Medicine, Northwestern University, Chicago, Illinois 60611, USA;
- Department of Biomedical Engineering, McCormick School of Engineering, Northwestern University, Evanston, Illinois 60628, USA
| |
Collapse
|
2
|
Murari E, Meadows D, Cuda N, Mangone M. A comprehensive analysis of 3'UTRs in Caenorhabditis elegans. Nucleic Acids Res 2024; 52:7523-7538. [PMID: 38917330 PMCID: PMC11260456 DOI: 10.1093/nar/gkae543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/29/2024] [Accepted: 06/11/2024] [Indexed: 06/27/2024] Open
Abstract
3'Untranslated regions (3'UTRs) are essential portions of genes containing elements necessary for pre-mRNA 3'end processing and are involved in post-transcriptional gene regulation. Despite their importance, they remain poorly characterized in eukaryotes. Here, we have used a multi-pronged approach to extract and curate 3'UTR data from 11533 publicly available datasets, corresponding to the entire collection of Caenorhabditis elegans transcriptomes stored in the NCBI repository from 2009 to 2023. We have also performed high throughput cloning pipelines to identify and validate rare 3'UTR isoforms and incorporated and manually curated 3'UTR isoforms from previously published datasets. This updated C. elegans 3'UTRome (v3) is the most comprehensive resource in any metazoan to date, covering 97.4% of the 20362 experimentally validated protein-coding genes with refined and updated 3'UTR boundaries for 23489 3'UTR isoforms. We also used this novel dataset to identify and characterize sequence elements involved in pre-mRNA 3'end processing and update miRNA target predictions. This resource provides important insights into the 3'UTR formation, function, and regulation in eukaryotes.
Collapse
Affiliation(s)
- Emma Murari
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| | - Dalton Meadows
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| | - Nicholas Cuda
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
- School of Life Sciences, Arizona State University, 427 E Tyler Mall, Tempe, AZ, USA
| | - Marco Mangone
- The Biodesign Institute at Arizona State University, 1001 S McAllister Ave, Tempe, AZ, USA
| |
Collapse
|
3
|
Stroup EK, Ji Z. Delineating yeast cleavage and polyadenylation signals using deep learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561764. [PMID: 37873420 PMCID: PMC10592759 DOI: 10.1101/2023.10.10.561764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
3'-end cleavage and polyadenylation is an essential process for eukaryotic mRNA maturation. In yeast species, the polyadenylation signals that recruit the processing machinery are degenerate and remain poorly characterized compared to well-defined regulatory elements in mammals. Especially, recent deep sequencing experiments showed extensive cleavage heterogeneity for some mRNAs in Saccharomyces cerevisiae and uncovered the polyA motif differences between S. cerevisiae vs. Schizosaccharomyces pombe . The findings raised the fundamental question of how polyadenylation signals are formed in yeast. Here we addressed this question by developing deep learning models to deconvolute degenerate cis -regulatory elements and quantify their positional importance in mediating yeast polyA site formation, cleavage heterogeneity, and strength. In S. cerevisiae , cleavage heterogeneity is promoted by the depletion of U-rich elements around polyA sites as well as multiple occurrences of upstream UA-rich elements. Sites with high cleavage heterogeneity show overall lower strength. The site strength and tandem site distances modulate alternative polyadenylation (APA) under the diauxic stress. Finally, we developed a deep learning model to reveal the distinct motif configuration of S. pombe polyA sites which show more precise cleavage than S. cerevisiae . Altogether, our deep learning models provide unprecedented insights into polyA site formation across yeast species.
Collapse
|
4
|
Li J, Querl L, Coban I, Salinas G, Krebber H. Surveillance of 3' mRNA cleavage during transcription termination requires CF IB/Hrp1. Nucleic Acids Res 2023; 51:8758-8773. [PMID: 37351636 PMCID: PMC10484732 DOI: 10.1093/nar/gkad530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 05/31/2023] [Accepted: 06/07/2023] [Indexed: 06/24/2023] Open
Abstract
CF IB/Hrp1 is part of the cleavage and polyadenylation factor (CPF) and cleavage factor (CF) complex (CPF-CF), which is responsible for 3' cleavage and maturation of pre-mRNAs. Although Hrp1 supports this process, its presence is not essential for the cleavage event. Here, we show that the main function of Hrp1 in the CPF-CF complex is the nuclear mRNA quality control of proper 3' cleavage. As such, Hrp1 acts as a nuclear mRNA retention factor that hinders transcripts from leaving the nucleus until processing is completed. Only after proper 3' cleavage, which is sensed through contacting Rna14, Hrp1 recruits the export receptor Mex67, allowing nuclear export. Consequently, its absence results in the leakage of elongated mRNAs into the cytoplasm. If cleavage is defective, the presence of Hrp1 on the mRNA retains these elongated transcripts until they are eliminated by the nuclear exosome. Together, we identify Hrp1 as the key quality control factor for 3' cleavage.
Collapse
Affiliation(s)
- Jing Li
- Abteilung für Molekulare Genetik, Institut für Mikrobiologie und Genetik, Göttinger Zentrum für Molekulare Biowissenschaften (GZMB), Georg-August Universität Göttingen, D-37075 Göttingen, Germany
| | - Luisa Querl
- Abteilung für Molekulare Genetik, Institut für Mikrobiologie und Genetik, Göttinger Zentrum für Molekulare Biowissenschaften (GZMB), Georg-August Universität Göttingen, D-37075 Göttingen, Germany
| | - Ivo Coban
- Abteilung für Molekulare Genetik, Institut für Mikrobiologie und Genetik, Göttinger Zentrum für Molekulare Biowissenschaften (GZMB), Georg-August Universität Göttingen, D-37075 Göttingen, Germany
| | - Gabriela Salinas
- NGS-Serviceeinrichtung für Integrative Genomik (NIG), Institut für Humangenetik, Universitätsmedizin Göttingen, D-37075 Göttingen, Germany
| | - Heike Krebber
- Abteilung für Molekulare Genetik, Institut für Mikrobiologie und Genetik, Göttinger Zentrum für Molekulare Biowissenschaften (GZMB), Georg-August Universität Göttingen, D-37075 Göttingen, Germany
| |
Collapse
|
5
|
Chaves-Arquero B, Martínez-Lumbreras S, Camero S, Santiveri CM, Mirassou Y, Campos-Olivas R, Jiménez MÁ, Calvo O, Pérez-Cañadillas JM. Structural basis of Nrd1-Nab3 heterodimerization. Life Sci Alliance 2022; 5:5/4/e202101252. [PMID: 35022249 PMCID: PMC8761494 DOI: 10.26508/lsa.202101252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 12/14/2021] [Accepted: 12/15/2021] [Indexed: 11/25/2022] Open
Abstract
The NMR structure of an Nrd1–Nab3 chimera describes the structural bases of Nrd1/Nab3 heterodimerization. Nrd1 embraces a bundle of helices in Nab3, building a large interface. Key mutations at that interface compromise cell fitness. Heterodimerization of RNA binding proteins Nrd1 and Nab3 is essential to communicate the RNA recognition in the nascent transcript with the Nrd1 recognition of the Ser5-phosphorylated Rbp1 C-terminal domain in RNA polymerase II. The structure of a Nrd1–Nab3 chimera reveals the basis of heterodimerization, filling a missing gap in knowledge of this system. The free form of the Nrd1 interaction domain of Nab3 (NRID) forms a multi-state three-helix bundle that is clamped in a single conformation upon complex formation with the Nab3 interaction domain of Nrd1 (NAID). The latter domain forms two long helices that wrap around NRID, resulting in an extensive protein–protein interface that would explain the highly favorable free energy of heterodimerization. Mutagenesis of some conserved hydrophobic residues involved in the heterodimerization leads to temperature-sensitive phenotypes, revealing the importance of this interaction in yeast cell fitness. The Nrd1–Nab3 structure resembles the previously reported Rna14/Rna15 heterodimer structure, which is part of the poly(A)-dependent termination pathway, suggesting that both machineries use similar structural solutions despite they share little sequence homology and are potentially evolutionary divergent.
Collapse
Affiliation(s)
- Belén Chaves-Arquero
- Departamento de Química-Física Biológica, Instituto de Química-Física "Rocasolano" (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain.,Research Department of Structural and Molecular Biology, University College London, London, UK
| | - Santiago Martínez-Lumbreras
- Departamento de Química-Física Biológica, Instituto de Química-Física "Rocasolano" (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain.,Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg, Germany and Bavarian NMR Centre, Chemistry Department, Technical University of Munich, Garching, Germany
| | - Sergio Camero
- Departamento de Química-Física Biológica, Instituto de Química-Física "Rocasolano" (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain
| | - Clara M Santiveri
- Spectroscopy and Nuclear Magnetic Resonance Unit, Structural Biology Programme, Spanish National Cancer Research Centre, Madrid, Spain
| | - Yasmina Mirassou
- Departamento de Química-Física Biológica, Instituto de Química-Física "Rocasolano" (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain.,Centro Nacional de Análisis Genómico (CNAG)-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Ramón Campos-Olivas
- Spectroscopy and Nuclear Magnetic Resonance Unit, Structural Biology Programme, Spanish National Cancer Research Centre, Madrid, Spain
| | - Maria Ángeles Jiménez
- Departamento de Química-Física Biológica, Instituto de Química-Física "Rocasolano" (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain
| | - Olga Calvo
- Instituto de Biología Funcional y Genómica, Consejo Superior de Investigaciones Científicas, Universidad de Salamanca, Salamanca, Spain
| | - José Manuel Pérez-Cañadillas
- Departamento de Química-Física Biológica, Instituto de Química-Física "Rocasolano" (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain
| |
Collapse
|
6
|
Architectural and functional details of CF IA proteins involved in yeast 3'-end pre-mRNA processing and its significance for eukaryotes: A concise review. Int J Biol Macromol 2021; 193:387-400. [PMID: 34699898 DOI: 10.1016/j.ijbiomac.2021.10.129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 10/04/2021] [Accepted: 10/18/2021] [Indexed: 11/22/2022]
Abstract
In eukaryotes, maturation of pre-mRNA relies on its precise 3'-end processing. This processing involves co-transcriptional steps regulated by sequence elements and other proteins. Although, it holds tremendous importance, defect in the processing machinery will result in erroneous pre-mRNA maturation leading to defective translation. Remarkably, more than 20 proteins in humans and yeast share homology and execute this processing. The defects in this processing are associated with various diseases in humans. We shed light on the CF IA subunit of yeast Saccharomyces cerevisiae that contains four proteins (Pcf11, Clp1, Rna14 and Rna15) involved in this processing. Structural details of various domains of CF IA and their roles during 3'-end processing, like cleavage and polyadenylation at 3'-UTR of pre-mRNA and other cellular events are explained. Further, the chronological development and important discoveries associated with 3'-end processing are summarized. Moreover, the mammalian homologues of yeast CF IA proteins, along with their key roles are described. This knowledge would be helpful for better comprehension of the mechanism associated with this marvel; thus opening up vast avenues in this area.
Collapse
|
7
|
Bruni F, Giancaspero TA, Oreb M, Tolomeo M, Leone P, Boles E, Roberti M, Caselle M, Barile M. Subcellular Localization of Fad1p in Saccharomyces cerevisiae: A Choice at Post-Transcriptional Level? Life (Basel) 2021; 11:967. [PMID: 34575116 PMCID: PMC8470081 DOI: 10.3390/life11090967] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 09/06/2021] [Accepted: 09/13/2021] [Indexed: 11/24/2022] Open
Abstract
FAD synthase is the last enzyme in the pathway that converts riboflavin into FAD. In Saccharomyces cerevisiae, the gene encoding for FAD synthase is FAD1, from which a sole protein product (Fad1p) is expected to be generated. In this work, we showed that a natural Fad1p exists in yeast mitochondria and that, in its recombinant form, the protein is able, per se, to both enter mitochondria and to be destined to cytosol. Thus, we propose that FAD1 generates two echoforms-that is, two identical proteins addressed to different subcellular compartments. To shed light on the mechanism underlying the subcellular destination of Fad1p, the 3' region of FAD1 mRNA was analyzed by 3'RACE experiments, which revealed the existence of (at least) two FAD1 transcripts with different 3'UTRs, the short one being 128 bp and the long one being 759 bp. Bioinformatic analysis on these 3'UTRs allowed us to predict the existence of a cis-acting mitochondrial localization motif, present in both the transcripts and, presumably, involved in protein targeting based on the 3'UTR context. Here, we propose that the long FAD1 transcript might be responsible for the generation of mitochondrial Fad1p echoform.
Collapse
Affiliation(s)
- Francesco Bruni
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy; (F.B.); (T.A.G.); (M.T.); (P.L.); (M.R.)
| | - Teresa Anna Giancaspero
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy; (F.B.); (T.A.G.); (M.T.); (P.L.); (M.R.)
| | - Mislav Oreb
- Institute of Molecular Biosciences, Goethe-University Frankfurt, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany; (M.O.); (E.B.)
| | - Maria Tolomeo
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy; (F.B.); (T.A.G.); (M.T.); (P.L.); (M.R.)
| | - Piero Leone
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy; (F.B.); (T.A.G.); (M.T.); (P.L.); (M.R.)
| | - Eckhard Boles
- Institute of Molecular Biosciences, Goethe-University Frankfurt, Max-von-Laue-Str. 9, 60438 Frankfurt am Main, Germany; (M.O.); (E.B.)
| | - Marina Roberti
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy; (F.B.); (T.A.G.); (M.T.); (P.L.); (M.R.)
| | - Michele Caselle
- Physics Department, University of Turin and INFN, Via P. Giuria 1, 10125 Turin, Italy;
| | - Maria Barile
- Department of Biosciences, Biotechnologies and Biopharmaceutics, University of Bari Aldo Moro, Via Orabona 4, 70125 Bari, Italy; (F.B.); (T.A.G.); (M.T.); (P.L.); (M.R.)
| |
Collapse
|
8
|
Turner RE, Harrison PF, Swaminathan A, Kraupner-Taylor CA, Goldie BJ, See M, Peterson AL, Schittenhelm RB, Powell DR, Creek DJ, Dichtl B, Beilharz TH. Genetic and pharmacological evidence for kinetic competition between alternative poly(A) sites in yeast. eLife 2021; 10:65331. [PMID: 34232857 PMCID: PMC8263057 DOI: 10.7554/elife.65331] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 06/22/2021] [Indexed: 01/23/2023] Open
Abstract
Most eukaryotic mRNAs accommodate alternative sites of poly(A) addition in the 3’ untranslated region in order to regulate mRNA function. Here, we present a systematic analysis of 3’ end formation factors, which revealed 3’UTR lengthening in response to a loss of the core machinery, whereas a loss of the Sen1 helicase resulted in shorter 3’UTRs. We show that the anti-cancer drug cordycepin, 3’ deoxyadenosine, caused nucleotide accumulation and the usage of distal poly(A) sites. Mycophenolic acid, a drug which reduces GTP levels and impairs RNA polymerase II (RNAP II) transcription elongation, promoted the usage of proximal sites and reversed the effects of cordycepin on alternative polyadenylation. Moreover, cordycepin-mediated usage of distal sites was associated with a permissive chromatin template and was suppressed in the presence of an rpb1 mutation, which slows RNAP II elongation rate. We propose that alternative polyadenylation is governed by temporal coordination of RNAP II transcription and 3’ end processing and controlled by the availability of 3’ end factors, nucleotide levels and chromatin landscape.
Collapse
Affiliation(s)
- Rachael Emily Turner
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Paul F Harrison
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia.,Monash Bioinformatics Platform, Monash University, Melbourne, Australia
| | - Angavai Swaminathan
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Calvin A Kraupner-Taylor
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Belinda J Goldie
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Michael See
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia.,Monash Bioinformatics Platform, Monash University, Melbourne, Australia
| | - Amanda L Peterson
- Drug Delivery, Disposition and Dynamics, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Australia
| | - Ralf B Schittenhelm
- Monash Proteomics & Metabolomics Facility, Department of Biochemistry and Molecular Biology, Monash Biomedicine Discovery Institute, Monash University, Melbourne, Australia
| | - David R Powell
- Monash Bioinformatics Platform, Monash University, Melbourne, Australia
| | - Darren J Creek
- Drug Delivery, Disposition and Dynamics, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Australia
| | - Bernhard Dichtl
- School of Life and Environmental Sciences, Deakin University, Geelong, Australia
| | - Traude H Beilharz
- Development and Stem Cells Program, Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| |
Collapse
|
9
|
Hill CH, Boreikaitė V, Kumar A, Casañal A, Kubík P, Degliesposti G, Maslen S, Mariani A, von Loeffelholz O, Girbig M, Skehel M, Passmore LA. Activation of the Endonuclease that Defines mRNA 3' Ends Requires Incorporation into an 8-Subunit Core Cleavage and Polyadenylation Factor Complex. Mol Cell 2019; 73:1217-1231.e11. [PMID: 30737185 PMCID: PMC6436931 DOI: 10.1016/j.molcel.2018.12.023] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Revised: 11/02/2018] [Accepted: 12/21/2018] [Indexed: 01/19/2023]
Abstract
Cleavage and polyadenylation factor (CPF/CPSF) is a multi-protein complex essential for formation of eukaryotic mRNA 3' ends. CPF cleaves pre-mRNAs at a specific site and adds a poly(A) tail. The cleavage reaction defines the 3' end of the mature mRNA, and thus the activity of the endonuclease is highly regulated. Here, we show that reconstitution of specific pre-mRNA cleavage with recombinant yeast proteins requires incorporation of the Ysh1 endonuclease into an eight-subunit "CPFcore" complex. Cleavage also requires the accessory cleavage factors IA and IB, which bind substrate pre-mRNAs and CPF, likely facilitating assembly of an active complex. Using X-ray crystallography, electron microscopy, and mass spectrometry, we determine the structure of Ysh1 bound to Mpe1 and the arrangement of subunits within CPFcore. Together, our data suggest that the active mRNA 3' end processing machinery is a dynamic assembly that is licensed to cleave only when all protein factors come together at the polyadenylation site.
Collapse
Affiliation(s)
- Chris H Hill
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | | | | | - Ana Casañal
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | - Peter Kubík
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | | | - Sarah Maslen
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | | | - Ottilie von Loeffelholz
- Centre for Integrative Biology, Department of Integrated Structural Biology, Institute of Genetics and of Molecular and Cellular Biology, Illkirch, Université de Strasbourg, Strasbourg, France; Centre National de la Recherche Scientifique UMR 7104, Illkirch, Université de Strasbourg, Strasbourg, France; INSERM U964, Illkirch, Université de Strasbourg, Strasbourg, France
| | - Mathias Girbig
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | - Mark Skehel
- MRC Laboratory of Molecular Biology, Cambridge CB2 0QH, UK
| | | |
Collapse
|
10
|
RNA Polymerase II Transcription Attenuation at the Yeast DNA Repair Gene, DEF1, Involves Sen1-Dependent and Polyadenylation Site-Dependent Termination. G3-GENES GENOMES GENETICS 2018; 8:2043-2058. [PMID: 29686108 PMCID: PMC5982831 DOI: 10.1534/g3.118.200072] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Termination of RNA Polymerase II (Pol II) activity serves a vital cellular role by separating ubiquitous transcription units and influencing RNA fate and function. In the yeast Saccharomyces cerevisiae, Pol II termination is carried out by cleavage and polyadenylation factor (CPF-CF) and Nrd1-Nab3-Sen1 (NNS) complexes, which operate primarily at mRNA and non-coding RNA genes, respectively. Premature Pol II termination (attenuation) contributes to gene regulation, but there is limited knowledge of its prevalence and biological significance. In particular, it is unclear how much crosstalk occurs between CPF-CF and NNS complexes and how Pol II attenuation is modulated during stress adaptation. In this study, we have identified an attenuator in the DEF1 DNA repair gene, which includes a portion of the 5′-untranslated region (UTR) and upstream open reading frame (ORF). Using a plasmid-based reporter gene system, we conducted a genetic screen of 14 termination mutants and their ability to confer Pol II read-through defects. The DEF1 attenuator behaved as a hybrid terminator, relying heavily on CPF-CF and Sen1 but without Nrd1 and Nab3 involvement. Our genetic selection identified 22 cis-acting point mutations that clustered into four regions, including a polyadenylation site efficiency element that genetically interacts with its cognate binding-protein Hrp1. Outside of the reporter gene context, a DEF1 attenuator mutant increased mRNA and protein expression, exacerbating the toxicity of a constitutively active Def1 protein. Overall, our data support a biologically significant role for transcription attenuation in regulating DEF1 expression, which can be modulated during the DNA damage response.
Collapse
|
11
|
Zhou Z, Dang Y, Zhou M, Yuan H, Liu Y. Codon usage biases co-evolve with transcription termination machinery to suppress premature cleavage and polyadenylation. eLife 2018; 7:33569. [PMID: 29547124 PMCID: PMC5869017 DOI: 10.7554/elife.33569] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 03/15/2018] [Indexed: 12/13/2022] Open
Abstract
Codon usage biases are found in all genomes and influence protein expression levels. The codon usage effect on protein expression was thought to be mainly due to its impact on translation. Here, we show that transcription termination is an important driving force for codon usage bias in eukaryotes. Using Neurospora crassa as a model organism, we demonstrated that introduction of rare codons results in premature transcription termination (PTT) within open reading frames and abolishment of full-length mRNA. PTT is a wide-spread phenomenon in Neurospora, and there is a strong negative correlation between codon usage bias and PTT events. Rare codons lead to the formation of putative poly(A) signals and PTT. A similar role for codon usage bias was also observed in mouse cells. Together, these results suggest that codon usage biases co-evolve with the transcription termination machinery to suppress premature termination of transcription and thus allow for optimal gene expression.
Collapse
Affiliation(s)
- Zhipeng Zhou
- Department of Physiology, The University of Texas Southwestern Medical Center, Dallas, United States
| | - Yunkun Dang
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.,Center for Life Science, School of Life Sciences, Yunnan University, Kunming, China
| | - Mian Zhou
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Haiyan Yuan
- Department of Physiology, The University of Texas Southwestern Medical Center, Dallas, United States
| | - Yi Liu
- Department of Physiology, The University of Texas Southwestern Medical Center, Dallas, United States
| |
Collapse
|
12
|
Morse NJ, Gopal MR, Wagner JM, Alper HS. Yeast Terminator Function Can Be Modulated and Designed on the Basis of Predictions of Nucleosome Occupancy. ACS Synth Biol 2017; 6:2086-2095. [PMID: 28771342 DOI: 10.1021/acssynbio.7b00138] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
The design of improved synthetic parts is a major goal of synthetic biology. Mechanistically, nucleosome occupancy in the 3' terminator region of a gene has been found to correlate with transcriptional expression. Here, we seek to establish a predictive relationship between terminator function and predicted nucleosome positioning to design synthetic terminators in the yeast Saccharomyces cerevisiae. In doing so, terminators improved net protein output from these expression cassettes nearly 4-fold over their original sequence with observed increases in termination efficiency to 96%. The resulting terminators were indeed depleted of nucleosomes on the basis of mapping experiments. This approach was successfully applied to synthetic, de novo, and native terminators. The mode of action of these modifications was mainly through increased termination efficiency, rather than half-life increases, perhaps suggesting a role in improved mRNA maturation. Collectively, these results suggest that predicted nucleosome depletion can be used as a heuristic approach for improving terminator function, though the underlying mechanism remains to be shown.
Collapse
Affiliation(s)
- Nicholas J. Morse
- McKetta
Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street Stop C0400, Austin, Texas 78712, United States
| | - Madan R. Gopal
- McKetta
Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street Stop C0400, Austin, Texas 78712, United States
| | - James M. Wagner
- McKetta
Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street Stop C0400, Austin, Texas 78712, United States
| | - Hal S. Alper
- McKetta
Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton Street Stop C0400, Austin, Texas 78712, United States
- Institute
for Cellular and Molecular Biology, The University of Texas at Austin, 2500 Speedway Avenue, Austin, Texas 78712, United States
| |
Collapse
|
13
|
Misra A, Green MR. From polyadenylation to splicing: Dual role for mRNA 3' end formation factors. RNA Biol 2015; 13:259-64. [PMID: 26891005 DOI: 10.1080/15476286.2015.1112490] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
Recent genome-wide protein-RNA interaction studies have significantly reshaped our understanding of the role of mRNA 3' end formation factors in RNA biology. Originally thought to function solely in mediating cleavage and polyadenylation of mRNAs during their maturation, 3' end formation factors have now been shown to play a role in alternative splicing, even at internal introns--an unanticipated role for factors thought only to act at the 3' end of the mRNA. Here, we discuss the recent advances in our understanding of the role of 3' end formation factors in promoting global changes in alternative splicing at internal exon-intron junctions and how they act as cofactors for well known splicing regulators. Additionally, we review the mechanism by which these factors affect the recruitment of early intron recognition components to the 5' and 3' splice site. Our understanding of the roles of 3' end formation factors is still evolving, and the final picture might be more complex than originally envisioned.
Collapse
Affiliation(s)
- Ashish Misra
- a Howard Hughes Medical Institute and Department of Molecular, Cell and Cancer Biology, University of Massachusetts Medical School , Worcester , MA USA
| | - Michael R Green
- a Howard Hughes Medical Institute and Department of Molecular, Cell and Cancer Biology, University of Massachusetts Medical School , Worcester , MA USA
| |
Collapse
|
14
|
Machinaga A, Takase-Yoden S. Polyadenylation of Friend murine leukemia virus env-mRNA is affected by its splicing. Microbiol Immunol 2015; 58:474-82. [PMID: 24935657 DOI: 10.1111/1348-0421.12170] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Revised: 05/19/2014] [Accepted: 06/10/2014] [Indexed: 01/27/2023]
Abstract
As splicing was previously found to be important for increasing Friend murine leukemia virus env-mRNA stability and translation, we investigated whether splicing of env-mRNA affected the poly(A) tail length using env expression vectors that yielded unspliced or spliced env-mRNA. Incomplete polyadenylation was detected in a fraction of the unspliced env-mRNA products in an env gene-dependent manner, showing that splicing of Friend murine leukemia virus plays an important role in the efficiency of complete polyadenylation of env-mRNA. These results suggested that the promotion of complete polyadenylation of env-mRNA by splicing might partially explain up-regulation of Env protein expression as a result of splicing.
Collapse
Affiliation(s)
- Akihito Machinaga
- Department of Bioinformatics, Faculty of Engineering, Soka University, 1-236, Tangi-machi, Hachioji-shi, Tokyo, 192-8577, Japan
| | | |
Collapse
|
15
|
Baejen C, Torkler P, Gressel S, Essig K, Söding J, Cramer P. Transcriptome Maps of mRNP Biogenesis Factors Define Pre-mRNA Recognition. Mol Cell 2014; 55:745-57. [DOI: 10.1016/j.molcel.2014.08.005] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Revised: 07/08/2014] [Accepted: 07/31/2014] [Indexed: 12/15/2022]
|
16
|
Efficient mRNA polyadenylation requires a ubiquitin-like domain, a zinc knuckle, and a RING finger domain, all contained in the Mpe1 protein. Mol Cell Biol 2014; 34:3955-67. [PMID: 25135474 DOI: 10.1128/mcb.00077-14] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Almost all eukaryotic mRNAs must be polyadenylated at their 3' ends to function in protein synthesis. This modification occurs via a large nuclear complex that recognizes signal sequences surrounding a poly(A) site on mRNA precursor, cleaves at that site, and adds a poly(A) tail. While the composition of this complex is known, the functions of some subunits remain unclear. One of these is a multidomain protein called Mpe1 in the yeast Saccharomyces cerevisiae and RBBP6 in metazoans. The three conserved domains of Mpe1 are a ubiquitin-like (UBL) domain, a zinc knuckle, and a RING finger domain characteristic of some ubiquitin ligases. We show that mRNA 3'-end processing requires all three domains of Mpe1 and that more than one region of Mpe1 is involved in contact with the cleavage/polyadenylation factor in which Mpe1 resides. Surprisingly, both the zinc knuckle and the RING finger are needed for RNA-binding activity. Consistent with a role for Mpe1 in ubiquitination, mutation of Mpe1 decreases the association of ubiquitin with Pap1, the poly(A) polymerase, and suppressors of mpe1 mutants are linked to ubiquitin ligases. Furthermore, an inhibitor of ubiquitin-mediated interactions blocks cleavage, demonstrating for the first time a direct role for ubiquitination in mRNA 3'-end processing.
Collapse
|
17
|
Delineating the structural blueprint of the pre-mRNA 3'-end processing machinery. Mol Cell Biol 2014; 34:1894-910. [PMID: 24591651 DOI: 10.1128/mcb.00084-14] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Processing of mRNA precursors (pre-mRNAs) by polyadenylation is an essential step in gene expression. Polyadenylation consists of two steps, cleavage and poly(A) synthesis, and requires multiple cis elements in the pre-mRNA and a megadalton protein complex bearing the two essential enzymatic activities. While genetic and biochemical studies remain the major approaches in characterizing these factors, structural biology has emerged during the past decade to help understand the molecular assembly and mechanistic details of the process. With structural information about more proteins and higher-order complexes becoming available, we are coming closer to obtaining a structural blueprint of the polyadenylation machinery that explains both how this complex functions and how it is regulated and connected to other cellular processes.
Collapse
|
18
|
Zheng D, Tian B. RNA-binding proteins in regulation of alternative cleavage and polyadenylation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 825:97-127. [PMID: 25201104 DOI: 10.1007/978-1-4939-1221-6_3] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Almost all eukaryotic pre-mRNAs are processed at the 3' end by the cleavage and polyadenylation (C/P) reaction, which preludes termination of transcription and gives rise to the poly(A) tail of mature mRNA. Genomic studies in recent years have indicated that most eukaryotic mRNA genes have multiple cleavage and polyadenylation sites (pAs), leading to alternative cleavage and polyadenylation (APA) products. APA isoforms generally differ in their 3' untranslated regions (3' UTRs), but can also have different coding sequences (CDSs). APA expands the repertoire of transcripts expressed from the genome, and is highly regulated under various physiological and pathological conditions. Growing lines of evidence have shown that RNA-binding proteins (RBPs) play important roles in regulation of APA. Some RBPs are part of the machinery for C/P; others influence pA choice through binding to adjacent regions. In this chapter, we review cis elements and trans factors involved in C/P, the significance of APA, and increasingly elucidated roles of RBPs in APA regulation. We also discuss analysis of APA using transcriptome-wide techniques as well as molecular biology approaches.
Collapse
Affiliation(s)
- Dinghai Zheng
- Department of Biochemistry and Molecular Biology, University of Medicine and Dentistry of New Jersey (UMDNJ)-New Jersey Medical School, 185 South Orange Ave., Newark, NJ, 07103, USA
| | | |
Collapse
|
19
|
Abstract
Systemic response to DNA damage and other stresses is a complex process that includes changes in the regulation and activity of nearly all stages of gene expression. One gene regulatory mechanism used by eukaryotes is selection among alternative transcript isoforms that differ in polyadenylation [poly(A)] sites, resulting in changes either to the coding sequence or to portions of the 3' UTR that govern translation, stability, and localization. To determine the extent to which this means of regulation is used in response to DNA damage, we conducted a global analysis of poly(A) site usage in Saccharomyces cerevisiae after exposure to the UV mimetic, 4-nitroquinoline 1-oxide (4NQO). Two thousand thirty-one genes were found to have significant variation in poly(A) site distributions following 4NQO treatment, with a strong bias toward loss of short transcripts, including many with poly(A) sites located within the protein coding sequence (CDS). We further explored one possible mechanism that could contribute to the widespread differences in mRNA isoforms. The change in poly(A) site profile was associated with an inhibition of cleavage and polyadenylation in cell extract and a decrease in the levels of several key subunits in the mRNA 3'-end processing complex. Sequence analysis identified differences in the cis-acting elements that flank putatively suppressed and enhanced poly(A) sites, suggesting a mechanism that could discriminate between variable and constitutive poly(A) sites. Our analysis indicates that variation in mRNA length is an important part of the regulatory response to DNA damage.
Collapse
|
20
|
Affiliation(s)
- C A Niño
- Institut Jacques Monod, Paris Diderot University , Sorbonne Paris Cité, CNRS UMR7592, Equipe labellisée Ligue contre le cancer, 15 rue Hélène Brion, 75205 Paris Cedex 13, France
| | | | | | | |
Collapse
|
21
|
Structural and biochemical analysis of the assembly and function of the yeast pre-mRNA 3' end processing complex CF I. Proc Natl Acad Sci U S A 2012; 109:21342-7. [PMID: 23236150 DOI: 10.1073/pnas.1214102110] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The accuracy of the 3'-end processing by cleavage and polyadenylation is essential for mRNA biogenesis and transcription termination. In yeast, two poorly conserved neighboring elements upstream of cleavage sites are important for accuracy and efficiency of this process. These two RNA sequences are recognized by the RNA binding proteins Hrp1 and Rna15, but efficient processing in vivo requires a bridging protein (Rna14), which forms a stable dimer of hetero-dimers with Rna15 to stabilize the RNA-protein complex. We earlier reported the structure of the ternary complex of Rna15 and Hrp1 bound to the RNA processing element. We now report the use of solution NMR to study the interaction of Hrp1 with the Rna14-Rna15 heterodimer in the presence and absence of 3'-end processing signals. By using methyl selective labeling on Hrp1, in vivo activity and pull-down assays, we were able to study this complex of several hundred kDa, identify the interface within Hrp1 responsible for recruitment of Rna14 and validate the functional significance of this interaction through structure-driven mutational analysis.
Collapse
|
22
|
Mischo HE, Proudfoot NJ. Disengaging polymerase: terminating RNA polymerase II transcription in budding yeast. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2012; 1829:174-85. [PMID: 23085255 PMCID: PMC3793857 DOI: 10.1016/j.bbagrm.2012.10.003] [Citation(s) in RCA: 106] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2012] [Revised: 10/01/2012] [Accepted: 10/05/2012] [Indexed: 11/29/2022]
Abstract
Termination of transcription by RNA polymerase II requires two distinct processes: The formation of a defined 3′ end of the transcribed RNA, as well as the disengagement of RNA polymerase from its DNA template. Both processes are intimately connected and equally pivotal in the process of functional messenger RNA production. However, research in recent years has elaborated how both processes can additionally be employed to control gene expression in qualitative and quantitative ways. This review embraces these new findings and attempts to paint a broader picture of how this final step in the transcription cycle is of critical importance to many aspects of gene regulation. This article is part of a Special Issue entitled: RNA polymerase II Transcript Elongation.
Collapse
Affiliation(s)
- Hannah E Mischo
- Cancer Research UK London Research Institute, Blanche Lane South Mimms, Herts, UK.
| | | |
Collapse
|
23
|
Ruepp MD, Schümperli D, Barabino SML. mRNA 3' end processing and more--multiple functions of mammalian cleavage factor I-68. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 2:79-91. [PMID: 21956970 DOI: 10.1002/wrna.35] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The formation of defined 3(') ends is an important step in the biogenesis of mRNAs. In eukaryotic cells, all mRNA 3(') ends are generated by endonucleolytic cleavage of primary transcripts in reactions that are essentially posttranscriptional. Nevertheless, 3(') end formation is tightly connected to transcription in vivo, and a link with mRNA export to the cytoplasm has been postulated. Here, we briefly review the current knowledge about the two types of mRNA 3(') end processing reactions, cleavage/polyadenylation and histone RNA processing. We then focus on factors shared between these two reactions. In particular, we discuss evidence for new functions of the mammalian cleavage factor I subunit CF I(m) 68 in histone RNA 3(') processing and in the export of mature mRNAs from the nucleus to the cytoplasm.
Collapse
Affiliation(s)
- Marc-David Ruepp
- Institute of Cell Biology, University of Bern, Bern, Switzerland
| | | | | |
Collapse
|
24
|
Tian B, Graber JH. Signals for pre-mRNA cleavage and polyadenylation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 3:385-96. [PMID: 22012871 DOI: 10.1002/wrna.116] [Citation(s) in RCA: 172] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Pre-mRNA cleavage and polyadenylation is an essential step for 3' end formation of almost all protein-coding transcripts in eukaryotes. The reaction, involving cleavage of nascent mRNA followed by addition of a polyadenylate or poly(A) tail, is controlled by cis-acting elements in the pre-mRNA surrounding the cleavage site. Experimental and bioinformatic studies in the past three decades have elucidated conserved and divergent elements across eukaryotes, from yeast to human. Here we review histories and current models of these elements in a broad range of species.
Collapse
Affiliation(s)
- Bin Tian
- UMDNJ-New Jersey Medical School, Newark, NJ, USA.
| | | |
Collapse
|
25
|
Yang Q, Doublié S. Structural biology of poly(A) site definition. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 2:732-47. [PMID: 21823232 DOI: 10.1002/wrna.88] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
3' processing is an essential step in the maturation of all messenger RNAs (mRNAs) and is a tightly coupled two-step reaction: endonucleolytic cleavage at the poly(A) site is followed by the addition of a poly(A) tail, except for metazoan histone mRNAs, which are cleaved but not polyadenylated. The recognition of a poly(A) site is coordinated by the sequence elements in the mRNA 3' UTR and associated protein factors. In mammalian cells, three well-studied sequence elements, UGUA, AAUAAA, and GU-rich, are recognized by three multisubunit factors: cleavage factor I(m) (CFI(m) ), cleavage and polyadenylation specificity factor (CPSF), and cleavage stimulation factor (CstF), respectively. In the yeast Saccharomyces cerevisiae, UA repeats and A-rich sequence elements are recognized by Hrp1p and cleavage factor IA. Structural studies of protein-RNA complexes have helped decipher the mechanisms underlying sequence recognition and shed light on the role of protein factors in poly(A) site selection and 3' processing machinery assembly. In this review we focus on the interactions between the mRNA cis-elements and the protein factors (CFI(m) , CPSF, CstF, and homologous factors from yeast and other eukaryotes) that define the poly(A) site. WIREs RNA 2011 2 732-747 DOI: 10.1002/wrna.88 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Qin Yang
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, USA
| | | |
Collapse
|
26
|
Dominski Z. The hunt for the 3' endonuclease. WILEY INTERDISCIPLINARY REVIEWS-RNA 2010; 1:325-40. [PMID: 21935893 DOI: 10.1002/wrna.33] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Pre-mRNAs are typically processed at the 3(') end by cleavage/polyadenylation. This is a two-step processing reaction initiated by endonucleolytic cleavage of pre-mRNAs downstream of the AAUAAA sequence or its variant, followed by extension of the newly generated 3(') end with a poly(A) tail. In metazoans, replication-dependent histone transcripts are cleaved by a different 3(') end processing mechanism that depends on the U7 small nuclear ribonucleoprotein and the polyadenylation step is omitted. Each of the two mechanisms occurs in a macromolecular assembly that primarily functions to juxtapose the scissile bond with the 3(') endonuclease. Remarkably, despite characterizing a number of processing factors, the identity of this most critical component remained elusive until recently. For cleavage coupled to polyadenylation, much needed help was offered by bioinformatics, which pointed to CPSF-73, a known processing factor required for both cleavage and polyadenylation, as the possible 3(') endonuclease. In silico structural analysis indicated that this protein is a member of the large metallo-β-lactamase family of hydrolytic enzymes and belongs to the β-CASP subfamily that includes several RNA and DNA-specific nucleases. Subsequent experimental studies supported the notion that CPSF-73 does function as the endonuclease in the formation of polyadenylated mRNAs, but some controversy still remains as a different cleavage and polyadenylation specificity factor (CPSF) subunit, CPSF-30, displays an endonuclease activity in vitro while recombinant CPSF-73 is inactive. Unexpectedly, CPSF-73 as the 3(') endonuclease in cleavage coupled to polyadenylation found a strong ally in U7-dependent processing of histone pre-mRNAs, which was shown to utilize the same protein as the cleaving enzyme. It thus seems likely that these two processing reactions evolved from a common mechanism, with CPSF-73 as the endonuclease.
Collapse
Affiliation(s)
- Zbigniew Dominski
- Department of Biochemistry and Biophysics and Program in Molecular Biology and Biotechnology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| |
Collapse
|
27
|
Leeper TC, Qu X, Lu C, Moore C, Varani G. Novel protein-protein contacts facilitate mRNA 3'-processing signal recognition by Rna15 and Hrp1. J Mol Biol 2010; 401:334-49. [PMID: 20600122 DOI: 10.1016/j.jmb.2010.06.032] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 06/10/2010] [Accepted: 06/10/2010] [Indexed: 12/22/2022]
Abstract
Precise 3'-end processing of mRNA is essential for correct gene expression, yet in yeast, 3'-processing signals consist of multiple ambiguous sequence elements. Two neighboring elements upstream of the cleavage site are particularly important for the accuracy (positioning element) and efficiency (efficiency element) of 3'-processing and are recognized by the RNA-binding proteins Rna15 and Hrp1, respectively. In vivo, these interactions are strengthened by the scaffolding protein Rna14 that stabilizes their association. The NMR structure of the 34 -kDa ternary complex of the RNA recognition motif (RRM) domains of Hrp1 and Rna15 bound to this pair of RNA elements was determined by residual dipolar coupling and paramagnetic relaxation experiments. It reveals how each of the proteins binds to RNA and introduces a novel class of protein-protein contact in regions of previously unknown function. These interdomain contacts had previously been overlooked in other multi-RRM structures, although a careful analysis suggests that they may be frequently present. Mutations in the regions of these contacts disrupt 3'-end processing, suggesting that they may structurally organize the ribonucleoprotein complexes responsible for RNA processing.
Collapse
Affiliation(s)
- Thomas C Leeper
- Department of Chemistry, University of Washington, Seattle, WA 98195-1700, USA.
| | | | | | | | | |
Collapse
|
28
|
Pancevac C, Goldstone DC, Ramos A, Taylor IA. Structure of the Rna15 RRM-RNA complex reveals the molecular basis of GU specificity in transcriptional 3'-end processing factors. Nucleic Acids Res 2010; 38:3119-32. [PMID: 20097654 PMCID: PMC2875009 DOI: 10.1093/nar/gkq002] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Rna15 is a core subunit of cleavage factor IA (CFIA), an essential transcriptional 3′-end processing factor from Saccharomyces cerevisiae. CFIA is required for polyA site selection/cleavage targeting RNA sequences that surround polyadenylation sites in the 3′-UTR of RNA polymerase-II transcripts. RNA recognition by CFIA is mediated by an RNA recognition motif (RRM) contained in the Rna15 subunit of the complex. We show here that Rna15 has a strong and unexpected preference for GU containing RNAs and reveal the molecular basis for a base selectivity mechanism that accommodates G or U but discriminates against C and A bases. This mode of base selectivity is rather different to that observed in other RRM-RNA structures and is structurally conserved in CstF64, the mammalian counterpart of Rna15. Our observations provide evidence for a highly conserved mechanism of base recognition amongst the 3′-end processing complexes that interact with the U-rich or U/G-rich elements at 3′-end cleavage/polyadenylation sites.
Collapse
Affiliation(s)
- Christina Pancevac
- Division of Molecular Structure, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA, UK
| | | | | | | |
Collapse
|
29
|
Roth KM, Byam J, Fang F, Butler JS. Regulation of NAB2 mRNA 3'-end formation requires the core exosome and the Trf4p component of the TRAMP complex. RNA (NEW YORK, N.Y.) 2009; 15:1045-58. [PMID: 19369424 PMCID: PMC2685527 DOI: 10.1261/rna.709609] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
The nuclear exosome functions in a variety of pathways catalyzing formation of mature RNA 3'-ends or the destruction of aberrant RNA transcripts. The RNA 3'-end formation activity of the exosome appeared restricted to small noncoding RNAs. However, the nuclear exosome controls the level of the mRNA encoding the poly(A)-binding protein Nab2p in a manner requiring an A(26) sequence in the mRNA 3' untranslated regions (UTR), and the activities of Nab2p and the exosome-associated exoribonuclease Rrp6p. Here we show that the A(26) sequence inhibits normal 3'-end processing of NAB2 mRNA in vivo and in vitro, and makes formation of the mature 3'-end dependent on trimming of the transcript by the core exosome and the Trf4p component of the TRAMP complex from a downstream site. The detection of mature, polyadenylated transcripts ending at, or within, the A(26) sequence indicates that exosome trimming sometimes gives way to polyadenylation of the mRNA. Alternatively, Rrp6p and the TRAMP-associated Mtr4p degrade these transcripts thereby limiting the amount of Nab2p in the cell. These findings suggest that NAB2 mRNA 3'-end formation requires the exosome and TRAMP complex, and that competition between polyadenylation and Rrp6p-dependent degradation controls the level of this mRNA.
Collapse
Affiliation(s)
- Kelly M Roth
- Department of Microbiology and Immunology, University of Rochester Medical Center, Rochester, New York 14642, USA
| | | | | | | |
Collapse
|
30
|
Abstract
Most eukaryotic mRNA precursors (premRNAs) must undergo extensive processing, including cleavage and polyadenylation at the 3'-end. Processing at the 3'-end is controlled by sequence elements in the pre-mRNA (cis elements) as well as protein factors. Despite the seeming biochemical simplicity of the processing reactions, more than 14 proteins have been identified for the mammalian complex, and more than 20 proteins have been identified for the yeast complex. The 3'-end processing machinery also has important roles in transcription and splicing. The mammalian machinery contains several sub-complexes, including cleavage and polyadenylation specificity factor, cleavage stimulation factor, cleavage factor I, and cleavage factor II. Additional protein factors include poly(A) polymerase, poly(A)-binding protein, symplekin, and the C-terminal domain of RNA polymerase II largest subunit. The yeast machinery includes cleavage factor IA, cleavage factor IB, and cleavage and polyadenylation factor.
Collapse
Affiliation(s)
- C. R. Mandel
- Department of Biological Sciences, Columbia University, New York, NY 10027 USA
| | - Y. Bai
- Department of Biological Sciences, Columbia University, New York, NY 10027 USA
| | - L. Tong
- Department of Biological Sciences, Columbia University, New York, NY 10027 USA
| |
Collapse
|
31
|
Viphakone N, Voisinet-Hakil F, Minvielle-Sebastia L. Molecular dissection of mRNA poly(A) tail length control in yeast. Nucleic Acids Res 2008; 36:2418-33. [PMID: 18304944 PMCID: PMC2367721 DOI: 10.1093/nar/gkn080] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
In eukaryotic cells, newly synthesized mRNAs acquire a poly(A) tail that plays several fundamental roles in export, translation and mRNA decay. In mammals, PABPN1 controls the processivity of polyadenylation and the length of poly(A) tails during de novo synthesis. This regulation is less well-detailed in yeast. We have recently demonstrated that Nab2p is necessary and sufficient for the regulation of polyadenylation and that the Pab1p/PAN complex may act at a later stage in mRNA metabolism. Here, we show that the presence of both Pab1p and Nab2p in reconstituted pre-mRNA 3′-end processing reactions has no stimulating nor inhibitory effect on poly(A) tail regulation. Importantly, the poly(A)-binding proteins are essential to protect the mature mRNA from being subjected to a second round of processing. We have determined which domains of Nab2p are important to control polyadenylation and found that the RGG-box work in conjunction with the two last essential CCCH-type zinc finger domains. Finally, we have tried to delineate the mechanism by which Nab2p performs its regulation function during polyadenylation: it likely forms a complex with poly(A) tails different from a simple linear deposit of proteins as it has been observed with Pab1p.
Collapse
Affiliation(s)
- Nicolas Viphakone
- Université Victor Segalen Bordeaux 2, CNRS, Institut de Biochimie et Génétique Cellulaires, Bordeaux, France
| | | | | |
Collapse
|
32
|
|
33
|
Kim Guisbert KS, Li H, Guthrie C. Alternative 3' pre-mRNA processing in Saccharomyces cerevisiae is modulated by Nab4/Hrp1 in vivo. PLoS Biol 2007; 5:e6. [PMID: 17194212 PMCID: PMC1717019 DOI: 10.1371/journal.pbio.0050006] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2006] [Accepted: 11/03/2006] [Indexed: 11/18/2022] Open
Abstract
The Saccharomyces cerevisiae RNA-binding protein Nab4/Hrp1 is a component of the cleavage factor complex required for 3' pre-mRNA processing. Although the precise role of Nab4/Hrp1 remains unclear, it has been implicated in correct positioning of the cleavage site in vitro. Here, we show that mutation or overexpression of NAB4/HRP1 alters polyA cleavage site selection in vivo. Using bioinformatic analysis, we identified four related motifs that are statistically enriched in Nab4-associated transcripts; each motif is similar to the known binding site for Nab4/Hrp1. Site-directed mutations in predicted Nab4/Hrp1 binding elements result in decreased use of adjacent cleavage sites. Additionally, we show that the nab4-7 mutant displays a striking resistance to toxicity from excess copper. We identify a novel target of alternative 3' pre-mRNA processing, CTR2, and demonstrate that CTR2 is required for the copper resistance phenotype in the nab4-7 strain. We propose that alternative 3' pre-mRNA processing is mediated by a Nab4-based mechanism and that these alternative processing events could help control gene expression as part of a physiological response in S. cerevisiae.
Collapse
Affiliation(s)
- Karen S. Kim Guisbert
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Hao Li
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
| | - Christine Guthrie
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
34
|
Kim M, Vasiljeva L, Rando OJ, Zhelkovsky A, Moore C, Buratowski S. Distinct pathways for snoRNA and mRNA termination. Mol Cell 2007; 24:723-734. [PMID: 17157255 DOI: 10.1016/j.molcel.2006.11.011] [Citation(s) in RCA: 150] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2006] [Revised: 10/17/2006] [Accepted: 11/13/2006] [Indexed: 10/23/2022]
Abstract
Transcription termination at mRNA genes is linked to polyadenylation. Cleavage at the poly(A) site generates an entry point for the Rat1/Xrn2 exonuclease, which degrades the downstream transcript to promote termination. Small nucleolar RNAs (snoRNAs) are also transcribed by RNA polymerase II but are not polyadenylated. Chromatin immunoprecipitation experiments show that polyadenylation factors and Rat1 localize to snoRNA genes, but mutations that disrupt poly(A) site cleavage or Rat1 activity do not lead to termination defects at these genes. Conversely, mutations of Nrd1, Sen1, and Ssu72 affect termination at snoRNAs but not at several mRNA genes. The exosome complex was required for 3' trimming, but not termination, of snoRNAs. Both the mRNA and snoRNA pathways require Pcf11 but show differential effects of individual mutant alleles. These results suggest that in yeast the transcribing RNA polymerase II can choose between two distinct termination mechanisms but keeps both options available during elongation.
Collapse
Affiliation(s)
- Minkyu Kim
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, Massachusetts 02115
| | - Lidia Vasiljeva
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, Massachusetts 02115
| | - Oliver J Rando
- Bauer Center for Genomics Research, Harvard University, 7 Divinity Avenue, Cambridge, Massachusetts 02138
| | - Alexander Zhelkovsky
- Department of Molecular Microbiology, Tufts University School of Medicine, 136 Harrison Avenue, Boston, Massachusetts 02111
| | - Claire Moore
- Department of Molecular Microbiology, Tufts University School of Medicine, 136 Harrison Avenue, Boston, Massachusetts 02111
| | - Stephen Buratowski
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, Massachusetts 02115.
| |
Collapse
|
35
|
Noble CG, Beuth B, Taylor IA. Structure of a nucleotide-bound Clp1-Pcf11 polyadenylation factor. Nucleic Acids Res 2006; 35:87-99. [PMID: 17151076 PMCID: PMC1761425 DOI: 10.1093/nar/gkl1010] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2006] [Revised: 10/30/2006] [Accepted: 10/30/2006] [Indexed: 01/10/2023] Open
Abstract
Pcf11 and Clp1 are subunits of cleavage factor IA (CFIA), an essential polyadenylation factor in Saccahromyces cerevisiae. We have determined the structure of a ternary complex of Clp1 together with ATP and the Clp1-binding region of Pcf11. Clp1 contains three domains, a small N-terminal beta sandwich domain, a C-terminal domain containing a novel alpha/beta-fold and a central domain that binds ATP. The arrangement of the nucleotide binding site is similar to that observed in SIMIBI-class ATPase subunits found in other multisubunit macromolecular complexes. However, despite this similarity, nucleotide hydrolysis does not occur. The Pcf11 binding site is also located in the central domain where three highly conserved residues in Pcf11 mediate many of the protein-protein interactions. We propose that this conserved Clp1-Pcf11 interaction is responsible for maintaining a tight coupling between the Clp1 nucleotide binding subunit and the other components of the polyadenylation machinery. Moreover, we suggest that this complex represents a stabilized ATP bound form of Clp1 that requires the participation of other non-CFIA processing factors in order to initiate timely ATP hydrolysis during 3' end processing.
Collapse
Affiliation(s)
- Christian G. Noble
- Division of Molecular Structure, National Institute for Medical ResearchThe Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Barbara Beuth
- Division of Molecular Structure, National Institute for Medical ResearchThe Ridgeway, Mill Hill, London NW7 1AA, UK
| | - Ian A. Taylor
- Division of Molecular Structure, National Institute for Medical ResearchThe Ridgeway, Mill Hill, London NW7 1AA, UK
| |
Collapse
|
36
|
Abstract
Polyadenylation is an essential processing step for most eukaryotic mRNAs. In the nucleus, poly(A) polymerase adds poly(A) tails to mRNA 3' ends, contributing to their export, stability and translatability. Recently, a novel class of non-canonical poly(A) polymerases was discovered in yeast, worms and vertebrates. Different members of the Cid1 family, named after its founding member in the fission yeast Schizosaccharomyces pombe, are localized in the nucleus and the cytoplasm and are thought to target specific RNAs for polyadenylation. Polyadenylation of a target RNA by a Cid1-like poly(A) polymerase can lead to its degradation or stabilization, depending on the enzyme involved. Cid1-like proteins have important roles in diverse biological processes, including RNA surveillance pathways, DNA integrity checkpoint responses and RNAi-dependent heterochromatin formation.
Collapse
Affiliation(s)
- Abigail L Stevenson
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | | |
Collapse
|
37
|
Kyburz A, Friedlein A, Langen H, Keller W. Direct interactions between subunits of CPSF and the U2 snRNP contribute to the coupling of pre-mRNA 3' end processing and splicing. Mol Cell 2006; 23:195-205. [PMID: 16857586 DOI: 10.1016/j.molcel.2006.05.037] [Citation(s) in RCA: 145] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Revised: 04/13/2006] [Accepted: 05/22/2006] [Indexed: 10/24/2022]
Abstract
Eukaryotic pre-mRNAs are capped at their 5' ends, polyadenylated at their 3' ends, and spliced before being exported from the nucleus to the cytoplasm. Although the three processing reactions can be studied separately in vitro, they are coupled in vivo. We identified subunits of the U2 snRNP in highly purified CPSF and showed that the two complexes physically interact. We therefore tested whether this interaction contributes to the coupling of 3' end processing and splicing. We found that CPSF is necessary for efficient splicing activity in coupled assays and that mutations in the pre-mRNA binding site of the U2 snRNP resulted in impaired splicing and in much reduced cleavage efficiency. Moreover, we showed that efficient cleavage required the presence of the U2 snRNA in coupled assays. We therefore propose that the interaction between CPSF and the U2 snRNP contributes to the coupling of splicing and 3' end formation.
Collapse
Affiliation(s)
- Andrea Kyburz
- Department of Cell Biology, Biozentrum, University of Basel, CH-4056 Basel, Switzerland
| | | | | | | |
Collapse
|
38
|
Zhelkovsky A, Tacahashi Y, Nasser T, He X, Sterzer U, Jensen TH, Domdey H, Moore C. The role of the Brr5/Ysh1 C-terminal domain and its homolog Syc1 in mRNA 3'-end processing in Saccharomyces cerevisiae. RNA (NEW YORK, N.Y.) 2006; 12:435-45. [PMID: 16431986 PMCID: PMC1383582 DOI: 10.1261/rna.2267606] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
The cleavage/polyadenylation factor (CPF) of Saccharomyces cerevisiae is thought to provide the catalytic activities of the mRNA 3'-end processing machinery, which include endonucleolytic cleavage at the poly(A) site, followed by synthesis of an adenosine polymer onto the new 3'-end by the CPF subunit Pap1. Because of similarity to other nucleases in the metallo-beta-lactamase family, the Brr5/Ysh1 subunit has been proposed to be the endonuclease. The C-terminal domain of Brr5 lies outside of beta-lactamase homology, and its function has not been elucidated. We show here that this region of Brr5 is necessary for cell viability and mRNA 3'-end processing. It is highly homologous to another CPF subunit, Syc1. Syc1 is not essential, but its removal improves the growth of other processing mutants at restrictive temperatures and restores in vitro processing activity to cleavage/ polyadenylation-defective brr5-1 extract. Our findings suggest that Syc1, by mimicking the essential Brr5 C-terminus, serves as a negative regulator of mRNA 3'-end formation.
Collapse
Affiliation(s)
- Alexander Zhelkovsky
- Department of Molecular Microbiology, Tufts University School of Medicine, Boston, MA 02111, USA
| | | | | | | | | | | | | | | |
Collapse
|
39
|
Forbes KP, Addepalli B, Hunt AG. An Arabidopsis Fip1 homolog interacts with RNA and provides conceptual links with a number of other polyadenylation factor subunits. J Biol Chem 2005; 281:176-86. [PMID: 16282318 DOI: 10.1074/jbc.m510964200] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The protein Fip1 is an important subunit of the eukaryotic polyadenylation apparatus, since it provides a bridge of sorts between poly(A) polymerase, other subunits of the polyadenylation apparatus, and the substrate RNA. In this study, a previously unreported Arabidopsis Fip1 homolog is characterized. The gene for this protein resides on chromosome V and encodes a 1196-amino acid polypeptide. Yeast two-hybrid and in vitro assays indicate that the N-terminal 137 amino acids of the Arabidopsis Fip1 protein interact with poly(A) polymerase (PAP). This domain also stimulates the activity of the PAP. Interestingly, this part of the Arabidopsis Fip1 interacts with Arabidopsis homologs of CstF77, CPSF30, CFIm-25, and PabN1. The interactions with CstF77, CPSF30, and CFIm-25 are reminiscent in various respects of similar interactions seen in yeast and mammals, although the part of the Arabidopsis Fip1 protein that participates in these interactions has no apparent counterpart in other eukaryotic Fip1 proteins. Interactions between Fip1 and PabN1 have not been reported in other systems; this may represent plant-specific associations. The C-terminal 789 amino acids of the Arabidopsis Fip1 protein were found to contain an RNA-binding domain; this domain correlated with an intact arginine-rich region and had a marked preference for poly(G) among the four homopolymers studied. These results indicate that the Arabidopsis Fip1, like its human counterpart, is an RNA-binding protein. Moreover, they provide conceptual links between PAP and several other Arabidopsis polyadenylation factor subunit homologs.
Collapse
Affiliation(s)
- Kevin P Forbes
- Plant Physiology, Biochemistry, and Molecular Biology Program, Department of Plant and Soil Sciences, University of Kentucky, Lexington, Kentucky 40546-0312, USA
| | | | | |
Collapse
|
40
|
Vanˇácˇová S, Wolf J, Martin G, Blank D, Dettwiler S, Friedlein A, Langen H, Keith G, Keller W. A new yeast poly(A) polymerase complex involved in RNA quality control. PLoS Biol 2005; 3:e189. [PMID: 15828860 PMCID: PMC1079787 DOI: 10.1371/journal.pbio.0030189] [Citation(s) in RCA: 475] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2005] [Accepted: 03/28/2005] [Indexed: 11/18/2022] Open
Abstract
Eukaryotic cells contain several unconventional poly(A) polymerases in addition to the canonical enzymes responsible for the synthesis of poly(A) tails of nuclear messenger RNA precursors. The yeast protein Trf4p has been implicated in a quality control pathway that leads to the polyadenylation and subsequent exosome-mediated degradation of hypomethylated initiator tRNAMet (tRNAiMet). Here we show that Trf4p is the catalytic subunit of a new poly(A) polymerase complex that contains Air1p or Air2p as potential RNA-binding subunits, as well as the putative RNA helicase Mtr4p. Comparison of native tRNAiMet with its in vitro transcribed unmodified counterpart revealed that the unmodified RNA was preferentially polyadenylated by affinity-purified Trf4 complex from yeast, as well as by complexes reconstituted from recombinant components. These results and additional experiments with other tRNA substrates suggested that the Trf4 complex can discriminate between native tRNAs and molecules that are incorrectly folded. Moreover, the polyadenylation activity of the Trf4 complex stimulated the degradation of unmodified tRNAiMet by nuclear exosome fractions in vitro. Degradation was most efficient when coupled to the polyadenylation activity of the Trf4 complex, indicating that the poly(A) tails serve as signals for the recruitment of the exosome. This polyadenylation-mediated RNA surveillance resembles the role of polyadenylation in bacterial RNA turnover. A new molecular surveillance mechanism is uncovered in eukaryotes, in which incorrectly folded tRNAs are polyadenylated and then targeted for degradation
Collapse
Affiliation(s)
| | - Jeannette Wolf
- 1Department of Cell Biology, BiozentrumUniversity of Basel, BaselSwitzerland
| | - Georges Martin
- 1Department of Cell Biology, BiozentrumUniversity of Basel, BaselSwitzerland
| | - Diana Blank
- 1Department of Cell Biology, BiozentrumUniversity of Basel, BaselSwitzerland
| | - Sabine Dettwiler
- 1Department of Cell Biology, BiozentrumUniversity of Basel, BaselSwitzerland
| | - Arno Friedlein
- 2Roche Genetics, F. Hoffmann-La Roche AGBaselSwitzerland
| | - Hanno Langen
- 2Roche Genetics, F. Hoffmann-La Roche AGBaselSwitzerland
| | - Gérard Keith
- 3Institut de Biologie Moléculaire et Cellulaire du CNRS, Université Louis PasteurStrasbourgFrance
| | - Walter Keller
- 1Department of Cell Biology, BiozentrumUniversity of Basel, BaselSwitzerland
| |
Collapse
|
41
|
Cann H, Brown SV, Oguariri RM, Golightly LM. 3' UTR signals necessary for expression of the Plasmodium gallinaceum ookinete protein, Pgs28, share similarities with those of yeast and plants. Mol Biochem Parasitol 2005; 137:239-45. [PMID: 15383294 DOI: 10.1016/j.molbiopara.2004.06.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2004] [Revised: 06/15/2004] [Accepted: 06/18/2004] [Indexed: 11/15/2022]
Abstract
During metazoan development, 3' UTR signals mediate the time and place of gene expression. For protozoan Plasmodium parasites, the formation of ookinetes from gametes in the mosquito midgut is an analogous developmental process. Previous studies of the 3' UTR signals necessary for expression of Pgs28, the major surface protein of Plasmodium gallinaceum ookinetes, suggested that a 3' UTR T-rich region and DNA sequences containing an ATTAAA eukaryotic polyadenylation consensus motif were necessary for its expression. During metazoan development, U-rich elements may function in conjunction with eukaryotic polyadenylation consensus signals to mediate developmental protein expression. To define whether the putative Plasmodium elements were mediators of Pgs28 expression mutations of these nucleotide sequences were made in plasmid constructs. The effect of the mutations on Pgs28 expression was tested by the transient gene transfection of sexual stage P. gallinaceum parasites. These studies reveal that two different mutations of the ATTAAA motif, which alter gene expression in higher eukaryotes and yeast, do not alter the expression of Pgs28. However, the U-rich element, adjacent nucleotides UUUACAAAAUUGUUUUAACU and downstream nucleotides UAUAUAAAA are able to mediate expression to varying degrees. The organization and overlapping function of these elements appears to more closely resemble that of yeasts or plants than those of metazoans.
Collapse
Affiliation(s)
- Helen Cann
- Department of Medicine, Division of International Medicine and Infectious Diseases, Weill Medical College of Cornell University, 1300 York Avenue, Room A421, New York, NY 10021, USA
| | | | | | | |
Collapse
|
42
|
Güldener U, Münsterkötter M, Kastenmüller G, Strack N, van Helden J, Lemer C, Richelles J, Wodak SJ, García-Martínez J, Pérez-Ortín JE, Michael H, Kaps A, Talla E, Dujon B, André B, Souciet JL, De Montigny J, Bon E, Gaillardin C, Mewes HW. CYGD: the Comprehensive Yeast Genome Database. Nucleic Acids Res 2005; 33:D364-8. [PMID: 15608217 PMCID: PMC540007 DOI: 10.1093/nar/gki053] [Citation(s) in RCA: 210] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The Comprehensive Yeast Genome Database (CYGD) compiles a comprehensive data resource for information on the cellular functions of the yeast Saccharomyces cerevisiae and related species, chosen as the best understood model organism for eukaryotes. The database serves as a common resource generated by a European consortium, going beyond the provision of sequence information and functional annotations on individual genes and proteins. In addition, it provides information on the physical and functional interactions among proteins as well as other genetic elements. These cellular networks include metabolic and regulatory pathways, signal transduction and transport processes as well as co-regulated gene clusters. As more yeast genomes are published, their annotation becomes greatly facilitated using S.cerevisiae as a reference. CYGD provides a way of exploring related genomes with the aid of the S.cerevisiae genome as a backbone and SIMAP, the Similarity Matrix of Proteins. The comprehensive resource is available under http://mips.gsf.de/genre/proj/yeast/.
Collapse
Affiliation(s)
- U Güldener
- Institute for Bioinformatics, GSF National Research Center for Environment and Health, Ingolstädter Landstrasse 1, D-85764 Neuherberg, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Kaplan CD, Holland MJ, Winston F. Interaction between Transcription Elongation Factors and mRNA 3′-End Formation at the Saccharomyces cerevisiae GAL10-GAL7 Locus. J Biol Chem 2005; 280:913-22. [PMID: 15531585 DOI: 10.1074/jbc.m411108200] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Spt6 is a conserved transcription factor that associates with RNA polymerase II (pol II) during elongation. Spt6 is essential for viability in Saccharomyces cerevisiae and regulates chromatin structure during pol II transcription. Here we present evidence that mutations that impair Spt6, a second elongation factor, Spt4, and pol II can affect 3'-end formation at GAL10. Additional analysis suggests that Spt6 is required for cotranscriptional association of the factor Ctr9, a member of the Paf1 complex, with GAL10 and GAL7, and that Ctr9 association with chromatin 3' of GAL10 is regulated by the GAL10 polyadenylation signal. Overall, these results provide new evidence for a connection between the transcription elongation factor Spt6 and 3'-end formation in vivo.
Collapse
Affiliation(s)
- Craig D Kaplan
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.
| | | | | |
Collapse
|
44
|
Hajarnavis A, Korf I, Durbin R. A probabilistic model of 3' end formation in Caenorhabditis elegans. Nucleic Acids Res 2004; 32:3392-9. [PMID: 15247332 PMCID: PMC443532 DOI: 10.1093/nar/gkh656] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2004] [Revised: 05/31/2004] [Accepted: 05/31/2004] [Indexed: 11/14/2022] Open
Abstract
The 3' ends of mRNAs terminate with a poly(A) tail. This post-transcriptional modification is directed by sequence features present in the 3'-untranslated region (3'-UTR). We have undertaken a computational analysis of 3' end formation in Caenorhabditis elegans. By aligning cDNAs that diverge from genomic sequence at the poly(A) tract, we accurately identified a large set of true cleavage sites. When there are many transcripts aligned to a particular locus, local variation of the cleavage site over a span of a few bases is frequently observed. We find that in addition to the well-known AAUAAA motif there are several regions with distinct nucleotide compositional biases. We propose a generalized hidden Markov model that describes sequence features in C.elegans 3'-UTRs. We find that a computer program employing this model accurately predicts experimentally observed 3' ends even when there are multiple AAUAAA motifs and multiple cleavage sites. We have made available a complete set of polyadenylation site predictions for the C.elegans genome, including a subset of 6570 supported by aligned transcripts.
Collapse
Affiliation(s)
- Ashwin Hajarnavis
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK
| | | | | |
Collapse
|
45
|
Mangus DA, Smith MM, McSweeney JM, Jacobson A. Identification of factors regulating poly(A) tail synthesis and maturation. Mol Cell Biol 2004; 24:4196-206. [PMID: 15121841 PMCID: PMC400472 DOI: 10.1128/mcb.24.10.4196-4206.2004] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Posttranscriptional maturation of the 3' end of eukaryotic pre-mRNAs occurs as a three-step pathway involving site-specific cleavage, polymerization of a poly(A) tail, and trimming of the newly synthesized tail to its mature length. While most of the factors essential for catalyzing these reactions have been identified, those that regulate them remain to be characterized. Previously, we demonstrated that the yeast protein Pbp1p associates with poly(A)-binding protein (Pab1p) and controls the extent of mRNA polyadenylation. To further elucidate the function of Pbp1p, we conducted a two-hybrid screen to identify factors with which it interacts. Five genes encoding putative Pbp1p-interacting proteins were identified, including (i) FIR1/PIP1 and UFD1/PIP3, genes encoding factors previously implicated in mRNA 3'-end processing; (ii) PBP1 itself, confirming directed two-hybrid results and suggesting that Pbp1p can multimerize; (iii) DIG1, encoding a mitogen-activated protein kinase-associated protein; and (iv) PBP4 (YDL053C), a previously uncharacterized gene. In vitro polyadenylation reactions utilizing extracts derived from fir1 Delta and pbp1 Delta cells and from cells lacking the Fir1p interactor, Ref2p, demonstrated that Pbp1p, Fir1p, and Ref2p are all required for the formation of a normal-length poly(A) tail on precleaved CYC1 pre-mRNA. Kinetic analyses of the respective polyadenylation reactions indicated that Pbp1p is a negative regulator of poly(A) nuclease (PAN) activity and that Fir1p and Ref2p are, respectively, a positive regulator and a negative regulator of poly(A) synthesis. We suggest a model in which these three factors and Ufd1p are part of a regulatory complex that exploits Pab1p to link cleavage and polyadenylation factors of CFIA and CFIB (cleavage factors IA and IB) to the polyadenylation factors of CPF (cleavage and polyadenylation factor).
Collapse
Affiliation(s)
- David A Mangus
- Department of Molecular Genetics and Microbiology, University of Massachusetts Medical School, Worcester, Massachusetts 01655, USA
| | | | | | | |
Collapse
|
46
|
Morlando M, Ballarino M, Greco P, Caffarelli E, Dichtl B, Bozzoni I. Coupling between snoRNP assembly and 3' processing controls box C/D snoRNA biosynthesis in yeast. EMBO J 2004; 23:2392-401. [PMID: 15167896 PMCID: PMC423293 DOI: 10.1038/sj.emboj.7600254] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2004] [Accepted: 05/04/2004] [Indexed: 11/08/2022] Open
Abstract
RNA polymerase II transcribes genes encoding proteins and a large number of small stable RNAs. While pre-mRNA 3'-end formation requires a machinery ensuring tight coupling between cleavage and polyadenylation, small RNAs utilize polyadenylation-independent pathways. In yeast, specific factors required for snRNA and snoRNA 3'-end formation were characterized as components of the APT complex that is associated with the core complex of the cleavage/polyadenylation machinery (core-CPF). Other essential factors were identified as independent components: Nrd1p, Nab3p and Sen1p. Here we report that mutations in the conserved box D of snoRNAs and in the snoRNP-specific factor Nop1p interfere with transcription and 3'-end formation of box C/D snoRNAs. We demonstrate that Nop1p is associated with box C/D snoRNA genes and that it interacts with APT components. These data suggest a mechanism of quality control in which efficient transcription and 3'-end formation occur only when nascent snoRNAs are successfully assembled into functional particles.
Collapse
Affiliation(s)
- Mariangela Morlando
- Department of Genetics and Molecular Biology, Institute Pasteur Cenci-Bolognetti, University of Rome ‘La Sapienza', Rome, Italy
| | - Monica Ballarino
- Department of Genetics and Molecular Biology, Institute Pasteur Cenci-Bolognetti, University of Rome ‘La Sapienza', Rome, Italy
| | - Paolo Greco
- Department of Genetics and Molecular Biology, Institute Pasteur Cenci-Bolognetti, University of Rome ‘La Sapienza', Rome, Italy
| | - Elisa Caffarelli
- Institute of Molecular Biology and Pathology of CNR, University of Rome ‘La Sapienza', Rome, Italy
| | - Bernhard Dichtl
- Department of Cell Biology, Biozentrum, University of Basel, Klingelbergstrasse, Basel, Switzerland
| | - Irene Bozzoni
- Department of Genetics and Molecular Biology, Institute Pasteur Cenci-Bolognetti, University of Rome ‘La Sapienza', Rome, Italy
- Institute of Molecular Biology and Pathology of CNR, University of Rome ‘La Sapienza', Rome, Italy
| |
Collapse
|
47
|
Dichtl B, Aasland R, Keller W. Functions for S. cerevisiae Swd2p in 3' end formation of specific mRNAs and snoRNAs and global histone 3 lysine 4 methylation. RNA (NEW YORK, N.Y.) 2004; 10:965-77. [PMID: 15146080 PMCID: PMC1370588 DOI: 10.1261/rna.7090104] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
The Saccharomyces cerevisiae WD-40 repeat protein Swd2p associates with two functionally distinct multiprotein complexes: the cleavage and polyadenylation factor (CPF) that is involved in pre-mRNA and snoRNA 3' end formation and the SET1 complex (SET1C) that methylates histone 3 lysine 4. Based on bioinformatic analysis we predict a seven-bladed beta-propeller structure for Swd2p proteins. Northern, transcriptional run-on and in vitro 3' end cleavage analyses suggest that temperature sensitive swd2 strains were defective in 3' end formation of specific mRNAs and snoRNAs. Protein-protein interaction studies support a role for Swd2p in the assembly of 3' end formation complexes. Furthermore, histone 3 lysine 4 di-and tri-methylation were adversely affected and telomeres were shortened in swd2 mutants. Underaccumulation of the Set1p methyltransferase accounts for the observed loss of SET1C activity and suggests a requirement for Swd2p for the stability or assembly of this complex. We also provide evidence that the roles of Swd2p as component of CPF and SET1C are functionally independent. Taken together, our results establish a dual requirement for Swd2p in 3' end formation and histone tail modification.
Collapse
MESH Headings
- Amino Acid Sequence
- Base Sequence
- DNA, Fungal/genetics
- DNA-Binding Proteins/genetics
- DNA-Binding Proteins/metabolism
- Histone-Lysine N-Methyltransferase
- Histones/chemistry
- Histones/metabolism
- Lysine/chemistry
- Macromolecular Substances
- Methylation
- Molecular Sequence Data
- Multiprotein Complexes
- RNA, Fungal/chemistry
- RNA, Fungal/genetics
- RNA, Fungal/metabolism
- RNA, Messenger/chemistry
- RNA, Messenger/genetics
- RNA, Messenger/metabolism
- RNA, Small Nucleolar/chemistry
- RNA, Small Nucleolar/genetics
- RNA, Small Nucleolar/metabolism
- Repetitive Sequences, Amino Acid
- Saccharomyces cerevisiae/genetics
- Saccharomyces cerevisiae/metabolism
- Saccharomyces cerevisiae Proteins/genetics
- Saccharomyces cerevisiae Proteins/metabolism
- Sequence Homology, Amino Acid
- Transcription Factors/genetics
- Transcription Factors/metabolism
- mRNA Cleavage and Polyadenylation Factors/genetics
- mRNA Cleavage and Polyadenylation Factors/metabolism
Collapse
Affiliation(s)
- Bernhard Dichtl
- Department of Cell Biology, Biozentrum, University of Basel, CH-4056 Basel, Switzerland.
| | | | | |
Collapse
|
48
|
Dettwiler S, Aringhieri C, Cardinale S, Keller W, Barabino SML. Distinct sequence motifs within the 68-kDa subunit of cleavage factor Im mediate RNA binding, protein-protein interactions, and subcellular localization. J Biol Chem 2004; 279:35788-97. [PMID: 15169763 DOI: 10.1074/jbc.m403927200] [Citation(s) in RCA: 131] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Cleavage factor I(m) (CF I(m)) is required for the first step in pre-mRNA 3'-end processing and can be reconstituted in vitro from its heterologously expressed 25- and 68-kDa subunits. The binding of CF I(m) to the pre-mRNA is one of the earliest steps in the assembly of the cleavage and polyadenylation machinery and facilitates the recruitment of other processing factors. We identified regions in the subunits of CF I(m) involved in RNA binding, protein-protein interactions, and subcellular localization. CF I(m)68 has a modular domain organization consisting of an N-terminal RNA recognition motif and a C-terminal alternating charge domain. However, the RNA recognition motif of CF I(m)68 on its own is not sufficient to bind RNA but is necessary for association with the 25-kDa subunit. RNA binding appears to require a CF I(m)68/25 heterodimer. Whereas multiple protein interactions with other 3'-end-processing factors are detected with CF I(m)25, CF I(m)68 interacts with SRp20, 9G8, and hTra2beta, members of the SR family of splicing factors, via its C-terminal alternating charge domain. This domain is also required for targeting CF I(m)68 to the nucleus. However, CF I(m)68 does not concentrate in splicing speckles but in foci that partially colocalize with paraspeckles, a subnuclear component in which other proteins involved in transcriptional control and RNA processing have been found.
Collapse
Affiliation(s)
- Sabine Dettwiler
- Department of Cell Biology, Biozentrum, University of Basel, Klingelbergstrasse 70, CH-4056 Basel, Switzerland
| | | | | | | | | |
Collapse
|
49
|
Kaufmann I, Martin G, Friedlein A, Langen H, Keller W. Human Fip1 is a subunit of CPSF that binds to U-rich RNA elements and stimulates poly(A) polymerase. EMBO J 2004; 23:616-26. [PMID: 14749727 PMCID: PMC1271804 DOI: 10.1038/sj.emboj.7600070] [Citation(s) in RCA: 203] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2003] [Accepted: 12/17/2003] [Indexed: 11/09/2022] Open
Abstract
In mammals, polyadenylation of mRNA precursors (pre-mRNAs) by poly(A) polymerase (PAP) depends on cleavage and polyadenylation specificity factor (CPSF). CPSF is a multisubunit complex that binds to the canonical AAUAAA hexamer and to U-rich upstream sequence elements on the pre-mRNA, thereby stimulating the otherwise weakly active and nonspecific polymerase to elongate efficiently RNAs containing a poly(A) signal. Based on sequence similarity to the Saccharomyces cerevisiae polyadenylation factor Fip1p, we have identified human Fip1 (hFip1) and found that the protein is an integral subunit of CPSF. hFip1 interacts with PAP and has an arginine-rich RNA-binding motif that preferentially binds to U-rich sequence elements on the pre-mRNA. Recombinant hFip1 is sufficient to stimulate the in vitro polyadenylation activity of PAP in a U-rich element-dependent manner. hFip1, CPSF160 and PAP form a ternary complex in vitro, suggesting that hFip1 and CPSF160 act together in poly(A) site recognition and in cooperative recruitment of PAP to the RNA. These results show that hFip1 significantly contributes to CPSF-mediated stimulation of PAP activity.
Collapse
Affiliation(s)
- Isabelle Kaufmann
- Department of Cell Biology, Biozentrum, University of Basel, Basel, Switzerland
| | - Georges Martin
- Department of Cell Biology, Biozentrum, University of Basel, Basel, Switzerland
| | - Arno Friedlein
- Roche Genetics, F Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Hanno Langen
- Roche Genetics, F Hoffmann-La Roche Ltd, Basel, Switzerland
| | - Walter Keller
- Department of Cell Biology, Biozentrum, University of Basel, Basel, Switzerland
- Department of Cell Biology, Biozentrum, University of Basel, Klingelbergstrasse 70, CH-4056 Basel, Switzerland. Tel.: +41 61 267 20 60; Fax: +41 61 267 20 79; E-mail:
| |
Collapse
|
50
|
Abstract
The frequencies of individual nucleotides exhibit significant fluctuations across eukaryotic genes. In this paper, we investigate nucleotide variation across an averaged representation of all known human genes. Such a representation allows us to average out random fluctuations that constitute noise and uncover remarkable systematic trends in nucleotide distributions, particularly near boundaries between genetic elements--the promoter, exons, and introns. We propose that such variations result from differential mutational pressures and from the presence of specific regulatory motifs, such as transcription and splicing factor binding sites. Specifically, we observe significant GC and TA biases (excess of G over C and T over A) in noncoding regions of genes. Such biases are most probably caused by transcription-coupled mismatch repair, an effect that has recently been detected in mammalian genes. Subsequently, we examine the distribution of all hexanucleotides and identify motifs that are overrepresented within regulatory regions. By clustering and aligning such sequences, we recognize families of putative regulatory elements involved in exonic and intronic splicing control, and 3' mRNA processing. Some of our motifs have been identified in prior theoretical and experimental studies, thus validating our approach, but we detect several novel sequences that we propose as candidates for future functional assays and mutation screens for genetic disorders.
Collapse
|