1
|
On the function and relevance of alternative 3'-UTRs in gene expression regulation. WILEY INTERDISCIPLINARY REVIEWS-RNA 2021; 12:e1653. [PMID: 33843145 DOI: 10.1002/wrna.1653] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2021] [Revised: 03/15/2021] [Accepted: 03/16/2021] [Indexed: 12/12/2022]
Abstract
Messanger RNA (mRNA) isoforms with alternative 3'-untranslated regions (3'-UTRs) are produced by alternative polyadenylation (APA), which occurs during transcription in most eukaryotic genes. APA fine-tunes gene expression in a cell-type- and cellular state-dependent manner. Selection of an APA site entails the binding of core cleavage and polyadenylation factors to a particular polyadenylation site localized in the pre-mRNA and is controlled by multiple regulatory determinants, including transcription, pre-mRNA cis-regulatory sequences, and protein factors. Alternative 3'-UTRs serve as platforms for specific RNA binding proteins and microRNAs, which regulate gene expression in a coordinated manner by controlling mRNA fate and function in the cell. Genome-wide studies illustrated the full extent of APA prevalence and revealed that specific 3'-UTR profiles are associated with particular cellular states and diseases. Generally, short 3'-UTRs are associated with proliferative and cancer cells, and long 3'-UTRs are mostly found in polarized and differentiated cells. Fundamental new insights on the physiological consequences of this widespread event and the molecular mechanisms involved have been revealed through single-cell studies. Publicly available comprehensive databases that cover all APA mRNA isoforms identified in many cellular states and diseases reveal specific APA signatures. Therapies tackling APA mRNA isoforms or APA regulators may be regarded as innovative and attractive tools for diagnostics or treatment of several pathologies. We highlight the function of APA and alternative 3'-UTRs in gene expression regulation, the control of these mechanisms, their physiological consequences, and their potential use as new biomarkers and therapeutic tools. This article is categorized under: RNA Processing > 3' End Processing RNA Interactions with Proteins and Other Molecules > Protein-RNA Interactions: Functional Implications RNA in Disease and Development > RNA in Disease.
Collapse
|
2
|
Advances in the Bioinformatics Knowledge of mRNA Polyadenylation in Baculovirus Genes. Viruses 2020; 12:v12121395. [PMID: 33291215 PMCID: PMC7762203 DOI: 10.3390/v12121395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2020] [Revised: 11/19/2020] [Accepted: 11/30/2020] [Indexed: 11/17/2022] Open
Abstract
Baculoviruses are a group of insect viruses with large circular dsDNA genomes exploited in numerous biotechnological applications, such as the biological control of agricultural pests, the expression of recombinant proteins or the gene delivery of therapeutic sequences in mammals, among others. Their genomes encode between 80 and 200 proteins, of which 38 are shared by all reported species. Thanks to multi-omic studies, there is remarkable information about the baculoviral proteome and the temporality in the virus gene expression. This allows some functional elements of the genome to be very well described, such as promoters and open reading frames. However, less information is available about the transcription termination signals and, consequently, there are still imprecisions about what are the limits of the transcriptional units present in the baculovirus genomes and how is the processing of the 3′ end of viral mRNA. Regarding to this, in this review we provide an update about the characteristics of DNA signals involved in this process and we contribute to their correct prediction through an exhaustive analysis that involves bibliography information, data mining, RNA structure and a comprehensive study of the core gene 3′ ends from 180 baculovirus genomes.
Collapse
|
3
|
Tissue-specific mechanisms of alternative polyadenylation: Testis, brain, and beyond (2018 update). WILEY INTERDISCIPLINARY REVIEWS-RNA 2019; 10:e1526. [PMID: 30816016 PMCID: PMC6617714 DOI: 10.1002/wrna.1526] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Revised: 11/05/2018] [Accepted: 01/14/2019] [Indexed: 12/21/2022]
Abstract
Alternative polyadenylation (APA) is how genes choose different sites for 3′ end formation for mRNAs during transcription. APA often occurs in a tissue‐ or developmental stage‐specific manner that can significantly affect gene activity by changing the protein product generated, the stability of the transcript, its localization within the cell, or its translatability. Despite the important regulatory effects that APA has on tissue‐specific gene expression, only a few examples have been characterized mechanistically. In this 2018 update to our 2010 review, we examine mechanisms for the control of APA and update our understanding of the older mechanisms since 2010. We once postulated the existence of tissue‐specific factors in APA. However, while a few tissue‐specific polyadenylation factors are known, the emerging conclusion is that the majority of APA is accomplished by altering levels of core polyadenylation proteins. Examples of those core proteins include CSTF2, CPSF1, and subunits of mammalian cleavage factor I. But despite support for these mechanisms, no one has yet documented any of these proteins changing in either a tissue‐specific or developmental manner. Given the profound effect that APA can have on gene expression and human health, improved understanding of tissue‐specific APA could lead to numerous advances in gene activity control. This article is categorized under:RNA Processing > 3′ End Processing RNA in Disease and Development > RNA in Development
Collapse
|
4
|
The structural basis of CstF-77 modulation of cleavage and polyadenylation through stimulation of CstF-64 activity. Nucleic Acids Res 2018; 46:12022-12039. [PMID: 30257008 PMCID: PMC6294498 DOI: 10.1093/nar/gky862] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2018] [Revised: 08/31/2018] [Accepted: 09/12/2018] [Indexed: 01/14/2023] Open
Abstract
Cleavage and polyadenylation (C/P) of mRNA is an important cellular process that promotes increased diversity of mRNA isoforms and could change their stability in different cell types. The cleavage stimulation factor (CstF) complex, part of the C/P machinery, binds to U- and GU-rich sequences located downstream from the cleavage site through its RNA-binding subunit, CstF-64. Less is known about the function of the other two subunits of CstF, CstF-77 and CstF-50. Here, we show that the carboxy-terminus of CstF-77 plays a previously unrecognized role in enhancing C/P by altering how the RNA recognition motif (RRM) of CstF-64 binds RNA. In support of this finding, we also show that CstF-64 relies on CstF-77 to be transported to the nucleus; excess CstF-64 localizes to the cytoplasm, possibly via interaction with cytoplasmic RNAs. Reverse genetics and nuclear magnetic resonance studies of recombinant CstF-64 (RRM-Hinge) and CstF-77 (monkeytail-carboxy-terminal domain) indicate that the last 30 amino acids of CstF-77 increases the stability of the RRM, thus altering the affinity of the complex for RNA. These results provide new insights into the mechanism by which CstF regulates the location of the RNA cleavage site during C/P.
Collapse
|
5
|
Localization of RNAPII and 3' end formation factor CstF subunits on C. elegans genes and operons. Transcription 2016; 7:96-110. [PMID: 27124504 DOI: 10.1080/21541264.2016.1168509] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Transcription termination is mechanistically coupled to pre-mRNA 3' end formation to prevent transcription much beyond the gene 3' end. C. elegans, however, engages in polycistronic transcription of operons in which 3' end formation between genes is not accompanied by termination. We have performed RNA polymerase II (RNAPII) and CstF ChIP-seq experiments to investigate at a genome-wide level how RNAPII can transcribe through multiple poly-A signals without causing termination. Our data shows that transcription proceeds in some ways as if operons were composed of multiple adjacent single genes. Total RNAPII shows a small peak at the promoter of the gene cluster and a much larger peak at 3' ends. These 3' peaks coincide with maximal phosphorylation of Ser2 within the C-terminal domain (CTD) of RNAPII and maximal localization of the 3' end formation factor CstF. This pattern occurs at all 3' ends including those at internal sites in operons where termination does not occur. Thus the normal mechanism of 3' end formation does not always result in transcription termination. Furthermore, reduction of CstF50 by RNAi did not substantially alter the pattern of CstF64, total RNAPII, or Ser2 phosphorylation at either internal or terminal 3' ends. However, CstF50 RNAi did result in a subtle reduction of CstF64 binding upstream of the site of 3' cleavage, suggesting that the CstF50/CTD interaction may facilitate bringing the 3' end machinery to the transcription complex.
Collapse
|
6
|
CstF-64 supports pluripotency and regulates cell cycle progression in embryonic stem cells through histone 3' end processing. Nucleic Acids Res 2014; 42:8330-42. [PMID: 24957598 PMCID: PMC4117776 DOI: 10.1093/nar/gku551] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Embryonic stem cells (ESCs) exhibit a unique cell cycle with a shortened G1 phase that supports their pluripotency, while apparently buffering them against pro-differentiation stimuli. In ESCs, expression of replication-dependent histones is a main component of this abbreviated G1 phase, although the details of this mechanism are not well understood. Similarly, the role of 3' end processing in regulation of ESC pluripotency and cell cycle is poorly understood. To better understand these processes, we examined mouse ESCs that lack the 3' end-processing factor CstF-64. These ESCs display slower growth, loss of pluripotency and a lengthened G1 phase, correlating with increased polyadenylation of histone mRNAs. Interestingly, these ESCs also express the τCstF-64 paralog of CstF-64. However, τCstF-64 only partially compensates for lost CstF-64 function, despite being recruited to the histone mRNA 3' end-processing complex. Reduction of τCstF-64 in CstF-64-deficient ESCs results in even greater levels of histone mRNA polyadenylation, suggesting that both CstF-64 and τCstF-64 function to inhibit polyadenylation of histone mRNAs. These results suggest that CstF-64 plays a key role in modulating the cell cycle in ESCs while simultaneously controlling histone mRNA 3' end processing.
Collapse
|
7
|
The conserved intronic cleavage and polyadenylation site of CstF-77 gene imparts control of 3' end processing activity through feedback autoregulation and by U1 snRNP. PLoS Genet 2013; 9:e1003613. [PMID: 23874216 PMCID: PMC3708835 DOI: 10.1371/journal.pgen.1003613] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2012] [Accepted: 05/22/2013] [Indexed: 11/29/2022] Open
Abstract
The human gene encoding the cleavage/polyadenylation (C/P) factor CstF-77 contains 21 exons. However, intron 3 (In3) accounts for nearly half of the gene region, and contains a C/P site (pA) with medium strength, leading to short mRNA isoforms with no apparent protein products. This intron contains a weak 5′ splice site (5′SS), opposite to the general trend for large introns in the human genome. Importantly, the intron size and strengths of 5′SS and pA are all highly conserved across vertebrates, and perturbation of these parameters drastically alters intronic C/P. We found that the usage of In3 pA is responsive to the expression level of CstF-77 as well as several other C/P factors, indicating it attenuates the expression of CstF-77 via a negative feedback mechanism. Significantly, intronic C/P of CstF-77 pre-mRNA correlates with global 3′UTR length across cells and tissues. In addition, inhibition of U1 snRNP also leads to regulation of the usage of In3 pA, suggesting that the C/P activity in the cell can be cross-regulated by splicing, leading to coordination between these two processes. Importantly, perturbation of CstF-77 expression leads to widespread alternative cleavage and polyadenylation (APA) and disturbance of cell proliferation and differentiation. Thus, the conserved intronic pA of the CstF-77 gene may function as a sensor for cellular C/P and splicing activities, controlling the homeostasis of CstF-77 and C/P activity and impacting cell proliferation and differentiation. Autoregulation is commonly used in biological systems to control the homeostasis of certain activity, and cross-regulation coordinates multiple processes. We show that vertebrate genes encoding the cleavage/polyadenylation (C/P) factor CstF-77 contain a conserved intronic C/P site (pA) which regulates CstF-77 expression through a negative feedback loop. Since the usage of this intronic pA is also responsive to the expression of other C/P factors, the pA can function as a sensor for the cellular C/P activity. Because the CstF-77 level is important for the usage of a large number of pAs in the genome and is particularly critical for expression of genes involved in cell cycle, this autoregulatory mechanism has far-reaching implications for cell proliferation and differentiation. The human intron harboring the pA is large and has a weak 5′ splice site, both of which are also highly conserved in other vertebrates. Inhibition of U1 snRNP, which recognizes the 5′ splice site of intron, leads to upregulation of the intronic pA isoform of CstF-77 gene, suggesting that the C/P activity in the cell can be cross-regulated by splicing, leading to coordination between these two processes.
Collapse
|
8
|
Tissue-specific mechanisms of alternative polyadenylation: testis, brain, and beyond. WILEY INTERDISCIPLINARY REVIEWS-RNA 2012; 1:494-501. [PMID: 21956945 DOI: 10.1002/wrna.29] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Changing the position of the poly(A) tail in an mRNA--alternative polyadenylation--is an important mechanism to increase the diversity of gene expression, especially in metazoans. Alternative polyadenylation often occurs in a tissue- or developmental stage-specific manner and can significantly affect gene activity by changing the protein product generated, the stability of the transcript, its localization, or its translatability. Despite the important regulatory effects that alternative polyadenylation have on gene expression, only a sparse few examples have been mechanistically characterized. Here, we review the known mechanisms for the control of alternative polyadenylation, catalog the tissues that demonstrate a propensity for alternative polyadenylation, and focus on the proteins that are known to regulate alternative polyadenylation in specific tissues. We conclude that the field of alternative polyadenylation remains in its infancy, with possibilities for future investigation on the horizon. Given the profound effect alternative polyadenylation can have on gene expression and human health, improved understanding of alternative polyadenylation could lead to numerous advances in control of gene activity.
Collapse
|
9
|
Structural biology of poly(A) site definition. WILEY INTERDISCIPLINARY REVIEWS-RNA 2011; 2:732-47. [PMID: 21823232 DOI: 10.1002/wrna.88] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
3' processing is an essential step in the maturation of all messenger RNAs (mRNAs) and is a tightly coupled two-step reaction: endonucleolytic cleavage at the poly(A) site is followed by the addition of a poly(A) tail, except for metazoan histone mRNAs, which are cleaved but not polyadenylated. The recognition of a poly(A) site is coordinated by the sequence elements in the mRNA 3' UTR and associated protein factors. In mammalian cells, three well-studied sequence elements, UGUA, AAUAAA, and GU-rich, are recognized by three multisubunit factors: cleavage factor I(m) (CFI(m) ), cleavage and polyadenylation specificity factor (CPSF), and cleavage stimulation factor (CstF), respectively. In the yeast Saccharomyces cerevisiae, UA repeats and A-rich sequence elements are recognized by Hrp1p and cleavage factor IA. Structural studies of protein-RNA complexes have helped decipher the mechanisms underlying sequence recognition and shed light on the role of protein factors in poly(A) site selection and 3' processing machinery assembly. In this review we focus on the interactions between the mRNA cis-elements and the protein factors (CFI(m) , CPSF, CstF, and homologous factors from yeast and other eukaryotes) that define the poly(A) site. WIREs RNA 2011 2 732-747 DOI: 10.1002/wrna.88 For further resources related to this article, please visit the WIREs website.
Collapse
|
10
|
Global changes in processing of mRNA 3' untranslated regions characterize clinically distinct cancer subtypes. Cancer Res 2010; 69:9422-30. [PMID: 19934316 DOI: 10.1158/0008-5472.can-09-2236] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Molecular cancer diagnostics are an important clinical advance in cancer management, but new methods are still needed. In this context, gene expression signatures obtained by microarray represent a useful molecular diagnostic. Here, we describe novel probe-level microarray analyses that reveal connections between mRNA processing and neoplasia in multiple tumor types, with diagnostic potential. We now show that characteristic differences in mRNA processing, primarily in the 3'-untranslated region, define molecular signatures that can distinguish similar tumor subtypes with different survival characteristics, with at least 74% accuracy. Using a mouse model of B-cell leukemia/lymphoma, we find that differences in transcript isoform abundance are likely due to both alternative polyadenylation (APA) and differential degradation. While truncation of the 3'-UTR is the most common observed pattern, genes with elongated transcripts were also observed, and distinct groups of affected genes are found in related but distinct tumor types. Genes with elongated transcripts are overrepresented in ontology categories related to cell-cell adhesion and morphology. Analysis of microarray data from human primary tumor samples revealed similar phenomena. Western blot analysis of selected proteins confirms that changes in the 3'-UTR can correlate with changes in protein expression. Our work suggests that alternative mRNA processing, particularly APA, can be a powerful molecular biomarker with prognostic potential. Finally, these findings provide insights into the molecular mechanisms of gene deregulation in tumorigenesis.
Collapse
|
11
|
The hinge domain of the cleavage stimulation factor protein CstF-64 is essential for CstF-77 interaction, nuclear localization, and polyadenylation. J Biol Chem 2009; 285:695-704. [PMID: 19887456 DOI: 10.1074/jbc.m109.061705] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Because polyadenylation is essential for cell growth, in vivo examination of polyadenylation protein function has been difficult. Here we describe a new in vivo assay that allows structure-function assays on CstF-64, a protein that binds to pre-mRNAs downstream of the cleavage site for accurate and efficient polyadenylation. In this assay (the stem-loop luciferase assay for polyadenylation, SLAP), expression of a luciferase pre-mRNA with a modified downstream sequence element was made dependent upon co-expression of an MS2-CstF-64 fusion protein. We show here that SLAP accurately reflects CstF-64-dependent polyadenylation, confirming the validity of this assay. Using SLAP, we determined that CstF-64 domains involved in RNA binding, interaction with CstF-77 (the "Hinge" domain), and coupling to transcription are critical for polyadenylation. Further, we showed that the Hinge domain is necessary for CstF-64 interaction with CstF-77 and consequent nuclear localization, suggesting that nuclear import of a preformed CstF complex is an essential step in polyadenylation.
Collapse
|
12
|
Enterovirus 71 3C protease cleaves a novel target CstF-64 and inhibits cellular polyadenylation. PLoS Pathog 2009; 5:e1000593. [PMID: 19779565 PMCID: PMC2742901 DOI: 10.1371/journal.ppat.1000593] [Citation(s) in RCA: 120] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2009] [Accepted: 08/27/2009] [Indexed: 12/23/2022] Open
Abstract
Identification of novel cellular proteins as substrates to viral proteases would provide a new insight into the mechanism of cell-virus interplay. Eight nuclear proteins as potential targets for enterovirus 71 (EV71) 3C protease (3C(pro)) cleavages were identified by 2D electrophoresis and MALDI-TOF analysis. Of these proteins, CstF-64, which is a critical factor for 3' pre-mRNA processing in a cell nucleus, was selected for further study. A time-course study to monitor the expression levels of CstF-64 in EV71-infected cells also revealed that the reduction of CstF-64 during virus infection was correlated with the production of viral 3C(pro). CstF-64 was cleaved in vitro by 3C(pro) but neither by mutant 3C(pro) (in which the catalytic site was inactivated) nor by another EV71 protease 2A(pro). Serial mutagenesis was performed in CstF-64, revealing that the 3C(pro) cleavage sites are located at position 251 in the N-terminal P/G-rich domain and at multiple positions close to the C-terminus of CstF-64 (around position 500). An accumulation of unprocessed pre-mRNA and the depression of mature mRNA were observed in EV71-infected cells. An in vitro assay revealed the inhibition of the 3'-end pre-mRNA processing and polyadenylation in 3C(pro)-treated nuclear extract, and this impairment was rescued by adding purified recombinant CstF-64 protein. In summing up the above results, we suggest that 3C(pro) cleavage inactivates CstF-64 and impairs the host cell polyadenylation in vitro, as well as in virus-infected cells. This finding is, to our knowledge, the first to demonstrate that a picornavirus protein affects the polyadenylation of host mRNA.
Collapse
|
13
|
A core complex of CPSF73, CPSF100, and Symplekin may form two different cleavage factors for processing of poly(A) and histone mRNAs. Mol Cell 2009; 34:322-32. [PMID: 19450530 DOI: 10.1016/j.molcel.2009.04.024] [Citation(s) in RCA: 94] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2009] [Revised: 04/14/2009] [Accepted: 04/24/2009] [Indexed: 11/23/2022]
Abstract
Metazoan histone mRNAs are unique: their pre-mRNAs contain no introns, and the mRNAs are not polyadenylated, ending instead in a conserved stem-loop structure. In Drosophila, canonical poly(A) signals are located downstream of the normal cleavage site of each histone gene and are utilized when histone 3' end formation is inhibited. Here we define a subcomplex of poly(A) factors that are required for histone pre-mRNA processing. We demonstrate that Symplekin, CPSF73, and CPSF100 are present in a stable complex and interact with histone-specific processing factors. We use chromatin immunoprecipitation to show that Symplekin and CPSF73, but not CstF50, cotranscriptionally associate with histone genes. Depletion of SLBP recruits CstF50 to histone genes. Knockdown of CPSF160 or CstF64 downregulates Symplekin but does not affect histone pre-mRNA processing or association of Symplekin with the histone locus. These results suggest that a common core cleavage factor is required for processing of histone and polyadenylated pre-mRNAs.
Collapse
|
14
|
Abstract
Most eukaryotic mRNA precursors (premRNAs) must undergo extensive processing, including cleavage and polyadenylation at the 3'-end. Processing at the 3'-end is controlled by sequence elements in the pre-mRNA (cis elements) as well as protein factors. Despite the seeming biochemical simplicity of the processing reactions, more than 14 proteins have been identified for the mammalian complex, and more than 20 proteins have been identified for the yeast complex. The 3'-end processing machinery also has important roles in transcription and splicing. The mammalian machinery contains several sub-complexes, including cleavage and polyadenylation specificity factor, cleavage stimulation factor, cleavage factor I, and cleavage factor II. Additional protein factors include poly(A) polymerase, poly(A)-binding protein, symplekin, and the C-terminal domain of RNA polymerase II largest subunit. The yeast machinery includes cleavage factor IA, cleavage factor IB, and cleavage and polyadenylation factor.
Collapse
|
15
|
A birth-to-death view of mRNA from the RNA recognition motif perspective. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2008; 36:1-8. [PMID: 21591152 DOI: 10.1002/bmb.20149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
RNA binding proteins are a large and varied group of factors that are the driving force behind post-transcriptional gene regulation. By analogy with transcription factors, RNA binding proteins bind to various regions of the mRNAs that they regulate, usually upstream or downstream from the coding region, and modulate one of the five major processes in mRNA metabolism: splicing, polyadenylation, export, translation and decay. The most abundant RNA binding protein domain is called the RNA Recognition Motif (RRM)1. It is probably safe to say that an RRM-containing protein is making some contact with an mRNA throughout its existence. The transcriptional counterpart would likely be the histones, yet the multitude of specific functions that are results of RRM based interactions belies the universality of the motif. This complex and diverse application of a single protein motif was used as the basis to develop an advanced graduate level seminar course in RNA:protein interactions. The course, utilizing a learner-centered empowerment model, was developed to dissect each step in RNA metabolism from the perspective of an RRM containing protein. This provided a framework to discuss the development of specificity for the RRM for each required process.
Collapse
|
16
|
Partner proteins that interact with Clonorchis sinensis WD40-repeat protein. Parasitol Res 2007; 101:1233-8. [PMID: 17618461 DOI: 10.1007/s00436-007-0625-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2007] [Accepted: 05/31/2007] [Indexed: 11/30/2022]
Abstract
WD40-repeat proteins have four to eight repeat units, which have Gly-His (GH) and Trp-Asp (WD) at both termini and fold into a beta-propeller. In particular, the WD40-repeat protein of Clonorchis sinensis (CsWD1) has seven WD-repeat units and is expressed stage-specifically in metacercariae. By yeast two-hybrid screening, putative interacting protein cDNAs were cloned from a C. sinensis metacercaria cDNA library and purified further by higher stringency screening and lacZ colony-lift assay. After assessing their nucleotide and polypeptide sequences, 21 putative partner protein cDNAs were selected and assembled into 14 clones. Using YRG2 strain yeast, 12 putative partner protein clones were confirmed to interact with CsWD1 protein. These 12 proteins were grouped into functional categories, i.e., signal proteins, transporters, proteases, and muscle proteins. These results suggest that CsWD1 protein is associated with intracellular protein translocation and cell cycle control in C. sinensis metacercaria.
Collapse
|
17
|
The C-terminal domains of vertebrate CstF-64 and its yeast orthologue Rna15 form a new structure critical for mRNA 3'-end processing. J Biol Chem 2006; 282:2101-15. [PMID: 17116658 DOI: 10.1074/jbc.m609981200] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Yeast Rna15 and its vertebrate orthologue CstF-64 play critical roles in mRNA 3 '-end processing and in transcription termination downstream of poly(A) sites. These proteins contain N-terminal domains that recognize the poly(A) site, but little is known about their highly conserved C-terminal regions. Here we show by NMR that the C-terminal domains of CstF-64 and Rna15 fold into a three-helix bundle with an uncommon topological arrangement. The structure defines a cluster of evolutionary conserved yet exposed residues we show to be essential for the interaction between Pcf11 and Rna15. Furthermore, we demonstrate that this interaction is critical for the function of Rna15 in 3 '-end processing but dispensable for transcription termination. The C-terminal domain of the Rna15 homologue Pti1 contains critical sequence alterations within this region that are predicted to prevent Pcf11 interaction, providing an explanation for the distinct functions of these two closely related proteins in the 3 '-end formation of RNA polymerase II transcripts. These results define the role of the C-terminal half of Rna15 and provide insight into the network of protein/protein interactions responsible for assembly of the 3 '-end processing apparatus.
Collapse
|
18
|
An intronic polyadenylation site in human and mouse CstF-77 genes suggests an evolutionarily conserved regulatory mechanism. Gene 2006; 366:325-34. [PMID: 16316725 DOI: 10.1016/j.gene.2005.09.024] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2005] [Revised: 08/09/2005] [Accepted: 09/22/2005] [Indexed: 01/24/2023]
Abstract
Human CstF-77 is one of the three subunits of cleavage stimulation factor (CstF) that is essential for mRNA polyadenylation. Its Drosophila homologue, suppressor of forked [su(f)], contains an intronic poly(A) site, which can lead to a short transcript without a stop codon. By both bioinformatic searches and validation with molecular biology experiments, we found that human and mouse CstF-77 genes also contain an intronic poly(A) site, which can be utilized to produce short CstF-77 transcripts lacking sequences encoding domains that are involved in many of the CstF-77 functions. The genomic sequence surrounding the poly(A) site is highly conserved among all vertebrates, but is not present in non-vertebrate species. Using public Serial Analysis of Gene Expression (SAGE) data, we found that the intronic poly(A) site is utilized in a wide range of tissues. This finding indicates that vertebrates may employ a similar alternative polyadenylation mechanism to modulate CstF-77, highlighting the importance of the regulation of CstF-77 in various species.
Collapse
|
19
|
Analysis of gene expression in rabbit nuclear transfer embryos: Use of single-embryo mRNA differential display. Dev Growth Differ 2004; 45:543-51. [PMID: 14706078 DOI: 10.1111/j.1440-169x.2003.00720.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Lack of or abnormal expression of developmentally important genes is believed to hamper early development of the nuclear transfer (NT) embryo. To identify stage-specific genes in rabbit NT embryo development, mRNA differential display was used to compare the mRNA content of rabbit NT embryos at different developmental stages, from Metaphase II oocytes to 8-16-cell stage embryos. Thirty-four zygotic transcripts, which abruptly appeared at the 8-16-cell stage in rabbit NT embryos, were isolated; 11 of these were potential novel genes with no matches in the current databases. Of the remaining 23, 12 were matched with established sequence tags with functions uncharacterized and the other 11 were homologous to those in the European Molecular Biology Laboratory (EMBL) and GenBank databases. The differential expression of eight of the 34 amplicons were confirmed by reverse Northern blotting, and four positive clones were validated. Previous studies and present data indicated that these three genes were probably related to preimplantation rabbit embryo development.
Collapse
|
20
|
Abstract
Vertebrate polyadenylation sites are identified by the AAUAAA signal and by GU-rich sequences downstream of the cleavage site. These are recognized by a heterotrimeric protein complex (CstF) through its 64 kDa subunit (CstF-64); the strength of this interaction affects the efficiency of poly(A) site utilization. We present the structure of the RNA-binding domain of CstF-64 containing an RNA recognition motif (RRM) augmented by N- and C-terminal helices. The C-terminal helix unfolds upon RNA binding and extends into the hinge domain where interactions with factors responsible for assembly of the polyadenylation complex occur. We propose that this conformational change initiates assembly. Consecutive Us are required for a strong CstF-GU interaction and we show how UU dinucleotides are recognized. Contacts outside the UU pocket fine tune the protein-RNA interaction and provide different affinities for distinct GU-rich elements. The protein-RNA interface remains mobile, most likely a requirement to bind many GU-rich sequences and yet discriminate against other RNAs. The structural distinction between sequences that form stable and unstable complexes provides an operational distinction between weakly and strongly processed poly(A) sites.
Collapse
|
21
|
Downstream elements of mammalian pre-mRNA polyadenylation signals: primary, secondary and higher-order structures. Nucleic Acids Res 2003; 31:1375-86. [PMID: 12595544 PMCID: PMC149834 DOI: 10.1093/nar/gkg241] [Citation(s) in RCA: 108] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2002] [Accepted: 01/13/2003] [Indexed: 01/06/2023] Open
Abstract
Primary, secondary and higher-order structures of downstream elements of mammalian pre-mRNA polyadenylation signals [poly(A) signals] are re viewed. We have carried out a detailed analysis on our database of 244 human pre-mRNA poly(A) signals in order to characterize elements in their downstream regions. We suggest that the downstream region of the mammalian pre-mRNA poly(A) signal consists of various simple elements located at different distances from each other. Thus, the downstream region is not described by any precise consensus. Searching our database, we found that approximately 80% of pre-mRNAs with the AAUAAA or AUUAAA core upstream elements contain simple downstream elements, consisting of U-rich and/or 2GU/U tracts, the former occurring approximately 2-fold more often than the latter. Approximately one-third of the pre-mRNAs analyzed here contain sequences that may form G-quadruplexes. A substantial number of these sequences are located immediately downstream of the poly(A) signal. A possible role of G-rich sequences in the polyadenylation process is discussed. A model of the secondary structure of the SV40 late pre-mRNA poly(A) signal downstream region is presented.
Collapse
|
22
|
Molecular cloning and sequence analysis of the Anticarsia gemmatalis multicapsid nuclear polyhedrosis virus GP64 glycoprotein. Virus Genes 2003; 26:57-69. [PMID: 12680694 DOI: 10.1023/a:1022382106174] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The gp64 locus of Anticarsia gemmatalis multicapsid nucleopolyhedrovirus isolate Santa Fe (AgMNPV-SF) was characterised molecularly in our laboratory. To this end, we have located and cloned a AgMNPV-SF genomic DNA fragment containing the gp64 gene and sequenced the complete gp64 locus. Nucleotide sequence analysis indicated that the AgMNPV gp64 gene consists of a 1500 nucleotide open reading frame (ORF), encoding a protein of 499 amino acids. Of the seven gp64 homologues identified to date, the AgMNPV gp64 ORF shared most sequence similarity with the gp64 gene of Orgyia pseudotsugata MNPV. The GP64 from AgMNPV is the smallest baculoviral envelope glycoprotein found to date, differing in 10 or more residues from the other group I nucleopolyhedroviruses. The biological activity of AgMNPV GP64 protein was assessed by cell fusion assays in UFL-AG-286 cells using the obtained recombinant plasmids. In the upstream and downstream regions, relative to the gp64 ORF, we found different conserved transcriptional and post-transcriptional regulatory elements, respectively.
Collapse
|
23
|
The Gene CSTF2T, Encoding the Human Variant CstF-64 Polyadenylation Protein τCstF-64, Lacks Introns and May Be Associated with Male Sterility. Genomics 2002. [DOI: 10.1006/geno.2002.6862] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
24
|
Chimeric human CstF-77/Drosophila Suppressor of forked proteins rescue suppressor of forked mutant lethality and mRNA 3' end processing in Drosophila. Proc Natl Acad Sci U S A 2002; 99:10593-8. [PMID: 12149458 PMCID: PMC124984 DOI: 10.1073/pnas.162191899] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The Suppressor of forked [Su(f)] protein is the Drosophila homologue of CstF-77, a subunit of human cleavage stimulation factor (CstF) that is required for the first step of the mRNA 3' end processing reaction in vitro. We have addressed directly the role of su(f) in the mRNA 3' end processing reaction in vivo. We show that su(f) is required for the cleavage of pre-mRNA during mRNA 3' end formation. Analysis of the functional complementation between Su(f) and CstF-77 shows that most of the Drosophila protein (85%) can be exchanged for the human protein to produce chimeric CstF-77/Su(f) proteins that rescue lethality and cleavage defect during mRNA 3' end formation in su(f) mutants. Interestingly, we show that a domain in human CstF-77 is limiting for the rescue and that this domain is not able to reproduce protein interactions with the CstF subunits of Drosophila. We also show that chimeric CstF-77/Su(f) proteins that rescue lethality of su(f) mutants cannot restore utilization of a regulated poly(A) site in Drosophila. Taken together, these results demonstrate that CstF-77 and Su(f) have the same function in mRNA 3' end formation in vivo, but that these two proteins are not interchangeable for regulation of poly(A) site utilization.
Collapse
|
25
|
Overexpression of the CstF-64 and CPSF-160 polyadenylation protein messenger RNAs in mouse male germ cells. Biol Reprod 2001; 64:1722-9. [PMID: 11369601 DOI: 10.1095/biolreprod64.6.1722] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022] Open
Abstract
Messenger RNAs for several components of the transcriptional apparatus are greatly overexpressed in postmeiotic male germ cells in rodents (Schmidt and Schibler, Development 1995; 121:2373-2383). Because of the tight coupling of polyadenylation and transcription, we examined expression in germ cells of mRNAs for key polyadenylation factors. The mRNA for the 64 000 M(r) subunit of the cleavage stimulation factor (CstF-64) was expressed at least 250-fold greater in mouse testicular RNA than in liver RNA. RNA blot analysis showed that the mRNA for the 160 000 M(r) subunit of the cleavage and polyadenylation specificity factor was similarly overexpressed, as was the mRNA for the large subunit of RNA polymerase II. General transcription factors, such as the TATA-binding protein and transcription factor IIH, and splicing factors, such as components of the small nuclear ribonucleoproteins, were also expressed in meiotic and postmeiotic germ cells. The X-linked CstF-64 protein is expressed before and after but not during meiosis in the mouse (Wallace et al., Proc Natl Acad Sci U S A 1999; 96:6763-6768), which suggests that overexpression of mRNA transcription and processing factors plays an essential role in postmeiotic germ cell mRNA metabolism.
Collapse
|
26
|
Evolutionarily conserved interaction between CstF-64 and PC4 links transcription, polyadenylation, and termination. Mol Cell 2001; 7:1013-23. [PMID: 11389848 DOI: 10.1016/s1097-2765(01)00236-2] [Citation(s) in RCA: 116] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
Tight connections exist between transcription and subsequent processing of mRNA precursors, and interactions between the transcription and polyadenylation machineries seem especially extensive. Using a yeast two-hybrid screen to identify factors that interact with the polyadenylation factor CstF-64, we uncovered an interaction with the transcriptional coactivator PC4. Both human proteins have yeast homologs, Rna15p and Sub1p, respectively, and we show that these two proteins also interact. Given evidence that certain polyadenylation factors, including Rna15p, are necessary for termination in yeast, we show that deletion or overexpression of SUB1 suppresses or enhances, respectively, both growth and termination defects detected in an rna15 mutant strain. Our findings provide an additional, unexpected connection between transcription and polyadenylation and suggest that PC4/Sub1p, via its interaction with CstF-64/Rna15p, possesses an evolutionarily conserved antitermination activity.
Collapse
|
27
|
Abstract
The molecular connections between mRNA 3' end processing and transcriptional termination have been investigated in S. pombe using a genetic screen. By this approach, we have identified a RNAP II termination domain in the well-defined cleavage polyadenylation factor called CstF-64 in metazoans and Rna15p in S. cerevisiae. Furthermore, this C-terminal domain interacts with Res2, previously identified as a component of the G1/S transition-specific transcription factor MBF. Deletion of res2 in both fission and budding yeast results in a defect in 3' end formation. This raises the possibility that RNAP II transcriptional termination may in some situations be integrated with cell cycle events.
Collapse
|
28
|
The gene for a variant form of the polyadenylation protein CstF-64 is on chromosome 19 and is expressed in pachytene spermatocytes in mice. J Biol Chem 2001; 276:8044-50. [PMID: 11113135 DOI: 10.1074/jbc.m009091200] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Many mRNAs in male germ cells lack the canonical AAUAAA but are normally polyadenylated (Wallace, A. M., Dass, B., Ravnik, S. E., Tonk, V., Jenkins, N. A., Gilbert, D. J., Copeland, N. G., and MacDonald, C. C. (1999) Proc. Natl. Acad Sci. U. S. A. 96, 6763-6768). Previously, we demonstrated the presence of two distinct forms of the M(r) 64,000 protein of the cleavage stimulation factor (CstF-64) in mouse male germ cells and in brain, a somatic M(r) 64,000 form and a variant M(r) 70,000 form. The variant form was specific to meiotic and postmeiotic germ cells. We localized the gene for the somatic CstF-64 to the X chromosome, which would be inactivated during male meiosis. This suggested that the variant CstF-64 was an autosomal homolog activated during that time. We have named the variant form "tau CstF-64," and we describe here the cloning and characterization of the mouse tauCstF-64 cDNA, which maps to chromosome 19. The mouse tauCstF-64 protein fits the criteria of the variant CstF-64, including antibody reactivity, size, germ cell expression, and a common proteolytic digest pattern with tauCstF-64 from testis. Features of mtauCstF-64 that might allow it to promote the germ cell pattern of polyadenylation include a Pro --> Ser substitution in the RNA-binding domain and significant changes in the region that interacts with CstF-77.
Collapse
|
29
|
Tissue-specific autoregulation of Drosophila suppressor of forked by alternative poly(A) site utilization leads to accumulation of the suppressor of forked protein in mitotically active cells. RNA (NEW YORK, N.Y.) 2000; 6:1529-1538. [PMID: 11105753 PMCID: PMC1370023 DOI: 10.1017/s1355838200001266] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The Suppressor of forked protein is the Drosophila homolog of the 77K subunit of human cleavage stimulation factor, a complex required for the first step of the mRNA 3'-end-processing reaction. We have shown previously that wild-type su(f) function is required for the accumulation of a truncated su(f) transcript polyadenylated in intron 4 of the gene. This led us to propose a model in which the Su(f) protein would negatively regulate its own accumulation by stimulating 3'-end formation of this truncated su(f) RNA. In this article, we demonstrate this model and show that su(f) autoregulation is tissue specific. The Su(f) protein accumulates at a high level in dividing tissues, but not in nondividing tissues. We show that this distribution of the Su(f) protein results from stimulation by Su(f) of the tissue-specific utilization of the su(f) intronic poly(A) site, leading to the accumulation of the truncated su(f) transcript in nondividing tissues. Utilization of this intronic poly(A) site is affected in a su(f) mutant and restored in the mutant with a transgene encoding wild-type Su(f) protein. These data provide an in vivo example of cell-type-specific regulation of a protein level by poly(A) site choice, and confirm the role of Su(f) in regulation of poly(A) site utilization.
Collapse
|