51
|
Pan YJ, Liu BW, Pei DS. The Role of Alternative Splicing in Cancer: Regulatory Mechanism, Therapeutic Strategy, and Bioinformatics Application. DNA Cell Biol 2022; 41:790-809. [PMID: 35947859 DOI: 10.1089/dna.2022.0322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
[Formula: see text] Alternative splicing (AS) can generate distinct transcripts and subsequent isoforms that play differential functions from the same pre-mRNA. Recently, increasing numbers of studies have emerged, unmasking the association between AS and cancer. In this review, we arranged AS events that are closely related to cancer progression and presented promising treatments based on AS for cancer therapy. Obtaining proliferative capacity, acquiring invasive properties, gaining angiogenic features, shifting metabolic ability, and getting immune escape inclination are all splicing events involved in biological processes. Spliceosome-targeted and antisense oligonucleotide technologies are two novel strategies that are hopeful in tumor therapy. In addition, bioinformatics applications based on AS were summarized for better prediction and elucidation of regulatory routines mingled in. Together, we aimed to provide a better understanding of complicated AS events associated with cancer biology and reveal AS a promising target of cancer treatment in the future.
Collapse
Affiliation(s)
- Yao-Jie Pan
- Department of Pathology, Laboratory of Clinical and Experimental Pathology, Xuzhou Medical University, Xuzhou, China
| | - Bo-Wen Liu
- Department of General Surgery, Xuzhou Medical University, Xuzhou, China
| | - Dong-Sheng Pei
- Department of Pathology, Laboratory of Clinical and Experimental Pathology, Xuzhou Medical University, Xuzhou, China
| |
Collapse
|
52
|
Vorländer MK, Pacheco-Fiallos B, Plaschka C. Structural basis of mRNA maturation: Time to put it together. Curr Opin Struct Biol 2022; 75:102431. [PMID: 35930970 DOI: 10.1016/j.sbi.2022.102431] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 06/02/2022] [Accepted: 06/14/2022] [Indexed: 11/27/2022]
Abstract
In eukaryotes, the expression of genetic information begins in the cell nucleus with precursor messenger RNA (pre-mRNA) transcription and processing into mature mRNA. The mRNA is subsequently recognized and packaged by proteins into an mRNA ribonucleoprotein complex (mRNP) and exported to the cytoplasm for translation. Each of the nuclear mRNA maturation steps is carried out by a dedicated molecular machine. Here, we highlight recent structural and mechanistic insights into how these machines function, including the capping enzyme, the spliceosome, the 3'-end processing machinery, and the transcription-export complex. While we increasingly understand individual steps of nuclear gene expression, many questions remain. For example, we are only beginning to reveal how mature mRNAs are recognized and packaged for nuclear export and how mRNA maturation events are coupled to transcription and to each other. Advances in the preparation of recombinant and endogenous protein-nucleic acid complexes, cryo-electron microscopy, and machine learning promise exciting insights into the mechanisms of nuclear gene expression and its spatial organization.
Collapse
Affiliation(s)
- Matthias K Vorländer
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, 1030, Vienna, Austria. https://twitter.com/@MVorlandr
| | - Belén Pacheco-Fiallos
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, 1030, Vienna, Austria; Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, 1030, Vienna, Austria. https://twitter.com/@bpachecofiallos
| | - Clemens Plaschka
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, 1030, Vienna, Austria.
| |
Collapse
|
53
|
Mohamed AA, Vazquez Nunez R, Vos SM. Structural advances in transcription elongation. Curr Opin Struct Biol 2022; 75:102422. [PMID: 35816930 PMCID: PMC9398977 DOI: 10.1016/j.sbi.2022.102422] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 05/22/2022] [Accepted: 06/02/2022] [Indexed: 11/03/2022]
Abstract
Transcription is the first step of gene expression and involves RNA polymerases. After transcription initiation, RNA polymerase enters elongation followed by transcription termination at the end of the gene. Only recently, structures of transcription elongation complexes bound to key transcription elongation factors have been determined in bacterial and eukaryotic systems. These structures have revealed numerous insights including the basis for transcriptional pausing, RNA polymerase interaction with large complexes such as the ribosome and the spliceosome, and the transition into productive elongation. Here, we review these structures and describe areas for future research.
Collapse
Affiliation(s)
- Abdallah A Mohamed
- Massachusetts Institute of Technology, Department of Biology, 31 Ames St., Cambridge, MA 02142, USA. https://twitter.com/AMohamed_98
| | - Roberto Vazquez Nunez
- Massachusetts Institute of Technology, Department of Biology, 31 Ames St., Cambridge, MA 02142, USA. https://twitter.com/rjareth
| | - Seychelle M Vos
- Massachusetts Institute of Technology, Department of Biology, 31 Ames St., Cambridge, MA 02142, USA.
| |
Collapse
|
54
|
Wu M, Schmid M, Jensen T, Sandelin A. Computational identification of signals predictive for nuclear RNA exosome degradation pathway targeting. NAR Genom Bioinform 2022; 4:lqac071. [PMID: 36128426 PMCID: PMC9477074 DOI: 10.1093/nargab/lqac071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Revised: 08/05/2022] [Accepted: 09/01/2022] [Indexed: 11/15/2022] Open
Abstract
The RNA exosome degrades transcripts in the nucleoplasm of mammalian cells. Its substrate specificity is mediated by two adaptors: the ‘nuclear exosome targeting (NEXT)’ complex and the ‘poly(A) exosome targeting (PAXT)’ connection. Previous studies have revealed some DNA/RNA elements that differ between the two pathways, but how informative these features are for distinguishing pathway targeting, or whether additional genomic features that are informative for such classifications exist, is unknown. Here, we leverage the wealth of available genomic data and develop machine learning models that predict exosome targets and subsequently rank the features the models use by their predictive power. As expected, features around transcript end sites were most predictive; specifically, the lack of canonical 3′ end processing was highly predictive of NEXT targets. Other associated features, such as promoter-proximal G/C content and 5′ splice sites, were informative, but only for distinguishing NEXT and not PAXT targets. Finally, we discovered predictive features not previously associated with exosome targeting, in particular RNA helicase DDX3X binding sites. Overall, our results demonstrate that nucleoplasmic exosome targeting is to a large degree predictable, and our approach can assess the predictive power of previously known and new features in an unbiased way.
Collapse
Affiliation(s)
- Mengjun Wu
- The Bioinformatics Centre, Department of Biology and Biotech and Research Innovation Centre, University of Copenhagen , Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
- SciLifeLab, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet , 171 65 Solna , Sweden
| | - Manfred Schmid
- Department of Molecular Biology and Genetics, Aarhus University , Universitetsbyen 81, Aarhus , DK-8000, Denmark
| | - Torben Heick Jensen
- Department of Molecular Biology and Genetics, Aarhus University , Universitetsbyen 81, Aarhus , DK-8000, Denmark
| | - Albin Sandelin
- The Bioinformatics Centre, Department of Biology and Biotech and Research Innovation Centre, University of Copenhagen , Ole Maaloes Vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
55
|
Reixachs‐Solé M, Eyras E. Uncovering the impacts of alternative splicing on the proteome with current omics techniques. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1707. [PMID: 34979593 PMCID: PMC9542554 DOI: 10.1002/wrna.1707] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 11/27/2021] [Accepted: 11/29/2021] [Indexed: 12/15/2022]
Abstract
The high-throughput sequencing of cellular RNAs has underscored a broad effect of isoform diversification through alternative splicing on the transcriptome. Moreover, the differential production of transcript isoforms from gene loci has been recognized as a critical mechanism in cell differentiation, organismal development, and disease. Yet, the extent of the impact of alternative splicing on protein production and cellular function remains a matter of debate. Multiple experimental and computational approaches have been developed in recent years to address this question. These studies have unveiled how molecular changes at different steps in the RNA processing pathway can lead to differences in protein production and have functional effects. New and emerging experimental technologies open exciting new opportunities to develop new methods to fully establish the connection between messenger RNA expression and protein production and to further investigate how RNA variation impacts the proteome and cell function. This article is categorized under: RNA Processing > Splicing Regulation/Alternative Splicing Translation > Regulation RNA Evolution and Genomics > Computational Analyses of RNA.
Collapse
Affiliation(s)
- Marina Reixachs‐Solé
- The John Curtin School of Medical ResearchAustralian National UniversityCanberraAustralian Capital TerritoryAustralia
- EMBL Australia Partner Laboratory Network and the Australian National UniversityCanberraAustralian Capital TerritoryAustralia
| | - Eduardo Eyras
- The John Curtin School of Medical ResearchAustralian National UniversityCanberraAustralian Capital TerritoryAustralia
- EMBL Australia Partner Laboratory Network and the Australian National UniversityCanberraAustralian Capital TerritoryAustralia
- Catalan Institution for Research and Advanced StudiesBarcelonaSpain
- Hospital del Mar Medical Research Institute (IMIM)BarcelonaSpain
| |
Collapse
|
56
|
van Dyck JF, Burns JR, Le Huray KIP, Konijnenberg A, Howorka S, Sobott F. Sizing up DNA nanostructure assembly with native mass spectrometry and ion mobility. Nat Commun 2022; 13:3610. [PMID: 35750666 PMCID: PMC9232653 DOI: 10.1038/s41467-022-31029-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 05/30/2022] [Indexed: 11/09/2022] Open
Abstract
Recent interest in biological and synthetic DNA nanostructures has highlighted the need for methods to comprehensively characterize intermediates and end products of multimeric DNA assembly. Here we use native mass spectrometry in combination with ion mobility to determine the mass, charge state and collision cross section of noncovalent DNA assemblies, and thereby elucidate their structural composition, oligomeric state, overall size and shape. We showcase the approach with a prototypical six-subunit DNA nanostructure to reveal how its assembly is governed by the ionic strength of the buffer, as well as how the mass and mobility of heterogeneous species can be well resolved by careful tuning of instrumental parameters. We find that the assembly of the hexameric, barrel-shaped complex is guided by positive cooperativity, while previously undetected higher-order 12- and 18-mer assemblies are assigned to defined larger-diameter geometric structures. Guided by our insight, ion mobility-mass spectrometry is poised to make significant contributions to understanding the formation and structural diversity of natural and synthetic oligonucleotide assemblies relevant in science and technology.
Collapse
Affiliation(s)
- Jeroen F van Dyck
- Biomolecular & Analytical Mass Spectrometry, Chemistry Department, University of Antwerp, Antwerpen, Belgium
| | - Jonathan R Burns
- Department of Chemistry & Institute of Structural and Molecular Biology, University College London, London, UK
| | - Kyle I P Le Huray
- School of Molecular and Cellular Biology & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK
| | - Albert Konijnenberg
- Biomolecular & Analytical Mass Spectrometry, Chemistry Department, University of Antwerp, Antwerpen, Belgium.,Thermo Fisher Scientific, Eindhoven, The Netherlands
| | - Stefan Howorka
- Department of Chemistry & Institute of Structural and Molecular Biology, University College London, London, UK.
| | - Frank Sobott
- Biomolecular & Analytical Mass Spectrometry, Chemistry Department, University of Antwerp, Antwerpen, Belgium. .,School of Molecular and Cellular Biology & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, UK.
| |
Collapse
|
57
|
Bushhouse DZ, Choi EK, Hertz LM, Lucks JB. How does RNA fold dynamically? J Mol Biol 2022; 434:167665. [PMID: 35659535 DOI: 10.1016/j.jmb.2022.167665] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 05/26/2022] [Accepted: 05/27/2022] [Indexed: 10/18/2022]
Abstract
Recent advances in interrogating RNA folding dynamics have shown the classical model of RNA folding to be incomplete. Here, we pose three prominent questions for the field that are at the forefront of our understanding of the importance of RNA folding dynamics for RNA function. The first centers on the most appropriate biophysical framework to describe changes to the RNA folding energy landscape that a growing RNA chain encounters during transcriptional elongation. The second focuses on the potential ubiquity of strand displacement - a process by which RNA can rapidly change conformations - and how this process may be generally present in broad classes of seemingly different RNAs. The third raises questions about the potential importance and roles of cellular protein factors in RNA conformational switching. Answers to these questions will greatly improve our fundamental knowledge of RNA folding and function, drive biotechnological advances that utilize engineered RNAs, and potentially point to new areas of biology yet to be discovered.
Collapse
Affiliation(s)
- David Z Bushhouse
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA; Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA
| | - Edric K Choi
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA; Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA
| | - Laura M Hertz
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA; Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA
| | - Julius B Lucks
- Interdisciplinary Biological Sciences Graduate Program, Northwestern University, Evanston, Illinois 60208, USA; Center for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, USA; Department of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, USA; Center for Water Research, Northwestern University, Evanston, Illinois 60208, USA; Center for Engineering Sustainability and Resilience, Northwestern University, Evanston, Illinois 60208, USA.
| |
Collapse
|
58
|
Lee ES, Smith HW, Wolf EJ, Guvenek A, Wang YE, Emili A, Tian B, Palazzo AF. ZFC3H1 and U1-70K promote the nuclear retention of mRNAs with 5' splice site motifs within nuclear speckles. RNA (NEW YORK, N.Y.) 2022; 28:878-894. [PMID: 35351812 PMCID: PMC9074902 DOI: 10.1261/rna.079104.122] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Accepted: 03/12/2022] [Indexed: 05/22/2023]
Abstract
Quality control of mRNA represents an important regulatory mechanism for gene expression in eukaryotes. One component of this quality control is the nuclear retention and decay of misprocessed RNAs. Previously, we demonstrated that mature mRNAs containing a 5' splice site (5'SS) motif, which is typically found in misprocessed RNAs such as intronic polyadenylated (IPA) transcripts, are nuclear retained and degraded. Using high-throughput sequencing of cellular fractions, we now demonstrate that IPA transcripts require the zinc finger protein ZFC3H1 for their nuclear retention and degradation. Using reporter mRNAs, we demonstrate that ZFC3H1 promotes the nuclear retention of mRNAs with intact 5'SS motifs by sequestering them into nuclear speckles. Furthermore, we find that U1-70K, a component of the spliceosomal U1 snRNP, is also required for the nuclear retention of these reporter mRNAs and likely functions in the same pathway as ZFC3H1. Finally, we show that the disassembly of nuclear speckles impairs the nuclear retention of reporter mRNAs with 5'SS motifs. Our results highlight a splicing independent role of U1 snRNP and indicate that it works in conjunction with ZFC3H1 in preventing the nuclear export of misprocessed mRNAs by sequestering them into nuclear speckles.
Collapse
Affiliation(s)
- Eliza S Lee
- Department of Biochemistry, University of Toronto, Ontario M5S 1A8, Canada
| | - Harrison W Smith
- Department of Biochemistry, University of Toronto, Ontario M5S 1A8, Canada
| | - Eric J Wolf
- Department of Molecular Genetics, University of Toronto, Ontario M5S 1A8, Canada
| | - Aysegul Guvenek
- Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
| | - Yifan E Wang
- Department of Biochemistry, University of Toronto, Ontario M5S 1A8, Canada
| | - Andrew Emili
- Department of Molecular Genetics, University of Toronto, Ontario M5S 1A8, Canada
- Department of Biochemistry, Boston University School of Medicine, Boston, Massachusetts 02118, USA
| | - Bin Tian
- Rutgers New Jersey Medical School, Newark, New Jersey 07103, USA
- Wistar Institute, Philadelphia, Pennsylvania 19104, USA
| | | |
Collapse
|
59
|
Screening thousands of transcribed coding and non-coding regions reveals sequence determinants of RNA polymerase II elongation potential. Nat Struct Mol Biol 2022; 29:613-620. [PMID: 35681023 DOI: 10.1038/s41594-022-00785-9] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Accepted: 04/28/2022] [Indexed: 01/07/2023]
Abstract
Precise regulation of transcription by RNA polymerase II (RNAPII) is critical for organismal growth and development. However, what determines whether an engaged RNAPII will synthesize a full-length transcript or terminate prematurely is poorly understood. Notably, RNAPII is far more susceptible to termination when transcribing non-coding RNAs than when synthesizing protein-coding mRNAs, but the mechanisms underlying this are unclear. To investigate the impact of transcribed sequence on elongation potential, we developed a method to screen the effects of thousands of INtegrated Sequences on Expression of RNA and Translation using high-throughput sequencing (INSERT-seq). We found that higher AT content in non-coding RNAs, rather than specific sequence motifs, drives RNAPII termination. Further, we demonstrate that 5' splice sites autonomously stimulate processive transcription, even in the absence of polyadenylation signals. Our results reveal a potent role for the transcribed sequence in dictating gene output and demonstrate the power of INSERT-seq toward illuminating these contributions.
Collapse
|
60
|
Alternative splicing diversifies the skeletal muscle transcriptome during prolonged spaceflight. Skelet Muscle 2022; 12:11. [PMID: 35642060 PMCID: PMC9153194 DOI: 10.1186/s13395-022-00294-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 04/05/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND As the interest in manned spaceflight increases, so does the requirement to understand the transcriptomic mechanisms that underlay the detrimental physiological adaptations of skeletal muscle to microgravity. While microgravity-induced differential gene expression (DGE) has been extensively investigated, the contribution of differential alternative splicing (DAS) to the plasticity and functional status of the skeletal muscle transcriptome has not been studied in an animal model. Therefore, by evaluating both DGE and DAS across spaceflight, we set out to provide the first comprehensive characterization of the transcriptomic landscape of skeletal muscle during exposure to microgravity. METHODS RNA-sequencing, immunohistochemistry, and morphological analyses were conducted utilizing total RNA and tissue sections isolated from the gastrocnemius and quadriceps muscles of 30-week-old female BALB/c mice exposed to microgravity or ground control conditions for 9 weeks. RESULTS In response to microgravity, the skeletal muscle transcriptome was remodeled via both DGE and DAS. Importantly, while DGE showed variable gene network enrichment, DAS was enriched in structural and functional gene networks of skeletal muscle, resulting in the expression of alternatively spliced transcript isoforms that have been associated with the physiological changes to skeletal muscle in microgravity, including muscle atrophy and altered fiber type function. Finally, RNA-binding proteins, which are required for regulation of pre-mRNA splicing, were themselves differentially spliced but not differentially expressed, an upstream event that is speculated to account for the downstream splicing changes identified in target skeletal muscle genes. CONCLUSIONS Our work serves as the first investigation of coordinate changes in DGE and DAS in large limb muscles across spaceflight. It opens up a new opportunity to understand (i) the molecular mechanisms by which splice variants of skeletal muscle genes regulate the physiological adaptations of skeletal muscle to microgravity and (ii) how small molecule splicing regulator therapies might thwart muscle atrophy and alterations to fiber type function during prolonged spaceflight.
Collapse
|
61
|
Structural insights into nuclear transcription by eukaryotic DNA-dependent RNA polymerases. Nat Rev Mol Cell Biol 2022; 23:603-622. [PMID: 35505252 DOI: 10.1038/s41580-022-00476-9] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/18/2022] [Indexed: 02/07/2023]
Abstract
The eukaryotic transcription apparatus synthesizes a staggering diversity of RNA molecules. The labour of nuclear gene transcription is, therefore, divided among multiple DNA-dependent RNA polymerases. RNA polymerase I (Pol I) transcribes ribosomal RNA, Pol II synthesizes messenger RNAs and various non-coding RNAs (including long non-coding RNAs, microRNAs and small nuclear RNAs) and Pol III produces transfer RNAs and other short RNA molecules. Pol I, Pol II and Pol III are large, multisubunit protein complexes that associate with a multitude of additional factors to synthesize transcripts that largely differ in size, structure and abundance. The three transcription machineries share common characteristics, but differ widely in various aspects, such as numbers of RNA polymerase subunits, regulatory elements and accessory factors, which allows them to specialize in transcribing their specific RNAs. Common to the three RNA polymerases is that the transcription process consists of three major steps: transcription initiation, transcript elongation and transcription termination. In this Review, we outline the common principles and differences between the Pol I, Pol II and Pol III transcription machineries and discuss key structural and functional insights obtained into the three stages of their transcription processes.
Collapse
|
62
|
Reprogramming RNA processing: an emerging therapeutic landscape. Trends Pharmacol Sci 2022; 43:437-454. [PMID: 35331569 DOI: 10.1016/j.tips.2022.02.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Revised: 02/22/2022] [Accepted: 02/24/2022] [Indexed: 12/13/2022]
Abstract
The production of a mature mRNA requires coordination of multiple processing steps, which ultimately control its content, localization, and stability. These steps include some of the largest macromolecular machines in the cell, which were, until recently, considered undruggable due to their biological complexity. Building from an expanded understanding of the underlying mechanisms that drive these processes, a new wave of therapeutics is seeking to target RNA processing. With a focus on impacting gene regulation at the RNA level, such modalities offer potential for sequence-specific resolution in drug design. Here, we review our current understanding of RNA-processing events and their role in gene regulation, with a focus on the therapeutic opportunities that have emerged within this landscape.
Collapse
|
63
|
Control of non-productive RNA polymerase II transcription via its early termination in metazoans. Biochem Soc Trans 2022; 50:283-295. [PMID: 35166324 DOI: 10.1042/bst20201140] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 01/11/2022] [Accepted: 01/24/2022] [Indexed: 11/17/2022]
Abstract
Transcription establishes the universal first step of gene expression where RNA is produced by a DNA-dependent RNA polymerase. The most versatile of eukaryotic RNA polymerases, RNA polymerase II (Pol II), transcribes a broad range of DNA including protein-coding and a variety of non-coding transcription units. Although Pol II can be configured as a durable enzyme capable of transcribing hundreds of kilobases, there is reliable evidence of widespread abortive Pol II transcription termination shortly after initiation, which is often followed by rapid degradation of the associated RNA. The molecular details underlying this phenomenon are still vague but likely reflect the action of quality control mechanisms on the early Pol II complex. Here, we summarize current knowledge of how and when such promoter-proximal quality control is asserted on metazoan Pol II.
Collapse
|
64
|
Xiao L, Wang J, Ju S, Cui M, Jing R. Disorders and roles of tsRNA, snoRNA, snRNA and piRNA in cancer. J Med Genet 2022; 59:623-631. [PMID: 35145038 DOI: 10.1136/jmedgenet-2021-108327] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2021] [Accepted: 01/24/2022] [Indexed: 11/04/2022]
Abstract
Most small non-coding RNAs (sncRNAs) with regulatory functions are encoded by majority sequences in the human genome, and the emergence of high-throughput sequencing technology has greatly expanded our understanding of sncRNAs. sncRNAs are composed of a variety of RNAs, including tRNA-derived small RNA (tsRNA), small nucleolar RNA (snoRNA), small nuclear RNA (snRNA), PIWI-interacting RNA (piRNA), etc. While for some, sncRNAs' implication in several pathologies is now well established, the potential involvement of tsRNA, snoRNA, snRNA and piRNA in human diseases is only beginning to emerge. Recently, accumulating pieces of evidence demonstrate that tsRNA, snoRNA, snRNA and piRNA play an important role in many biological processes, and their dysregulation is closely related to the progression of cancer. Abnormal expression of tsRNA, snoRNA, snRNA and piRNA participates in the occurrence and development of tumours through different mechanisms, such as transcriptional inhibition and post-transcriptional regulation. In this review, we describe the research progress in the classification, biogenesis and biological function of tsRNA, snoRNA, snRNA and piRNA. Moreover, we emphasised their dysregulation and mechanism of action in cancer and discussed their potential as diagnostic and prognostic biomarkers or therapeutic targets.
Collapse
Affiliation(s)
- Lin Xiao
- Department of Laboratory Medicine, Affiliated Hospital of Nantong University, Nantong, Jiangsu, China.,Department of Medical School of Nantong University, Nantong University, Nantong, Jiangsu, China
| | - Jie Wang
- Department of Medical School of Nantong University, Nantong University, Nantong, Jiangsu, China
| | - Shaoqing Ju
- Department of Laboratory Medicine, Affiliated Hospital of Nantong University, Nantong, Jiangsu, China
| | - Ming Cui
- Department of Laboratory Medicine, Affiliated Hospital of Nantong University, Nantong, Jiangsu, China.,Department of Medical School of Nantong University, Nantong University, Nantong, Jiangsu, China
| | - Rongrong Jing
- Department of Laboratory Medicine, Affiliated Hospital of Nantong University, Nantong, Jiangsu, China
| |
Collapse
|
65
|
Promoter-Bound Full-Length Intronic Circular RNAs-RNA Polymerase II Complexes Regulate Gene Expression in the Human Parasite Entamoeba histolytica. Noncoding RNA 2022; 8:ncrna8010012. [PMID: 35202086 PMCID: PMC8876499 DOI: 10.3390/ncrna8010012] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 01/20/2022] [Accepted: 01/21/2022] [Indexed: 12/12/2022] Open
Abstract
Ubiquitous eukaryotic non-coding circular RNAs are involved in numerous co- and post-transcriptional regulatory mechanisms. Recently, we reported full-length intronic circular RNAs (flicRNAs) in Entamoeba histolytica, with 3′ss–5′ss ligation points and 5′ss GU-rich elements essential for their biogenesis and their suggested role in transcription regulation. Here, we explored how flicRNAs impact gene expression regulation. Using CLIP assays, followed by qRT-PCR, we identified that the RabX13 control flicRNA and virulence-associated flicRNAs were bound to the HA-tagged RNA Pol II C-terminus domain in E. histolytica transformants. The U2 snRNA was also present in such complexes, indicating that they belonged to transcription initiation/elongation complexes. Correspondingly, inhibition of the second step of splicing using boric acid reduced flicRNA formation and modified the expression of their parental genes and non-related genes. flicRNAs were also recovered from chromatin immunoprecipitation eluates, indicating that the flicRNA-Pol II complex was formed in the promoter of their cognate genes. Finally, two flicRNAs were found to be cytosolic, whose functions remain to be uncovered. Here, we provide novel evidence of the role of flicRNAs in gene expression regulation in cis, apparently in a widespread fashion, as an element bound to the RNA polymerase II transcription initiation complex, in E. histolytica.
Collapse
|
66
|
Malard F, Mackereth CD, Campagne S. Principles and correction of 5'-splice site selection. RNA Biol 2022; 19:943-960. [PMID: 35866748 PMCID: PMC9311317 DOI: 10.1080/15476286.2022.2100971] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 07/06/2022] [Indexed: 11/04/2022] Open
Abstract
In Eukarya, immature mRNA transcripts (pre-mRNA) often contain coding sequences, or exons, interleaved by non-coding sequences, or introns. Introns are removed upon splicing, and further regulation of the retained exons leads to alternatively spliced mRNA. The splicing reaction requires the stepwise assembly of the spliceosome, a macromolecular machine composed of small nuclear ribonucleoproteins (snRNPs). This review focuses on the early stage of spliceosome assembly, when U1 snRNP defines each intron 5'-splice site (5'ss) in the pre-mRNA. We first introduce the splicing reaction and the impact of alternative splicing on gene expression regulation. Thereafter, we extensively discuss splicing descriptors that influence the 5'ss selection by U1 snRNP, such as sequence determinants, and interactions mediated by U1-specific proteins or U1 small nuclear RNA (U1 snRNA). We also include examples of diseases that affect the 5'ss selection by U1 snRNP, and discuss recent therapeutic advances that manipulate U1 snRNP 5'ss selectivity with antisense oligonucleotides and small-molecule splicing switches.
Collapse
Affiliation(s)
- Florian Malard
- Inserm U1212, CNRS UMR5320, ARNA Laboratory, University of Bordeaux, Bordeaux Cedex, France
| | - Cameron D Mackereth
- Inserm U1212, CNRS UMR5320, ARNA Laboratory, University of Bordeaux, Bordeaux Cedex, France
| | - Sébastien Campagne
- Inserm U1212, CNRS UMR5320, ARNA Laboratory, University of Bordeaux, Bordeaux Cedex, France
| |
Collapse
|
67
|
Borao S, Ayté J, Hümmer S. Evolution of the Early Spliceosomal Complex-From Constitutive to Regulated Splicing. Int J Mol Sci 2021; 22:ijms222212444. [PMID: 34830325 PMCID: PMC8624252 DOI: 10.3390/ijms222212444] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 11/15/2021] [Accepted: 11/16/2021] [Indexed: 12/14/2022] Open
Abstract
Pre-mRNA splicing is a major process in the regulated expression of genes in eukaryotes, and alternative splicing is used to generate different proteins from the same coding gene. Splicing is a catalytic process that removes introns and ligates exons to create the RNA sequence that codifies the final protein. While this is achieved in an autocatalytic process in ancestral group II introns in prokaryotes, the spliceosome has evolved during eukaryogenesis to assist in this process and to finally provide the opportunity for intron-specific splicing. In the early stage of splicing, the RNA 5' and 3' splice sites must be brought within proximity to correctly assemble the active spliceosome and perform the excision and ligation reactions. The assembly of this first complex, termed E-complex, is currently the least understood process. We focused in this review on the formation of the E-complex and compared its composition and function in three different organisms. We highlight the common ancestral mechanisms in S. cerevisiae, S. pombe, and mammals and conclude with a unifying model for intron definition in constitutive and regulated co-transcriptional splicing.
Collapse
Affiliation(s)
- Sonia Borao
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain;
| | - José Ayté
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain;
- Correspondence: (J.A.); (S.H.)
| | - Stefan Hümmer
- Oxidative Stress and Cell Cycle Group, Universitat Pompeu Fabra, 08003 Barcelona, Spain;
- Translational Molecular Pathology, Vall d’Hebron Research Institute (VHIR), CIBERONC, 08035 Barcelona, Spain
- Correspondence: (J.A.); (S.H.)
| |
Collapse
|
68
|
Aibara S, Dienemann C, Cramer P. Structure of an inactive RNA polymerase II dimer. Nucleic Acids Res 2021; 49:10747-10755. [PMID: 34530439 PMCID: PMC8501987 DOI: 10.1093/nar/gkab783] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 08/24/2021] [Accepted: 09/14/2021] [Indexed: 02/05/2023] Open
Abstract
Eukaryotic gene transcription is carried out by three RNA polymerases: Pol I, Pol II and Pol III. Although it has long been known that Pol I can form homodimers, it is unclear whether and how the two other RNA polymerases dimerize. Here we present the cryo-electron microscopy (cryo-EM) structure of a mammalian Pol II dimer at 3.5 Å resolution. The structure differs from the Pol I dimer and reveals that one Pol II copy uses its RPB4-RPB7 stalk to penetrate the active centre cleft of the other copy, and vice versa, giving rise to a molecular handshake. The polymerase clamp domain is displaced and mobile, and the RPB7 oligonucleotide-binding fold mimics the DNA–RNA hybrid that occupies the cleft during active transcription. The Pol II dimer is incompatible with nucleic acid binding as required for transcription and may represent an inactive storage form of the polymerase.
Collapse
Affiliation(s)
- Shintaro Aibara
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany
| | - Christian Dienemann
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany
| | - Patrick Cramer
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Am Fassberg 11, 37077, Göttingen, Germany
| |
Collapse
|
69
|
Bhat P, Honson D, Guttman M. Nuclear compartmentalization as a mechanism of quantitative control of gene expression. Nat Rev Mol Cell Biol 2021; 22:653-670. [PMID: 34341548 DOI: 10.1038/s41580-021-00387-1] [Citation(s) in RCA: 149] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/28/2021] [Indexed: 01/08/2023]
Abstract
Gene regulation requires the dynamic coordination of hundreds of regulatory factors at precise genomic and RNA targets. Although many regulatory factors have specific affinity for their nucleic acid targets, molecular diffusion and affinity models alone cannot explain many of the quantitative features of gene regulation in the nucleus. One emerging explanation for these quantitative properties is that DNA, RNA and proteins organize within precise, 3D compartments in the nucleus to concentrate groups of functionally related molecules. Recently, nucleic acids and proteins involved in many important nuclear processes have been shown to engage in cooperative interactions, which lead to the formation of condensates that partition the nucleus. In this Review, we discuss an emerging perspective of gene regulation, which moves away from classic models of stoichiometric interactions towards an understanding of how spatial compartmentalization can lead to non-stoichiometric molecular interactions and non-linear regulatory behaviours. We describe key mechanisms of nuclear compartment formation, including emerging roles for non-coding RNAs in facilitating their formation, and discuss the functional role of nuclear compartments in transcription regulation, co-transcriptional and post-transcriptional RNA processing, and higher-order chromatin regulation. More generally, we discuss how compartmentalization may explain important quantitative aspects of gene regulation.
Collapse
Affiliation(s)
- Prashant Bhat
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
- David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Drew Honson
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - Mitchell Guttman
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA.
| |
Collapse
|
70
|
Abstract
Transcription-coupled DNA repair removes bulky DNA lesions from the genome1,2 and protects cells against ultraviolet (UV) irradiation3. Transcription-coupled DNA repair begins when RNA polymerase II (Pol II) stalls at a DNA lesion and recruits the Cockayne syndrome protein CSB, the E3 ubiquitin ligase, CRL4CSA and UV-stimulated scaffold protein A (UVSSA)3. Here we provide five high-resolution structures of Pol II transcription complexes containing human transcription-coupled DNA repair factors and the elongation factors PAF1 complex (PAF) and SPT6. Together with biochemical and published3,4 data, the structures provide a model for transcription–repair coupling. Stalling of Pol II at a DNA lesion triggers replacement of the elongation factor DSIF by CSB, which binds to PAF and moves upstream DNA to SPT6. The resulting elongation complex, ECTCR, uses the CSA-stimulated translocase activity of CSB to pull on upstream DNA and push Pol II forward. If the lesion cannot be bypassed, CRL4CSA spans over the Pol II clamp and ubiquitylates the RPB1 residue K1268, enabling recruitment of TFIIH to UVSSA and DNA repair. Conformational changes in CRL4CSA lead to ubiquitylation of CSB and to release of transcription-coupled DNA repair factors before transcription may continue over repaired DNA. The authors resolve the structure of five complexes containing RNA polymerase II and the CSA and CSB proteins, offering insight into how the repair of DNA lesions is coupled to transcription.
Collapse
|
71
|
Ottesen EW, Luo D, Singh NN, Singh RN. High Concentration of an ISS-N1-Targeting Antisense Oligonucleotide Causes Massive Perturbation of the Transcriptome. Int J Mol Sci 2021; 22:ijms22168378. [PMID: 34445083 PMCID: PMC8395096 DOI: 10.3390/ijms22168378] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2021] [Revised: 07/14/2021] [Accepted: 07/31/2021] [Indexed: 12/17/2022] Open
Abstract
Intronic splicing silencer N1 (ISS-N1) located within Survival Motor Neuron 2 (SMN2) intron 7 is the target of a therapeutic antisense oligonucleotide (ASO), nusinersen (Spinraza), which is currently being used for the treatment of spinal muscular atrophy (SMA), a leading genetic disease associated with infant mortality. The discovery of ISS-N1 as a promising therapeutic target was enabled in part by Anti-N1, a 20-mer ASO that restored SMN2 exon 7 inclusion by annealing to ISS-N1. Here, we analyzed the transcriptome of SMA patient cells treated with 100 nM of Anti-N1 for 30 h. Such concentrations are routinely used to demonstrate the efficacy of an ASO. While 100 nM of Anti-N1 substantially stimulated SMN2 exon 7 inclusion, it also caused massive perturbations in the transcriptome and triggered widespread aberrant splicing, affecting expression of essential genes associated with multiple cellular processes such as transcription, splicing, translation, cell signaling, cell cycle, macromolecular trafficking, cytoskeletal dynamics, and innate immunity. We validated our findings with quantitative and semiquantitative PCR of 39 candidate genes associated with diverse pathways. We also showed a substantial reduction in off-target effects with shorter ISS-N1-targeting ASOs. Our findings are significant for implementing better ASO design and dosing regimens of ASO-based drugs.
Collapse
|
72
|
Ravindran S. Profile of Patrick Cramer. Proc Natl Acad Sci U S A 2021; 118:e2111728118. [PMID: 34301909 PMCID: PMC8325307 DOI: 10.1073/pnas.2111728118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
|
73
|
The upstream 5' splice site remains associated to the transcription machinery during intron synthesis. Nat Commun 2021; 12:4545. [PMID: 34315864 PMCID: PMC8316553 DOI: 10.1038/s41467-021-24774-6] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 07/02/2021] [Indexed: 12/28/2022] Open
Abstract
In the earliest step of spliceosome assembly, the two splice sites flanking an intron are brought into proximity by U1 snRNP and U2AF along with other proteins. The mechanism that facilitates this intron looping is poorly understood. Using a CRISPR interference-based approach to halt RNA polymerase II transcription in the middle of introns in human cells, we discovered that the nascent 5′ splice site base pairs with a U1 snRNA that is tethered to RNA polymerase II during intron synthesis. This association functionally corresponds with splicing outcome, involves bona fide 5′ splice sites and cryptic intronic sites, and occurs transcriptome-wide. Overall, our findings reveal that the upstream 5′ splice sites remain attached to the transcriptional machinery during intron synthesis and are thus brought into proximity of the 3′ splice sites; potentially mediating the rapid splicing of long introns. We know that most splicing reactions take place co-transcriptionally, but how the transcription machinery facilitate splicing of introns is unknown. Here the authors show that the 5′ splice site remains associated with the transcription machinery during intron synthesis through U1 snRNP, providing a basis for the rapid splicing reaction of introns.
Collapse
|
74
|
Campagne S, de Vries T, Malard F, Afanasyev P, Dorn G, Dedic E, Kohlbrecher J, Boehringer D, Cléry A, Allain FHT. An in vitro reconstituted U1 snRNP allows the study of the disordered regions of the particle and the interactions with proteins and ligands. Nucleic Acids Res 2021; 49:e63. [PMID: 33677607 PMCID: PMC8216277 DOI: 10.1093/nar/gkab135] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 02/11/2021] [Accepted: 02/17/2021] [Indexed: 11/17/2022] Open
Abstract
U1 small nuclear ribonucleoparticle (U1 snRNP) plays a central role during RNA processing. Previous structures of U1 snRNP revealed how the ribonucleoparticle is organized and recognizes the pre-mRNA substrate at the exon–intron junction. As with many other ribonucleoparticles involved in RNA metabolism, U1 snRNP contains extensions made of low complexity sequences. Here, we developed a protocol to reconstitute U1 snRNP in vitro using mostly full-length components in order to perform liquid-state NMR spectroscopy. The accuracy of the reconstitution was validated by probing the shape and structure of the particle by SANS and cryo-EM. Using an NMR spectroscopy-based approach, we probed, for the first time, the U1 snRNP tails at atomic detail and our results confirm their high degree of flexibility. We also monitored the labile interaction between the splicing factor PTBP1 and U1 snRNP and validated the U1 snRNA stem loop 4 as a binding site for the splicing regulator on the ribonucleoparticle. Altogether, we developed a method to probe the intrinsically disordered regions of U1 snRNP and map the interactions controlling splicing regulation. This approach could be used to get insights into the molecular mechanisms of alternative splicing and screen for potential RNA therapeutics.
Collapse
Affiliation(s)
- Sébastien Campagne
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Tebbe de Vries
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Florian Malard
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Pavel Afanasyev
- Cryo-EM Knowledge Hub (CEMK), ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Georg Dorn
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Emil Dedic
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | | | - Daniel Boehringer
- Cryo-EM Knowledge Hub (CEMK), ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Antoine Cléry
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| | - Frédéric H-T Allain
- Institute of Biochemistry, Department of Biology, ETH Zurich, Hönggerbergring 64, CH-8093 Zürich, Switzerland
| |
Collapse
|
75
|
Zhang Y, Cai Y, Roca X, Kwoh CK, Fullwood MJ. Chromatin loop anchors predict transcript and exon usage. Brief Bioinform 2021; 22:6319936. [PMID: 34263910 PMCID: PMC8575016 DOI: 10.1093/bib/bbab254] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 06/16/2021] [Accepted: 05/25/2021] [Indexed: 11/24/2022] Open
Abstract
Epigenomics and transcriptomics data from high-throughput sequencing techniques such as RNA-seq and ChIP-seq have been successfully applied in predicting gene transcript expression. However, the locations of chromatin loops in the genome identified by techniques such as Chromatin Interaction Analysis with Paired End Tag sequencing (ChIA-PET) have never been used for prediction tasks. Here, we developed machine learning models to investigate if ChIA-PET could contribute to transcript and exon usage prediction. In doing so, we used a large set of transcription factors as well as ChIA-PET data. We developed different Gradient Boosting Trees models according to the different tasks with the integrated datasets from three cell lines, including GM12878, HeLaS3 and K562. We validated the models via 10-fold cross validation, chromosome-split validation and cross-cell validation. Our results show that both transcript and splicing-derived exon usage can be effectively predicted with at least 0.7512 and 0.7459 of accuracy, respectively, on all cell lines from all kinds of validations. Examining the predictive features, we found that RNA Polymerase II ChIA-PET was one of the most important features in both transcript and exon usage prediction, suggesting that chromatin loop anchors are predictive of both transcript and exon usage.
Collapse
Affiliation(s)
- Yu Zhang
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Yichao Cai
- Cancer Science Institute of Singapore, National University of Singapore, 14 Medical Dr, Singapore 117599, Singapore
| | - Xavier Roca
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Dr, Singapore 637551, Singapore
| | - Chee Keong Kwoh
- School of Computer Science and Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore, 639798, Singapore
| | - Melissa Jane Fullwood
- Cancer Science Institute of Singapore, National University of Singapore, 14 Medical Dr, Singapore 117599, Singapore.,School of Biological Sciences, Nanyang Technological University, 637551, Singapore.,Institute of Molecular and Cell Biology, Agency for Science, Technology and Research (A*STAR), 61 Biopolis Dr, Singapore 138673, Singapore
| |
Collapse
|
76
|
Saha K, Fernandez MM, Biswas T, Joseph S, Ghosh G. Discovery of a pre-mRNA structural scaffold as a contributor to the mammalian splicing code. Nucleic Acids Res 2021; 49:7103-7121. [PMID: 34161584 PMCID: PMC8266590 DOI: 10.1093/nar/gkab533] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Revised: 06/03/2021] [Accepted: 06/08/2021] [Indexed: 11/13/2022] Open
Abstract
The specific recognition of splice signals at or near exon-intron junctions is not explained by their weak conservation and instead is postulated to require a multitude of features embedded in the pre-mRNA strand. We explored the possibility of 3D structural scaffold of AdML-a model pre-mRNA substrate-guiding early spliceosomal components to the splice signal sequences. We find that mutations in the non-cognate splice signal sequences impede recruitment of early spliceosomal components due to disruption of the global structure of the pre-mRNA. We further find that the pre-mRNA segments potentially interacting with the early spliceosomal component U1 snRNP are distributed across the intron, that there is a spatial proximity of 5' and 3' splice sites within the pre-mRNA scaffold, and that an interplay exists between the structural scaffold and splicing regulatory elements in recruiting early spliceosomal components. These results suggest that early spliceosomal components can recognize a 3D structural scaffold beyond the short splice signal sequences, and that in our model pre-mRNA, this scaffold is formed across the intron involving the major splice signals. This provides a conceptual basis to analyze the contribution of recognizable 3D structural scaffolds to the splicing code across the mammalian transcriptome.
Collapse
Affiliation(s)
- Kaushik Saha
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0375, USA
| | - Mike Minh Fernandez
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0375, USA
| | - Tapan Biswas
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0375, USA
| | - Simpson Joseph
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0375, USA
| | - Gourisankar Ghosh
- Department of Chemistry and Biochemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0375, USA
| |
Collapse
|
77
|
ARS2/SRRT: at the nexus of RNA polymerase II transcription, transcript maturation and quality control. Biochem Soc Trans 2021; 49:1325-1336. [PMID: 34060620 DOI: 10.1042/bst20201008] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 05/04/2021] [Accepted: 05/06/2021] [Indexed: 01/26/2023]
Abstract
ARS2/SRRT is an essential eukaryotic protein that has emerged as a critical factor in the sorting of functional from non-functional RNA polymerase II (Pol II) transcripts. Through its interaction with the Cap Binding Complex (CBC), it associates with the cap of newly made RNAs and acts as a hub for competitive exchanges of protein factors that ultimately determine the fate of the associated RNA. The central position of the protein within the nuclear gene expression machinery likely explains why its depletion causes a broad range of phenotypes, yet an exact function of the protein remains elusive. Here, we consider the literature on ARS2/SRRT with the attempt to garner the threads into a unifying working model for ARS2/SRRT function at the nexus of Pol II transcription, transcript maturation and quality control.
Collapse
|
78
|
Strobel EJ. Preparation of E. coli RNA polymerase transcription elongation complexes by selective photoelution from magnetic beads. J Biol Chem 2021; 297:100812. [PMID: 34023383 PMCID: PMC8212663 DOI: 10.1016/j.jbc.2021.100812] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 05/17/2021] [Accepted: 05/19/2021] [Indexed: 11/30/2022] Open
Abstract
In vitro studies of transcription frequently require the preparation of defined elongation complexes. Defined transcription elongation complexes (TECs) are typically prepared by constructing an artificial transcription bubble from synthetic oligonucleotides and RNA polymerase. This approach is optimal for diverse applications but is sensitive to nucleic acid length and sequence and is not compatible with systems where promoter-directed initiation or extensive transcription elongation is crucial. To complement scaffold-directed approaches for TEC assembly, I have developed a method for preparing promoter-initiated Escherichia coli TECs using a purification strategy called selective photoelution. This approach combines TEC-dependent sequestration of a biotin-triethylene glycol transcription stall site with photoreversible DNA immobilization to enrich TECs from an in vitro transcription reaction. I show that selective photoelution can be used to purify TECs that contain a 273-bp DNA template and 194-nt structured RNA. Selective photoelution is a straightforward and robust procedure that, in the systems assessed here, generates precisely positioned TECs with >95% purity and >30% yield. TECs prepared by selective photoelution can contain complex nucleic acid sequences and will therefore likely be useful for investigating RNA structure and function in the context of RNA polymerases.
Collapse
Affiliation(s)
- Eric J Strobel
- Department of Biological Sciences, The University at Buffalo, Buffalo, New York, USA.
| |
Collapse
|
79
|
Wan Y, Anastasakis DG, Rodriguez J, Palangat M, Gudla P, Zaki G, Tandon M, Pegoraro G, Chow CC, Hafner M, Larson DR. Dynamic imaging of nascent RNA reveals general principles of transcription dynamics and stochastic splice site selection. Cell 2021; 184:2878-2895.e20. [PMID: 33979654 DOI: 10.1016/j.cell.2021.04.012] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Revised: 11/12/2020] [Accepted: 04/08/2021] [Indexed: 01/06/2023]
Abstract
The activities of RNA polymerase and the spliceosome are responsible for the heterogeneity in the abundance and isoform composition of mRNA in human cells. However, the dynamics of these megadalton enzymatic complexes working in concert on endogenous genes have not been described. Here, we establish a quasi-genome-scale platform for observing synthesis and processing kinetics of single nascent RNA molecules in real time. We find that all observed genes show transcriptional bursting. We also observe large kinetic variation in intron removal for single introns in single cells, which is inconsistent with deterministic splice site selection. Transcriptome-wide footprinting of the U2AF complex, nascent RNA profiling, long-read sequencing, and lariat sequencing further reveal widespread stochastic recursive splicing within introns. We propose and validate a unified theoretical model to explain the general features of transcription and pervasive stochastic splice site selection.
Collapse
Affiliation(s)
- Yihan Wan
- Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Dimitrios G Anastasakis
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, MD 20892, USA
| | | | - Murali Palangat
- Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Prabhakar Gudla
- Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - George Zaki
- Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Mayank Tandon
- Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA; Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Gianluca Pegoraro
- Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Carson C Chow
- Laboratory of Biological Modeling, NIDDK, Bethesda, MD, USA
| | - Markus Hafner
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, Bethesda, MD 20892, USA.
| | - Daniel R Larson
- Center for Cancer Research, National Cancer Institute, Bethesda, MD 20892, USA.
| |
Collapse
|