1
|
Gheitanchi F, Giambra IJ, Hecker AS, Strube C, König S, May K. Relationships between liver and rumen fluke infections, milk somatic cells and polymorphisms in the Toll-like receptor 5 gene and vitamin D metabolism-related genes in Holstein dairy cows. Vet Immunol Immunopathol 2025; 283:110911. [PMID: 40058098 DOI: 10.1016/j.vetimm.2025.110911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2024] [Revised: 02/21/2025] [Accepted: 02/25/2025] [Indexed: 05/02/2025]
Abstract
This study investigated polymorphisms in the genes CYP3A4, CYP2R1, and TLR5, and their associations with liver fluke (Fasciola hepatica) and rumen fluke (Calicophoron / Paramphistomum spp.) infections as well as with milk somatic cell count (SCC) as an indicator for mastitis in Holstein Friesian dairy cows. DNA sequencing of the genes' exons, 5'-, 3'-untranslated regions (UTR), introns, and flanking regions of 24 cows revealed 442 variants (388 SNPs and 54 InDels) including 116 previously unknown variants. We detected three novel non-synonymous variants leading to the derived amino acid exchanges, i.e. CYP3A4 p.Gly197Ser, CYP3A4 p.Ile388Val, and CYP2R1 p.Val11Ala. The newly identified SNP 25:36589922 T > C (ss11846100002) is positioned in the splice site of CYP3A4, but showed no impact on the binding score of the splice enzymes. The CYP2R1 and TLR5 genes presented 11 SNPs in the 5'- and 3'-UTR, partly influencing transcription factor binding or microRNA target sites. Associations between polymorphisms and constructed haplotypes with infection traits were analysed via (generalized) linear mixed models including further potential confounders. In total, 109 variants in CYP3A4, 37 variants in CYP2R1, and 18 variants in TLR5 were significantly associated with F. hepatica and rumen fluke infections, and with SCC. The CYP2R1 and TLR5 variants were mostly linked to SCC, indicating the genes' roles in immune responses to bacterial infections. Haplotype analysis revealed significant associations between specific CYP3A4 haplotypes and F. hepatica worm count and faecal egg counts. This study revealed significant insights into gene polymorphisms related to vitamin D metabolism and immune response, which seem to play a role in helminth and udder infections.
Collapse
Affiliation(s)
- Fatemeh Gheitanchi
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, Gießen 35390, Germany.
| | - Isabella Jasmin Giambra
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, Gießen 35390, Germany
| | - Anna Sophie Hecker
- Institute for Parasitology, Centre for Infection Medicine, University of Veterinary Medicine Hannover, Hannover 30559, Germany
| | - Christina Strube
- Institute for Parasitology, Centre for Infection Medicine, University of Veterinary Medicine Hannover, Hannover 30559, Germany
| | - Sven König
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, Gießen 35390, Germany
| | - Katharina May
- Institute of Animal Breeding and Genetics, Justus-Liebig-University Gießen, Gießen 35390, Germany; Institute for Parasitology, Centre for Infection Medicine, University of Veterinary Medicine Hannover, Hannover 30559, Germany
| |
Collapse
|
2
|
Klavaris A, Kouma M, Ozdemir C, Nicolaidou V, Miller KM, Koufaris C, Kirmizis A. Biochemical Characterisation of the Short Isoform of Histone N-Terminal Acetyltransferase NAA40. Biomolecules 2024; 14:1100. [PMID: 39334865 PMCID: PMC11430322 DOI: 10.3390/biom14091100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Revised: 08/21/2024] [Accepted: 08/28/2024] [Indexed: 09/30/2024] Open
Abstract
N-alpha-acetyltransferase 40 (NAA40) is an evolutionarily conserved N-terminal acetyltransferase (NAT) linked to oncogenesis and chemoresistance. A recent study reported the generation of a second, shorter NAA40 isoform (NAA40S) through alternative translation, which we proceeded to further characterise. Notably, recombinant NAA40S had a greater in vitro enzymatic activity and affinity towards its histone H2A/H4 substrates compared to full-length NAA40 (NAA40L). Within cells, NAA40S was enzymatically active, based on its ability to suppress the H2A/H4S1Ph antagonistic mark in CRISPR-generated NAA40 knockout cells. Finally, we show that in addition to alternative translation, the NAA40S isoform could be derived from a primate and testis-specific transcript, which may align with the "out-of-testis" origin of recently evolved genes and isoforms. To summarise, our data reveal an even greater functional divergence between the two NAA40 isoforms than had been previously recognised.
Collapse
Affiliation(s)
- Ariel Klavaris
- Epigenetics and Gene Regulation Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia 2109, Cyprus
| | - Maria Kouma
- Epigenetics and Gene Regulation Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia 2109, Cyprus
| | - Cem Ozdemir
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Vicky Nicolaidou
- Department of Life Sciences, University of Nicosia, Nicosia 2417, Cyprus
| | - Kyle M Miller
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Costas Koufaris
- Epigenetics and Gene Regulation Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia 2109, Cyprus
- Cyprus Cancer Research Institute, Nicosia 2109, Cyprus
| | - Antonis Kirmizis
- Epigenetics and Gene Regulation Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia 2109, Cyprus
| |
Collapse
|
3
|
Gupta K, Yang C, McCue K, Bastani O, Sharp PA, Burge CB, Solar-Lezama A. Improved modeling of RNA-binding protein motifs in an interpretable neural model of RNA splicing. Genome Biol 2024; 25:23. [PMID: 38229106 PMCID: PMC10790492 DOI: 10.1186/s13059-023-03162-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 12/28/2023] [Indexed: 01/18/2024] Open
Abstract
Sequence-specific RNA-binding proteins (RBPs) play central roles in splicing decisions. Here, we describe a modular splicing architecture that leverages in vitro-derived RNA affinity models for 79 human RBPs and the annotated human genome to produce improved models of RBP binding and activity. Binding and activity are modeled by separate Motif and Aggregator components that can be mixed and matched, enforcing sparsity to improve interpretability. Training a new Adjusted Motif (AM) architecture on the splicing task not only yields better splicing predictions but also improves prediction of RBP-binding sites in vivo and of splicing activity, assessed using independent data.
Collapse
Affiliation(s)
- Kavi Gupta
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Chenxi Yang
- Department of Computer Science, University of Texas at Austin, Austin, TX, 78712, USA
| | - Kayla McCue
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Osbert Bastani
- Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Phillip A Sharp
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
- Koch Institute of Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Christopher B Burge
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| | - Armando Solar-Lezama
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| |
Collapse
|
4
|
Dai Y, Wang Y, Zeng L, Jia R, He L, Huang X, Zhao H, Liu D, Zhao H, Hu S, Gao L, Guo A, Xia W, Ji C. Genomic and Transcriptomic Insights into the Evolution and Divergence of MIKC-Type MADS-Box Genes in Carica papaya. Int J Mol Sci 2023; 24:14039. [PMID: 37762345 PMCID: PMC10531014 DOI: 10.3390/ijms241814039] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 09/02/2023] [Accepted: 09/08/2023] [Indexed: 09/29/2023] Open
Abstract
MIKC-type MADS-box genes, also known as type II genes, play a crucial role in regulating the formation of floral organs and reproductive development in plants. However, the genome-wide identification and characterization of type II genes as well as a transcriptomic survey of their potential roles in Carica papaya remain unresolved. Here, we identified and characterized 24 type II genes in the C. papaya genome, and investigated their evolutional scenario and potential roles with a widespread expression profile. The type II genes were divided into thirteen subclades, and gene loss events likely occurred in papaya, as evidenced by the contracted member size of most subclades. Gene duplication mainly contributed to MIKC-type gene formation in papaya, and the duplicated gene pairs displayed prevalent expression divergence, implying the evolutionary significance of gene duplication in shaping the diversity of type II genes in papaya. A large-scale transcriptome analysis of 152 samples indicated that different subclasses of these genes showed distinct expression patterns in various tissues, biotic stress response, and abiotic stress response, reflecting their divergent functions. The hub-network of male and female flowers and qRT-PCR suggested that TT16-3 and AGL8 participated in male flower development and seed germination. Overall, this study provides valuable insights into the evolution and functions of MIKC-type genes in C. papaya.
Collapse
Affiliation(s)
- Yunsu Dai
- Sanya Nanfan Research Institution of Hainan University, Sanya 572025, China; (Y.D.); (Y.W.); (D.L.)
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
| | - Yu Wang
- Sanya Nanfan Research Institution of Hainan University, Sanya 572025, China; (Y.D.); (Y.W.); (D.L.)
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- National Key Laboratory for Tropical Crop Breeding, Sanya 572025, China
| | - Liwang Zeng
- Key Laboratory of Applied Research on Tropical Crop Information Technology of Hainan Province, Institute of Scientific and Technical Information, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China
| | - Ruizong Jia
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- National Key Laboratory for Tropical Crop Breeding, Sanya 572025, China
| | - Linwen He
- College of Marine Science, Hainan University, Haikou 570228, China;
| | - Xueying Huang
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- College of Marine Science, Hainan University, Haikou 570228, China;
| | - Hui Zhao
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- National Key Laboratory for Tropical Crop Breeding, Sanya 572025, China
| | - Difa Liu
- Sanya Nanfan Research Institution of Hainan University, Sanya 572025, China; (Y.D.); (Y.W.); (D.L.)
- Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China
| | - Haixu Zhao
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
| | - Shuai Hu
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- National Key Laboratory for Tropical Crop Breeding, Sanya 572025, China
| | - Ling Gao
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China
| | - Anping Guo
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- National Key Laboratory for Tropical Crop Breeding, Sanya 572025, China
| | - Wei Xia
- Sanya Nanfan Research Institution of Hainan University, Sanya 572025, China; (Y.D.); (Y.W.); (D.L.)
| | - Changmian Ji
- Hainan Key Laboratory for Biosafety Monitoring and Molecular Breeding in Off-Season Reproduction Regions, Sanya Research Institute & Institute of Tropical Bioscience and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China; (R.J.); (X.H.); (H.Z.); (H.Z.); (S.H.); (L.G.); (A.G.)
- National Key Laboratory for Tropical Crop Breeding, Sanya 572025, China
| |
Collapse
|
5
|
Chakraborty A, Mondal S, Mahajan S, Sharma VK. High-quality genome assemblies provide clues on the evolutionary advantage of blue peafowl over green peafowl. Heliyon 2023; 9:e18571. [PMID: 37576271 PMCID: PMC10412995 DOI: 10.1016/j.heliyon.2023.e18571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 07/14/2023] [Accepted: 07/20/2023] [Indexed: 08/15/2023] Open
Abstract
An intriguing example of differential adaptability is the case of two Asian peafowl species, Pavo cristatus (blue peafowl) and Pavo muticus (green peafowl), where the former has a "Least Concern" conservation status and the latter is an "Endangered" species. To understand the genetic basis of this differential adaptability of the two peafowl species, a comparative analysis of these species is much needed to gain the genomic and evolutionary insights. Thus, we constructed a high-quality genome assembly of blue peafowl with an N50 value of 84.81 Mb (pseudochromosome-level assembly), and a high-confidence coding gene set to perform the genomic and evolutionary analyses of blue and green peafowls with 49 other avian species. The analyses revealed adaptive evolution of genes related to neuronal development, immunity, and skeletal muscle development in these peafowl species. Major genes related to axon guidance such as NEO1 and UNC5, semaphorin (SEMA), and ephrin receptor showed adaptive evolution in peafowl species. However, blue peafowl showed the presence of 42% more coding genes compared to the green peafowl along with a higher number of species-specific gene clusters, segmental duplicated genes and expanded gene families, and comparatively higher evolution in neuronal and developmental pathways. Blue peafowl also showed longer branch length compared to green peafowl in the species phylogenetic tree. These genomic insights obtained from the high-quality genome assembly of P. cristatus constructed in this study provide new clues on the superior adaptability of the blue peafowl over green peafowl despite having a recent species divergence time.
Collapse
Affiliation(s)
- Abhisek Chakraborty
- MetaBioSys Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, 462066, Madhya Pradesh, India
| | - Samuel Mondal
- MetaBioSys Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, 462066, Madhya Pradesh, India
| | - Shruti Mahajan
- MetaBioSys Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, 462066, Madhya Pradesh, India
| | - Vineet K. Sharma
- MetaBioSys Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, 462066, Madhya Pradesh, India
| |
Collapse
|
6
|
Ferrández-Peral L, Zhan X, Alvarez-Estape M, Chiva C, Esteller-Cucala P, García-Pérez R, Julià E, Lizano E, Fornas Ò, Sabidó E, Li Q, Marquès-Bonet T, Juan D, Zhang G. Transcriptome innovations in primates revealed by single-molecule long-read sequencing. Genome Res 2022; 32:1448-1462. [PMID: 35840341 PMCID: PMC9435740 DOI: 10.1101/gr.276395.121] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 07/12/2022] [Indexed: 11/24/2022]
Abstract
Transcriptomic diversity greatly contributes to the fundamentals of disease, lineage-specific biology, and environmental adaptation. However, much of the actual isoform repertoire contributing to shaping primate evolution remains unknown. Here, we combined deep long- and short-read sequencing complemented with mass spectrometry proteomics in a panel of lymphoblastoid cell lines (LCLs) from human, three other great apes, and rhesus macaque, producing the largest full-length isoform catalog in primates to date. Around half of the captured isoforms are not annotated in their reference genomes, significantly expanding the gene models in primates. Furthermore, our comparative analyses unveil hundreds of transcriptomic innovations and isoform usage changes related to immune function and immunological disorders. The confluence of these evolutionary innovations with signals of positive selection and their limited impact in the proteome points to changes in alternative splicing in genes involved in immune response as an important target of recent regulatory divergence in primates.
Collapse
Affiliation(s)
| | | | | | - Cristina Chiva
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | | | | | - Eva Julià
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08003 Barcelona, Spain
| | - Esther Lizano
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, 08193 Barcelona, Spain
| | - Òscar Fornas
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Eduard Sabidó
- Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
| | - Qiye Li
- BGI-Shenzhen, Shenzhen 518083, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tomàs Marquès-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, 08193 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
- CNAG-CRG, Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), 08028 Barcelona, Spain
| | - David Juan
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, 08003 Barcelona, Spain
| | - Guojie Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen 2200, Denmark
- Evolutionary and Organismal Biology Research Center, School of Medicine, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
7
|
Zhang F, Raabe CA, Cardoso-Moreira M, Brosius J, Kaessmann H, Schmitz J. ExoPLOT: Representation of alternative splicing in human tissues and developmental stages with transposed element (TE) involvement. Genomics 2022; 114:110434. [PMID: 35863675 DOI: 10.1016/j.ygeno.2022.110434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 07/04/2022] [Accepted: 07/14/2022] [Indexed: 02/08/2023]
Abstract
Advances in RNA high-throughput sequencing and large-scale functional assays yield new insights into the multifaceted activities of transposed elements (TE) and many other previously undiscovered sequence elements. Currently, no tool for easy access, analysis, quantification, and visualization of alternatively spliced exons across multiple tissues or developmental stages is available. Also, analysis pipelines demand computational skills or hardware requirements, which often are hard to meet by wet-lab scientists. We developed ExoPLOT to enable simplified access to massive RNA high throughput sequencing datasets to facilitate the analysis of alternative splicing across many biological samples. To demonstrate the functonality of ExoPLOT, we analyzed the contributon of exonized TEs to human coding sequences (CDS). mRNA splice variants containing the TE-derived exon were quantified and compared to expression levels of TE-free splice variants. For analysis, we utilized 313 human cerebrum, cerebellum, heart, kidney, liver, ovary, and testis transcriptomes, representing various pre- and postnatal developmental stages. ExoPLOT visualizes the relative expression levels of alternative transcripts, e.g., caused by the insertion of new TE-derived exons, across different developmental stages of and among multiple tissues. This tool also provides a unique link between evolution and function during exonization (gain of a new exon) and exaptation (recruitment/co-optation) of a new exon. As input for analysis, we derived a database of 1151 repeat-masked, exonized TEs, representing all prominent families of transposons in the human genome and the collection of human consensus coding sequences (CCDS). ExoPLOT screened preprocessed RNA high-throughput sequencing datasets from seven human tissues to quantify and visualize the dynamics in RNA splicing for these 1151 TE-derived exons during the entire human organ development. In addition, we successfully mapped and analyzed 993 recently described exonized sequences from the human frontal cortex onto these 313 transcriptome libraries. ExoPLOT's approach to preprocessing RNA deep sequencing datasets facilitates alternative splicing analysis and significantly reduces processing times. In addition, ExoPLOT's design allows studying alternative RNA isoforms other than TE-derived in a customized - coordinate-based manner and is available at http://retrogenomics3.uni-muenster.de:3838/exz-plot-d/.
Collapse
Affiliation(s)
- Fengjun Zhang
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany.
| | | | | | - Jürgen Brosius
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany; Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, 610000 Chengdu, Sichuan, China
| | - Henrik Kaessmann
- Center for Molecular Biology of Heidelberg University, ZMBH, 69120 Heidelberg, Germany
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany; EvoPAD-RTG, University of Münster, 48149 Münster, Germany.
| |
Collapse
|
8
|
Wright CJ, Smith CWJ, Jiggins CD. Alternative splicing as a source of phenotypic diversity. Nat Rev Genet 2022; 23:697-710. [PMID: 35821097 DOI: 10.1038/s41576-022-00514-4] [Citation(s) in RCA: 190] [Impact Index Per Article: 63.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/13/2022] [Indexed: 12/27/2022]
Abstract
A major goal of evolutionary genetics is to understand the genetic processes that give rise to phenotypic diversity in multicellular organisms. Alternative splicing generates multiple transcripts from a single gene, enriching the diversity of proteins and phenotypic traits. It is well established that alternative splicing contributes to key innovations over long evolutionary timescales, such as brain development in bilaterians. However, recent developments in long-read sequencing and the generation of high-quality genome assemblies for diverse organisms has facilitated comparisons of splicing profiles between closely related species, providing insights into how alternative splicing evolves over shorter timescales. Although most splicing variants are probably non-functional, alternative splicing is nonetheless emerging as a dynamic, evolutionarily labile process that can facilitate adaptation and contribute to species divergence.
Collapse
Affiliation(s)
- Charlotte J Wright
- Tree of Life, Wellcome Sanger Institute, Cambridge, UK. .,Department of Zoology, University of Cambridge, Cambridge, UK.
| | | | - Chris D Jiggins
- Department of Zoology, University of Cambridge, Cambridge, UK.
| |
Collapse
|
9
|
Fiszbein A, McGurk M, Calvo-Roitberg E, Kim G, Burge CB, Pai AA. Widespread occurrence of hybrid internal-terminal exons in human transcriptomes. SCIENCE ADVANCES 2022; 8:eabk1752. [PMID: 35044812 PMCID: PMC8769537 DOI: 10.1126/sciadv.abk1752] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 11/23/2021] [Indexed: 06/12/2023]
Abstract
Messenger RNA isoform differences are predominantly driven by alternative first, internal, and last exons. Despite the importance of classifying exons to understand isoform structure, few tools examine isoform-specific exon usage. We recently observed that alternative transcription start sites often arise near internal exons, often creating “hybrid” first/internal exons. To systematically detect hybrid exons, we built the hybrid-internal-terminal (HIT) pipeline to classify exons depending on their isoform-specific usage. On the basis of splice junction reads in RNA sequencing data and probabilistic modeling, the HIT index identified thousands of previously misclassified hybrid first-internal and internal-last exons. Hybrid exons are enriched in long genes and genes involved in RNA splicing and have longer flanking introns and strong splice sites. Their usage varies considerably across human tissues. By developing the first method to classify exons according to isoform contexts, our findings document the occurrence of hybrid exons, a common quirk of the human transcriptome.
Collapse
Affiliation(s)
- Ana Fiszbein
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biology, Boston University, Boston, MA, USA
| | - Michael McGurk
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | - GyeungYun Kim
- Department of Biology, Boston University, Boston, MA, USA
| | - Christopher B. Burge
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Athma A. Pai
- RNA Therapeutics Institute, University of Massachusetts Medical School, Worcester, MA, USA
| |
Collapse
|
10
|
Herbrechter R, Hube N, Buchholz R, Reiner A. Splicing and editing of ionotropic glutamate receptors: a comprehensive analysis based on human RNA-Seq data. Cell Mol Life Sci 2021; 78:5605-5630. [PMID: 34100982 PMCID: PMC8257547 DOI: 10.1007/s00018-021-03865-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2021] [Revised: 05/12/2021] [Accepted: 05/22/2021] [Indexed: 12/11/2022]
Abstract
Ionotropic glutamate receptors (iGluRs) play key roles for signaling in the central nervous system. Alternative splicing and RNA editing are well-known mechanisms to increase iGluR diversity and to provide context-dependent regulation. Earlier work on isoform identification has focused on the analysis of cloned transcripts, mostly from rodents. We here set out to obtain a systematic overview of iGluR splicing and editing in human brain based on RNA-Seq data. Using data from two large-scale transcriptome studies, we established a workflow for the de novo identification and quantification of alternative splice and editing events. We detected all canonical iGluR splice junctions, assessed the abundance of alternative events described in the literature, and identified new splice events in AMPA, kainate, delta, and NMDA receptor subunits. Notable events include an abundant transcript encoding the GluA4 amino-terminal domain, GluA4-ATD, a novel C-terminal GluD1 (delta receptor 1) isoform, GluD1-b, and potentially new GluK4 and GluN2C isoforms. C-terminal GluN1 splicing may be controlled by inclusion of a cassette exon, which shows preference for one of the two acceptor sites in the last exon. Moreover, we identified alternative untranslated regions (UTRs) and species-specific differences in splicing. In contrast, editing in exonic iGluR regions appears to be mostly limited to ten previously described sites, two of which result in silent amino acid changes. Coupling of proximal editing/editing and editing/splice events occurs to variable degree. Overall, this analysis provides the first inventory of alternative splicing and editing in human brain iGluRs and provides the impetus for further transcriptome-based and functional investigations.
Collapse
Affiliation(s)
- Robin Herbrechter
- Department of Biology and Biotechnology, Ruhr University Bochum, Universitätsstrasse 150, 44801, Bochum, Germany
| | - Nadine Hube
- Department of Biology and Biotechnology, Ruhr University Bochum, Universitätsstrasse 150, 44801, Bochum, Germany
| | - Raoul Buchholz
- Department of Biology and Biotechnology, Ruhr University Bochum, Universitätsstrasse 150, 44801, Bochum, Germany
| | - Andreas Reiner
- Department of Biology and Biotechnology, Ruhr University Bochum, Universitätsstrasse 150, 44801, Bochum, Germany.
| |
Collapse
|
11
|
Mazin PV, Khaitovich P, Cardoso-Moreira M, Kaessmann H. Alternative splicing during mammalian organ development. Nat Genet 2021; 53:925-934. [PMID: 33941934 PMCID: PMC8187152 DOI: 10.1038/s41588-021-00851-w] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Accepted: 03/19/2021] [Indexed: 12/30/2022]
Abstract
Alternative splicing (AS) is pervasive in mammalian genomes, yet cross-species comparisons have been largely restricted to adult tissues and the functionality of most AS events remains unclear. We assessed AS patterns across pre- and postnatal development of seven organs in six mammals and a bird. Our analyses revealed that developmentally dynamic AS events, which are especially prevalent in the brain, are substantially more conserved than nondynamic ones. Cassette exons with increasing inclusion frequencies during development show the strongest signals of conserved and regulated AS. Newly emerged cassette exons are typically incorporated late in testis development, but those retained during evolution are predominantly brain specific. Our work suggests that an intricate interplay of programs controlling gene expression levels and AS is fundamental to organ development, especially for the brain and heart. In these regulatory networks, AS affords substantial functional diversification of genes through the generation of tissue- and time-specific isoforms from broadly expressed genes.
Collapse
Affiliation(s)
- Pavel V Mazin
- V. Zelman Center for Neurobiology and Brain Restoration, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Philipp Khaitovich
- V. Zelman Center for Neurobiology and Brain Restoration, Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Margarida Cardoso-Moreira
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany.
- Evolutionary Developmental Biology Laboratory, The Francis Crick Institute, London, UK.
| | - Henrik Kaessmann
- Center for Molecular Biology of Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany.
| |
Collapse
|
12
|
Abstract
BACKGROUND Eukaryotic protein-coding genes consist of exons and introns. Exon-intron borders are conserved between species and thus their changes might be observed only on quite long evolutionary distances. One of the rarest types of change, in which intron relocates over a short distance, is called "intron sliding", but the reality of this event has been debated for a long time. The main idea of a search for intron sliding is to use the most accurate genome annotation and genome sequence, as well as high-quality transcriptome data. We applied them in a search for sliding introns in mammals in order to widen knowledge about the presence or absence of such phenomena in this group. RESULTS We didn't find any significant evidence of intron sliding in the primate group (human, chimpanzee, rhesus macaque, crab-eating macaque, green monkey, marmoset). Only one possible intron sliding event supported by a set of high quality transcriptomes was observed between EIF1AX human and sheep gene orthologs. Also, we checked a list of previously observed intron sliding events in mammals and showed that most likely they are artifacts of genome annotations and are not shown in subsequent annotation versions as well as are not supported by transcriptomic data. CONCLUSIONS We assume that intron sliding is indeed a very rare evolutionary event if it exists at all. Every case of intron sliding needs a lot of supportive data for detection and confirmation.
Collapse
|
13
|
Martí-Gómez C, Lara-Pezzi E, Sánchez-Cabo F. dSreg: a Bayesian model to integrate changes in splicing and RNA-binding protein activity. Bioinformatics 2020; 36:2134-2141. [PMID: 31834368 PMCID: PMC7141860 DOI: 10.1093/bioinformatics/btz915] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 09/09/2019] [Accepted: 12/10/2019] [Indexed: 12/19/2022] Open
Abstract
MOTIVATION Alternative splicing (AS) is an important mechanism in the generation of transcript diversity across mammals. AS patterns are dynamically regulated during development and in response to environmental changes. Defects or perturbations in its regulation may lead to cancer or neurological disorders, among other pathological conditions. The regulatory mechanisms controlling AS in a given biological context are typically inferred using a two-step framework: differential AS analysis followed by enrichment methods. These strategies require setting rather arbitrary thresholds and are prone to error propagation along the analysis. RESULTS To overcome these limitations, we propose dSreg, a Bayesian model that integrates RNA-seq with data from regulatory features, e.g. binding sites of RNA-binding proteins. dSreg identifies the key underlying regulators controlling AS changes and quantifies their activity while simultaneously estimating the changes in exon inclusion rates. dSreg increased both the sensitivity and the specificity of the identified AS changes in simulated data, even at low read coverage. dSreg also showed improved performance when analyzing a collection of knock-down RNA-binding proteins' experiments from ENCODE, as opposed to traditional enrichment methods, such as over-representation analysis and gene set enrichment analysis. dSreg opens the possibility to integrate a large amount of readily available RNA-seq datasets at low coverage for AS analysis and allows more cost-effective RNA-seq experiments. AVAILABILITY AND IMPLEMENTATION dSreg was implemented in python using stan and is freely available to the community at https://bitbucket.org/cmartiga/dsreg. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Carlos Martí-Gómez
- Molecular Regulation of Heart Failure (CMG and ELP); Bioinformatics Unit (FSC), Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid 28029, Spain
| | - Enrique Lara-Pezzi
- Molecular Regulation of Heart Failure (CMG and ELP); Bioinformatics Unit (FSC), Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid 28029, Spain
| | - Fátima Sánchez-Cabo
- Molecular Regulation of Heart Failure (CMG and ELP); Bioinformatics Unit (FSC), Centro Nacional de Investigaciones Cardiovasculares (CNIC), Madrid 28029, Spain
| |
Collapse
|
14
|
Tan JY, Biasini A, Young RS, Marques AC. Splicing of enhancer-associated lincRNAs contributes to enhancer activity. Life Sci Alliance 2020; 3:3/4/e202000663. [PMID: 32086317 PMCID: PMC7035876 DOI: 10.26508/lsa.202000663] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 02/12/2020] [Accepted: 02/13/2020] [Indexed: 12/19/2022] Open
Abstract
Transcription is common at active mammalian enhancers sometimes giving rise to stable enhancer-associated long intergenic noncoding RNAs (elincRNAs). Expression of elincRNA is associated with changes in neighboring gene product abundance and local chromosomal topology, suggesting that transcription at these loci contributes to gene expression regulation in cis Despite the lack of evidence supporting sequence-dependent functions for most elincRNAs, splicing of these transcripts is unexpectedly common. Whether elincRNA splicing is a mere consequence of cognate enhancer activity or if it directly impacts enhancer function remains unresolved. Here, we investigate the association between elincRNA splicing and enhancer activity in mouse embryonic stem cells. We show that multi-exonic elincRNAs are enriched at conserved enhancers, and the efficient processing of elincRNAs is strongly associated with their cognate enhancer activity. This association is supported by their enrichment in enhancer-specific chromatin signatures; elevated binding of co-transcriptional regulators; increased local intra-chromosomal DNA contacts; and strengthened cis-regulation on target gene expression. Our results support the role of efficient RNA processing of enhancer-associated transcripts to cognate enhancer activity.
Collapse
Affiliation(s)
- Jennifer Y Tan
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Adriano Biasini
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Robert S Young
- Medical Research Council Human Genetics Unit, Medical Research Council Institute of Genetics & Molecular Medicine, University of Edinburgh, Edinburgh, UK
| | - Ana C Marques
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
15
|
Kuitche E, Jammali S, Ouangraoua A. SimSpliceEvol: alternative splicing-aware simulation of biological sequence evolution. BMC Bioinformatics 2019; 20:640. [PMID: 31842741 PMCID: PMC6916212 DOI: 10.1186/s12859-019-3207-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
Background It is now well established that eukaryotic coding genes have the ability to produce more than one type of transcript thanks to the mechanisms of alternative splicing and alternative transcription. Because of the lack of gold standard real data on alternative splicing, simulated data constitute a good option for evaluating the accuracy and the efficiency of methods developed for splice-aware sequence analysis. However, existing sequence evolution simulation methods do not model alternative splicing, and so they can not be used to test spliced sequence analysis methods. Results We propose a new method called SimSpliceEvol for simulating the evolution of sets of alternative transcripts along the branches of an input gene tree. In addition to traditional sequence evolution events, the simulation also includes gene exon-intron structure evolution events and alternative splicing events that modify the sets of transcripts produced from genes. SimSpliceEvol was implemented in Python. The source code is freely available at https://github.com/UdeS-CoBIUS/SimSpliceEvol. Conclusions Data generated using SimSpliceEvol are useful for testing spliced RNA sequence analysis methods such as methods for spliced alignment of cDNA and genomic sequences, multiple cDNA alignment, orthologous exons identification, splicing orthology inference, transcript phylogeny inference, which requires to know the real evolutionary relationships between the sequences.
Collapse
Affiliation(s)
- Esaie Kuitche
- Department of Computer Science, University of Sherbrooke, 2500 Boulevard de l'Université, Quebec, J1K2R1, Canada.
| | - Safa Jammali
- Department of Computer Science, University of Sherbrooke, 2500 Boulevard de l'Université, Quebec, J1K2R1, Canada.,Department of Biochemistry, University of Sherbrooke, 3001 12e avenue Nord, Quebec, J1H5N4, Canada
| | - Aïda Ouangraoua
- Department of Computer Science, University of Sherbrooke, 2500 Boulevard de l'Université, Quebec, J1K2R1, Canada
| |
Collapse
|
16
|
Fiszbein A, Krick KS, Begg BE, Burge CB. Exon-Mediated Activation of Transcription Starts. Cell 2019; 179:1551-1565.e17. [PMID: 31787377 DOI: 10.1016/j.cell.2019.11.002] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 08/20/2019] [Accepted: 10/30/2019] [Indexed: 10/25/2022]
Abstract
The processing of RNA transcripts from mammalian genes occurs in proximity to their transcription. Here, we describe a phenomenon affecting thousands of genes that we call exon-mediated activation of transcription starts (EMATS), in which the splicing of internal exons impacts promoter choice and the expression level of the gene. We observed that evolutionary gain of internal exons is associated with gain of new transcription start sites (TSSs) nearby and increased gene expression. Inhibiting exon splicing reduced transcription from nearby promoters, and creation of new spliced exons activated transcription from cryptic promoters. The strongest effects occurred for weak promoters located proximal and upstream of efficiently spliced exons. Together, our findings support a model in which splicing recruits transcription machinery locally to influence TSS choice and identify exon gain, loss, and regulatory change as major contributors to the evolution of alternative promoters and gene expression in mammals.
Collapse
Affiliation(s)
- Ana Fiszbein
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA
| | - Keegan S Krick
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA
| | - Bridget E Begg
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA
| | - Christopher B Burge
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02138, USA.
| |
Collapse
|
17
|
Sequence and Evolutionary Features for the Alternatively Spliced Exons of Eukaryotic Genes. Int J Mol Sci 2019; 20:ijms20153834. [PMID: 31390737 PMCID: PMC6695735 DOI: 10.3390/ijms20153834] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 07/25/2019] [Accepted: 07/31/2019] [Indexed: 12/22/2022] Open
Abstract
Alternative splicing of pre-mRNAs is a crucial mechanism for maintaining protein diversity in eukaryotes without requiring a considerable increase of genes in the number. Due to rapid advances in high-throughput sequencing technologies and computational algorithms, it is anticipated that alternative splicing events will be more intensively studied to address different kinds of biological questions. The occurrences of alternative splicing mean that all exons could be classified to be either constitutively or alternatively spliced depending on whether they are virtually included into all mature mRNAs. From an evolutionary point of view, therefore, the alternatively spliced exons would have been associated with distinctive biological characteristics in comparison with constitutively spliced exons. In this paper, we first outline the representative types of alternative splicing events and exon classification, and then review sequence and evolutionary features for the alternatively spliced exons. The main purpose is to facilitate understanding of the biological implications of alternative splicing in eukaryotes. This knowledge is also helpful to establish computational approaches for predicting the splicing pattern of exons.
Collapse
|
18
|
Avgan N, Wang JI, Fernandez-Chamorro J, Weatheritt RJ. Multilayered control of exon acquisition permits the emergence of novel forms of regulatory control. Genome Biol 2019; 20:141. [PMID: 31315652 PMCID: PMC6637531 DOI: 10.1186/s13059-019-1757-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 07/09/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The long introns of mammals are pools of evolutionary potential due to the multiplicity of sequences that permit the acquisition of novel exons. However, the permissibility of genes to this type of acquisition and its influence on the evolution of cell regulation is poorly understood. RESULTS Here, we observe that human genes are highly permissive to the inclusion of novel exonic regions permitting the emergence of novel regulatory features. Our analysis reveals the potential for novel exon acquisition to occur in over 30% of evaluated human genes. Regulatory processes including the rate of splicing efficiency and RNA polymerase II (RNAPII) elongation control this process by modulating the "window of opportunity" for spliceosomal recognition. DNA damage alters this window promoting the inclusion of repeat-derived novel exons that reduce the ribosomal engagement of cell cycle genes. Finally, we demonstrate that the inclusion of novel exons is suppressed in hematological cancer samples and can be reversed by drugs modulating the rate of RNAPII elongation. CONCLUSION Our work demonstrates that the inclusion of repeat-associated novel intronic regions is a tightly controlled process capable of expanding the regulatory capacity of cells.
Collapse
Affiliation(s)
- Nesli Avgan
- EMBL Australia, Garvan Institute of Medical Research, Sydney, NSW, 2010, Australia
| | - Juli I Wang
- EMBL Australia, Garvan Institute of Medical Research, Sydney, NSW, 2010, Australia
- St. Vincent's Clinical School, University of New South Wales, Sydney, NSW, 2010, Australia
| | | | - Robert J Weatheritt
- EMBL Australia, Garvan Institute of Medical Research, Sydney, NSW, 2010, Australia.
- St. Vincent's Clinical School, University of New South Wales, Sydney, NSW, 2010, Australia.
| |
Collapse
|
19
|
Attig J, Agostini F, Gooding C, Chakrabarti AM, Singh A, Haberman N, Zagalak JA, Emmett W, Smith CWJ, Luscombe NM, Ule J. Heteromeric RNP Assembly at LINEs Controls Lineage-Specific RNA Processing. Cell 2018; 174:1067-1081.e17. [PMID: 30078707 PMCID: PMC6108849 DOI: 10.1016/j.cell.2018.07.001] [Citation(s) in RCA: 103] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2017] [Revised: 04/23/2018] [Accepted: 07/01/2018] [Indexed: 12/30/2022]
Abstract
Long mammalian introns make it challenging for the RNA processing machinery to identify exons accurately. We find that LINE-derived sequences (LINEs) contribute to this selection by recruiting dozens of RNA-binding proteins (RBPs) to introns. This includes MATR3, which promotes binding of PTBP1 to multivalent binding sites within LINEs. Both RBPs repress splicing and 3' end processing within and around LINEs. Notably, repressive RBPs preferentially bind to evolutionarily young LINEs, which are located far from exons. These RBPs insulate the LINEs and the surrounding intronic regions from RNA processing. Upon evolutionary divergence, changes in RNA motifs within LINEs lead to gradual loss of their insulation. Hence, older LINEs are located closer to exons, are a common source of tissue-specific exons, and increasingly bind to RBPs that enhance RNA processing. Thus, LINEs are hubs for the assembly of repressive RBPs and also contribute to the evolution of new, lineage-specific transcripts in mammals. VIDEO ABSTRACT.
Collapse
Affiliation(s)
- Jan Attig
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK.
| | - Federico Agostini
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK
| | - Clare Gooding
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge CB2 1QW, UK
| | - Anob M Chakrabarti
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Genetics, Environment and Evolution, UCL Genetics Institute, Gower Street, London WC1E 6BT, UK
| | - Aarti Singh
- Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK; Department of Comparative Biomedical Sciences, The Royal Veterinary College, Royal College Street, London NW1 0TU, UK
| | - Nejc Haberman
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK
| | - Julian A Zagalak
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK
| | - Warren Emmett
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK; Department of Genetics, Environment and Evolution, UCL Genetics Institute, Gower Street, London WC1E 6BT, UK
| | - Christopher W J Smith
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge CB2 1QW, UK
| | - Nicholas M Luscombe
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Genetics, Environment and Evolution, UCL Genetics Institute, Gower Street, London WC1E 6BT, UK; Okinawa Institute of Science and Technology Graduate University, 1919-1 Tancha, Onna-son, Kunigami-gun, Okinawa 904-0495, Japan
| | - Jernej Ule
- The Francis Crick Institute, Midland Road 1, Kings Cross, London NW1 1AT, UK; Department of Molecular Neuroscience, UCL Institute of Neurology, Queen Square, London WC1N 3BG, UK.
| |
Collapse
|
20
|
Abstract
Nucleosomal modifications have been implicated in fundamental epigenetic regulation, whereas the roles of nucleosome binding in shaping changes through evolution remain to be addressed. Here we performed a comparative study to clarify the roles of nucleosome occupancy in exon origination. By profiling a high-resolution, cross-species mononucleosome landscape for mammalian tissues, we found nucleosome occupancy profiles are conserved across tissues and species. Further, through a phylogenetic approach, we found that the feature of differential nucleosome occupancy appears prior to the origination of new exons and, presumably, facilitates the origin of new exons by increasing the splice strength of the ancestral nonexonic regions through driving a local difference in GC content, which suggests the function of nucleosome binding in exonization. Nucleosomal modifications have been implicated in fundamental epigenetic regulation, but the roles of nucleosome occupancy in shaping changes through evolution remain to be addressed. Here we present high-resolution nucleosome occupancy profiles for multiple tissues derived from human, macaque, tree shrew, mouse, and pig. Genome-wide comparison reveals conserved nucleosome occupancy profiles across both different species and tissue types. Notably, we found significantly higher levels of nucleosome occupancy in exons than in introns, a pattern correlated with the different exon–intron GC content. We then determined whether this biased occupancy may play roles in the origination of new exons through evolution, rather than being a downstream effect of exonization, through a comparative approach to sequentially trace the order of the exonization and biased nucleosome binding. By identifying recently evolved exons in human but not in macaque using matched RNA sequencing, we found that higher exonic nucleosome occupancy also existed in macaque regions orthologous to these exons. Presumably, such biased nucleosome occupancy facilitates the origination of new exons by increasing the splice strength of the ancestral nonexonic regions through driving a local difference in GC content. These data thus support a model that sites bound by nucleosomes are more likely to evolve into exons, which we term the “nucleosome-first” model.
Collapse
|
21
|
Kini RM. Accelerated evolution of toxin genes: Exonization and intronization in snake venom disintegrin/metalloprotease genes. Toxicon 2018; 148:16-25. [PMID: 29634956 DOI: 10.1016/j.toxicon.2018.04.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Revised: 03/21/2018] [Accepted: 04/01/2018] [Indexed: 12/20/2022]
Abstract
Toxin genes in animals undergo accelerated evolution compared to non-toxin genes to be effective and competitive in prey capture, as well as to enhance their predator defense. Several mechanisms have been proposed to explain this unusual phenomenon. These include (a) frequent mutations in exons compared to introns and nonsynonymous substitutions in exons; (b) high frequency of point mutations are due to the presence of more unstable triplets in exons compared to introns; (c) Accelerated Segment Switch in Exons to alter Targeting (ASSET); (d) Rapid Accumulation of Variations in Exposed Residues (RAVERs); (e) alteration in intron-exon boundary; (f) deletion of exon; and (g) loss/gain of domains through recombination. By systematic analyses of snake venom disintegrin/metalloprotease genes, I describe a new mechanism in the evolution of these genes through exonization and intronization. In the evolution of RTS/KTS disintegrins, a new exon (10a) is formed in intron 10 of the disintegrin/metalloprotease gene. Unlike more than 90% new exons that are from repetitive elements in introns, exon 10a originated from a non-repetitive element. To incorporate exon 10a, part of the exon 11 is intronized to retain the open reading frame. This is the first case of simultaneous exonization and intronization within a single gene. This new mechanism alters the function of toxins through drastic changes to the molecular surface via insertion of new exons and deletion of exons.
Collapse
Affiliation(s)
- R Manjunatha Kini
- Protein Science Laboratory, Department of Biological Sciences, Faculty of Science, National University of Singapore, Singapore, 117543, Singapore.
| |
Collapse
|
22
|
Auboeuf D. Alternative mRNA processing sites decrease genetic variability while increasing functional diversity. Transcription 2017; 9:75-87. [PMID: 29099315 PMCID: PMC5834221 DOI: 10.1080/21541264.2017.1373891] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022] Open
Abstract
Recent large-scale RNA sequencing efforts have revealed the extensive diversity of mRNA molecules produced from most eukaryotic coding genes, which arises from the usage of alternative, cryptic or non-canonical splicing and intronic polyadenylation sites. The prevailing view regarding the tremendous diversity of coding gene transcripts is that mRNA processing is a flexible and more-or-less noisy process leading to a diversity of proteins on which natural selection can act depending on protein-mediated cellular functions. However, this concept raises two main questions. First, do alternative mRNA processing pathways have a role other than generating mRNA and protein diversity? Second, is the cellular function of mRNA variants restricted to the biogenesis of functional protein isoforms? Here, I propose that the co-transcriptional use of alternative mRNA processing sites allows first, the resolution of co-transcriptional biophysical constraints that may otherwise result in DNA instability, and second, increases the diversity of cellular functions of mRNAs in a manner that is not restricted to protein synthesis.
Collapse
Affiliation(s)
- Didier Auboeuf
- a Univ Lyon, ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell , 46 Allée d'Italie Site Jacques Monod, Lyon , France
| |
Collapse
|
23
|
Deveson IW, Hardwick SA, Mercer TR, Mattick JS. The Dimensions, Dynamics, and Relevance of the Mammalian Noncoding Transcriptome. Trends Genet 2017; 33:464-478. [PMID: 28535931 DOI: 10.1016/j.tig.2017.04.004] [Citation(s) in RCA: 151] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 04/24/2017] [Indexed: 01/02/2023]
Abstract
The combination of pervasive transcription and prolific alternative splicing produces a mammalian transcriptome of great breadth and diversity. The majority of transcribed genomic bases are intronic, antisense, or intergenic to protein-coding genes, yielding a plethora of short and long non-protein-coding regulatory RNAs. Long noncoding RNAs (lncRNAs) share most aspects of their biogenesis, processing, and regulation with mRNAs. However, lncRNAs are typically expressed in more restricted patterns, frequently from enhancers, and exhibit almost universal alternative splicing. These features are consistent with their role as modular epigenetic regulators. We describe here the key studies and technological advances that have shaped our understanding of the dimensions, dynamics, and biological relevance of the mammalian noncoding transcriptome.
Collapse
Affiliation(s)
- Ira W Deveson
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, NSW, Australia; School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW, Australia
| | - Simon A Hardwick
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, NSW, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, NSW, Australia
| | - Tim R Mercer
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, NSW, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, NSW, Australia
| | - John S Mattick
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, NSW, Australia; School of Biotechnology and Biomolecular Sciences, Faculty of Science, University of New South Wales, Sydney, NSW, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, NSW, Australia.
| |
Collapse
|
24
|
Ramírez-Sánchez O, Pérez-Rodríguez P, Delaye L, Tiessen A. Plant Proteins Are Smaller Because They Are Encoded by Fewer Exons than Animal Proteins. GENOMICS, PROTEOMICS & BIOINFORMATICS 2016; 14:357-370. [PMID: 27998811 PMCID: PMC5200936 DOI: 10.1016/j.gpb.2016.06.003] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Revised: 06/03/2016] [Accepted: 06/03/2016] [Indexed: 01/27/2023]
Abstract
Protein size is an important biochemical feature since longer proteins can harbor more domains and therefore can display more biological functionalities than shorter proteins. We found remarkable differences in protein length, exon structure, and domain count among different phylogenetic lineages. While eukaryotic proteins have an average size of 472 amino acid residues (aa), average protein sizes in plant genomes are smaller than those of animals and fungi. Proteins unique to plants are ∼81aa shorter than plant proteins conserved among other eukaryotic lineages. The smaller average size of plant proteins could neither be explained by endosymbiosis nor subcellular compartmentation nor exon size, but rather due to exon number. Metazoan proteins are encoded on average by ∼10 exons of small size [∼176 nucleotides (nt)]. Streptophyta have on average only ∼5.7 exons of medium size (∼230nt). Multicellular species code for large proteins by increasing the exon number, while most unicellular organisms employ rather larger exons (>400nt). Among subcellular compartments, membrane proteins are the largest (∼520aa), whereas the smallest proteins correspond to the gene ontology group of ribosome (∼240aa). Plant genes are encoded by half the number of exons and also contain fewer domains than animal proteins on average. Interestingly, endosymbiotic proteins that migrated to the plant nucleus became larger than their cyanobacterial orthologs. We thus conclude that plants have proteins larger than bacteria but smaller than animals or fungi. Compared to the average of eukaryotic species, plants have ∼34% more but ∼20% smaller proteins. This suggests that photosynthetic organisms are unique and deserve therefore special attention with regard to the evolutionary forces acting on their genomes and proteomes.
Collapse
Affiliation(s)
- Obed Ramírez-Sánchez
- Genetic Engineering Department, CINVESTAV Unidad Irapuato, Irapuato, CP 36821, Mexico
| | | | - Luis Delaye
- Genetic Engineering Department, CINVESTAV Unidad Irapuato, Irapuato, CP 36821, Mexico
| | - Axel Tiessen
- Genetic Engineering Department, CINVESTAV Unidad Irapuato, Irapuato, CP 36821, Mexico.
| |
Collapse
|
25
|
Widespread Shortening of 3' Untranslated Regions and Increased Exon Inclusion Are Evolutionarily Conserved Features of Innate Immune Responses to Infection. PLoS Genet 2016; 12:e1006338. [PMID: 27690314 PMCID: PMC5045211 DOI: 10.1371/journal.pgen.1006338] [Citation(s) in RCA: 74] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 09/02/2016] [Indexed: 11/19/2022] Open
Abstract
The contribution of pre-mRNA processing mechanisms to the regulation of immune responses remains poorly studied despite emerging examples of their role as regulators of immune defenses. We sought to investigate the role of mRNA processing in the cellular responses of human macrophages to live bacterial infections. Here, we used mRNA sequencing to quantify gene expression and isoform abundances in primary macrophages from 60 individuals, before and after infection with Listeria monocytogenes and Salmonella typhimurium. In response to both bacteria we identified thousands of genes that significantly change isoform usage in response to infection, characterized by an overall increase in isoform diversity after infection. In response to both bacteria, we found global shifts towards (i) the inclusion of cassette exons and (ii) shorter 3’ UTRs, with near-universal shifts towards usage of more upstream polyadenylation sites. Using complementary data collected in non-human primates, we show that these features are evolutionarily conserved among primates. Following infection, we identify candidate RNA processing factors whose expression is associated with individual-specific variation in isoform abundance. Finally, by profiling microRNA levels, we show that 3’ UTRs with reduced abundance after infection are significantly enriched for target sites for particular miRNAs. These results suggest that the pervasive usage of shorter 3’ UTRs is a mechanism for particular genes to evade repression by immune-activated miRNAs. Collectively, our results suggest that dynamic changes in RNA processing may play key roles in the regulation of innate immune responses. Changes in gene regulation have long been known to play important roles in both innate and adaptive immune responses. While transcriptional responses to infection have been well-characterized, much less is known about the extent to which co-transcriptional mechanisms of mRNA processing are involved in the regulation of immune defenses. In this study, we sought to investigate the role of mRNA processing in the cellular responses of human macrophages to live bacterial infection. Using primary human macrophages derived from whole blood samples from 60 individuals, we sequenced mRNA both before and after infection with two live bacteria. We show that immune responses to infection are accompanied by pervasive changes in mRNA isoform usage, with systematic shifts towards increased cassette exon inclusion and shortening of Tandem 3’ UTRs post-infection. These patterns are conserved in nonhuman primates, supporting their functional importance across evolutionary time. Complementary microRNA profiling revealed that shortened 3’ UTRs are enriched for target sites of macrophage-expressed miRNAs, many of which are specifically activated after infection to regulate the innate immune response. Our results therefore provide the first genome-wide empirical support for the idea that actively regulated shifts towards shorter 3’ UTRs might allow specific genes to evade repression by immune-activated miRNAs.
Collapse
|
26
|
DNA methylation: conducting the orchestra from exposure to phenotype? Clin Epigenetics 2016; 8:92. [PMID: 27602172 PMCID: PMC5012062 DOI: 10.1186/s13148-016-0256-8] [Citation(s) in RCA: 126] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Accepted: 08/22/2016] [Indexed: 01/02/2023] Open
Abstract
DNA methylation, through 5-methyl- and 5-hydroxymethylcytosine (5mC and 5hmC), is considered to be one of the principal interfaces between the genome and our environment, and it helps explain phenotypic variations in human populations. Initial reports of large differences in methylation level in genomic regulatory regions, coupled with clear gene expression data in both imprinted genes and malignant diseases, provided easily dissected molecular mechanisms for switching genes on or off. However, a more subtle process is becoming evident, where small (<10 %) changes to intermediate methylation levels are associated with complex disease phenotypes. This has resulted in two clear methylation paradigms. The latter “subtle change” paradigm is rapidly becoming the epigenetic hallmark of complex disease phenotypes, although we are currently hampered by a lack of data addressing the true biological significance and meaning of these small differences. Our initial expectation of rapidly identifying mechanisms linking environmental exposure to a disease phenotype led to numerous observational/association studies being performed. Although this expectation remains unmet, there is now a growing body of literature on specific genes, suggesting wide ranging transcriptional and translational consequences of such subtle methylation changes. Data from the glucocorticoid receptor (NR3C1) has shown that a complex interplay between DNA methylation, extensive 5′UTR splicing, and microvariability gives rise to the overall level and relative distribution of total and N-terminal protein isoforms generated. Additionally, the presence of multiple AUG translation initiation codons throughout the complete, processed mRNA enables translation variability, hereby enhancing the translational isoforms and the resulting protein isoform diversity, providing a clear link between small changes in DNA methylation and significant changes in protein isoforms and cellular locations. Methylation changes in the NR3C1 CpG island alters the NR3C1 transcription and eventually protein isoforms in the tissues, resulting in subtle but visible physiological variability. This review addresses the current pathophysiological and clinical associations of such characteristically small DNA methylation changes, the ever-growing roles of DNA methylation and the evidence available, particularly from the glucocorticoid receptor of the cascade of events initiated by such subtle methylation changes, as well as addressing the underlying question as to what represents a genuine biologically significant difference in methylation.
Collapse
|
27
|
Kralovicova J, Vorechovsky I. Alternative splicing of U2AF1 reveals a shared repression mechanism for duplicated exons. Nucleic Acids Res 2016; 45:417-434. [PMID: 27566151 PMCID: PMC5224494 DOI: 10.1093/nar/gkw733] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Revised: 08/10/2016] [Accepted: 08/11/2016] [Indexed: 12/30/2022] Open
Abstract
The auxiliary factor of U2 small nuclear ribonucleoprotein (U2AF) facilitates branch point (BP) recognition and formation of lariat introns. The gene for the 35-kD subunit of U2AF gives rise to two protein isoforms (termed U2AF35a and U2AF35b) that are encoded by alternatively spliced exons 3 and Ab, respectively. The splicing recognition sequences of exon 3 are less favorable than exon Ab, yet U2AF35a expression is higher than U2AF35b across tissues. We show that U2AF35b repression is facilitated by weak, closely spaced BPs next to a long polypyrimidine tract of exon Ab. Each BP lacked canonical uridines at position -2 relative to the BP adenines, with efficient U2 base-pairing interactions predicted only for shifted registers reminiscent of programmed ribosomal frameshifting. The BP cluster was compensated by interactions involving unpaired cytosines in an upstream, EvoFold-predicted stem loop (termed ESL) that binds FUBP1/2. Exon Ab inclusion correlated with predicted free energies of mutant ESLs, suggesting that the ESL operates as a conserved rheostat between long inverted repeats upstream of each exon. The isoform-specific U2AF35 expression was U2AF65-dependent, required interactions between the U2AF-homology motif (UHM) and the α6 helix of U2AF35, and was fine-tuned by exon Ab/3 variants. Finally, we identify tandem homologous exons regulated by U2AF and show that their preferential responses to U2AF65-related proteins and SRSF3 are associated with unpaired pre-mRNA segments upstream of U2AF-repressed 3′ss. These results provide new insights into tissue-specific subfunctionalization of duplicated exons in vertebrate evolution and expand the repertoire of exon repression mechanisms that control alternative splicing.
Collapse
Affiliation(s)
- Jana Kralovicova
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
| | - Igor Vorechovsky
- University of Southampton, Faculty of Medicine, Southampton SO16 6YD, UK
| |
Collapse
|
28
|
Lyons SM, Cunningham CH, Welch JD, Groh B, Guo AY, Wei B, Whitfield ML, Xiong Y, Marzluff WF. A subset of replication-dependent histone mRNAs are expressed as polyadenylated RNAs in terminally differentiated tissues. Nucleic Acids Res 2016; 44:9190-9205. [PMID: 27402160 PMCID: PMC5100578 DOI: 10.1093/nar/gkw620] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 06/30/2016] [Indexed: 11/24/2022] Open
Abstract
Histone proteins are synthesized in large amounts during S-phase to package the newly replicated DNA, and are among the most stable proteins in the cell. The replication-dependent (RD)-histone mRNAs expressed during S-phase end in a conserved stem-loop rather than a polyA tail. In addition, there are replication-independent (RI)-histone genes that encode histone variants as polyadenylated mRNAs. Most variants have specific functions in chromatin, but H3.3 also serves as a replacement histone for damaged histones in long-lived terminally differentiated cells. There are no reported replacement histone genes for histones H2A, H2B or H4. We report that a subset of RD-histone genes are expressed in terminally differentiated tissues as polyadenylated mRNAs, likely serving as replacement histone genes in long-lived non-dividing cells. Expression of two genes, HIST2H2AA3 and HIST1H2BC, is conserved in mammals. They are expressed as polyadenylated mRNAs in fibroblasts differentiated in vitro, but not in serum starved fibroblasts, suggesting that their expression is part of the terminal differentiation program. There are two histone H4 genes and an H3 gene that encode mRNAs that are polyadenylated and expressed at 5- to 10-fold lower levels than the mRNAs from H2A and H2B genes, which may be replacement genes for the H3.1 and H4 proteins.
Collapse
Affiliation(s)
- Shawn M Lyons
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Clark H Cunningham
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Joshua D Welch
- Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599
| | - Beezly Groh
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Andrew Y Guo
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Bruce Wei
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Michael L Whitfield
- Department of Genetics, Dartmouth Geisel School of Medicine, Hanover, NH 03755, USA
| | - Yue Xiong
- Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599.,Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599, USA
| | - William F Marzluff
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA .,Department of Computer Science, University of North Carolina, Chapel Hill, NC 27599.,Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
29
|
Leenen FAD, Vernocchi S, Hunewald OE, Schmitz S, Molitor AM, Muller CP, Turner JD. Where does transcription start? 5'-RACE adapted to next-generation sequencing. Nucleic Acids Res 2016; 44:2628-45. [PMID: 26615195 PMCID: PMC4824077 DOI: 10.1093/nar/gkv1328] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Revised: 11/11/2015] [Accepted: 11/13/2015] [Indexed: 01/27/2023] Open
Abstract
The variability and complexity of the transcription initiation process was examined by adapting RNA ligase-mediated rapid amplification of 5' cDNA ends (5'-RACE) to Next-Generation Sequencing (NGS). We oligo-labelled 5'-m(7)G-capped mRNA from two genes, the simple mono-exonic Beta-2-Adrenoceptor (ADRB2R)and the complex multi-exonic Glucocorticoid Receptor (GR, NR3C1), and detected a variability in TSS location that has received little attention up to now. Transcription was not initiated at a fixed TSS, but from loci of 4 to 10 adjacent nucleotides. Individual TSSs had frequencies from <0.001% to 38.5% of the total gene-specific 5' m(7)G-capped transcripts. ADRB2R used a single locus consisting of 4 adjacent TSSs. Unstimulated, the GR used a total of 358 TSSs distributed throughout 38 loci, that were principally in the 5' UTRs and were spliced using established donor and acceptor sites. Complete demethylation of the epigenetically sensitive GR promoter with 5-azacytidine induced one new locus and 127 TSSs, 12 of which were unique. We induced GR transcription with dexamethasone and Interferon-γ, adding one new locus and 185 additional TSSs distributed throughout the promoter region. In-vitro the TSS microvariability regulated mRNA translation efficiency and the relative abundance of the different GRN-terminal protein isoform levels.
Collapse
Affiliation(s)
- Fleur A D Leenen
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg Department of Immunology, Research Institute of Psychobiology, University of Trier, Trier D-54290, Germany
| | - Sara Vernocchi
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg Department of Immunology, Research Institute of Psychobiology, University of Trier, Trier D-54290, Germany
| | - Oliver E Hunewald
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg
| | - Stephanie Schmitz
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg
| | - Anne M Molitor
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg
| | - Claude P Muller
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg Department of Immunology, Research Institute of Psychobiology, University of Trier, Trier D-54290, Germany
| | - Jonathan D Turner
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-Sur-Alzette L-4354, Grand-Duchy of Luxembourg
| |
Collapse
|
30
|
Raj B, Blencowe B. Alternative Splicing in the Mammalian Nervous System: Recent Insights into Mechanisms and Functional Roles. Neuron 2015; 87:14-27. [DOI: 10.1016/j.neuron.2015.05.004] [Citation(s) in RCA: 345] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
31
|
RNA-Binding Proteins: Splicing Factors and Disease. Biomolecules 2015; 5:893-909. [PMID: 25985083 PMCID: PMC4496701 DOI: 10.3390/biom5020893] [Citation(s) in RCA: 65] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 04/22/2015] [Accepted: 04/29/2015] [Indexed: 12/12/2022] Open
Abstract
Pre-mRNA splicing is mediated by interactions of the Core Spliceosome and an array of accessory RNA binding proteins with cis-sequence elements. Splicing is a major regulatory component in higher eukaryotes. Disruptions in splicing are a major contributor to human disease. One in three hereditary disease alleles are believed to cause aberrant splicing. Hereditary disease alleles can alter splicing by disrupting a splicing element, creating a toxic RNA, or affecting splicing factors. One of the challenges of medical genetics is identifying causal variants from the thousands of possibilities discovered in a clinical sequencing experiment. Here we review the basic biochemistry of splicing, the mechanisms of splicing mutations, the methods for identifying splicing mutants, and the potential of therapeutic interventions.
Collapse
|