1
|
Telonis AG, Rigoutsos I. The transcriptional trajectories of pluripotency and differentiation comprise genes with antithetical architecture and repetitive-element content. BMC Biol 2021; 19:60. [PMID: 33765992 PMCID: PMC7995781 DOI: 10.1186/s12915-020-00928-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 11/18/2020] [Indexed: 12/12/2022] Open
Abstract
Background Extensive molecular differences exist between proliferative and differentiated cells. Here, we conduct a meta-analysis of publicly available transcriptomic datasets from preimplantation and differentiation stages examining the architectural properties and content of genes whose abundance changes significantly across developmental time points. Results Analysis of preimplantation embryos from human and mouse showed that short genes whose introns are enriched in Alu (human) and B (mouse) elements, respectively, have higher abundance in the blastocyst compared to the zygote. These highly expressed genes encode ribosomal proteins or metabolic enzymes. On the other hand, long genes whose introns are depleted in repetitive elements have lower abundance in the blastocyst and include genes from signaling pathways. Additionally, the sequences of the genes that are differentially expressed between the blastocyst and the zygote contain distinct collections of pyknon motifs that differ between up- and down-regulated genes. Further examination of the genes that participate in the stem cell-specific protein interaction network shows that their introns are short and enriched in Alu (human) and B (mouse) elements. As organogenesis progresses, in both human and mouse, we find that the primarily short and repeat-rich expressed genes make way for primarily longer, repeat-poor genes. With that in mind, we used a machine learning-based approach to identify gene signatures able to classify human adult tissues: we find that the most discriminatory genes comprising these signatures have long introns that are repeat-poor and include transcription factors and signaling-cascade genes. The introns of widely expressed genes across human tissues, on the other hand, are short and repeat-rich, and coincide with those with the highest expression at the blastocyst stage. Conclusions Protein-coding genes that are characteristic of each trajectory, i.e., proliferation/pluripotency or differentiation, exhibit antithetical biases in their intronic and exonic lengths and in their repetitive-element content. While the respective human and mouse gene signatures are functionally and evolutionarily conserved, their introns and exons are enriched or depleted in organism-specific repetitive elements. We posit that these organism-specific repetitive sequences found in exons and introns are used to effect the corresponding genes’ regulation. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-020-00928-8.
Collapse
Affiliation(s)
- Aristeidis G Telonis
- Computational Medicine Center, Sidney Kimmel College of Medicine, Thomas Jefferson University, 1020 Locust Street, Suite M81, Philadelphia, PA, 19107, USA. .,Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, FL, 33136, USA.
| | - Isidore Rigoutsos
- Computational Medicine Center, Sidney Kimmel College of Medicine, Thomas Jefferson University, 1020 Locust Street, Suite M81, Philadelphia, PA, 19107, USA.
| |
Collapse
|
2
|
Schaap P, Barrantes I, Minx P, Sasaki N, Anderson RW, Bénard M, Biggar KK, Buchler NE, Bundschuh R, Chen X, Fronick C, Fulton L, Golderer G, Jahn N, Knoop V, Landweber LF, Maric C, Miller D, Noegel AA, Peace R, Pierron G, Sasaki T, Schallenberg-Rüdinger M, Schleicher M, Singh R, Spaller T, Storey KB, Suzuki T, Tomlinson C, Tyson JJ, Warren WC, Werner ER, Werner-Felmayer G, Wilson RK, Winckler T, Gott JM, Glöckner G, Marwan W. The Physarum polycephalum Genome Reveals Extensive Use of Prokaryotic Two-Component and Metazoan-Type Tyrosine Kinase Signaling. Genome Biol Evol 2015; 8:109-25. [PMID: 26615215 PMCID: PMC4758236 DOI: 10.1093/gbe/evv237] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/23/2015] [Indexed: 12/13/2022] Open
Abstract
Physarum polycephalum is a well-studied microbial eukaryote with unique experimental attributes relative to other experimental model organisms. It has a sophisticated life cycle with several distinct stages including amoebal, flagellated, and plasmodial cells. It is unusual in switching between open and closed mitosis according to specific life-cycle stages. Here we present the analysis of the genome of this enigmatic and important model organism and compare it with closely related species. The genome is littered with simple and complex repeats and the coding regions are frequently interrupted by introns with a mean size of 100 bases. Complemented with extensive transcriptome data, we define approximately 31,000 gene loci, providing unexpected insights into early eukaryote evolution. We describe extensive use of histidine kinase-based two-component systems and tyrosine kinase signaling, the presence of bacterial and plant type photoreceptors (phytochromes, cryptochrome, and phototropin) and of plant-type pentatricopeptide repeat proteins, as well as metabolic pathways, and a cell cycle control system typically found in more complex eukaryotes. Our analysis characterizes P. polycephalum as a prototypical eukaryote with features attributed to the last common ancestor of Amorphea, that is, the Amoebozoa and Opisthokonts. Specifically, the presence of tyrosine kinases in Acanthamoeba and Physarum as representatives of two distantly related subdivisions of Amoebozoa argues against the later emergence of tyrosine kinase signaling in the opisthokont lineage and also against the acquisition by horizontal gene transfer.
Collapse
Affiliation(s)
- Pauline Schaap
- School of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - Israel Barrantes
- Magdeburg Centre for Systems Biology and Institute for Biology, University of Magdeburg, Magdeburg, Germany
| | - Pat Minx
- The Genome Institute, Washington University School of Medicine, St Louis
| | - Narie Sasaki
- Department of Biological Sciences, Graduate School of Science, Nagoya University, Furocho, Chikusaku, Nagoya, Aichi, Japan
| | - Roger W Anderson
- Department of Molecular Biology and Biotechnology, University of Sheffield, Firth Court, Western Bank, Sheffield, United Kingdom
| | - Marianne Bénard
- UPMC Univ Paris 06, Institut de Biologie Paris-Seine (IBPS), CNRS UMR-7622, Paris, France
| | - Kyle K Biggar
- Biochemistry Department, Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
| | - Nicolas E Buchler
- Department of Biology and Center for Genomic and Computational Biology, Duke University, Durham Department of Physics, Duke University, Durham
| | - Ralf Bundschuh
- Department of Physics and Center for RNA Biology, The Ohio State University, Columbus Department of Chemistry & Biochemistry, The Ohio State University, Columbus Division of Hematology, Department of Internal Medicine, The Ohio State University, Columbus
| | - Xiao Chen
- Department of Ecology & Evolutionary Biology, Princeton University, Princeton
| | - Catrina Fronick
- The Genome Institute, Washington University School of Medicine, St Louis
| | - Lucinda Fulton
- The Genome Institute, Washington University School of Medicine, St Louis
| | - Georg Golderer
- Biological Chemistry, Biocenter, Innsbruck Medical University, Innsbruck, Austria
| | - Niels Jahn
- Genome Analysis, Leibniz Institute on Aging - Fritz Lipmann Institute (FLI), Jena, Germany
| | - Volker Knoop
- IZMB - Institut für Zelluläre und Molekulare Botanik, Universität Bonn, Bonn, Germany
| | - Laura F Landweber
- Department of Ecology & Evolutionary Biology, Princeton University, Princeton
| | - Chrystelle Maric
- Institut Jacques Monod, CNRS UMR7592, Université Paris Diderot Paris7, Paris, France
| | - Dennis Miller
- The University of Texas at Dallas, Biological Sciences, Richardson
| | - Angelika A Noegel
- Institute for Biochemistry I, Medical Faculty, University of Cologne, Cologne, Germany
| | - Rob Peace
- Carleton University, Ottawa, Ontario, Canada
| | - Gérard Pierron
- Institut Jacques Monod, CNRS UMR7592, Université Paris Diderot Paris7, Paris, France
| | - Taeko Sasaki
- Department of Biological Sciences, Graduate School of Science, Nagoya University, Furocho, Chikusaku, Nagoya, Aichi, Japan
| | | | - Michael Schleicher
- Institute for Anatomy III / Cell Biology, BioMedCenter, Ludwig-Maximilians-Universität, Planegg-Martinsried, Germany
| | - Reema Singh
- School of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - Thomas Spaller
- Institut für Pharmazie, Friedrich-Schiller-Universität Jena, Jena, Germany
| | | | - Takamasa Suzuki
- Department of Biological Sciences, Graduate School of Science and JST ERATO Higashiyama Live-holonics Project, Nagoya University, Furocho, Chikusaku, Nagoya, Aichi, Japan
| | - Chad Tomlinson
- The Genome Institute, Washington University School of Medicine, St Louis
| | - John J Tyson
- Department of Biological Sciences, Virginia Polytechnic Institute and State University, Blacksburg
| | - Wesley C Warren
- The Genome Institute, Washington University School of Medicine, St Louis
| | - Ernst R Werner
- Biological Chemistry, Biocenter, Innsbruck Medical University, Innsbruck, Austria
| | | | - Richard K Wilson
- The Genome Institute, Washington University School of Medicine, St Louis
| | - Thomas Winckler
- Institut für Pharmazie, Friedrich-Schiller-Universität Jena, Jena, Germany
| | - Jonatha M Gott
- Center for RNA Molecular Biology, Case Western Reserve University, School of Medicine, Cleveland
| | - Gernot Glöckner
- Institute for Biochemistry I, Medical Faculty, University of Cologne, Cologne, Germany Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
| | - Wolfgang Marwan
- Magdeburg Centre for Systems Biology and Institute for Biology, University of Magdeburg, Magdeburg, Germany
| |
Collapse
|
3
|
Zhou K, Salamov A, Kuo A, Aerts AL, Kong X, Grigoriev IV. Alternative splicing acting as a bridge in evolution. Stem Cell Investig 2015; 2:19. [PMID: 27358887 DOI: 10.3978/j.issn.2306-9759.2015.10.01] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 10/15/2015] [Indexed: 12/15/2022]
Abstract
BACKGROUND Alternative splicing (AS) regulates diverse cellular and developmental functions through alternative protein structures of different isoforms. Alternative exons dominate AS in vertebrates; however, very little is known about the extent and function of AS in lower eukaryotes. To understand the role of introns in gene evolution, we examined AS from a green algal and five fungal genomes using a novel EST-based gene-modeling algorithm (COMBEST). METHODS AS from each genome was classified with COMBEST that maps EST sequences to genomes to build gene models. Various aspects of AS were analyzed through statistical methods. The interplay of intron 3n length, phase, coding property, and intron retention (RI) were examined with Chi-square testing. RESULTS With 3 to 834 times EST coverage, we identified up to 73% of AS in intron-containing genes and found preponderance of RI among 11 types of AS. The number of exons, expression level, and maximum intron length correlated with number of AS per gene (NAG), and intron-rich genes suppressed AS. Genes with AS were more ancient, and AS was conserved among fungal genomes. Among stopless introns, non-retained introns (NRI) avoided, but major RI preferred 3n length. In contrast, stop-containing introns showed uniform distribution among 3n, 3n+1, and 3n+2 lengths. We found a clue to the intron phase enigma: it was the coding function of introns involved in AS that dictates the intron phase bias. CONCLUSIONS Majority of AS is non-functional, and the extent of AS is suppressed for intron-rich genes. RI through 3n length, stop codon, and phase bias bridges the transition from functionless to functional alternative isoforms.
Collapse
Affiliation(s)
- Kemin Zhou
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Asaf Salamov
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Alan Kuo
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Andrea L Aerts
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Xiangyang Kong
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| | - Igor V Grigoriev
- 1 US Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA ; 2 Roche Molecular Diagnostics, 4300 Hacienda Drive, Pleasanton, CA 94588, USA ; 3 Department of Clinical Medicine, Kunming University of Science and Technology, Kunming 650031, China
| |
Collapse
|