1
|
Vedanayagam J. Small RNA-mediated suppression of sex chromosome meiotic conflicts during Drosophila male gametogenesis. Biochem Soc Trans 2025; 53:BST20240344. [PMID: 39918264 DOI: 10.1042/bst20240344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2025] [Revised: 01/09/2025] [Accepted: 01/16/2025] [Indexed: 02/23/2025]
Abstract
Meiosis is an evolutionarily conserved process in eukaryotes that ensures equal segregation of alleles and chromosomes during reproduction. Although parity in allelic transmission is the norm, selfish genes such as meiotic drivers can violate Mendel's first law of segregation. Sex chromosome drive is a form of meiotic drive that leads to unequal segregation of sex chromosomes, resulting in sex-ratio distortion and/or sterility in the offspring. Adverse fitness effects due to sex chromosome drive trigger the evolution of suppressors to restore Mendelian segregation. However, the molecular mechanisms by which suppressors emerge and counteract meiotic drive genes remain unclear. Recent studies from Drosophila have shed light on the critical roles of small RNA-mediated post-transcriptional silencing in mitigating sex chromosome meiotic conflicts. This review highlights the recruitment of two distinct small RNA pathways to combat intragenomic conflicts during male gametogenesis and seeks to reveal the impact of molecular arms races between meiotic drivers and their suppressors in shaping genome and sex chromosome evolution.
Collapse
Affiliation(s)
- Jeffrey Vedanayagam
- Department of Neuroscience, Developmental and Regenerative Biology, University of Texas at San Antonio, San Antonio, TX 78249
| |
Collapse
|
2
|
Paul B, Siddaramappa S. Comparative analysis of the diversity of trinucleotide repeats in bacterial genomes. Genome 2024; 67:281-291. [PMID: 38593473 DOI: 10.1139/gen-2023-0097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2024]
Abstract
The human gut is the most favorable niche for microbial populations, and few studies have explored the possibilities of horizontal gene transfer between host and pathogen. Trinucleotide repeat (TNR) expansion in humans can cause more than 40 neurodegenerative diseases. Further, TNRs are a type of microsatellite that resides on coding regions can contribute to the synthesis of homopolymeric amino acids. Hence, the present study aims to estimate the occurrence and diversity of TNRs in bacterial genomes available in the NCBI Genome database. Genome-wide analyses revealed that several bacterial genomes contain different types of uninterrupted TNRs. It was found that TNRs are abundant in the genomes of Alcaligenes faecalis, Mycoplasma gallisepticum, Mycoplasma genitalium, Sorangium cellulosum, and Thermus thermophilus. Interestingly, the genome of Bacillus thuringiensis strain YBT-1518 contained 169 uninterrupted ATT repeats. The genome of Leclercia adecarboxylata had 46 uninterrupted CAG repeats, which potentially translate into polyglutamine. In some instances, the TNRs were present in genes that potentially encode essential functions. Similar occurrences in human genes are known to cause genetic disorders. Further analysis of the occurrence of TNRs in bacterial genomes is likely to provide a better understanding of mismatch repair, genetic disorders, host-pathogen interaction, and homopolymeric amino acids.
Collapse
Affiliation(s)
- Bobby Paul
- Department of Bioinformatics, Manipal School of Life Sciences, Manipal Academy of Higher Education, Manipal 576104, Karnataka, India
| | - Shivakumara Siddaramappa
- Institute of Bioinformatics and Applied Biotechnology, Biotech Park, Electronic City, Bengaluru 560100, Karnataka, India
| |
Collapse
|
3
|
Shukla HG, Chakraborty M, Emerson J. Genetic variation in recalcitrant repetitive regions of the Drosophila melanogaster genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.11.598575. [PMID: 38915508 PMCID: PMC11195212 DOI: 10.1101/2024.06.11.598575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Many essential functions of organisms are encoded in highly repetitive genomic regions, including histones involved in DNA packaging, centromeres that are core components of chromosome segregation, ribosomal RNA comprising the protein translation machinery, telomeres that ensure chromosome integrity, piRNA clusters encoding host defenses against selfish elements, and virtually the entire Y chromosome. These regions, formed by highly similar tandem arrays, pose significant challenges for experimental and informatic study, impeding sequence-level descriptions essential for understanding genetic variation. Here, we report the assembly and variation analysis of such repetitive regions in Drosophila melanogaster, offering significant improvements to the existing community reference assembly. Our work successfully recovers previously elusive segments, including complete reconstructions of the histone locus and the pericentric heterochromatin of the X chromosome, spanning the Stellate locus to the distal flank of the rDNA cluster. To infer structural changes in these regions where alignments are often not practicable, we introduce landmark anchors based on unique variants that are putatively orthologous. These regions display considerable structural variation between different D. melanogaster strains, exhibiting differences in copy number and organization of homologous repeat units between haplotypes. In the histone cluster, although we observe minimal genetic exchange indicative of crossing over, the variation patterns suggest mechanisms such as unequal sister chromatid exchange. We also examine the prevalence and scale of concerted evolution in the histone and Stellate clusters and discuss the mechanisms underlying these observed patterns.
Collapse
Affiliation(s)
- Harsh G. Shukla
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Graduate Program in Mathematical, Computational and Systems Biology, University of California Irvine, Irvine, California 92697, USA
| | - Mahul Chakraborty
- Department of Biology, Texas A&M University, College Station, Texas 77843, USA
| | - J.J. Emerson
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
- Center for Complex Biological Systems, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
4
|
Flynn JM, Yamashita YM. The implications of satellite DNA instability on cellular function and evolution. Semin Cell Dev Biol 2024; 156:152-159. [PMID: 37852904 DOI: 10.1016/j.semcdb.2023.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/21/2023] [Accepted: 10/11/2023] [Indexed: 10/20/2023]
Abstract
Abundant tandemly repeated satellite DNA is present in most eukaryotic genomes. Previous limitations including a pervasive view that it was uninteresting junk DNA, combined with challenges in studying it, are starting to dissolve - and recent studies have found important functions for satellite DNAs. The observed rapid evolution and implied instability of satellite DNA now has important significance for their functions and maintenance within the genome. In this review, we discuss the processes that lead to satellite DNA copy number instability, and the importance of mechanisms to manage the potential negative effects of instability. Satellite DNA is vulnerable to challenges during replication and repair, since it forms difficult-to-process secondary structures and its homology within tandem arrays can result in various types of recombination. Satellite DNA instability may be managed by DNA or chromatin-binding proteins ensuring proper nuclear localization and repair, or by proteins that process aberrant structures that satellite DNAs tend to form. We also discuss the pattern of satellite DNA mutations from recent mutation accumulation (MA) studies that have tracked changes in satellite DNA for up to 1000 generations with minimal selection. Finally, we highlight examples of satellite evolution from studies that have characterized satellites across millions of years of Drosophila fruit fly evolution, and discuss possible ways that selection might act on the satellite DNA composition.
Collapse
Affiliation(s)
- Jullien M Flynn
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA.
| | - Yukiko M Yamashita
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA; Howard Hughes Medical Institute, Cambridge, MA, USA; Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
5
|
Zhang Y, Chu J, Cheng H, Li H. De novo reconstruction of satellite repeat units from sequence data. Genome Res 2023; 33:1994-2001. [PMID: 37918962 PMCID: PMC10760446 DOI: 10.1101/gr.278005.123] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 10/18/2023] [Indexed: 11/04/2023]
Abstract
Satellite DNA are long tandemly repeating sequences in a genome and may be organized as high-order repeats (HORs). They are enriched in centromeres and are challenging to assemble. Existing algorithms for identifying satellite repeats either require the complete assembly of satellites or only work for simple repeat structures without HORs. Here we describe Satellite Repeat Finder (SRF), a new algorithm for reconstructing satellite repeat units and HORs from accurate reads or assemblies without prior knowledge on repeat structures. Applying SRF to real sequence data, we show that SRF could reconstruct known satellites in human and well-studied model organisms. We also find satellite repeats are pervasive in various other species, accounting for up to 12% of their genome contents but are often underrepresented in assemblies. With the rapid progress in genome sequencing, SRF will help the annotation of new genomes and the study of satellite DNA evolution even if such repeats are not fully assembled.
Collapse
Affiliation(s)
- Yujie Zhang
- Harvard School of Public Health, Boston, Massachusetts 02115, USA
| | - Justin Chu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Haoyu Cheng
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, USA;
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|
6
|
Courret C, Wei X, Larracuente AM. New perspectives on the causes and consequences of male meiotic drive. Curr Opin Genet Dev 2023; 83:102111. [PMID: 37704518 PMCID: PMC10842977 DOI: 10.1016/j.gde.2023.102111] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 08/07/2023] [Accepted: 08/09/2023] [Indexed: 09/15/2023]
Abstract
Gametogenesis is vulnerable to selfish genetic elements that bias their transmission to the next generation by cheating meiosis. These so-called meiotic drivers are widespread in plants, animals, and fungi and can impact genome evolution. Here, we summarize recent progress on the causes and consequences of meiotic drive in males, where selfish elements attack vulnerabilities in spermatogenesis. Advances in genomics provide new insights into the organization and dynamics of driving chromosomes in natural populations. Common themes, including small RNAs, gene duplications, and heterochromatin, emerged from these studies. Interdisciplinary approaches combining evolutionary genomics with molecular and cell biology are beginning to unravel the mysteries of drive and suppression mechanisms. These approaches also provide insights into fundamental processes in spermatogenesis and chromatin regulation.
Collapse
Affiliation(s)
- Cécile Courret
- Department of Biology, University of Rochester, Rochester, NY 14627, USA. https://twitter.com/@CecileCourret
| | - Xiaolu Wei
- Department of Biology, University of Rochester, Rochester, NY 14627, USA. https://twitter.com/@xiaolu_wei
| | | |
Collapse
|
7
|
Wierzbicki F, Kofler R. The composition of piRNA clusters in Drosophila melanogaster deviates from expectations under the trap model. BMC Biol 2023; 21:224. [PMID: 37858221 PMCID: PMC10588112 DOI: 10.1186/s12915-023-01727-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 10/06/2023] [Indexed: 10/21/2023] Open
Abstract
BACKGROUND It is widely assumed that the invasion of a transposable element (TE) in mammals and invertebrates is stopped when a copy of the TE jumps into a piRNA cluster (i.e., the trap model). However, recent works, which for example showed that deletion of three major piRNA clusters has no effect on TE activity, cast doubt on the trap model. RESULTS Here, we test the trap model from a population genetics perspective. Our simulations show that the composition of regions that act as transposon traps (i.e., potentially piRNA clusters) ought to deviate from regions that have no effect on TE activity. We investigated TEs in five Drosophila melanogaster strains using three complementary approaches to test whether the composition of piRNA clusters matches these expectations. We found that the abundance of TE families inside and outside of piRNA clusters is highly correlated, although this is not expected under the trap model. Furthermore, the distribution of the number of TE insertions in piRNA clusters is also much broader than expected. CONCLUSIONS We found that the observed composition of piRNA clusters is not in agreement with expectations under the simple trap model. Dispersed piRNA producing TE insertions and temporal as well as spatial heterogeneity of piRNA clusters may account for these deviations.
Collapse
Affiliation(s)
- Filip Wierzbicki
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria.
| |
Collapse
|
8
|
Ferraz ME, Ribeiro T, Sader M, Nascimento T, Pedrosa-Harand A. Comparative analysis of repetitive DNA in dysploid and non-dysploid Phaseolus beans. Chromosome Res 2023; 31:30. [PMID: 37812264 DOI: 10.1007/s10577-023-09739-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 08/31/2023] [Accepted: 09/15/2023] [Indexed: 10/10/2023]
Abstract
Structural karyotype changes result from ectopic recombination events frequently associated with repetitive DNA. Although most Phaseolus species present relatively stable karyotypes with 2n = 22 chromosomes, the karyotypes of species of the Leptostachyus group show high rates of structural rearrangements, including a nested chromosome fusion that led to the dysploid chromosome number of the group (2n = 20). We examined the roles of repetitive landscapes in the rearrangements of species of the Leptostachyus group using genome-skimming data to characterize the repeatome in a range of Phaseolus species and compared them to species of that group (P. leptostachyus and P. macvaughii). LTR retrotransposons, especially the Ty3/gypsy lineage Chromovirus, were the most abundant elements in the genomes. Differences in the abundance of Tekay, Retand, and SIRE elements between P. macvaughii and P. leptostachyus were reflected in their total amounts of Ty3/gypsy and Ty1/copia. The satellite DNA fraction was the most divergent among the species, varying both in abundance and distribution, even between P. leptostachyus and P. macvaughii. The rapid turnover of repeats in the Leptostachyus group may be associated with the several rearrangements observed.
Collapse
Affiliation(s)
- Maria Eduarda Ferraz
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Centre, Federal University of Pernambuco, Recife, PE, Brazil
| | - Tiago Ribeiro
- Integrative Plant Research Lab, Department of Botany and Ecology, Institute of Biosciences, Federal University of Mato Grosso, Cuiabá, MT, Brazil
| | - Mariela Sader
- Multidisciplinary Institute of Plant Biology, National Council for Scientific and Technical Research, National University of Córdoba, Córdoba, Argentina
| | - Thiago Nascimento
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Centre, Federal University of Pernambuco, Recife, PE, Brazil
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Biosciences Centre, Federal University of Pernambuco, Recife, PE, Brazil.
| |
Collapse
|
9
|
Zhang Y, Chu J, Cheng H, Li H. De novo reconstruction of satellite repeat units from sequence data. ARXIV 2023:arXiv:2304.09729v1. [PMID: 37131874 PMCID: PMC10153287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Satellite DNA are long tandemly repeating sequences in a genome and may be organized as high-order repeats (HORs). They are enriched in centromeres and are challenging to assemble. Existing algorithms for identifying satellite repeats either require the complete assembly of satellites or only work for simple repeat structures without HORs. Here we describe Satellite Repeat Finder (SRF), a new algorithm for reconstructing satellite repeat units and HORs from accurate reads or assemblies without prior knowledge on repeat structures. Applying SRF to real sequence data, we showed that SRF could reconstruct known satellites in human and well-studied model organisms. We also found satellite repeats are pervasive in various other species, accounting for up to 12% of their genome contents but are often underrepresented in assemblies. With the rapid progress on genome sequencing, SRF will help the annotation of new genomes and the study of satellite DNA evolution even if such repeats are not fully assembled.
Collapse
Affiliation(s)
- Yujie Zhang
- Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115, USA
| | - Justin Chu
- Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA
| | - Haoyu Cheng
- Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA
| | - Heng Li
- Department of Data Science, Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA
- Department of Biomedical Informatics, Harvard Medical School, 10 Shattuck St, Boston, MA 02115, USA
| |
Collapse
|
10
|
Wierzbicki F, Kofler R, Signor S. Evolutionary dynamics of piRNA clusters in Drosophila. Mol Ecol 2023; 32:1306-1322. [PMID: 34878692 DOI: 10.1111/mec.16311] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/24/2021] [Accepted: 12/01/2021] [Indexed: 12/21/2022]
Abstract
Small RNAs produced from transposable element (TE)-rich sections of the genome, termed piRNA clusters, are a crucial component in the genomic defence against selfish DNA. In animals, it is thought the invasion of a TE is stopped when a copy of the TE inserts into a piRNA cluster, triggering the production of cognate small RNAs that silence the TE. Despite this importance for TE control, little is known about the evolutionary dynamics of piRNA clusters, mostly because these repeat-rich regions are difficult to assemble and compare. Here, we establish a framework for studying the evolution of piRNA clusters quantitatively. Previously introduced quality metrics and a newly developed software for multiple alignments of repeat annotations (Manna) allow us to estimate the level of polymorphism segregating in piRNA clusters and the divergence among homologous piRNA clusters. By studying 20 conserved piRNA clusters in multiple assemblies of four Drosophila species, we show that piRNA clusters are evolving rapidly. While 70%-80% of the clusters are conserved within species, the clusters share almost no similarity between species as closely related as D. melanogaster and D. simulans. Furthermore, abundant insertions and deletions are segregating within the Drosophila species. We show that the evolution of clusters is mainly driven by large insertions of recently active TEs and smaller deletions mostly in older TEs. The effect of these forces is so rapid that homologous clusters often do not contain insertions from the same TE families.
Collapse
Affiliation(s)
- Filip Wierzbicki
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | - Sarah Signor
- Biological Sciences, North Dakota State University, Fargo, North Dakota, USA
| |
Collapse
|
11
|
Silva BSML, Picorelli ACR, Kuhn GCS. In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group. Genes (Basel) 2023; 14:300. [PMID: 36833227 PMCID: PMC9957191 DOI: 10.3390/genes14020300] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/13/2023] [Accepted: 01/16/2023] [Indexed: 01/24/2023] Open
Abstract
Satellite DNA (satDNA) is a class of tandemly repeated non-protein coding DNA sequences which can be found in abundance in eukaryotic genomes. They can be functional, impact the genomic architecture in many ways, and their rapid evolution has consequences for species diversification. We took advantage of the recent availability of sequenced genomes from 23 Drosophila species from the montium group to study their satDNA landscape. For this purpose, we used publicly available whole-genome sequencing Illumina reads and the TAREAN (tandem repeat analyzer) pipeline. We provide the characterization of 101 non-homologous satDNA families in this group, 93 of which are described here for the first time. Their repeat units vary in size from 4 bp to 1897 bp, but most satDNAs show repeat units < 100 bp long and, among them, repeats ≤ 10 bp are the most frequent ones. The genomic contribution of the satDNAs ranges from ~1.4% to 21.6%. There is no significant correlation between satDNA content and genome sizes in the 23 species. We also found that at least one satDNA originated from an expansion of the central tandem repeats (CTRs) present inside a Helitron transposon. Finally, some satDNAs may be useful as taxonomic markers for the identification of species or subgroups within the group.
Collapse
Affiliation(s)
| | | | - Gustavo C. S. Kuhn
- Department of Genetics, Ecology and Evolution, Federal University of Minas Gerais, Belo Horizonte 31270-901, Brazil
| |
Collapse
|
12
|
Huang Y, Shukla H, Lee YCG. Species-specific chromatin landscape determines how transposable elements shape genome evolution. eLife 2022; 11:81567. [PMID: 35997258 PMCID: PMC9398452 DOI: 10.7554/elife.81567] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 07/15/2022] [Indexed: 11/30/2022] Open
Abstract
Transposable elements (TEs) are selfish genetic parasites that increase their copy number at the expense of host fitness. The ‘success’, or genome-wide abundance, of TEs differs widely between species. Deciphering the causes for this large variety in TE abundance has remained a central question in evolutionary genomics. We previously proposed that species-specific TE abundance could be driven by the inadvertent consequences of host-direct epigenetic silencing of TEs—the spreading of repressive epigenetic marks from silenced TEs into adjacent sequences. Here, we compared this TE-mediated local enrichment of repressive marks, or ‘the epigenetic effect of TEs’, in six species in the Drosophila melanogaster subgroup to dissect step-by-step the role of such effect in determining genomic TE abundance. We found that TE-mediated local enrichment of repressive marks is prevalent and substantially varies across and even within species. While this TE-mediated effect alters the epigenetic states of adjacent genes, we surprisingly discovered that the transcription of neighboring genes could reciprocally impact this spreading. Importantly, our multi-species analysis provides the power and appropriate phylogenetic resolution to connect species-specific host chromatin regulation, TE-mediated epigenetic effects, the strength of natural selection against TEs, and genomic TE abundance unique to individual species. Our findings point toward the importance of host chromatin landscapes in shaping genome evolution through the epigenetic effects of a selfish genetic parasite. All the instructions required for life are encoded in the set of DNA present in a cell. It therefore seems natural to think that every bit of this genetic information should serve the organism. And yet most species carry parasitic ‘transposable’ sequences, or transposons, whose only purpose is to multiply and insert themselves at other positions in the genome. It is possible for cells to suppress these selfish elements. Chemical marks can be deposited onto the DNA to temporarily ‘silence’ transposons and prevent them from being able to move and replicate. However, this sometimes comes at a cost: the repressive chemical modifications can spread to nearby genes that are essential for the organism and perturb their function. Strangely, the prevalence of transposons varies widely across the tree of life. These sequences form the majority of the genome of certain species – in fact, they represent about half of the human genetic information. But their abundance is much lower in other organisms, forming a measly 6% of the genome of puffer fish for instance. Even amongst fruit fly species, the prevalence of transposable elements can range between 2% and 25%. What explains such differences? Huang et al. set out to examine this question through the lens of transposon silencing, systematically comparing how this process impacts nearby regions in six species of fruit flies. This revealed variations in the strength of the side effects associated with transposon silencing, resulting in different levels of perturbation on neighbouring genes. A stronger impact was associated with the species having fewer transposons in its genome, suggesting that an evolutionary pressure is at work to keep the abundance of transposons at a low level in these species. Further analyses showed that the genes which determine how silencing marks are distributed may also be responsible for the variations in the impact of transposon silencing. They could therefore be the ones driving differences in the abundance of transposons between species. Overall, this work sheds light on the complex mechanisms shaping the evolution of genomes, and it may help to better understand how transposons are linked to processes such as aging and cancer.
Collapse
Affiliation(s)
- Yuheng Huang
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, United States
| | - Harsh Shukla
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, United States
| | - Yuh Chwen G Lee
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, United States
| |
Collapse
|
13
|
New Developments and Possibilities in Reanalysis and Reinterpretation of Whole Exome Sequencing Datasets for Unsolved Rare Diseases Using Machine Learning Approaches. Int J Mol Sci 2022; 23:ijms23126792. [PMID: 35743235 PMCID: PMC9224427 DOI: 10.3390/ijms23126792] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 06/13/2022] [Accepted: 06/15/2022] [Indexed: 11/21/2022] Open
Abstract
Rare diseases impact the lives of 300 million people in the world. Rapid advances in bioinformatics and genomic technologies have enabled the discovery of causes of 20–30% of rare diseases. However, most rare diseases have remained as unsolved enigmas to date. Newer tools and availability of high throughput sequencing data have enabled the reanalysis of previously undiagnosed patients. In this review, we have systematically compiled the latest developments in the discovery of the genetic causes of rare diseases using machine learning methods. Importantly, we have detailed methods available to reanalyze existing whole exome sequencing data of unsolved rare diseases. We have identified different reanalysis methodologies to solve problems associated with sequence alterations/mutations, variation re-annotation, protein stability, splice isoform malfunctions and oligogenic analysis. In addition, we give an overview of new developments in the field of rare disease research using whole genome sequencing data and other omics.
Collapse
|
14
|
Wei KHC, Mai D, Chatla K, Bachtrog D. Dynamics and Impacts of Transposable Element Proliferation in the Drosophila nasuta Species Group Radiation. Mol Biol Evol 2022; 39:msac080. [PMID: 35485457 PMCID: PMC9075770 DOI: 10.1093/molbev/msac080] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Transposable element (TE) mobilization is a constant threat to genome integrity. Eukaryotic organisms have evolved robust defensive mechanisms to suppress their activity, yet TEs can escape suppression and proliferate, creating strong selective pressure for host defense to adapt. This genomic conflict fuels a never-ending arms race that drives the rapid evolution of TEs and recurrent positive selection of genes involved in host defense; the latter has been shown to contribute to postzygotic hybrid incompatibility. However, how TE proliferation impacts genome and regulatory divergence remains poorly understood. Here, we report the highly complete and contiguous (N50 = 33.8-38.0 Mb) genome assemblies of seven closely related Drosophila species that belong to the nasuta species group-a poorly studied group of flies that radiated in the last 2 My. We constructed a high-quality de novo TE library and gathered germline RNA-seq data, which allowed us to comprehensively annotate and compare TE insertion patterns between the species, and infer the evolutionary forces controlling their spread. We find a strong negative association between TE insertion frequency and expression of genes nearby; this likely reflects survivor bias from reduced fitness impact of TEs inserting near lowly expressed, nonessential genes, with limited TE-induced epigenetic silencing. Phylogenetic analyses of insertions of 147 TE families reveal that 53% of them show recent amplification in at least one species. The most highly amplified TE is a nonautonomous DNA element (Drosophila INterspersed Element; DINE) which has gone through multiple bouts of expansions with thousands of full-length copies littered throughout each genome. Across all TEs, we find that TEs expansions are significantly associated with high expression in the expanded species consistent with suppression escape. Thus, whereas horizontal transfer followed by the invasion of a naïve genome has been highlighted to explain the long-term survival of TEs, our analysis suggests that evasion of host suppression of resident TEs is a major strategy to persist over evolutionary times. Altogether, our results shed light on the heterogenous and context-dependent nature in which TEs affect gene regulation and the dynamics of rampant TE proliferation amidst a recently radiated species group.
Collapse
Affiliation(s)
- Kevin H.-C. Wei
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Dat Mai
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Kamalakar Chatla
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| | - Doris Bachtrog
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
15
|
Navarro-Dominguez B, Chang CH, Brand CL, Muirhead CA, Presgraves DC, Larracuente AM. Epistatic selection on a selfish Segregation Distorter supergene - drive, recombination, and genetic load. eLife 2022; 11:e78981. [PMID: 35486424 PMCID: PMC9122502 DOI: 10.7554/elife.78981] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 04/20/2022] [Indexed: 11/13/2022] Open
Abstract
Meiotic drive supergenes are complexes of alleles at linked loci that together subvert Mendelian segregation resulting in preferential transmission. In males, the most common mechanism of drive involves the disruption of sperm bearing one of a pair of alternative alleles. While at least two loci are important for male drive-the driver and the target-linked modifiers can enhance drive, creating selection pressure to suppress recombination. In this work, we investigate the evolution and genomic consequences of an autosomal, multilocus, male meiotic drive system, Segregation Distorter (SD) in the fruit fly, Drosophila melanogaster. In African populations, the predominant SD chromosome variant, SD-Mal, is characterized by two overlapping, paracentric inversions on chromosome arm 2R and nearly perfect (~100%) transmission. We study the SD-Mal system in detail, exploring its components, chromosomal structure, and evolutionary history. Our findings reveal a recent chromosome-scale selective sweep mediated by strong epistatic selection for haplotypes carrying Sd, the main driving allele, and one or more factors within the double inversion. While most SD-Mal chromosomes are homozygous lethal, SD-Mal haplotypes can recombine with other, complementing haplotypes via crossing over, and with wildtype chromosomes via gene conversion. SD-Mal chromosomes have nevertheless accumulated lethal mutations, excess non-synonymous mutations, and excess transposable element insertions. Therefore, SD-Mal haplotypes evolve as a small, semi-isolated subpopulation with a history of strong selection. These results may explain the evolutionary turnover of SD haplotypes in different populations around the world and have implications for supergene evolution broadly.
Collapse
Affiliation(s)
| | - Ching-Ho Chang
- Department of Biology, University of RochesterRochesterUnited States
| | - Cara L Brand
- Department of Biology, University of RochesterRochesterUnited States
| | - Christina A Muirhead
- Department of Biology, University of RochesterRochesterUnited States
- Ronin InstituteMontclairUnited States
| | | | | |
Collapse
|
16
|
Rech GE, Radío S, Guirao-Rico S, Aguilera L, Horvath V, Green L, Lindstadt H, Jamilloux V, Quesneville H, González J. Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila. Nat Commun 2022; 13:1948. [PMID: 35413957 PMCID: PMC9005704 DOI: 10.1038/s41467-022-29518-8] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/15/2022] [Indexed: 12/16/2022] Open
Abstract
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve.
Collapse
Affiliation(s)
- Gabriel E Rech
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Santiago Radío
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Sara Guirao-Rico
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Vivien Horvath
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Llewellyn Green
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Hannah Lindstadt
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | | | | | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain.
| |
Collapse
|
17
|
Affiliation(s)
| | - Francisco J. Ruiz-Ruano
- Department of Organismal Biology – Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
- School of Biological Sciences, Norwich Research Park University of East Anglia, Norwich, UK
| |
Collapse
|
18
|
Vedanayagam J, Lin CJ, Lai EC. Rapid evolutionary dynamics of an expanding family of meiotic drive factors and their hpRNA suppressors. Nat Ecol Evol 2021; 5:1613-1623. [PMID: 34862477 PMCID: PMC8665063 DOI: 10.1038/s41559-021-01592-z] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 10/19/2021] [Indexed: 11/25/2022]
Abstract
Meiotic drivers are a class of selfish genetic elements whose existence is frequently hidden due to concomitant suppressor systems. Accordingly, we know little of their evolutionary breadth and molecular mechanisms. Here, we trace the evolution of the Dox meiotic drive system in Drosophila simulans, which affects male-female balance (sex ratio). Dox emerged via stepwise mobilization and acquisition of multiple D. melanogaster gene segments including from protamine, which mediates compaction of sperm chromatin. Moreover, we reveal novel Dox homologs and massive amplification of Dox superfamily genes on X chromosomes of its closest sisters D. mauritiana and D. sechellia. Emergence of Dox loci is tightly associated with 359-class satellite repeats that flank de novo genomic copies. In concert, we find coordinated diversification of autosomal hairpin RNA-class siRNA loci that target subsets of Dox superfamily genes. Overall, we reveal fierce genetic arms races between meiotic drive factors and siRNA suppressors associated with recent speciation.
Collapse
Affiliation(s)
- Jeffrey Vedanayagam
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA.
| | - Ching-Jung Lin
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA
- Weill Graduate School of Medical Sciences, Weill Cornell Medical College, New York, NY, USA
| | - Eric C Lai
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA.
- Weill Graduate School of Medical Sciences, Weill Cornell Medical College, New York, NY, USA.
| |
Collapse
|
19
|
Kuhn GCS, Heringer P, Dias GB. Structure, Organization, and Evolution of Satellite DNAs: Insights from the Drosophila repleta and D. virilis Species Groups. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2021; 60:27-56. [PMID: 34386871 DOI: 10.1007/978-3-030-74889-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The fact that satellite DNAs (satDNAs) in eukaryotes are abundant genomic components, can perform functional roles, but can also change rapidly across species while being homogenous within a species, makes them an intriguing and fascinating genomic component to study. It is also becoming clear that satDNAs represent an important piece in genome architecture and that changes in their structure, organization, and abundance can affect the evolution of genomes and species in many ways. Since the discovery of satDNAs more than 50 years ago, species from the Drosophila genus have continuously been used as models to study several aspects of satDNA biology. These studies have been largely concentrated in D. melanogaster and closely related species from the Sophophora subgenus, even though the vast majority of all Drosophila species belong to the Drosophila subgenus. This chapter highlights some studies on the satDNA structure, organization, and evolution in two species groups from the Drosophila subgenus: the repleta and virilis groups. We also discuss and review the classification of other abundant tandem repeats found in these species in the light of the current information available.
Collapse
Affiliation(s)
- Gustavo C S Kuhn
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG, Brazil.
| | - Pedro Heringer
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG, Brazil
| | - Guilherme Borges Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| |
Collapse
|
20
|
Brashear WA, Bredemeyer KR, Murphy WJ. Genomic architecture constrained placental mammal X Chromosome evolution. Genome Res 2021; 31:1353-1365. [PMID: 34301625 PMCID: PMC8327908 DOI: 10.1101/gr.275274.121] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 06/22/2021] [Indexed: 01/02/2023]
Abstract
Susumu Ohno proposed that the gene content of the mammalian X Chromosome should remain highly conserved due to dosage compensation. X Chromosome linkage (gene order) conservation is widespread in placental mammals but does not fall within the scope of Ohno's prediction and may be an indirect result of selection on gene content or selection against rearrangements that might disrupt X-Chromosome inactivation (XCI). Previous comparisons between the human and mouse X Chromosome sequences have suggested that although single-copy X Chromosome genes are conserved between species, most ampliconic genes were independently acquired. To better understand the evolutionary and functional constraints on X-linked gene content and linkage conservation in placental mammals, we aligned a new, high-quality, long-read X Chromosome reference assembly from the domestic cat (incorporating 19.3 Mb of targeted BAC clone sequence) to the pig, human, and mouse assemblies. A comprehensive analysis of annotated X-linked orthologs in public databases demonstrated that the majority of ampliconic gene families were present on the ancestral placental X Chromosome. We generated a domestic cat Hi-C contact map from an F1 domestic cat/Asian leopard cat hybrid and demonstrated the formation of the bipartite structure found in primate and rodent inactivated X Chromosomes. Conservation of gene order and recombination patterns is attributable to strong selective constraints on three-dimensional genomic architecture necessary for superloop formation. Species with rearranged X Chromosomes retain the ancestral order and relative spacing of loci critical for superloop formation during XCI, with compensatory inversions evolving to maintain these long-range physical interactions.
Collapse
Affiliation(s)
- Wesley A Brashear
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, Texas 77843, USA
| | - Kevin R Bredemeyer
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, Texas 77843, USA
| | - William J Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, Texas 77843, USA
| |
Collapse
|
21
|
Wei X, Eickbush DG, Speece I, Larracuente AM. Heterochromatin-dependent transcription of satellite DNAs in the Drosophila melanogaster female germline. eLife 2021; 10:e62375. [PMID: 34259629 PMCID: PMC8321551 DOI: 10.7554/elife.62375] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Accepted: 07/08/2021] [Indexed: 12/15/2022] Open
Abstract
Large blocks of tandemly repeated DNAs-satellite DNAs (satDNAs)-play important roles in heterochromatin formation and chromosome segregation. We know little about how satDNAs are regulated; however, their misregulation is associated with genomic instability and human diseases. We use the Drosophila melanogaster germline as a model to study the regulation of satDNA transcription and chromatin. Here we show that complex satDNAs (>100-bp repeat units) are transcribed into long noncoding RNAs and processed into piRNAs (PIWI interacting RNAs). This satDNA piRNA production depends on the Rhino-Deadlock-Cutoff complex and the transcription factor Moonshiner-a previously described non-canonical pathway that licenses heterochromatin-dependent transcription of dual-strand piRNA clusters. We show that this pathway is important for establishing heterochromatin at satDNAs. Therefore, satDNAs are regulated by piRNAs originating from their own genomic loci. This novel mechanism of satDNA regulation provides insight into the role of piRNA pathways in heterochromatin formation and genome stability.
Collapse
Affiliation(s)
- Xiaolu Wei
- Department of Biomedical Genetics, University of Rochester Medical CenterRochesterUnited States
| | - Danna G Eickbush
- Department of Biology, University of RochesterRochesterUnited States
| | - Iain Speece
- Department of Biology, University of RochesterRochesterUnited States
| | | |
Collapse
|
22
|
Herbette M, Wei X, Chang CH, Larracuente AM, Loppin B, Dubruille R. Distinct spermiogenic phenotypes underlie sperm elimination in the Segregation Distorter meiotic drive system. PLoS Genet 2021; 17:e1009662. [PMID: 34228705 PMCID: PMC8284685 DOI: 10.1371/journal.pgen.1009662] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 07/16/2021] [Accepted: 06/10/2021] [Indexed: 12/28/2022] Open
Abstract
Segregation Distorter (SD) is a male meiotic drive system in Drosophila melanogaster. Males heterozygous for a selfish SD chromosome rarely transmit the homologous SD+ chromosome. It is well established that distortion results from an interaction between Sd, the primary distorting locus on the SD chromosome and its target, a satellite DNA called Rsp, on the SD+ chromosome. However, the molecular and cellular mechanisms leading to post-meiotic SD+ sperm elimination remain unclear. Here we show that SD/SD+ males of different genotypes but with similarly strong degrees of distortion have distinct spermiogenic phenotypes. In some genotypes, SD+ spermatids fail to fully incorporate protamines after the removal of histones, and degenerate during the individualization stage of spermiogenesis. In contrast, in other SD/SD+ genotypes, protamine incorporation appears less disturbed, yet spermatid nuclei are abnormally compacted, and mature sperm nuclei are eventually released in the seminal vesicle. Our analyses of different SD+ chromosomes suggest that the severity of the spermiogenic defects associates with the copy number of the Rsp satellite. We propose that when Rsp copy number is very high (> 2000), spermatid nuclear compaction defects reach a threshold that triggers a checkpoint controlling sperm chromatin quality to eliminate abnormal spermatids during individualization.
Collapse
Affiliation(s)
- Marion Herbette
- Laboratoire de Biologie et Modélisation de la Cellule, CNRS UMR 5239, École Normale Supérieure de Lyon, University of Lyon, Lyon, France
| | - Xiaolu Wei
- University of Rochester Medical Center, Department of Biomedical Genetics, Rochester, New York, United States of America
| | - Ching-Ho Chang
- University of Rochester Department of Biology, Rochester, New York, United States of America
| | - Amanda M. Larracuente
- University of Rochester Department of Biology, Rochester, New York, United States of America
| | - Benjamin Loppin
- Laboratoire de Biologie et Modélisation de la Cellule, CNRS UMR 5239, École Normale Supérieure de Lyon, University of Lyon, Lyon, France
| | - Raphaëlle Dubruille
- Laboratoire de Biologie et Modélisation de la Cellule, CNRS UMR 5239, École Normale Supérieure de Lyon, University of Lyon, Lyon, France
| |
Collapse
|
23
|
A de novo transcriptional atlas in Danaus plexippus reveals variability in dosage compensation across tissues. Commun Biol 2021; 4:791. [PMID: 34172835 PMCID: PMC8233437 DOI: 10.1038/s42003-021-02335-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 06/09/2021] [Indexed: 02/06/2023] Open
Abstract
A detailed knowledge of gene function in the monarch butterfly is still lacking. Here we generate a genome assembly from a Mexican nonmigratory population and used RNA-seq data from 14 biological samples for gene annotation and to construct an atlas portraying the breadth of gene expression during most of the monarch life cycle. Two thirds of the genes show expression changes, with long noncoding RNAs being particularly finely regulated during adulthood, and male-biased expression being four times more common than female-biased. The two portions of the monarch heterochromosome Z, one ancestral to the Lepidoptera and the other resulting from a chromosomal fusion, display distinct association with sex-biased expression, reflecting sample-dependent incompleteness or absence of dosage compensation in the ancestral but not the novel portion of the Z. This study presents extended genomic and transcriptomic resources that will facilitate a better understanding of the monarch's adaptation to a changing environment.
Collapse
|
24
|
Chen P, Kotov AA, Godneeva BK, Bazylev SS, Olenina LV, Aravin AA. piRNA-mediated gene regulation and adaptation to sex-specific transposon expression in D. melanogaster male germline. Genes Dev 2021; 35:914-935. [PMID: 33985970 PMCID: PMC8168559 DOI: 10.1101/gad.345041.120] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 04/08/2021] [Indexed: 12/19/2022]
Abstract
Small noncoding piRNAs act as sequence-specific guides to repress complementary targets in Metazoa. Prior studies in Drosophila ovaries have demonstrated the function of the piRNA pathway in transposon silencing and therefore genome defense. However, the ability of the piRNA program to respond to different transposon landscapes and the role of piRNAs in regulating host gene expression remain poorly understood. Here, we comprehensively analyzed piRNA expression and defined the repertoire of their targets in Drosophila melanogaster testes. Comparison of piRNA programs between sexes revealed sexual dimorphism in piRNA programs that parallel sex-specific transposon expression. Using a novel bioinformatic pipeline, we identified new piRNA clusters and established complex satellites as dual-strand piRNA clusters. While sharing most piRNA clusters, the two sexes employ them differentially to combat the sex-specific transposon landscape. We found two piRNA clusters that produce piRNAs antisense to four host genes in testis, including CG12717/pirate, a SUMO protease gene. piRNAs encoded on the Y chromosome silence pirate, but not its paralog, to exert sex- and paralog-specific gene regulation. Interestingly, pirate is targeted by endogenous siRNAs in a sibling species, Drosophila mauritiana, suggesting distinct but related silencing strategies invented in recent evolution to regulate a conserved protein-coding gene.
Collapse
Affiliation(s)
- Peiwei Chen
- California Institute of Technology, Division of Biology and Biological Engineering, Pasadena, California 91125, USA
| | - Alexei A Kotov
- Institute of Molecular Genetics of National Research Center "Kurchatov Institute," Moscow 123182, Russia
| | - Baira K Godneeva
- California Institute of Technology, Division of Biology and Biological Engineering, Pasadena, California 91125, USA
| | - Sergei S Bazylev
- Institute of Molecular Genetics of National Research Center "Kurchatov Institute," Moscow 123182, Russia
| | - Ludmila V Olenina
- Institute of Molecular Genetics of National Research Center "Kurchatov Institute," Moscow 123182, Russia
| | - Alexei A Aravin
- California Institute of Technology, Division of Biology and Biological Engineering, Pasadena, California 91125, USA
| |
Collapse
|
25
|
Chakraborty M, Chang CH, Khost DE, Vedanayagam J, Adrion JR, Liao Y, Montooth KL, Meiklejohn CD, Larracuente AM, Emerson JJ. Evolution of genome structure in the Drosophila simulans species complex. Genome Res 2021; 31:380-396. [PMID: 33563718 PMCID: PMC7919458 DOI: 10.1101/gr.263442.120] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 12/28/2020] [Indexed: 12/25/2022]
Abstract
The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex (D. simulans, D. mauritiana, and D. sechellia), which speciated ∼250,000 yr ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster owing to structural divergence-twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, whereas the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade- and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species.
Collapse
Affiliation(s)
- Mahul Chakraborty
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| | - Ching-Ho Chang
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| | - Danielle E Khost
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
- FAS Informatics and Scientific Applications, Harvard University, Cambridge, Massachusetts 02138, USA
| | - Jeffrey Vedanayagam
- Department of Developmental Biology, Memorial Sloan-Kettering Cancer Center, New York, New York 10065, USA
| | - Jeffrey R Adrion
- Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon 97403, USA
| | - Yi Liao
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| | - Kristi L Montooth
- School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska 68502, USA
| | - Colin D Meiklejohn
- School of Biological Sciences, University of Nebraska-Lincoln, Lincoln, Nebraska 68502, USA
| | | | - J J Emerson
- Department of Ecology and Evolutionary Biology, University of California Irvine, Irvine, California 92697, USA
| |
Collapse
|
26
|
dos Santos RZ, Calegari RM, Silva DMZDA, Ruiz-Ruano FJ, Melo S, Oliveira C, Foresti F, Uliano-Silva M, Porto-Foresti F, Utsunomia R. A Long-Term Conserved Satellite DNA That Remains Unexpanded in Several Genomes of Characiformes Fish Is Actively Transcribed. Genome Biol Evol 2021; 13:evab002. [PMID: 33502491 PMCID: PMC8210747 DOI: 10.1093/gbe/evab002] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/03/2021] [Indexed: 12/12/2022] Open
Abstract
Eukaryotic genomes contain large amounts of repetitive DNA sequences, such as tandemly repeated satellite DNAs (satDNAs). These sequences are highly dynamic and tend to be genus- or species-specific due to their particular evolutionary pathways, although there are few unusual cases of conserved satDNAs over long periods of time. Here, we used multiple approaches to reveal that an satDNA named CharSat01-52 originated in the last common ancestor of Characoidei fish, a superfamily within the Characiformes order, ∼140-78 Ma, whereas its nucleotide composition has remained considerably conserved in several taxa. We show that 14 distantly related species within Characoidei share the presence of this satDNA, which is highly amplified and clustered in subtelomeric regions in a single species (Characidium gomesi), while remained organized as small clusters in all the other species. Defying predictions of the molecular drive of satellite evolution, CharSat01-52 shows similar values of intra- and interspecific divergence. Although we did not provide evidence for a specific functional role of CharSat01-52, its transcriptional activity was demonstrated in different species. In addition, we identified short tandem arrays of CharSat01-52 embedded within single-molecule real-time long reads of Astyanax paranae (536 bp-3.1 kb) and A. mexicanus (501 bp-3.9 kb). Such arrays consisted of head-to-tail repeats and could be found interspersed with other sequences, inverted sequences, or neighbored by other satellites. Our results provide a detailed characterization of an old and conserved satDNA, challenging general predictions of satDNA evolution.
Collapse
Affiliation(s)
- Rodrigo Zeni dos Santos
- Departamento de Ciências Biológicas, Faculdade de Ciências, Universidade
Estadual Paulista, UNESP, Campus de Bauru, Bauru, Sao Paulo, Brazil
| | - Rodrigo Milan Calegari
- Departamento de Ciências Biológicas, Faculdade de Ciências, Universidade
Estadual Paulista, UNESP, Campus de Bauru, Bauru, Sao Paulo, Brazil
| | | | - Francisco J Ruiz-Ruano
- Department of Organismal Biology—Systematic Biology, Evolutionary Biology
Centre, Uppsala University, Uppsala, Sweden
| | - Silvana Melo
- Departamento de Biologia Estrutural e Funcional, Instituto de Biociências de
Botucatu, Universidade Estadual Paulista, UNESP, Botucatu, Sao Paulo,
Brazil
| | - Claudio Oliveira
- Departamento de Biologia Estrutural e Funcional, Instituto de Biociências de
Botucatu, Universidade Estadual Paulista, UNESP, Botucatu, Sao Paulo,
Brazil
| | - Fausto Foresti
- Departamento de Biologia Estrutural e Funcional, Instituto de Biociências de
Botucatu, Universidade Estadual Paulista, UNESP, Botucatu, Sao Paulo,
Brazil
| | | | - Fábio Porto-Foresti
- Departamento de Ciências Biológicas, Faculdade de Ciências, Universidade
Estadual Paulista, UNESP, Campus de Bauru, Bauru, Sao Paulo, Brazil
| | - Ricardo Utsunomia
- Departamento de Ciências Biológicas, Faculdade de Ciências, Universidade
Estadual Paulista, UNESP, Campus de Bauru, Bauru, Sao Paulo, Brazil
- Departamento de Genética, Instituto de Ciências Biológicas e da Saúde, ICBS,
Universidade Federal Rural do Rio de Janeiro, Seropédica, Rio de Janerio,
Brazil
| |
Collapse
|
27
|
Greif G, Rodriguez M, Bontempi I, Robello C, Alvarez-Valin F. Different kinetoplast degradation patterns in American Trypanosoma vivax strains: Multiple independent origins or fast evolution? Genomics 2021; 113:843-853. [PMID: 33418079 DOI: 10.1016/j.ygeno.2020.12.037] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 12/05/2020] [Accepted: 12/28/2020] [Indexed: 01/15/2023]
Abstract
We analyzed the kinetoplast (mitochondrial genome) of Trypanosoma vivax strains from America and Africa to determine their precise architecture and to understand their adaptive response to mechanical transmission. The use of long-read based assemblies that retain individuality of tandem repeats, without erasing inter-copy variability, allowed us to investigate the evolutionary dynamics of repetitive kinetoplast-DNA. This analysis revealed that repeat elements located in edges of repeat clusters are less active in terms of renewal, whereas internal copies appear to undergo a permanent process of birth-and-death. Comparing different American strains with the African Y486 strain, we found that in the former, protein coding genes from the maxicircle contain several function disrupting mutations that with very few exceptions are present in one or the other American strain but not in both, suggesting the absence of common ancestry for most of the genomic changes that led to their loss of oxidative phosphorylation capacity. Analysis of another component of kinetoplast, the minicircles, revealed great loss of diversity, and loss of their encoded guideRNAs. Both groups of American strains retain minimal sets required to edit the still functional A6-APTase and RPS12 genes. The extensive maxi- and minicircle divergence suggests a history of multiple introduction events in America of strains that probably started to degrade their kinetoplast in Africa. The notion that kinetoplast degradation began after incursion in America would imply a pace of accumulation of genetic changes considerably faster than other trypanosomatids.
Collapse
Affiliation(s)
- Gonzalo Greif
- Laboratorio de Interacciones Hospedero-Patógeno/Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay.
| | - Matias Rodriguez
- Sección Biomatemática-Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República Uruguay, Montevideo, Uruguay; Institute of Bioinformatics, University of Münster, Germany
| | - Ivan Bontempi
- Laboratorio de Tecnología Inmunológica, Facultad de Bioquímica y Ciencias Biológicas, Universidad Nacional del Litoral, Santa Fe, Argentina
| | - Carlos Robello
- Laboratorio de Interacciones Hospedero-Patógeno/Unidad de Biología Molecular, Institut Pasteur de Montevideo, Montevideo, Uruguay; Departamento de Bioquímica, Facultad de Medicina, Universidad de la República Uruguay, Montevideo, Uruguay
| | - Fernando Alvarez-Valin
- Sección Biomatemática-Laboratorio de Genómica Evolutiva, Facultad de Ciencias, Universidad de la República Uruguay, Montevideo, Uruguay.
| |
Collapse
|
28
|
Lauria Sneideman MP, Meller VH. Drosophila Satellite Repeats at the Intersection of Chromatin, Gene Regulation and Evolution. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2021; 60:1-26. [PMID: 34386870 DOI: 10.1007/978-3-030-74889-0_1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Satellite repeats make up a large fraction of the genomes of many higher eukaryotes. Until recently these sequences were viewed as molecular parasites with few functions. Drosophila melanogaster and related species have a wealth of diverse satellite repeats. Comparative studies of Drosophilids have been instrumental in understanding how these rapidly evolving sequences change and move. Remarkably, satellite repeats have been found to modulate gene expression and mediate genetic conflicts between chromosomes and between closely related fly species. This suggests that satellites play a key role in speciation. We have taken advantage of the depth of research on satellite repeats in flies to review the known functions of these sequences and consider their central role in evolution and gene expression.
Collapse
Affiliation(s)
| | - Victoria H Meller
- Department of Biological Sciences, Wayne State University, Detroit, MI, USA.
| |
Collapse
|
29
|
Palacios-Gimenez OM, Koelman J, Palmada-Flores M, Bradford TM, Jones KK, Cooper SJB, Kawakami T, Suh A. Comparative analysis of morabine grasshopper genomes reveals highly abundant transposable elements and rapidly proliferating satellite DNA repeats. BMC Biol 2020; 18:199. [PMID: 33349252 PMCID: PMC7754599 DOI: 10.1186/s12915-020-00925-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Accepted: 11/10/2020] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Repetitive DNA sequences, including transposable elements (TEs) and tandemly repeated satellite DNA (satDNAs), collectively called the "repeatome", are found in high proportion in organisms across the Tree of Life. Grasshoppers have large genomes, averaging 9 Gb, that contain a high proportion of repetitive DNA, which has hampered progress in assembling reference genomes. Here we combined linked-read genomics with transcriptomics to assemble, characterize, and compare the structure of repetitive DNA sequences in four chromosomal races of the morabine grasshopper Vandiemenella viatica species complex and determine their contribution to genome evolution. RESULTS We obtained linked-read genome assemblies of 2.73-3.27 Gb from estimated genome sizes of 4.26-5.07 Gb DNA per haploid genome of the four chromosomal races of V. viatica. These constitute the third largest insect genomes assembled so far. Combining complementary annotation tools and manual curation, we found a large diversity of TEs and satDNAs, constituting 66 to 75% per genome assembly. A comparison of sequence divergence within the TE classes revealed massive accumulation of recent TEs in all four races (314-463 Mb per assembly), indicating that their large genome sizes are likely due to similar rates of TE accumulation. Transcriptome sequencing showed more biased TE expression in reproductive tissues than somatic tissues, implying permissive transcription in gametogenesis. Out of 129 satDNA families, 102 satDNA families were shared among the four chromosomal races, which likely represent a diversity of satDNA families in the ancestor of the V. viatica chromosomal races. Notably, 50 of these shared satDNA families underwent differential proliferation since the recent diversification of the V. viatica species complex. CONCLUSION This in-depth annotation of the repeatome in morabine grasshoppers provided new insights into the genome evolution of Orthoptera. Our TEs analysis revealed a massive recent accumulation of TEs equivalent to the size of entire Drosophila genomes, which likely explains the large genome sizes in grasshoppers. Despite an overall high similarity of the TE and satDNA diversity between races, the patterns of TE expression and satDNA proliferation suggest rapid evolution of grasshopper genomes on recent timescales.
Collapse
Affiliation(s)
- Octavio M Palacios-Gimenez
- Department of Ecology and Genetics - Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden.
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden.
| | - Julia Koelman
- Department of Ecology and Genetics - Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden
| | - Marc Palmada-Flores
- Department of Ecology and Genetics - Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden
| | - Tessa M Bradford
- Evolutionary Biology Unit, South Australian Museum, Adelaide, SA, 5000, Australia
- School of Biological Sciences and Australian Centre for Evolutionary Biology and Biodiversity, The University of Adelaide, Adelaide, SA, 5005, Australia
| | - Karl K Jones
- Evolutionary Biology Unit, South Australian Museum, Adelaide, SA, 5000, Australia
| | - Steven J B Cooper
- Evolutionary Biology Unit, South Australian Museum, Adelaide, SA, 5000, Australia
- School of Biological Sciences and Australian Centre for Evolutionary Biology and Biodiversity, The University of Adelaide, Adelaide, SA, 5005, Australia
| | - Takeshi Kawakami
- Department of Ecology and Genetics - Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden.
- Embark Veterinary, Inc., Boston, MA, USA.
| | - Alexander Suh
- Department of Ecology and Genetics - Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden.
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden.
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TU, UK.
| |
Collapse
|
30
|
de Lima LG, Hanlon SL, Gerton JL. Origins and Evolutionary Patterns of the 1.688 Satellite DNA Family in Drosophila Phylogeny. G3 (BETHESDA, MD.) 2020; 10:4129-4146. [PMID: 32934018 PMCID: PMC7642928 DOI: 10.1534/g3.120.401727] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2020] [Accepted: 09/09/2020] [Indexed: 12/11/2022]
Abstract
Satellite DNAs (satDNAs) are a ubiquitous feature of eukaryotic genomes and are usually the major components of constitutive heterochromatin. The 1.688 satDNA, also known as the 359 bp satellite, is one of the most abundant repetitive sequences in Drosophila melanogaster and has been linked to several different biological functions. We investigated the presence and evolution of the 1.688 satDNA in 16 Drosophila genomes. We find that the 1.688 satDNA family is much more ancient than previously appreciated, being shared among part of the melanogaster group that diverged from a common ancestor ∼27 Mya. We found that the 1.688 satDNA family has two major subfamilies spread throughout Drosophila phylogeny (∼360 bp and ∼190 bp). Phylogenetic analysis of ∼10,000 repeats extracted from 14 of the species revealed that the 1.688 satDNA family is present within heterochromatin and euchromatin. A high number of euchromatic repeats are gene proximal, suggesting the potential for local gene regulation. Notably, heterochromatic copies display concerted evolution and a species-specific pattern, whereas euchromatic repeats display a more typical evolutionary pattern, suggesting that chromatin domains may influence the evolution of these sequences. Overall, our data indicate the 1.688 satDNA as the most perduring satDNA family described in Drosophila phylogeny to date. Our study provides a strong foundation for future work on the functional roles of 1.688 satDNA across many Drosophila species.
Collapse
Affiliation(s)
| | - Stacey L Hanlon
- Stowers Institute for Medical Research, Kansas City, Missouri 64110
| | | |
Collapse
|
31
|
Lin ZJ, Wang X, Wang J, Tan Y, Tang X, Werren JH, Zhang D, Wang X. Comparative analysis reveals the expansion of mitochondrial DNA control region containing unusually high G-C tandem repeat arrays in Nasonia vitripennis. Int J Biol Macromol 2020; 166:1246-1257. [PMID: 33159940 DOI: 10.1016/j.ijbiomac.2020.11.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Revised: 11/01/2020] [Accepted: 11/02/2020] [Indexed: 11/25/2022]
Abstract
Insect mitochondrial DNA (mtDNA) ranges from 14 to 19 kbp, and the size difference is attributed to the AT-rich control region. Jewel wasps have a parasitoid lifestyle, which may affect mitochondria function and evolution. We sequenced, assembled, and annotated mitochondrial genomes in Nasonia and outgroup species. Gene composition and order are conserved within Nasonia, but they differ from other parasitoids by two large inversion events that were not reported before. We observed a much higher substitution rate relative to the nuclear genome and mitochondrial introgression between N. giraulti and N. oneida, which is consistent with previous studies. Most strikingly, N. vitripennis mtDNA has an extremely long control region (7665 bp), containing twenty-nine 217 bp tandem repeats and can fold into a super-cruciform structure. In contrast to tandem repeats commonly found in other mitochondria, these high-copy repeats are highly conserved (98.7% sequence identity), much longer in length (approximately 8 Kb), extremely GC-rich (50.7%), and CpG-rich (percent CpG 19.4% vs. 1.1% in coding region), resulting in a 23 kbp mtDNA beyond the typical size range in insects. These N. vitripennis-specific mitochondrial repeats are not related to any known sequences in insect mitochondria. Their evolutionary origin and functional consequences warrant further investigations.
Collapse
Affiliation(s)
- Zi Jie Lin
- Department of Chemistry, Columbus State University, Columbus, GA 31909, United States of America
| | - Xiaozhu Wang
- Department of Pathobiology, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, United States of America
| | - Jinbin Wang
- Institute of Biotechnology Research, Shanghai Academy of Agricultural Sciences, Shanghai 201106, China
| | - Yongjun Tan
- Department of Biology, College of Arts & Sciences, Saint Louis University, St. Louis, MO 63103, United States of America
| | - Xueming Tang
- Institute of Biotechnology Research, Shanghai Academy of Agricultural Sciences, Shanghai 201106, China
| | - John H Werren
- Department of Biology, University of Rochester, Rochester, NY 14627, United States of America
| | - Dapeng Zhang
- Department of Biology, College of Arts & Sciences, Saint Louis University, St. Louis, MO 63103, United States of America
| | - Xu Wang
- Department of Pathobiology, College of Veterinary Medicine, Auburn University, Auburn, AL 36849, United States of America; HudsonAlpha Institute for Biotechnology, Huntsville, AL 35806, United States of America; Alabama Agricultural Experiment Station, Auburn University, Auburn, AL 36849, United States of America; Department of Entomology and Plant Pathology, Auburn University, Auburn, AL 36849, United States of America.
| |
Collapse
|
32
|
Yang J, Yuan B, Wu Y, Li M, Li J, Xu D, Gao ZH, Ma G, Zhou Y, Zuo Y, Wang J, Guo Y. The wide distribution and horizontal transfers of beta satellite DNA in eukaryotes. Genomics 2020; 112:5295-5304. [PMID: 33065245 DOI: 10.1016/j.ygeno.2020.10.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Revised: 09/08/2020] [Accepted: 10/10/2020] [Indexed: 01/12/2023]
Abstract
Beta satellite DNA (satDNA), also known as Sau3A sequences, are repetitive DNA sequences reported in human and primate genomes. It is previously thought that beta satDNAs originated in old world monkeys and bursted in great apes. In this study, we searched 7821 genome assemblies of 3767 eukaryotic species and found that beta satDNAs are widely distributed across eukaryotes. The four major branches of eukaryotes, animals, fungi, plants and Harosa/SAR, all have multiple clades containing beta satDNAs. These results were also confirmed by searching whole genome sequencing data (SRA) and PCR assay. Beta satDNA sequences were found in all the primate clades, as well as in Dermoptera and Scandentia, indicating that the beta satDNAs in primates might originate in the common ancestor of Primatomorpha or Euarchonta. In contrast, the widely patchy distribution of beta satDNAs across eukaryotes presents a typical scenario of multiple horizontal transfers.
Collapse
Affiliation(s)
- Jiawen Yang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China; State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, China.
| | - Bin Yuan
- Institute of Plant Protection and Soil Fertilizer, Hubei Academy of Agricultural Sciences, Wuhan, China
| | - Yu Wu
- Department of Parasitology, Zhongshan School of Medicine, Sun Yat-sen University, China.
| | - Meiyu Li
- Key Laboratory of Tropical Disease Control, Sun Yat-Sen University; Ministry of Education Experimental Teaching Center, Zhongshan School of Medicine, Sun Yat-sen University, China.
| | - Jian Li
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Donglin Xu
- Guangzhou Academy of Agricultural Sciences, Guangzhou, China
| | - Zeng-Hong Gao
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Guangwei Ma
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
| | - Yiting Zhou
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Yachao Zuo
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China; State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, China.
| | - Jin Wang
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| | - Yabin Guo
- Guangdong Provincial Key Laboratory of Malignant Tumor Epigenetics and Gene Regulation, Medical Research Center, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
| |
Collapse
|
33
|
Vojvoda Zeljko T, Pavlek M, Meštrović N, Plohl M. Satellite DNA-like repeats are dispersed throughout the genome of the Pacific oyster Crassostrea gigas carried by Helentron non-autonomous mobile elements. Sci Rep 2020; 10:15107. [PMID: 32934255 PMCID: PMC7492417 DOI: 10.1038/s41598-020-71886-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 08/11/2020] [Indexed: 01/31/2023] Open
Abstract
Satellite DNAs (satDNAs) are long arrays of tandem repeats typically located in heterochromatin and span the centromeres of eukaryotic chromosomes. Despite the wealth of knowledge about satDNAs, little is known about a fraction of short, satDNA-like arrays dispersed throughout the genome. Our survey of the Pacific oyster Crassostrea gigas sequenced genome revealed genome assembly replete with satDNA-like tandem repeats. We focused on the most abundant arrays, grouped according to sequence similarity into 13 clusters, and explored their flanking sequences. Structural analysis showed that arrays of all 13 clusters represent central repeats of 11 non-autonomous elements named Cg_HINE, which are classified into the Helentron superfamily of DNA transposons. Each of the described elements is formed by a unique combination of flanking sequences and satDNA-like central repeats, coming from one, exceptionally two clusters in a consecutive order. While some of the detected Cg_HINE elements are related according to sequence similarities in flanking and repetitive modules, others evidently arose in independent events. In addition, some of the Cg_HINE's central repeats are related to the classical C. gigas satDNA, interconnecting mobile elements and satDNAs. Genome-wide distribution of Cg_HINE implies non-autonomous Helentrons as a dynamic system prone to efficiently propagate tandem repeats in the C. gigas genome.
Collapse
Affiliation(s)
- Tanja Vojvoda Zeljko
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia
| | - Martina Pavlek
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia
| | - Nevenka Meštrović
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia
| | - Miroslav Plohl
- Division of Molecular Biology, Ruđer Bošković Institute, Bijenička 54, 10 000, Zagreb, Croatia.
| |
Collapse
|
34
|
Heitkam T, Weber B, Walter I, Liedtke S, Ost C, Schmidt T. Satellite DNA landscapes after allotetraploidization of quinoa (Chenopodium quinoa) reveal unique A and B subgenomes. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 103:32-52. [PMID: 31981259 DOI: 10.1111/tpj.14705] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Revised: 12/10/2019] [Accepted: 01/17/2020] [Indexed: 06/10/2023]
Abstract
If two related plant species hybridize, their genomes may be combined and duplicated within a single nucleus, thereby forming an allotetraploid. How the emerging plant balances two co-evolved genomes is still a matter of ongoing research. Here, we focus on satellite DNA (satDNA), the fastest turn-over sequence class in eukaryotes, aiming to trace its emergence, amplification, and loss during plant speciation and allopolyploidization. As a model, we used Chenopodium quinoa Willd. (quinoa), an allopolyploid crop with 2n = 4x = 36 chromosomes. Quinoa originated by hybridization of an unknown female American Chenopodium diploid (AA genome) with an unknown male Old World diploid species (BB genome), dating back 3.3-6.3 million years. Applying short read clustering to quinoa (AABB), C. pallidicaule (AA), and C. suecicum (BB) whole genome shotgun sequences, we classified their repetitive fractions, and identified and characterized seven satDNA families, together with the 5S rDNA model repeat. We show unequal satDNA amplification (two families) and exclusive occurrence (four families) in the AA and BB diploids by read mapping as well as Southern, genomic, and fluorescent in situ hybridization. Whereas the satDNA distributions support C. suecicum as possible parental species, we were able to exclude C. pallidicaule as progenitor due to unique repeat profiles. Using quinoa long reads and scaffolds, we detected only limited evidence of intergenomic homogenization of satDNA after allopolyploidization, but were able to exclude dispersal of 5S rRNA genes between subgenomes. Our results exemplify the complex route of tandem repeat evolution through Chenopodium speciation and allopolyploidization, and may provide sequence targets for the identification of quinoa's progenitors.
Collapse
Affiliation(s)
- Tony Heitkam
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Beatrice Weber
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Ines Walter
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Susan Liedtke
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| | - Charlotte Ost
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
- Institute of Biology, Martin-Luther-Universität Halle-Wittenberg, 06120, Halle (Saale), Germany
| | - Thomas Schmidt
- Institute of Botany, Technische Universität Dresden, 01069, Dresden, Germany
| |
Collapse
|
35
|
Kolesnikova TD, Kolodyazhnaya AV, Pokholkova GV, Schubert V, Dovgan VV, Romanenko SA, Prokopov DY, Zhimulev IF. Effects of Mutations in the Drosophila melanogaster Rif1 Gene on the Replication and Underreplication of Pericentromeric Heterochromatin in Salivary Gland Polytene Chromosomes. Cells 2020; 9:cells9061501. [PMID: 32575592 PMCID: PMC7349278 DOI: 10.3390/cells9061501] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 06/15/2020] [Accepted: 06/16/2020] [Indexed: 01/09/2023] Open
Abstract
In Drosophila salivary gland polytene chromosomes, a substantial portion of heterochromatin is underreplicated. The combination of mutations SuURES and Su(var)3-906 results in the polytenization of a substantial fraction of unique and moderately repeated sequences but has almost no effect on satellite DNA replication. The Rap1 interacting factor 1 (Rif) protein is a conserved regulator of replication timing, and in Drosophila, it affects underreplication in polytene chromosomes. We compared the morphology of pericentromeric regions and labeling patterns of in situ hybridization of heterochromatin-specific DNA probes between wild-type salivary gland polytene chromosomes and the chromosomes of Rif1 mutants and SuUR Su(var)3-906 double mutants. We show that, despite general similarities, heterochromatin zones exist that are polytenized only in the Rif1 mutants, and that there are zones that are under specific control of Su(var)3-9. In the Rif1 mutants, we found additional polytenization of the largest blocks of satellite DNA (in particular, satellite 1.688 of chromosome X and simple satellites in chromosomes X and 4) as well as partial polytenization of chromosome Y. Data on pulsed incorporation of 5-ethynyl-2′-deoxyuridine (EdU) into polytene chromosomes indicated that in the Rif1 mutants, just as in the wild type, most of the heterochromatin becomes replicated during the late S phase. Nevertheless, a significantly increased number of heterochromatin replicons was noted. These results suggest that Rif1 regulates the activation probability of heterochromatic origins in the satellite DNA region.
Collapse
Affiliation(s)
- Tatyana D. Kolesnikova
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
- Laboratory of Structural, Functional and Comparative Genomics, Novosibirsk State University, 630090 Novosibirsk, Russia
- Correspondence:
| | - Alexandra V. Kolodyazhnaya
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| | - Galina V. Pokholkova
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
| | - Veit Schubert
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, D-06466 Seeland, Germany;
| | - Viktoria V. Dovgan
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
- Department of Natural Sciences, Novosibirsk State University, 630090 Novosibirsk, Russia
| | - Svetlana A. Romanenko
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
| | - Dmitry Yu. Prokopov
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
| | - Igor F. Zhimulev
- Institute of Molecular and Cellular Biology, Siberian Branch of Russian Academy of Sciences, 630090 Novosibirsk, Russia; (A.V.K.); (G.V.P.); (V.V.D.); (S.A.R.); (D.Y.P.); (I.F.Z.)
- Laboratory of Structural, Functional and Comparative Genomics, Novosibirsk State University, 630090 Novosibirsk, Russia
| |
Collapse
|
36
|
Shatskikh AS, Kotov AA, Adashev VE, Bazylev SS, Olenina LV. Functional Significance of Satellite DNAs: Insights From Drosophila. Front Cell Dev Biol 2020; 8:312. [PMID: 32432114 PMCID: PMC7214746 DOI: 10.3389/fcell.2020.00312] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Accepted: 04/08/2020] [Indexed: 12/12/2022] Open
Abstract
Since their discovery more than 60 years ago, satellite repeats are still one of the most enigmatic parts of eukaryotic genomes. Being non-coding DNA, satellites were earlier considered to be non-functional “junk,” but recently this concept has been extensively revised. Satellite DNA contributes to the essential processes of formation of crucial chromosome structures, heterochromatin establishment, dosage compensation, reproductive isolation, genome stability and development. Genomic abundance of satellites is under stabilizing selection owing of their role in the maintenance of vital regions of the genome – centromeres, pericentromeric regions, and telomeres. Many satellites are transcribed with the generation of long or small non-coding RNAs. Misregulation of their expression is found to lead to various defects in the maintenance of genomic architecture, chromosome segregation and gametogenesis. This review summarizes our current knowledge concerning satellite functions, the mechanisms of regulation and evolution of satellites, focusing on recent findings in Drosophila. We discuss here experimental and bioinformatics data obtained in Drosophila in recent years, suggesting relevance of our analysis to a wide range of eukaryotic organisms.
Collapse
Affiliation(s)
- Aleksei S Shatskikh
- Laboratory of Analysis of Clinical and Model Tumor Pathologies on the Organismal Level, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Alexei A Kotov
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Vladimir E Adashev
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Sergei S Bazylev
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | - Ludmila V Olenina
- Laboratory of Biochemical Genetics of Animals, Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
37
|
Ellison CE, Cao W. Nanopore sequencing and Hi-C scaffolding provide insight into the evolutionary dynamics of transposable elements and piRNA production in wild strains of Drosophila melanogaster. Nucleic Acids Res 2020; 48:290-303. [PMID: 31754714 PMCID: PMC6943127 DOI: 10.1093/nar/gkz1080] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Revised: 10/29/2019] [Accepted: 11/01/2019] [Indexed: 01/29/2023] Open
Abstract
Illumina sequencing has allowed for population-level surveys of transposable element (TE) polymorphism via split alignment approaches, which has provided important insight into the population dynamics of TEs. However, such approaches are not able to identify insertions of uncharacterized TEs, nor can they assemble the full sequence of inserted elements. Here, we use nanopore sequencing and Hi-C scaffolding to produce de novo genome assemblies for two wild strains of Drosophila melanogaster from the Drosophila Genetic Reference Panel (DGRP). Ovarian piRNA populations and Illumina split-read TE insertion profiles have been previously produced for both strains. We find that nanopore sequencing with Hi-C scaffolding produces highly contiguous, chromosome-length scaffolds, and we identify hundreds of TE insertions that were missed by Illumina-based methods, including a novel micropia-like element that has recently invaded the DGRP population. We also find hundreds of piRNA-producing loci that are specific to each strain. Some of these loci are created by strain-specific TE insertions, while others appear to be epigenetically controlled. Our results suggest that Illumina approaches reveal only a portion of the repetitive sequence landscape of eukaryotic genomes and that population-level resequencing using long reads is likely to provide novel insight into the evolutionary dynamics of repetitive elements.
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Weihuan Cao
- Department of Genetics, Human Genetics Institute of New Jersey, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| |
Collapse
|
38
|
Louzada S, Lopes M, Ferreira D, Adega F, Escudeiro A, Gama-Carvalho M, Chaves R. Decoding the Role of Satellite DNA in Genome Architecture and Plasticity-An Evolutionary and Clinical Affair. Genes (Basel) 2020; 11:E72. [PMID: 31936645 PMCID: PMC7017282 DOI: 10.3390/genes11010072] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 12/29/2019] [Accepted: 01/08/2020] [Indexed: 12/11/2022] Open
Abstract
Repetitive DNA is a major organizational component of eukaryotic genomes, being intrinsically related with their architecture and evolution. Tandemly repeated satellite DNAs (satDNAs) can be found clustered in specific heterochromatin-rich chromosomal regions, building vital structures like functional centromeres and also dispersed within euchromatin. Interestingly, despite their association to critical chromosomal structures, satDNAs are widely variable among species due to their high turnover rates. This dynamic behavior has been associated with genome plasticity and chromosome rearrangements, leading to the reshaping of genomes. Here we present the current knowledge regarding satDNAs in the light of new genomic technologies, and the challenges in the study of these sequences. Furthermore, we discuss how these sequences, together with other repeats, influence genome architecture, impacting its evolution and association with disease.
Collapse
Affiliation(s)
- Sandra Louzada
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Mariana Lopes
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Daniela Ferreira
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Filomena Adega
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Ana Escudeiro
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Margarida Gama-Carvalho
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| | - Raquel Chaves
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (S.L.); (M.L.); (D.F.); (F.A.); (A.E.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisboa, 1749-016 Lisbon, Portugal;
| |
Collapse
|
39
|
Vondrak T, Ávila Robledillo L, Novák P, Koblížková A, Neumann P, Macas J. Characterization of repeat arrays in ultra-long nanopore reads reveals frequent origin of satellite DNA from retrotransposon-derived tandem repeats. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2020; 101:484-500. [PMID: 31559657 PMCID: PMC7004042 DOI: 10.1111/tpj.14546] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 09/09/2019] [Accepted: 09/12/2019] [Indexed: 05/21/2023]
Abstract
Amplification of monomer sequences into long contiguous arrays is the main feature distinguishing satellite DNA from other tandem repeats, yet it is also the main obstacle in its investigation because these arrays are in principle difficult to assemble. Here we explore an alternative, assembly-free approach that utilizes ultra-long Oxford Nanopore reads to infer the length distribution of satellite repeat arrays, their association with other repeats and the prevailing sequence periodicities. Using the satellite DNA-rich legume plant Lathyrus sativus as a model, we demonstrated this approach by analyzing 11 major satellite repeats using a set of nanopore reads ranging from 30 to over 200 kb in length and representing 0.73× genome coverage. We found surprising differences between the analyzed repeats because only two of them were predominantly organized in long arrays typical for satellite DNA. The remaining nine satellites were found to be derived from short tandem arrays located within LTR-retrotransposons that occasionally expanded in length. While the corresponding LTR-retrotransposons were dispersed across the genome, this array expansion occurred mainly in the primary constrictions of the L. sativus chromosomes, which suggests that these genome regions are favourable for satellite DNA accumulation.
Collapse
Affiliation(s)
- Tihana Vondrak
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Laura Ávila Robledillo
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
- Faculty of ScienceUniversity of South BohemiaČeské BudějoviceCzech Republic
| | - Petr Novák
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Andrea Koblížková
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Pavel Neumann
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| | - Jiří Macas
- Biology CentreCzech Academy of SciencesBranišovská 31České BudějoviceCZ‐37005Czech Republic
| |
Collapse
|
40
|
Silva BSML, Heringer P, Dias GB, Svartman M, Kuhn GCS. De novo identification of satellite DNAs in the sequenced genomes of Drosophila virilis and D. americana using the RepeatExplorer and TAREAN pipelines. PLoS One 2019; 14:e0223466. [PMID: 31856171 PMCID: PMC6922343 DOI: 10.1371/journal.pone.0223466] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 11/26/2019] [Indexed: 01/10/2023] Open
Abstract
Satellite DNAs are among the most abundant repetitive DNAs found in eukaryote genomes, where they participate in a variety of biological roles, from being components of important chromosome structures to gene regulation. Experimental methodologies used before the genomic era were insufficient, too laborious and time-consuming to recover the collection of all satDNAs from a genome. Today, the availability of whole sequenced genomes combined with the development of specific bioinformatic tools are expected to foster the identification of virtually all the "satellitome" of a particular species. While whole genome assemblies are important to obtain a global view of genome organization, most of them are incomplete and lack repetitive regions. We applied short-read sequencing and similarity clustering in order to perform a de novo identification of the most abundant satellite families in two Drosophila species from the virilis group: Drosophila virilis and D. americana, using the Tandem Repeat Analyzer (TAREAN) and RepeatExplorer pipelines. These species were chosen because they have been used as models to understand satDNA biology since the early 70's. We combined the computational approach with data from the literature and chromosome mapping to obtain an overview of the major tandem repeat sequences of these species. The fact that all of the abundant tandem repeats (TRs) we detected were previously identified in the literature allowed us to evaluate the efficiency of TAREAN in correctly identifying true satDNAs. Our results indicate that raw sequencing reads can be efficiently used to detect satDNAs, but that abundant tandem repeats present in dispersed arrays or associated with transposable elements are frequent false positives. We demonstrate that TAREAN with its parent method RepeatExplorer may be used as resources to detect tandem repeats associated with transposable elements and also to reveal families of dispersed tandem repeats.
Collapse
Affiliation(s)
- Bráulio S. M. L. Silva
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brasil
| | - Pedro Heringer
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brasil
| | - Guilherme B. Dias
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brasil
| | - Marta Svartman
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brasil
| | - Gustavo C. S. Kuhn
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais, Belo Horizonte, Minas Gerais, Brasil
| |
Collapse
|
41
|
Structural variants exhibit widespread allelic heterogeneity and shape variation in complex traits. Nat Commun 2019; 10:4872. [PMID: 31653862 PMCID: PMC6814777 DOI: 10.1038/s41467-019-12884-1] [Citation(s) in RCA: 104] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Accepted: 09/25/2019] [Indexed: 12/11/2022] Open
Abstract
It has been hypothesized that individually-rare hidden structural variants (SVs) could account for a significant fraction of variation in complex traits. Here we identified more than 20,000 euchromatic SVs from 14 Drosophila melanogaster genome assemblies, of which ~40% are invisible to high specificity short-read genotyping approaches. SVs are common, with 31.5% of diploid individuals harboring a SV in genes larger than 5kb, and 24% harboring multiple SVs in genes larger than 10kb. SV minor allele frequencies are rarer than amino acid polymorphisms, suggesting that SVs are more deleterious. We show that a number of functionally important genes harbor previously hidden structural variants likely to affect complex phenotypes. Furthermore, SVs are overrepresented in candidate genes associated with quantitative trait loci mapped using the Drosophila Synthetic Population Resource. We conclude that SVs are ubiquitous, frequently constitute a heterogeneous allelic series, and can act as rare alleles of large effect.
Collapse
|
42
|
Bracewell R, Chatla K, Nalley MJ, Bachtrog D. Dynamic turnover of centromeres drives karyotype evolution in Drosophila. eLife 2019; 8:e49002. [PMID: 31524597 PMCID: PMC6795482 DOI: 10.7554/elife.49002] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 09/12/2019] [Indexed: 12/21/2022] Open
Abstract
Centromeres are the basic unit for chromosome inheritance, but their evolutionary dynamics is poorly understood. We generate high-quality reference genomes for multiple Drosophila obscura group species to reconstruct karyotype evolution. All chromosomes in this lineage were ancestrally telocentric and the creation of metacentric chromosomes in some species was driven by de novo seeding of new centromeres at ancestrally gene-rich regions, independently of chromosomal rearrangements. The emergence of centromeres resulted in a drastic size increase due to repeat accumulation, and dozens of genes previously located in euchromatin are now embedded in pericentromeric heterochromatin. Metacentric chromosomes secondarily became telocentric in the pseudoobscura subgroup through centromere repositioning and a pericentric inversion. The former (peri)centric sequences left behind shrunk dramatically in size after their inactivation, yet contain remnants of their evolutionary past, including increased repeat-content and heterochromatic environment. Centromere movements are accompanied by rapid turnover of the major satellite DNA detected in (peri)centromeric regions.
Collapse
Affiliation(s)
- Ryan Bracewell
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| | - Kamalakar Chatla
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| | - Matthew J Nalley
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| | - Doris Bachtrog
- Department of Integrative BiologyUniversity of California, BerkeleyBerkeleyUnited States
| |
Collapse
|
43
|
Tusso S, Nieuwenhuis BPS, Sedlazeck FJ, Davey JW, Jeffares DC, Wolf JBW. Ancestral Admixture Is the Main Determinant of Global Biodiversity in Fission Yeast. Mol Biol Evol 2019; 36:1975-1989. [PMID: 31225876 PMCID: PMC6736153 DOI: 10.1093/molbev/msz126] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Mutation and recombination are key evolutionary processes governing phenotypic variation and reproductive isolation. We here demonstrate that biodiversity within all globally known strains of Schizosaccharomyces pombe arose through admixture between two divergent ancestral lineages. Initial hybridization was inferred to have occurred ∼20-60 sexual outcrossing generations ago consistent with recent, human-induced migration at the onset of intensified transcontinental trade. Species-wide heritable phenotypic variation was explained near-exclusively by strain-specific arrangements of alternating ancestry components with evidence for transgressive segregation. Reproductive compatibility between strains was likewise predicted by the degree of shared ancestry. To assess the genetic determinants of ancestry block distribution across the genome, we characterized the type, frequency, and position of structural genomic variation using nanopore and single-molecule real-time sequencing. Despite being associated with double-strand break initiation points, over 800 segregating structural variants exerted overall little influence on the introgression landscape or on reproductive compatibility between strains. In contrast, we found strong ancestry disequilibrium consistent with negative epistatic selection shaping genomic ancestry combinations during the course of hybridization. This study provides a detailed, experimentally tractable example that genomes of natural populations are mosaics reflecting different evolutionary histories. Exploiting genome-wide heterogeneity in the history of ancestral recombination and lineage-specific mutations sheds new light on the population history of S. pombe and highlights the importance of hybridization as a creative force in generating biodiversity.
Collapse
Affiliation(s)
- Sergio Tusso
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Planegg-Martinsried, Germany
- Department of Evolutionary Biology, Science for Life Laboratories, Uppsala University, Uppsala, Sweden
| | - Bart P S Nieuwenhuis
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Planegg-Martinsried, Germany
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - John W Davey
- Bioscience Technology Facility, Department of Biology, University of York, York, United Kingdom
| | - Daniel C Jeffares
- Department of Biology, University of York, York, United Kingdom
- York Biomedical Research Institute (YBRI), University of York, York, United Kingdom
| | - Jochen B W Wolf
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Planegg-Martinsried, Germany
- Department of Evolutionary Biology, Science for Life Laboratories, Uppsala University, Uppsala, Sweden
| |
Collapse
|
44
|
Hartmann M, Umbanhowar J, Sekelsky J. Centromere-Proximal Meiotic Crossovers in Drosophila melanogaster Are Suppressed by Both Highly Repetitive Heterochromatin and Proximity to the Centromere. Genetics 2019; 213:113-125. [PMID: 31345993 PMCID: PMC6727794 DOI: 10.1534/genetics.119.302509] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 07/19/2019] [Indexed: 11/18/2022] Open
Abstract
Crossovers are essential in meiosis of most organisms to ensure the proper segregation of chromosomes, but improper placement of crossovers can result in nondisjunction and aneuploidy in progeny. In particular, crossovers near the centromere can cause nondisjunction. Centromere-proximal crossovers are suppressed by what is termed the centromere effect, but the mechanism is unknown. Here, we investigate contributions to centromere-proximal crossover suppression in Drosophila melanogaster We mapped a large number of centromere-proximal crossovers, and find that crossovers are essentially absent from the highly repetitive (HR)-heterochromatin surrounding the centromere but occur at a low frequency within the less-repetitive (LR)-heterochromatic region and adjacent euchromatin. Previous research suggested that flies that lack the Bloom syndrome helicase (Blm) lose meiotic crossover patterning, including the centromere effect. Mapping of centromere-proximal crossovers in Blm mutants reveals that the suppression within the HR-heterochromatin is intact, but the distance-dependent centromere effect is lost. We conclude that centromere-proximal crossovers are suppressed by two separable mechanisms: an HR-heterochromatin effect that completely suppresses crossovers in the HR-heterochromatin, and the centromere effect, which suppresses crossovers with a dissipating effect with distance from the centromere.
Collapse
Affiliation(s)
- Michaelyn Hartmann
- Curriculum in Genetics and Molecular Biology, University of North Carolina, Chapel Hill, North Carolina 27599
| | - James Umbanhowar
- Environment, Ecology and Energy Program, University of North Carolina, Chapel Hill, North Carolina 27599
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599
| | - Jeff Sekelsky
- Curriculum in Genetics and Molecular Biology, University of North Carolina, Chapel Hill, North Carolina 27599
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina 27599
- Integrative Program in Biological and Genome Sciences, University of North Carolina, Chapel Hill, North Carolina 27599
| |
Collapse
|
45
|
Chang CH, Chavan A, Palladino J, Wei X, Martins NMC, Santinello B, Chen CC, Erceg J, Beliveau BJ, Wu CT, Larracuente AM, Mellone BG. Islands of retroelements are major components of Drosophila centromeres. PLoS Biol 2019; 17:e3000241. [PMID: 31086362 PMCID: PMC6516634 DOI: 10.1371/journal.pbio.3000241] [Citation(s) in RCA: 96] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Accepted: 04/08/2019] [Indexed: 12/24/2022] Open
Abstract
Centromeres are essential chromosomal regions that mediate kinetochore assembly and spindle attachments during cell division. Despite their functional conservation, centromeres are among the most rapidly evolving genomic regions and can shape karyotype evolution and speciation across taxa. Although significant progress has been made in identifying centromere-associated proteins, the highly repetitive centromeres of metazoans have been refractory to DNA sequencing and assembly, leaving large gaps in our understanding of their functional organization and evolution. Here, we identify the sequence composition and organization of the centromeres of Drosophila melanogaster by combining long-read sequencing, chromatin immunoprecipitation for the centromeric histone CENP-A, and high-resolution chromatin fiber imaging. Contrary to previous models that heralded satellite repeats as the major functional components, we demonstrate that functional centromeres form on islands of complex DNA sequences enriched in retroelements that are flanked by large arrays of satellite repeats. Each centromere displays distinct size and arrangement of its DNA elements but is similar in composition overall. We discover that a specific retroelement, G2/Jockey-3, is the most highly enriched sequence in CENP-A chromatin and is the only element shared among all centromeres. G2/Jockey-3 is also associated with CENP-A in the sister species D. simulans, revealing an unexpected conservation despite the reported turnover of centromeric satellite DNA. Our work reveals the DNA sequence identity of the active centromeres of a premier model organism and implicates retroelements as conserved features of centromeric DNA.
Collapse
Affiliation(s)
- Ching-Ho Chang
- Department of Biology, University of Rochester; Rochester, New York, United States of America
| | - Ankita Chavan
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Jason Palladino
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Xiaolu Wei
- Department of Biomedical Genetics, University of Rochester Medical Center, Rochester, New York, United States of America
| | - Nuno M. C. Martins
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Bryce Santinello
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Chin-Chi Chen
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Jelena Erceg
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Brian J. Beliveau
- Wyss Institute for Biologically Inspired Engineering, Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, United States of America
- Department of Genome Sciences, University of Washington Seattle, Seattle, Washington, United States of America
| | - Chao-Ting Wu
- Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Amanda M. Larracuente
- Department of Biology, University of Rochester; Rochester, New York, United States of America
| | - Barbara G. Mellone
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
- Institute for Systems Genomics, University of Connecticut Storrs, Connecticut, United States of America
| |
Collapse
|
46
|
Bravo GA, Antonelli A, Bacon CD, Bartoszek K, Blom MPK, Huynh S, Jones G, Knowles LL, Lamichhaney S, Marcussen T, Morlon H, Nakhleh LK, Oxelman B, Pfeil B, Schliep A, Wahlberg N, Werneck FP, Wiedenhoeft J, Willows-Munro S, Edwards SV. Embracing heterogeneity: coalescing the Tree of Life and the future of phylogenomics. PeerJ 2019; 7:e6399. [PMID: 30783571 PMCID: PMC6378093 DOI: 10.7717/peerj.6399] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 01/07/2019] [Indexed: 12/23/2022] Open
Abstract
Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress.
Collapse
Affiliation(s)
- Gustavo A. Bravo
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Alexandre Antonelli
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
- Gothenburg Botanical Garden, Göteborg, Sweden
| | - Christine D. Bacon
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Krzysztof Bartoszek
- Department of Computer and Information Science, Linköping University, Linköping, Sweden
| | - Mozes P. K. Blom
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Stella Huynh
- Institut de Biologie, Université de Neuchâtel, Neuchâtel, Switzerland
| | - Graham Jones
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - L. Lacey Knowles
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Sangeet Lamichhaney
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
| | - Thomas Marcussen
- Centre for Ecological and Evolutionary Synthesis, University of Oslo, Oslo, Norway
| | - Hélène Morlon
- Institut de Biologie, Ecole Normale Supérieure de Paris, Paris, France
| | - Luay K. Nakhleh
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Bengt Oxelman
- Gothenburg Global Biodiversity Centre, Göteborg, Sweden
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Bernard Pfeil
- Department of Biological and Environmental Sciences, University of Gothenburg, Göteborg, Sweden
| | - Alexander Schliep
- Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
| | | | - Fernanda P. Werneck
- Coordenação de Biodiversidade, Programa de Coleções Científicas Biológicas, Instituto Nacional de Pesquisa da Amazônia, Manaus, AM, Brazil
| | - John Wiedenhoeft
- Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
- Department of Computer Science, Rutgers University, Piscataway, NJ, USA
| | - Sandi Willows-Munro
- School of Life Sciences, University of Kwazulu-Natal, Pietermaritzburg, South Africa
| | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA, USA
- Gothenburg Centre for Advanced Studies in Science and Technology, Chalmers University of Technology and University of Gothenburg, Göteborg, Sweden
| |
Collapse
|
47
|
Heterochromatin-Enriched Assemblies Reveal the Sequence and Organization of the Drosophila melanogaster Y Chromosome. Genetics 2018; 211:333-348. [PMID: 30420487 PMCID: PMC6325706 DOI: 10.1534/genetics.118.301765] [Citation(s) in RCA: 76] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2018] [Accepted: 11/05/2018] [Indexed: 12/21/2022] Open
Abstract
Heterochromatic regions of the genome are repeat-rich and poor in protein coding genes, and are therefore underrepresented in even the best genome assemblies. One of the most difficult regions of the genome to assemble are sex-limited chromosomes. The Drosophila melanogaster Y chromosome is entirely heterochromatic, yet has wide-ranging effects on male fertility, fitness, and genome-wide gene expression. The genetic basis of this phenotypic variation is difficult to study, in part because we do not know the detailed organization of the Y chromosome. To study Y chromosome organization in D. melanogaster, we develop an assembly strategy involving the in silico enrichment of heterochromatic long single-molecule reads and use these reads to create targeted de novo assemblies of heterochromatic sequences. We assigned contigs to the Y chromosome using Illumina reads to identify male-specific sequences. Our pipeline extends the D. melanogaster reference genome by 11.9 Mb, closes 43.8% of the gaps, and improves overall contiguity. The addition of 10.6 MB of Y-linked sequence permitted us to study the organization of repeats and genes along the Y chromosome. We detected a high rate of duplication to the pericentric regions of the Y chromosome from other regions in the genome. Most of these duplicated genes exist in multiple copies. We detail the evolutionary history of one sex-linked gene family, crystal-Stellate While the Y chromosome does not undergo crossing over, we observed high gene conversion rates within and between members of the crystal-Stellate gene family, Su(Ste), and PCKR, compared to genome-wide estimates. Our results suggest that gene conversion and gene duplication play an important role in the evolution of Y-linked genes.
Collapse
|
48
|
Roach MJ, Johnson DL, Bohlmann J, van Vuuren HJJ, Jones SJM, Pretorius IS, Schmidt SA, Borneman AR. Population sequencing reveals clonal diversity and ancestral inbreeding in the grapevine cultivar Chardonnay. PLoS Genet 2018; 14:e1007807. [PMID: 30458008 PMCID: PMC6279053 DOI: 10.1371/journal.pgen.1007807] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 12/04/2018] [Accepted: 11/02/2018] [Indexed: 01/08/2023] Open
Abstract
Chardonnay is the basis of some of the world's most iconic wines and its success is underpinned by a historic program of clonal selection. There are numerous clones of Chardonnay available that exhibit differences in key viticultural and oenological traits that have arisen from the accumulation of somatic mutations during centuries of asexual propagation. However, the genetic variation that underlies these differences remains largely unknown. To address this knowledge gap, a high-quality, diploid-phased Chardonnay genome assembly was produced from single-molecule real time sequencing, and combined with re-sequencing data from 15 different Chardonnay clones. There were 1620 markers identified that distinguish the 15 clones. These markers were reliably used for clonal identification of independently sourced genomic material, as well as in identifying a potential genetic basis for some clonal phenotypic differences. The predicted parentage of the Chardonnay haplomes was elucidated by mapping sequence data from the predicted parents of Chardonnay (Gouais blanc and Pinot noir) against the Chardonnay reference genome. This enabled the detection of instances of heterosis, with differentially-expanded gene families being inherited from the parents of Chardonnay. Most surprisingly however, the patterns of nucleotide variation present in the Chardonnay genome indicate that Pinot noir and Gouais blanc share an extremely high degree of kinship that has resulted in the Chardonnay genome displaying characteristics that are indicative of inbreeding.
Collapse
Affiliation(s)
- Michael J. Roach
- The Australian Wine Research Institute, Glen Osmond, South Australia, Australia
| | - Daniel L. Johnson
- The Australian Wine Research Institute, Glen Osmond, South Australia, Australia
| | - Joerg Bohlmann
- Michael Smith Laboratories, The University of British Columbia, Vancouver, British Columbia, Canada
- Wine Research Centre, Faculty of Land and Food Systems, University of British Columbia, Vancouver, British Columbia, Canada
| | - Hennie J. J. van Vuuren
- Michael Smith Laboratories, The University of British Columbia, Vancouver, British Columbia, Canada
- Wine Research Centre, Faculty of Land and Food Systems, University of British Columbia, Vancouver, British Columbia, Canada
| | - Steven J. M. Jones
- Michael Smith Genome Sciences Centre, British Columbia Cancer Research Centre, Vancouver, British Columbia, Canada
| | - Isak S. Pretorius
- Chancellery, Macquarie University, Sydney, New South Wales, Australia
| | - Simon A. Schmidt
- The Australian Wine Research Institute, Glen Osmond, South Australia, Australia
| | - Anthony R. Borneman
- The Australian Wine Research Institute, Glen Osmond, South Australia, Australia
- Department of Genetics and Evolution, University of Adelaide, South Australia, Australia
| |
Collapse
|
49
|
Brashear WA, Raudsepp T, Murphy WJ. Evolutionary conservation of Y Chromosome ampliconic gene families despite extensive structural variation. Genome Res 2018; 28:1841-1851. [PMID: 30381290 PMCID: PMC6280758 DOI: 10.1101/gr.237586.118] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 10/27/2018] [Indexed: 12/20/2022]
Abstract
Despite claims that the mammalian Y Chromosome is on a path to extinction, comparative sequence analysis of primate Y Chromosomes has shown the decay of the ancestral single-copy genes has all but ceased in this eutherian lineage. The suite of single-copy Y-linked genes is highly conserved among the majority of eutherian Y Chromosomes due to strong purifying selection to retain dosage-sensitive genes. In contrast, the ampliconic regions of the Y Chromosome, which contain testis-specific genes that encode the majority of the transcripts on eutherian Y Chromosomes, are rapidly evolving and are thought to undergo species-specific turnover. However, ampliconic genes are known from only a handful of species, limiting insights into their long-term evolutionary dynamics. We used a clone-based sequencing approach employing both long- and short-read sequencing technologies to assemble ∼2.4 Mb of representative ampliconic sequence dispersed across the domestic cat Y Chromosome, and identified the major ampliconic gene families and repeat units. We analyzed fluorescence in situ hybridization, qPCR, and whole-genome sequence data from 20 cat species and revealed that ampliconic gene families are conserved across the cat family Felidae but show high transcript diversity, copy number variation, and structural rearrangement. Our analysis of ampliconic gene evolution unveils a complex pattern of long-term gene content stability despite extensive structural variation on a nonrecombining background.
Collapse
Affiliation(s)
- Wesley A Brashear
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, Texas 77843, USA
| | - Terje Raudsepp
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, Texas 77843, USA
| | - William J Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA.,Interdisciplinary Program in Genetics, Texas A&M University, College Station, Texas 77843, USA
| |
Collapse
|
50
|
Palacios-Gimenez OM, Bardella VB, Lemos B, Cabral-de-Mello DC. Satellite DNAs are conserved and differentially transcribed among Gryllus cricket species. DNA Res 2018; 25:137-147. [PMID: 29096008 PMCID: PMC5909420 DOI: 10.1093/dnares/dsx044] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 10/19/2017] [Indexed: 11/21/2022] Open
Abstract
Satellite DNA (satDNA) is an abundant class of non-coding repetitive DNA that is preferentially found as tandemly repeated arrays in gene-poor heterochromatin but is also present in gene-rich euchromatin. Here, we used DNA- and RNA-seq from Gryllus assimilis to address the content and transcriptional patterns of satDNAs. We also mapped RNA-seq libraries for other Gryllus species against the satDNAs found in G. assimilis and G. bimaculatus genomes to investigate their evolutionary conservation and transcriptional profiles in Gryllus. Through DNA-seq read clustering analysis using RepeatExplorer, dotplots analysis and fluorescence in situ hybridization mapping, we found that ∼4% of the G. assimilis genome is represented by 11 well-defined A + T-rich satDNA families. These are mainly located in heterochromatic areas, with some repeats able to form high-order repeat structures. By in silico transcriptional analysis we identified satDNAs that are conserved in Gryllus but differentially transcribed. The data regarding satDNA presence in G. assimilis genome were discussed in an evolutionary context, with transcriptional data enabling comparisons between sexes and across tissues when possible. We discuss hypotheses for the conservation and transcription of satDNAs in Gryllus, which might result from their role in sexual differentiation at the chromatin level, heterochromatin formation and centromeric function.
Collapse
Affiliation(s)
- Octavio Manuel Palacios-Gimenez
- Departamento de Biologia, Instituto de Biociências/IB, UNESP-Univ Estadual Paulista, Rio Claro, São Paulo, Brazil.,Program in Molecular and Integrative Physiological Sciences, Department of Environmental Health, Harvard University T. H. Chan School of Public Health, Boston, MA 02115, USA
| | - Vanessa Bellini Bardella
- Departamento de Biologia, Instituto de Biociências/IB, UNESP-Univ Estadual Paulista, Rio Claro, São Paulo, Brazil
| | - Bernardo Lemos
- Program in Molecular and Integrative Physiological Sciences, Department of Environmental Health, Harvard University T. H. Chan School of Public Health, Boston, MA 02115, USA
| | | |
Collapse
|