1
|
FREQ-Seq2: a method for precise high-throughput combinatorial quantification of allele frequencies. G3 (BETHESDA, MD.) 2023; 13:jkad162. [PMID: 37494033 PMCID: PMC10542570 DOI: 10.1093/g3journal/jkad162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 01/26/2023] [Accepted: 07/14/2023] [Indexed: 07/27/2023]
Abstract
The accurate determination of allele frequencies is crucially important across a wide range of problems in genetics, such as developing population genetic models, making inferences from genome-wide association studies, determining genetic risk for diseases, as well as other scientific and medical applications. Furthermore, understanding how allele frequencies change over time in populations is central to ascertaining their evolutionary dynamics. We present a precise, efficient, and economical method (FREQ-Seq2) for quantifying the relative frequencies of different alleles at loci of interest in mixed population samples. Through the creative use of paired barcode sequences, we exponentially increased the throughput of the original FREQ-Seq method from 48 to 2,304 samples. FREQ-Seq2 can be targeted to specific genomic regions of interest, which are amplified using universal barcoded adapters to generate Illumina sequencing libraries. Our enhanced method, available as a kit along with open-source software for analyzing sequenced libraries, enables the detection and removal of errors that are undetectable in the original FREQ-Seq method as well as other conventional methods for allele frequency quantification. Finally, we validated the performance of our sequencing-based approach with a highly multiplexed set of control samples as well as a competitive evolution experiment in Escherichia coli and compare the latter to estimates derived from manual colony counting. Our analyses demonstrate that FREQ-Seq2 is flexible, inexpensive, and produces large amounts of data with low error, low noise, and desirable statistical properties. In summary, FREQ-Seq2 is a powerful method for quantifying allele frequency that provides a versatile approach for profiling mixed populations.
Collapse
|
2
|
Sex-linked gene traffic underlies the acquisition of sexually dimorphic UV color vision in Heliconius butterflies. Proc Natl Acad Sci U S A 2023; 120:e2301411120. [PMID: 37552755 PMCID: PMC10438391 DOI: 10.1073/pnas.2301411120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Accepted: 06/16/2023] [Indexed: 08/10/2023] Open
Abstract
The acquisition of novel sexually dimorphic traits poses an evolutionary puzzle: How do new traits arise and become sex-limited? Recently acquired color vision, sexually dimorphic in animals like primates and butterflies, presents a compelling model for understanding how traits become sex-biased. For example, some Heliconius butterflies uniquely possess UV (ultraviolet) color vision, which correlates with the expression of two differentially tuned UV-sensitive rhodopsins, UVRh1 and UVRh2. To discover how such traits become sexually dimorphic, we studied Heliconius charithonia, which exhibits female-specific UVRh1 expression. We demonstrate that females, but not males, discriminate different UV wavelengths. Through whole-genome shotgun sequencing and assembly of the H. charithonia genome, we discovered that UVRh1 is present on the W chromosome, making it obligately female-specific. By knocking out UVRh1, we show that UVRh1 protein expression is absent in mutant female eye tissue, as in wild-type male eyes. A PCR survey of UVRh1 sex-linkage across the genus shows that species with female-specific UVRh1 expression lack UVRh1 gDNA in males. Thus, acquisition of sex linkage is sufficient to achieve female-specific expression of UVRh1, though this does not preclude other mechanisms, like cis-regulatory evolution from also contributing. Moreover, both this event, and mutations leading to differential UV opsin sensitivity, occurred early in the history of Heliconius. These results suggest a path for acquiring sexual dimorphism distinct from existing mechanistic models. We propose a model where gene traffic to heterosomes (the W or the Y) genetically partitions a trait by sex before a phenotype shifts (spectral tuning of UV sensitivity).
Collapse
|
3
|
Author Correction: Hidden genomic features of an invasive malaria vector, Anopheles stephensi, revealed by a chromosome-level genome assembly. BMC Biol 2022; 20:96. [PMID: 35501892 PMCID: PMC9059397 DOI: 10.1186/s12915-022-01314-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
4
|
Abstract
Studying how novel phenotypes originate and evolve is fundamental to the field of evolutionary biology as it allows us to understand how organismal diversity is generated and maintained. However, determining the basis of novel phenotypes is challenging as it involves orchestrated changes at multiple biological levels. Here, we aim to overcome this challenge by using a comparative species framework combining behavioral, gene expression, and genomic analyses to understand the evolutionary novel egg-laying substrate-choice behavior of the invasive pest species Drosophila suzukii. First, we used egg-laying behavioral assays to understand the evolution of ripe fruit oviposition preference in D. suzukii compared with closely related species D. subpulchrella and D. biarmipes as well as D. melanogaster. We show that D. subpulchrella and D. biarmipes lay eggs on both ripe and rotten fruits, suggesting that the transition to ripe fruit preference was gradual. Second, using two-choice oviposition assays, we studied how D. suzukii, D. subpulchrella, D. biarmipes, and D. melanogaster differentially process key sensory cues distinguishing ripe from rotten fruit during egg-laying. We found that D. suzukii's preference for ripe fruit is in part mediated through a species-specific preference for stiff substrates. Last, we sequenced and annotated a high-quality genome for D. subpulchrella. Using comparative genomic approaches, we identified candidate genes involved in D. suzukii's ability to seek out and target ripe fruits. Our results provide detail to the stepwise evolution of pest activity in D. suzukii, indicating important cues used by this species when finding a host, and the molecular mechanisms potentially underlying their adaptation to a new ecological niche.
Collapse
|
5
|
Evolution of genome structure in the Drosophila simulans species complex. Genome Res 2021; 31:380-396. [PMID: 33563718 PMCID: PMC7919458 DOI: 10.1101/gr.263442.120] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 12/28/2020] [Indexed: 12/25/2022]
Abstract
The rapid evolution of repetitive DNA sequences, including satellite DNA, tandem duplications, and transposable elements, underlies phenotypic evolution and contributes to hybrid incompatibilities between species. However, repetitive genomic regions are fragmented and misassembled in most contemporary genome assemblies. We generated highly contiguous de novo reference genomes for the Drosophila simulans species complex (D. simulans, D. mauritiana, and D. sechellia), which speciated ∼250,000 yr ago. Our assemblies are comparable in contiguity and accuracy to the current D. melanogaster genome, allowing us to directly compare repetitive sequences between these four species. We find that at least 15% of the D. simulans complex species genomes fail to align uniquely to D. melanogaster owing to structural divergence-twice the number of single-nucleotide substitutions. We also find rapid turnover of satellite DNA and extensive structural divergence in heterochromatic regions, whereas the euchromatic gene content is mostly conserved. Despite the overall preservation of gene synteny, euchromatin in each species has been shaped by clade- and species-specific inversions, transposable elements, expansions and contractions of satellite and tRNA tandem arrays, and gene duplications. We also find rapid divergence among Y-linked genes, including copy number variation and recent gene duplications from autosomes. Our assemblies provide a valuable resource for studying genome evolution and its consequences for phenotypic evolution in these genetic model species.
Collapse
|
6
|
Hidden genomic features of an invasive malaria vector, Anopheles stephensi, revealed by a chromosome-level genome assembly. BMC Biol 2021; 19:28. [PMID: 33568145 PMCID: PMC7876825 DOI: 10.1186/s12915-021-00963-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 01/19/2021] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND The mosquito Anopheles stephensi is a vector of urban malaria in Asia that recently invaded Africa. Studying the genetic basis of vectorial capacity and engineering genetic interventions are both impeded by limitations of a vector's genome assembly. The existing assemblies of An. stephensi are draft-quality and contain thousands of sequence gaps, potentially missing genetic elements important for its biology and evolution. RESULTS To access previously intractable genomic regions, we generated a reference-grade genome assembly and full transcript annotations that achieve a new standard for reference genomes of disease vectors. Here, we report novel species-specific transposable element (TE) families and insertions in functional genetic elements, demonstrating the widespread role of TEs in genome evolution and phenotypic variation. We discovered 29 previously hidden members of insecticide resistance genes, uncovering new candidate genetic elements for the widespread insecticide resistance observed in An. stephensi. We identified 2.4 Mb of the Y chromosome and seven new male-linked gene candidates, representing the most extensive coverage of the Y chromosome in any mosquito. By tracking full-length mRNA for > 15 days following blood feeding, we discover distinct roles of previously uncharacterized genes in blood metabolism and female reproduction. The Y-linked heterochromatin landscape reveals extensive accumulation of long-terminal repeat retrotransposons throughout the evolution and degeneration of this chromosome. Finally, we identify a novel Y-linked putative transcription factor that is expressed constitutively throughout male development and adulthood, suggesting an important role. CONCLUSION Collectively, these results and resources underscore the significance of previously hidden genomic elements in the biology of malaria mosquitoes and will accelerate the development of genetic control strategies of malaria transmission.
Collapse
|
7
|
Topologically associating domains and their role in the evolution of genome structure and function in Drosophila. Genome Res 2021; 31:397-410. [PMID: 33563719 PMCID: PMC7919452 DOI: 10.1101/gr.266130.120] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Accepted: 12/24/2020] [Indexed: 12/18/2022]
Abstract
Topologically associating domains (TADs) were recently identified as fundamental units of three-dimensional eukaryotic genomic organization, although our knowledge of the influence of TADs on genome evolution remains preliminary. To study the molecular evolution of TADs in Drosophila species, we constructed a new reference-grade genome assembly and accompanying high-resolution TAD map for D. pseudoobscura Comparison of D. pseudoobscura and D. melanogaster, which are separated by ∼49 million years of divergence, showed that ∼30%-40% of their genomes retain conserved TADs. Comparative genomic analysis of 17 Drosophila species revealed that chromosomal rearrangement breakpoints are enriched at TAD boundaries but depleted within TADs. Additionally, genes within conserved TADs show lower expression divergence than those located in nonconserved TADs. Furthermore, we found that a substantial proportion of long genes (>50 kbp) in D. melanogaster (42%) and D. pseudoobscura (26%) constitute their own TADs, implying transcript structure may be one of the deterministic factors for TAD formation. By using structural variants (SVs) identified from 14 D. melanogaster strains, its three closest sibling species from the D. simulans species complex, and two obscura clade species, we uncovered evidence of selection acting on SVs at TAD boundaries, but with the nature of selection differing between SV types. Deletions are depleted at TAD boundaries in both divergent and polymorphic SVs, suggesting purifying selection, whereas divergent tandem duplications are enriched at TAD boundaries relative to polymorphism, suggesting they are adaptive. Our findings highlight how important TADs are in shaping the acquisition and retention of structural mutations that fundamentally alter genome organization.
Collapse
|
8
|
Abstract
Structural variants (SVs) are a largely unstudied feature of plant genome evolution, despite the fact that SVs contribute substantially to phenotypes. In this study, we discovered SVs across a population sample of 347 high-coverage, resequenced genomes of Asian rice (Oryza sativa) and its wild ancestor (O. rufipogon). In addition to this short-read data set, we also inferred SVs from whole-genome assemblies and long-read data. Comparisons among data sets revealed different features of genome variability. For example, genome alignment identified a large (∼4.3 Mb) inversion in indica rice varieties relative to japonica varieties, and long-read analyses suggest that ∼9% of genes from the outgroup (O. longistaminata) are hemizygous. We focused, however, on the resequencing sample to investigate the population genomics of SVs. Clustering analyses with SVs recapitulated the rice cultivar groups that were also inferred from SNPs. However, the site-frequency spectrum of each SV type-which included inversions, duplications, deletions, translocations, and mobile element insertions-was skewed toward lower frequency variants than synonymous SNPs, suggesting that SVs may be predominantly deleterious. Among transposable elements, SINE and mariner insertions were found at especially low frequency. We also used SVs to study domestication by contrasting between rice and O. rufipogon. Cultivated genomes contained ∼25% more derived SVs and mobile element insertions than O. rufipogon, indicating that SVs contribute to the cost of domestication in rice. Peaks of SV divergence were enriched for known domestication genes, but we also detected hundreds of genes gained and lost during domestication, some of which were enriched for traits of agronomic interest.
Collapse
|
9
|
Efficient population modification gene-drive rescue system in the malaria mosquito Anopheles stephensi. Nat Commun 2020; 11:5553. [PMID: 33144570 PMCID: PMC7609566 DOI: 10.1038/s41467-020-19426-0] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 10/13/2020] [Indexed: 12/27/2022] Open
Abstract
Cas9/gRNA-mediated gene-drive systems have advanced development of genetic technologies for controlling vector-borne pathogen transmission. These technologies include population suppression approaches, genetic analogs of insecticidal techniques that reduce the number of insect vectors, and population modification (replacement/alteration) approaches, which interfere with competence to transmit pathogens. Here, we develop a recoded gene-drive rescue system for population modification of the malaria vector, Anopheles stephensi, that relieves the load in females caused by integration of the drive into the kynurenine hydroxylase gene by rescuing its function. Non-functional resistant alleles are eliminated via a dominantly-acting maternal effect combined with slower-acting standard negative selection, and rare functional resistant alleles do not prevent drive invasion. Small cage trials show that single releases of gene-drive males robustly result in efficient population modification with ≥95% of mosquitoes carrying the drive within 5-11 generations over a range of initial release ratios.
Collapse
|
10
|
Genomic and biochemical evidence of dietary adaptation in a marine herbivorous fish. Proc Biol Sci 2020; 287:20192327. [PMID: 32070255 PMCID: PMC7062031 DOI: 10.1098/rspb.2019.2327] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Accepted: 01/26/2020] [Indexed: 01/30/2023] Open
Abstract
Adopting a new diet is a significant evolutionary change, and can profoundly affect an animal's physiology, biochemistry, ecology and genome. To study this evolutionary transition, we investigated the physiology and genomics of digestion of a derived herbivorous fish, Cebidichthys violaceus. We sequenced and assembled its genome (N50 = 6.7 Mb) and digestive transcriptome, and revealed the molecular changes related to digestive enzymes (carbohydrases, proteases and lipases), finding abundant evidence of molecular adaptation. Specifically, two gene families experienced expansion in copy number and adaptive amino acid substitutions: amylase and carboxyl ester lipase (cel), which are involved in the digestion of carbohydrates and lipids, respectively. Both show elevated levels of gene expression and increased enzyme activity. Because carbohydrates are abundant in the prickleback's diet and lipids are rare, these findings suggest that such dietary specialization involves both exploiting abundant resources and scavenging rare ones, especially essential nutrients, like essential fatty acids.
Collapse
|
11
|
Rapid Low-Cost Assembly of the Drosophila melanogaster Reference Genome Using Low-Coverage, Long-Read Sequencing. G3 (BETHESDA, MD.) 2018; 8:3143-3154. [PMID: 30018084 PMCID: PMC6169397 DOI: 10.1534/g3.118.200162] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 07/03/2018] [Indexed: 12/27/2022]
Abstract
Accurate and comprehensive characterization of genetic variation is essential for deciphering the genetic basis of diseases and other phenotypes. A vast amount of genetic variation stems from large-scale sequence changes arising from the duplication, deletion, inversion, and translocation of sequences. In the past 10 years, high-throughput short reads have greatly expanded our ability to assay sequence variation due to single nucleotide polymorphisms. However, a recent de novo assembly of a second Drosophila melanogaster reference genome has revealed that short read genotyping methods miss hundreds of structural variants, including those affecting phenotypes. While genomes assembled using high-coverage long reads can achieve high levels of contiguity and completeness, concerns about cost, errors, and low yield have limited widespread adoption of such sequencing approaches. Here we resequenced the reference strain of D. melanogaster (ISO1) on a single Oxford Nanopore MinION flow cell run for 24 hr. Using only reads longer than 1 kb or with at least 30x coverage, we assembled a highly contiguous de novo genome. The addition of inexpensive paired reads and subsequent scaffolding using an optical map technology achieved an assembly with completeness and contiguity comparable to the D. melanogaster reference assembly. Comparison of our assembly to the reference assembly of ISO1 uncovered a number of structural variants (SVs), including novel LTR transposable element insertions and duplications affecting genes with developmental, behavioral, and metabolic functions. Collectively, these SVs provide a snapshot of the dynamics of genome evolution. Furthermore, our assembly and comparison to the D. melanogaster reference genome demonstrates that high-quality de novo assembly of reference genomes and comprehensive variant discovery using such assemblies are now possible by a single lab for under $1,000 (USD).
Collapse
|
12
|
Hidden genetic variation shapes the structure of functional elements in Drosophila. Nat Genet 2018; 50:20-25. [PMID: 29255259 PMCID: PMC5742068 DOI: 10.1038/s41588-017-0010-y] [Citation(s) in RCA: 109] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 11/10/2017] [Indexed: 12/31/2022]
Abstract
Mutations that add, subtract, rearrange, or otherwise refashion genome structure often affect phenotypes, although the fragmented nature of most contemporary assemblies obscures them. To discover such mutations, we assembled the first new reference-quality genome of Drosophila melanogaster since its initial sequencing. By comparing this new genome to the existing D. melanogaster assembly, we created a structural variant map of unprecedented resolution and identified extensive genetic variation that has remained hidden until now. Many of these variants constitute candidates underlying phenotypic variation, including tandem duplications and a transposable element insertion that amplifies the expression of detoxification-related genes associated with nicotine resistance. The abundance of important genetic variation that still evades discovery highlights how crucial high-quality reference genomes are to deciphering phenotypes.
Collapse
|
13
|
Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage. Nucleic Acids Res 2016; 44:e147. [PMID: 27458204 PMCID: PMC5100563 DOI: 10.1093/nar/gkw654] [Citation(s) in RCA: 185] [Impact Index Per Article: 23.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 07/09/2016] [Indexed: 01/19/2023] Open
Abstract
Genome assemblies that are accurate, complete and contiguous are essential for identifying important structural and functional elements of genomes and for identifying genetic variation. Nevertheless, most recent genome assemblies remain incomplete and fragmented. While long molecule sequencing promises to deliver more complete genome assemblies with fewer gaps, concerns about error rates, low yields, stringent DNA requirements and uncertainty about best practices may discourage many investigators from adopting this technology. Here, in conjunction with the platinum standard Drosophila melanogaster reference genome, we analyze recently published long molecule sequencing data to identify what governs completeness and contiguity of genome assemblies. We also present a hybrid meta-assembly approach that achieves remarkable assembly contiguity for both Drosophila and human assemblies with only modest long molecule sequencing coverage. Our results motivate a set of preliminary best practices for obtaining accurate and contiguous assemblies, a ‘missing manual’ that guides key decisions in building high quality de novo genome assemblies, from DNA isolation to polishing the assembly.
Collapse
|
14
|
Comparative sex chromosome genomics in snakes: differentiation, evolutionary strata, and lack of global dosage compensation. PLoS Biol 2013; 11:e1001643. [PMID: 24015111 PMCID: PMC3754893 DOI: 10.1371/journal.pbio.1001643] [Citation(s) in RCA: 221] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Accepted: 07/19/2013] [Indexed: 01/06/2023] Open
Abstract
Snakes exhibit genetic sex determination, with female heterogametic sex chromosomes (ZZ males, ZW females). Extensive cytogenetic work has suggested that the level of sex chromosome heteromorphism varies among species, with Boidae having entirely homomorphic sex chromosomes, Viperidae having completely heteromorphic sex chromosomes, and Colubridae showing partial differentiation. Here, we take a genomic approach to compare sex chromosome differentiation in these three snake families. We identify homomorphic sex chromosomes in boas (Boidae), but completely heteromorphic sex chromosomes in both garter snakes (Colubridae) and pygmy rattlesnake (Viperidae). Detection of W-linked gametologs enables us to establish the presence of evolutionary strata on garter and pygmy rattlesnake sex chromosomes where recombination was abolished at different time points. Sequence analysis shows that all strata are shared between pygmy rattlesnake and garter snake, i.e., recombination was abolished between the sex chromosomes before the two lineages diverged. The sex-biased transmission of the Z and its hemizygosity in females can impact patterns of molecular evolution, and we show that rates of evolution for Z-linked genes are increased relative to their pseudoautosomal homologs, both at synonymous and amino acid sites (even after controlling for mutational biases). This demonstrates that mutation rates are male-biased in snakes (male-driven evolution), but also supports faster-Z evolution due to differential selective effects on the Z. Finally, we perform a transcriptome analysis in boa and pygmy rattlesnake to establish baseline levels of sex-biased expression in homomorphic sex chromosomes, and show that heteromorphic ZW chromosomes in rattlesnakes lack chromosome-wide dosage compensation. Our study provides the first full scale overview of the evolution of snake sex chromosomes at the genomic level, thus greatly expanding our knowledge of reptilian and vertebrate sex chromosomes evolution.
Collapse
|
15
|
Inheritance of gene expression level and selective constraints on trans- and cis-regulatory changes in yeast. Mol Biol Evol 2013; 30:2121-33. [PMID: 23793114 DOI: 10.1093/molbev/mst114] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Gene expression evolution can be caused by changes in cis- or trans-regulatory elements or both. As cis and trans regulation operate through different molecular mechanisms, cis and trans mutations may show different inheritance patterns and may be subjected to different selective constraints. To investigate these issues, we obtained and analyzed gene expression data from two Saccharomyces cerevisiae strains and their hybrid, using high-throughput sequencing. Our data indicate that compared with other types of genes, those with antagonistic cis-trans interactions are more likely to exhibit over- or underdominant inheritance of expression level. Moreover, in accordance with previous studies, genes with trans variants tend to have a dominant inheritance pattern, whereas cis variants are enriched for additive inheritance. In addition, cis regulatory differences contribute more to expression differences between species than within species, whereas trans regulatory differences show a stronger association between divergence and polymorphism. Our data indicate that in the trans component of gene expression differences genes subjected to weaker selective constraints tend to have an excess of polymorphism over divergence compared with those subjected to stronger selective constraints. In contrast, in the cis component, this difference between genes under stronger and weaker selective constraint is mostly absent. To explain these observations, we propose that purifying selection more strongly shapes trans changes than cis changes and that positive selection may have significantly contributed to cis regulatory divergence.
Collapse
|
16
|
Drosophila duplication hotspots are associated with late-replicating regions of the genome. PLoS Genet 2011; 7:e1002340. [PMID: 22072977 PMCID: PMC3207856 DOI: 10.1371/journal.pgen.1002340] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Accepted: 08/26/2011] [Indexed: 12/22/2022] Open
Abstract
Duplications play a significant role in both extremes of the phenotypic spectrum of newly arising mutations: they can have severe deleterious effects (e.g. duplications underlie a variety of diseases) but can also be highly advantageous. The phenotypic potential of newly arisen duplications has stimulated wide interest in both the mutational and selective processes shaping these variants in the genome. Here we take advantage of the Drosophila simulans–Drosophila melanogaster genetic system to further our understanding of both processes. Regarding mutational processes, the study of two closely related species allows investigation of the potential existence of shared duplication hotspots, and the similarities and differences between the two genomes can be used to dissect its underlying causes. Regarding selection, the difference in the effective population size between the two species can be leveraged to ask questions about the strength of selection acting on different classes of duplications. In this study, we conducted a survey of duplication polymorphisms in 14 different lines of D. simulans using tiling microarrays and combined it with an analogous survey for the D. melanogaster genome. By integrating the two datasets, we identified duplication hotspots conserved between the two species. However, unlike the duplication hotspots identified in mammalian genomes, Drosophila duplication hotspots are not associated with sequences of high sequence identity capable of mediating non-allelic homologous recombination. Instead, Drosophila duplication hotspots are associated with late-replicating regions of the genome, suggesting a link between DNA replication and duplication rates. We also found evidence supporting a higher effectiveness of selection on duplications in D. simulans than in D. melanogaster. This is also true for duplications segregating at high frequency, where we find evidence in D. simulans that a sizeable fraction of these mutations is being driven to fixation by positive selection. DNA duplications are important contributors to the phenotypic differences observed between individuals. These mutations can disrupt the normal functioning of genes and so are often associated with disease. But because they can add genetic information they can also lead to evolutionary change. Understanding how selection and non-random mutation processes shape the distribution of duplications throughout the genome is important to elucidate both the medical and evolutionary impacts of these mutations. Here, we examined the roles of selection and mutation in shaping patterns of duplication polymorphisms across the genomes of the fruit fly Drosophila melanogaster and its sister species, D. simulans. We found that selection is pervasive in both genomes but is more efficient in D. simulans than in D. melanogaster. We also found that these two species have shared duplication hotspots, i.e. orthologous regions experiencing high rates of duplication in the two genomes. After excluding the hypothesis that Drosophila duplication hotspots are associated with regions of the genome rich in segmental duplications (as observed for mammalian genomes), we show that they are associated with late-replicating regions of the genome. Our work therefore proposes a link between DNA replication and rates of duplication across the genome.
Collapse
|
17
|
The genetic basis of evolutionary change in gene expression levels. Philos Trans R Soc Lond B Biol Sci 2010; 365:2581-90. [PMID: 20643748 DOI: 10.1098/rstb.2010.0005] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The regulation of gene expression is an important determinant of organismal phenotype and evolution. However, the widespread recognition of this fact occurred long after the synthesis of evolution and genetics. Here, we give a brief sketch of thoughts regarding gene regulation in the history of evolution and genetics. We then review the development of genome-wide studies of gene regulatory variation in the context of the location and mode of action of the causative genetic changes. In particular, we review mapping of the genetic basis of expression variation through expression quantitative trait locus studies and measuring the cis/trans component of expression variation in allele-specific expression studies. We conclude by proposing a systematic integration of ideas that combines global mapping studies, cis/trans tests and modern population genetics methodologies, in order to directly estimate the forces acting on regulatory variation within and between species.
Collapse
|
18
|
Abstract
Gene expression is regulated both by cis elements, which are DNA segments closely linked to the genes they regulate, and by trans factors, which are usually proteins capable of diffusing to unlinked genes. Understanding the patterns and sources of regulatory variation is crucial for understanding phenotypic and genome evolution. Here, we measure genome-wide allele-specific expression by deep sequencing to investigate the patterns of cis and trans expression variation between two strains of Saccharomyces cerevisiae. We propose a statistical modeling framework based on the binomial distribution that simultaneously addresses normalization of read counts derived from different parents and estimating the cis and trans expression variation parameters. We find that expression polymorphism in yeast is common for both cis and trans, though trans variation is more common. Constraint in expression evolution is correlated with other hallmarks of constraint, including gene essentiality, number of protein interaction partners, and constraint in amino acid substitution, indicating that both cis and trans polymorphism are clearly under purifying selection, though trans variation appears to be more sensitive to selective constraint. Comparing interspecific expression divergence between S. cerevisiae and S. paradoxus to our intraspecific variation suggests a significant departure from a neutral model of molecular evolution. A further examination of correlation between polymorphism and divergence within each category suggests that cis divergence is more frequently mediated by positive Darwinian selection than is trans divergence.
Collapse
|
19
|
Natural selection shapes genome-wide patterns of copy-number polymorphism in Drosophila melanogaster. Science 2008; 320:1629-31. [PMID: 18535209 DOI: 10.1126/science.1158078] [Citation(s) in RCA: 205] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
The role that natural selection plays in governing the locations and early evolution of copy-number mutations remains largely unexplored. We used high-density full-genome tiling arrays to create a fine-scale genomic map of copy-number polymorphisms (CNPs) in Drosophila melanogaster. We inferred a total of 2658 independent CNPs, 56% of which overlap genes. These include CNPs that are likely to be under positive selection, most notably high-frequency duplications encompassing toxin-response genes. The locations and frequencies of CNPs are strongly shaped by purifying selection, with deletions under stronger purifying selection than duplications. Among duplications, those overlapping exons or introns, as well as those falling on the X chromosome, seem to be subject to stronger purifying selection.
Collapse
|
20
|
Genetic, genomic, and functional analysis of the granule lattice proteins in Tetrahymena secretory granules. Mol Biol Cell 2005; 16:4046-60. [PMID: 15958493 PMCID: PMC1196318 DOI: 10.1091/mbc.e05-01-0028] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2005] [Revised: 04/27/2005] [Accepted: 06/03/2005] [Indexed: 11/11/2022] Open
Abstract
In some cells, the polypeptides stored in dense core secretory granules condense as ordered arrays. In ciliates such as Tetrahymena thermophila, the resulting crystals function as projectiles, expanding upon exocytosis. Isolation of granule contents previously defined five Granule lattice (Grl) proteins as abundant core constituents, whereas a functional screen identified a sixth family member. We have now expanded this screen to identify the nonredundant components required for projectile assembly. The results, further supported by gene disruption experiments, indicate that six Grl proteins define the core structure. Both in vivo and in vitro data indicate that core assembly begins in the endoplasmic reticulum with formation of specific hetero-oligomeric Grl proprotein complexes. Four additional GRL-like genes were found in the T. thermophila genome. Grl2p and Grl6p are targeted to granules, but the transcripts are present at low levels and neither is essential for core assembly. The DeltaGRL6 cells nonetheless showed a subtle change in granule morphology and a marked reduction in granule accumulation. Epistasis analysis suggests this results from accelerated loss of DeltaGRL6 granules, rather than from decreased synthesis. Our results not only provide insight into the organization of Grl-based granule cores but also imply that the functions of Grl proteins extend beyond core assembly.
Collapse
|
21
|
Abstract
Plotkin et al. introduce a method to detect selection that is based on an index called codon volatility and that uses only the sequence of a single genome, claiming that this method is applicable to a large range of sequenced organisms. Volatility for a given codon is the ratio of non-synonymous codons to all sense codons accessible by one point mutation. The significance of each gene's volatility is assessed by comparison with a simulated distribution of 10(6) synonymous versions of each gene, with synonymous codons drawn randomly from average genome frequencies. Here we re-examine their method and data and find that codon volatility does not detect selection, and that, even if it did, the genomes of Mycobacterium tuberculosis and Plasmodium falciparum, as well as those of most sequenced organisms, do not meet the assumptions necessary for application of their method.
Collapse
|
22
|
Abstract
The fourth chromosome of Drosophila melanogaster and its sister species are believed to be nonrecombining and have been a model system for testing predictions of the effects of selection on linked, neutral variation. We recently examined nucleotide variation along the chromosome of D. melanogaster and revealed that a low average level of recombination could be associated with considerably high levels of nucleotide variation. In this report, we further investigate the variation along the fourth chromosome of D. simulans. We sequenced 12 gene regions evenly distributed along the fourth chromosome for a worldwide collection of 11 isofemale lines and 5 gene regions in a local population of 10 isofemale lines from South America. In contrast to predictions for regions of very low recombination, these data reveal that the variation levels in many gene regions, including an intron region of the ci gene, vary considerably along the fourth chromosome. Nucleotide diversity ranged from 0.0010 to 0.0074 in 9 gene regions interspersed with several regions of greatly reduced variation. Tests of recombination indicate that the recombination level is not as low as previously thought, likely an order of magnitude higher than that in D. melanogaster. Finally, estimates of the recombination parameters are shown to support a crossover-plus-conversion model.
Collapse
|
23
|
Sex chromosomes and male functions: where do new genes go? Cell Cycle 2004; 3:873-5. [PMID: 15190200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023] Open
Abstract
The position of a gene in the genome may have important consequences for its function. Therefore, when a new duplicate gene arises, its location may be critical in determining its fate. Our recent work in humans, mouse, and Drosophila provided a test by studying the patterns of duplication in sex chromosome evolution. We revealed a bias in the generation and recruitment of new gene copies involving the X chromosome that has been shaped largely by selection for male germline functions. The gene movement patterns we observed reflect an ongoing process as some of the new genes are very young while others were present before the divergence of humans and mouse. This suggests a continuing redistribution of male-related genes to achieve a more efficient allocation of male functions. This notion should be further tested in organisms employing other sex determination systems or in organisms differing in germline sex chromosome inactivation. It is likely that the selective forces that were detected in these studies are also acting on other types of duplicate genes. As a result, future work elucidating sex chromosome differentiation by other mutational mechanisms will shed light on this important process.
Collapse
|
24
|
Abstract
AbstractThe fourth chromosome of Drosophila melanogaster and its sister species are believed to be nonrecombining and have been a model system for testing predictions of the effects of selection on linked, neutral variation. We recently examined nucleotide variation along the chromosome of D. melanogaster and revealed that a low average level of recombination could be associated with considerably high levels of nucleotide variation. In this report, we further investigate the variation along the fourth chromosome of D. simulans. We sequenced 12 gene regions evenly distributed along the fourth chromosome for a worldwide collection of 11 isofemale lines and 5 gene regions in a local population of 10 isofemale lines from South America. In contrast to predictions for regions of very low recombination, these data reveal that the variation levels in many gene regions, including an intron region of the ci gene, vary considerably along the fourth chromosome. Nucleotide diversity ranged from 0.0010 to 0.0074 in 9 gene regions interspersed with several regions of greatly reduced variation. Tests of recombination indicate that the recombination level is not as low as previously thought, likely an order of magnitude higher than that in D. melanogaster. Finally, estimates of the recombination parameters are shown to support a crossover-plus-conversion model.
Collapse
|
25
|
Abstract
Mammalian sex chromosomes have undergone profound changes since evolving from ancestral autosomes. By examining retroposed genes in the human and mouse genomes, we demonstrate that, during evolution, the mammalian X chromosome has generated and recruited a disproportionately high number of functional retroposed genes, whereas the autosomes experienced lower gene turnover. Most autosomal copies originating from X-linked genes exhibited testis-biased expression. Such export is incompatible with mutational bias and is likely driven by natural selection to attain male germline function. However, the excess recruitment is consistent with a combination of both natural selection and mutational bias.
Collapse
|
26
|
Abstract
Although coevolution is complicated, in that the interacting species evolve in response to each other, such evolutionary dynamics are amenable to mathematical modeling. In this article, we briefly review models and data on coevolution between plants and the pathogens and herbivores that attack them. We focus on "arms races," in which trait values in the plant and its enemies escalate to more and more extreme values. Untested key assumptions in many of the models are the relationships between costs and benefits of resistance in the plant and the level of resistance, as well as how costs of virulence or detoxification ability in the enemy change with levels of these traits. A preliminary assessment of these assumptions finds only mixed support for the models. What is needed are models that are more closely tailored to particular plant-enemy interactions, as well as experiments that are expressly designed to test existing models.
Collapse
|