1
|
Clark E, Akam M. Odd-paired controls frequency doubling in Drosophila segmentation by altering the pair-rule gene regulatory network. eLife 2016; 5:e18215. [PMID: 27525481 PMCID: PMC5035143 DOI: 10.7554/elife.18215] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2016] [Accepted: 08/14/2016] [Indexed: 01/08/2023] Open
Abstract
The Drosophila embryo transiently exhibits a double-segment periodicity, defined by the expression of seven 'pair-rule' genes, each in a pattern of seven stripes. At gastrulation, interactions between the pair-rule genes lead to frequency doubling and the patterning of 14 parasegment boundaries. In contrast to earlier stages of Drosophila anteroposterior patterning, this transition is not well understood. By carefully analysing the spatiotemporal dynamics of pair-rule gene expression, we demonstrate that frequency-doubling is precipitated by multiple coordinated changes to the network of regulatory interactions between the pair-rule genes. We identify the broadly expressed but temporally patterned transcription factor, Odd-paired (Opa/Zic), as the cause of these changes, and show that the patterning of the even-numbered parasegment boundaries relies on Opa-dependent regulatory interactions. Our findings indicate that the pair-rule gene regulatory network has a temporally modulated topology, permitting the pair-rule genes to play stage-specific patterning roles.
Collapse
Affiliation(s)
- Erik Clark
- Laboratory for Development and Evolution, Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| | - Michael Akam
- Laboratory for Development and Evolution, Department of Zoology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
2
|
Ren J, Song K, Sun F, Deng M, Reinert G. Multiple alignment-free sequence comparison. Bioinformatics 2013; 29:2690-8. [PMID: 23990418 DOI: 10.1093/bioinformatics/btt462] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Recently, a range of new statistics have become available for the alignment-free comparison of two sequences based on k-tuple word content. Here, we extend these statistics to the simultaneous comparison of more than two sequences. Our suite of statistics contains, first, C(*)1 and C(S)1, extensions of statistics for pairwise comparison of the joint k-tuple content of all the sequences, and second, C(*)2, C(S)2 and C(geo)2, averages of sums of pairwise comparison statistics. The two tasks we consider are, first, to identify sequences that are similar to a set of target sequences, and, second, to measure the similarity within a set of sequences. RESULTS Our investigation uses both simulated data as well as cis-regulatory module data where the task is to identify cis-regulatory modules with similar transcription factor binding sites. We find that although for real data, all of our statistics show a similar performance, on simulated data the Shepp-type statistics are in some instances outperformed by star-type statistics. The multiple alignment-free statistics are more sensitive to contamination in the data than the pairwise average statistics. AVAILABILITY Our implementation of the five statistics is available as R package named 'multiAlignFree' at be http://www-rcf.usc.edu/∼fsun/Programs/multiAlignFree/multiAlignFreemain.html. CONTACT reinert@stats.ox.ac.uk. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jie Ren
- School of Mathematics, Peking University, Beijing 100871, PR China, Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089-2910, USA, MOE Key Laboratory of Bioinformatics and Bioinformatics Division, TNLIST/Department of Automation, Tsinghua University, Beijing 100084, PR China and Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK
| | | | | | | | | |
Collapse
|
3
|
Arunachalam M, Jayasurya K, Tomancak P, Ohler U. An alignment-free method to identify candidate orthologous enhancers in multiple Drosophila genomes. ACTA ACUST UNITED AC 2010; 26:2109-15. [PMID: 20624780 DOI: 10.1093/bioinformatics/btq358] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
MOTIVATION Evolutionarily conserved non-coding genomic sequences represent a potentially rich source for the discovery of gene regulatory region such as transcriptional enhancers. However, detecting orthologous enhancers using alignment-based methods in higher eukaryotic genomes is particularly challenging, as regulatory regions can undergo considerable sequence changes while maintaining their functionality. RESULTS We have developed an alignment-free method which identifies conserved enhancers in multiple diverged species. Our method is based on similarity metrics between two sequences based on the co-occurrence of sequence patterns regardless of their order and orientation, thus tolerating sequence changes observed in non-coding evolution. We show that our method is highly successful in detecting orthologous enhancers in distantly related species without requiring additional information such as knowledge about transcription factors involved, or predicted binding sites. By estimating the significance of similarity scores, we are able to discriminate experimentally validated functional enhancers from seemingly equally conserved candidates without function. We demonstrate the effectiveness of this approach on a wide range of enhancers in Drosophila, and also present encouraging results to detect conserved functional regions across large evolutionary distances. Our work provides encouraging steps on the way to ab initio unbiased enhancer prediction to complement ongoing experimental efforts. AVAILABILITY The software, data and the results used in this article are available at http://www.genome.duke.edu/labs/ohler/research/transcription/fly_enhancer/.
Collapse
|
4
|
Weirauch MT, Hughes TR. Conserved expression without conserved regulatory sequence: the more things change, the more they stay the same. Trends Genet 2010; 26:66-74. [PMID: 20083321 DOI: 10.1016/j.tig.2009.12.002] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2009] [Revised: 12/09/2009] [Accepted: 12/09/2009] [Indexed: 12/28/2022]
Abstract
Regulatory regions with similar transcriptional output often have little overt sequence similarity, both within and between genomes. Although cis- and trans-regulatory changes can contribute to sequence divergence without dramatically altering gene expression outputs, heterologous DNA often functions similarly in organisms that share little regulatory sequence similarities (e.g. human DNA in fish), indicating that trans-regulatory mechanisms tend to diverge more slowly and can accommodate a variety of cis-regulatory configurations. This capacity to 'tinker' with regulatory DNA probably relates to the complexity, robustness and evolvability of regulatory systems, but cause-and-effect relationships among evolutionary processes and properties of regulatory systems remain a topic of debate. The challenge of understanding the concrete mechanisms underlying cis-regulatory evolution - including the conservation of function without the conservation of sequence - relates to the challenge of understanding the function of regulatory systems in general. Currently, we are largely unable to recognize functionally similar regulatory DNA.
Collapse
Affiliation(s)
- Matthew T Weirauch
- Banting and Best Department of Medical Research and Donnelly Centre for Cellular and Biomolecular Research, Ontario, Canada
| | | |
Collapse
|
5
|
Duncan EJ, Wilson MJ, Smith JM, Dearden PK. Evolutionary origin and genomic organisation of runt-domain containing genes in arthropods. BMC Genomics 2008; 9:558. [PMID: 19032778 PMCID: PMC2631020 DOI: 10.1186/1471-2164-9-558] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2008] [Accepted: 11/25/2008] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Gene clusters, such as the Hox gene cluster, are known to have critical roles in development. In eukaryotes gene clusters arise primarily by tandem gene duplication and divergence. Genes within a cluster are often co-regulated, providing selective pressure to maintain the genome organisation, and this co-regulation can result in temporal or spatial co-linearity of gene expression. It has been previously noted that in Drosophila melanogaster, three of the four runt-domain (RD) containing genes are found in a relatively tight cluster on chromosome 1, raising the possibility of a putative functional RD gene cluster in D. melanogaster. RESULTS To investigate the possibility of such a gene cluster, orthologues of the Drosophila melanogaster RD genes were identified in several endopterygotan insects, two exopterygotan insects and two non-insect arthropods. In all insect species four RD genes were identified and orthology was assigned to the Drosophila sequences by phylogenetic analyses. Although four RD genes were found in the crustacean D. pulex, orthology could not be assigned to the insect sequences, indicating independent gene duplications from a single ancestor following the split of the hexapod lineage from the crustacean lineage.In insects, two chromosomal arrangements of these genes was observed; the first a semi-dispersed cluster, such as in Drosophila, where lozenge is separated from the core cluster of three RD genes often by megabases of DNA. The second arrangement was a tight cluster of the four RD genes, such as in Apis mellifera.This genomic organisation, particularly of the three core RD genes, raises the possibility of shared regulatory elements. In situ hybridisation of embryonic expression of the four RD genes in Drosophila melanogaster and the honeybee A. mellifera shows no evidence for either spatial or temporal co-linearity of expression during embryogenesis. CONCLUSION All fully sequenced insect genomes contain four RD genes and orthology can be assigned to these genes based on similarity to the D. melanogaster protein sequences. Examination of the genomic organisation of these genes provides evidence for a functional RD gene cluster. RD genes from non-insect arthropods are also clustered, however the lack of orthology between these and insect RD genes suggests this cluster is likely to have resulted from a duplication event independent from that which created the insect RD gene cluster. Analysis of embryonic RD gene expression in two endopterygotan insects, A. mellifera and D. melanogaster, did not show evidence for coordinated gene expression, therefore while the functional significance of this gene cluster remains unknown its maintenance during insect evolution implies some functional significance to the cluster.
Collapse
Affiliation(s)
- Elizabeth J Duncan
- Laboratory for Evolution and Development, University of Otago, PO Box 56, Dunedin, Aotearoa-New Zealand
| | | | | | | |
Collapse
|
6
|
Li L, Zhu Q, He X, Sinha S, Halfon MS. Large-scale analysis of transcriptional cis-regulatory modules reveals both common features and distinct subclasses. Genome Biol 2008; 8:R101. [PMID: 17550599 PMCID: PMC2394749 DOI: 10.1186/gb-2007-8-6-r101] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2007] [Revised: 05/23/2007] [Accepted: 06/05/2007] [Indexed: 02/01/2023] Open
Abstract
Analysis of 280 experimentally-verified cis-regulatory modules from Drosophila reveal features both common to all and unique to distinct subclasses of modules. Background Transcriptional cis-regulatory modules (for example, enhancers) play a critical role in regulating gene expression. While many individual regulatory elements have been characterized, they have never been analyzed as a class. Results We have performed the first such large-scale study of cis-regulatory modules in order to determine whether they have common properties that might aid in their identification and contribute to our understanding of the mechanisms by which they function. A total of 280 individual, experimentally verified cis-regulatory modules from Drosophila were analyzed for a range of sequence-level and functional properties. We report here that regulatory modules do indeed share common properties, among them an elevated GC content, an increased level of interspecific sequence conservation, and a tendency to be transcribed into RNA. However, we find that dense clustering of transcription factor binding sites, especially homotypic clustering, which is commonly believed to be a general characteristic of regulatory modules, is rather a feature that belongs chiefly to a specific subclass. This has important implications for current computational approaches, many of which are biased toward this subset. We explore two new strategies to assess binding site clustering and gauge their performances with respect to their ability to detect all 280 modules and various functionally coherent subsets. Conclusion Our findings demonstrate that cis-regulatory modules share common features that help to define them as a class and that may lead to new insights into mechanisms of gene regulation. However, these properties alone may not be sufficient to reliably distinguish regulatory from non-regulatory sequences. We also demonstrate that there are distinct subclasses of cis-regulatory modules that are more amenable to in silico detection than others and that these differences must be taken into account when attempting genome-wide regulatory element discovery.
Collapse
Affiliation(s)
- Long Li
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14214, USA
| | - Qianqian Zhu
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14214, USA
| | - Xin He
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Saurabh Sinha
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Marc S Halfon
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY 14214, USA
- Department of Biological Sciences, State University of New York at Buffalo, Buffalo, NY 14214, USA
- New York State Center of Excellence in Bioinformatics and the Life Sciences, Buffalo, NY 14203, USA
- Department of Molecular and Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
| |
Collapse
|
7
|
Chabot A, Shrit RA, Blekhman R, Gilad Y. Using reporter gene assays to identify cis regulatory differences between humans and chimpanzees. Genetics 2007; 176:2069-76. [PMID: 17565944 PMCID: PMC1950614 DOI: 10.1534/genetics.107.073429] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Most phenotypic differences between human and chimpanzee are likely to result from differences in gene regulation, rather than changes to protein-coding regions. To date, however, only a handful of human-chimpanzee nucleotide differences leading to changes in gene regulation have been identified. To hone in on differences in regulatory elements between human and chimpanzee, we focused on 10 genes that were previously found to be differentially expressed between the two species. We then designed reporter gene assays for the putative human and chimpanzee promoters of the 10 genes. Of seven promoters that we found to be active in human liver cell lines, human and chimpanzee promoters had significantly different activity in four cases, three of which recapitulated the gene expression difference seen in the microarray experiment. For these three genes, we were therefore able to demonstrate that a change in cis influences expression differences between humans and chimpanzees. Moreover, using site-directed mutagenesis on one construct, the promoter for the DDA3 gene, we were able to identify three nucleotides that together lead to a cis regulatory difference between the species. High-throughput application of this approach can provide a map of regulatory element differences between humans and our close evolutionary relatives.
Collapse
Affiliation(s)
- Adrien Chabot
- Department of Human Genetics, University of Chicago, 920 E. 58th Street, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
8
|
van Opijnen T, Boerlijst MC, Berkhout B. Effects of random mutations in the human immunodeficiency virus type 1 transcriptional promoter on viral fitness in different host cell environments. J Virol 2006; 80:6678-85. [PMID: 16775355 PMCID: PMC1488947 DOI: 10.1128/jvi.02547-05] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A mutation's effect on fitness or phenotype may in part depend on the interaction of the mutation with the environment. The resulting phenotype or fitness is important, since it determines the adaptive potential of a species. To date, most studies have focused on alterations to protein-coding regions of the genome and their consequential fitness effects. Non-protein-coding regulatory regions have been largely neglected, although they make up a large and important part of an organism's genome. Here, we use human immunodeficiency virus type 1 as a model system to investigate fitness effects of random mutations in noncoding DNA-binding sites of the transcriptional promoter. We determined 242 fitness values for 35 viral promoter mutants with one, two, or three mutations across seven distinct cellular environments and identified that (i) all mutants have an effect in at least one cellular environment; (ii) fitness effects are highly dependent on the cellular environment; (iii) disadvantageous and advantageous mutations occur at high and similar frequencies; and (iv) epistatic effects of multiple mutations are rare. Our results underline the evolutionary potential of regulatory regions and indicate that DNA-binding sites evolve under strong selection, while at the same time, they are very plastic to environmental change.
Collapse
Affiliation(s)
- Tim van Opijnen
- Department of Human Retrovirology, Academic Medical Centre, University of Amsterdam, Meibergdreef 15, 1105 AZ Amsterdam, The Netherlands
| | | | | |
Collapse
|
9
|
Abstract
Cis-regulatory sequences direct patterns of gene expression essential for development and physiology. Evolutionary changes in these sequences contribute to phenotypic divergence. Despite their importance, cis-regulatory regions remain one of the most enigmatic features of the genome. Patterns of sequence evolution can be used to identify cis-regulatory elements, but the power of this approach depends upon the relationship between sequence and function. Comparative studies of gene regulation among Diptera reveal that divergent sequences can underlie conserved expression, and that expression differences can evolve despite largely similar sequences. This complex structure-function relationship is the primary impediment for computational identification and interpretation of cis-regulatory sequences. Biochemical characterization and in vivo assays of cis-regulatory sequences on a genomic-scale will relieve this barrier.
Collapse
Affiliation(s)
- P J Wittkopp
- Department of Ecology and Evolutionary Biology, University of Michigan, 1061 Natural Science Building, 830 North University Ave., Ann Arbor, MI 48109-1048, USA.
| |
Collapse
|
10
|
Bazykin GA, Kondrashov AS. Rate of promoter class turn-over in yeast evolution. BMC Evol Biol 2006; 6:14. [PMID: 16472383 PMCID: PMC1457003 DOI: 10.1186/1471-2148-6-14] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2005] [Accepted: 02/10/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Phylogenetic conservation at the DNA level is routinely used as evidence of molecular function, under the assumption that locations and sequences of functional DNA segments remain invariant in evolution. In particular, short DNA segments participating in initiation and regulation of transcription are often conserved between related species. However, transcription of a gene can evolve, and this evolution may involve changes of even such conservative DNA segments. Genes of yeast Saccharomyces have promoters of two classes, class 1 (TATA-containing) and class 2 (non-TATA-containing). RESULTS Comparison of upstream non-coding regions of orthologous genes from the five species of Saccharomyces sensu stricto group shows that among 212 genes which very likely have class 1 promoters in S. cerevisiae, 17 probably have class 2 promoters in one or more other species. Conversely, among 322 genes which very likely have class 2 promoters in S. cerevisiae, 44 probably have class 1 promoters in one or more other species. Also, for at least 2 genes from the set of 212 S. cerevisiae genes with class 1 promoters, the locations of the TATA consensus sequences are substantially different between the species. CONCLUSION Our results indicate that, in the course of yeast evolution, a promoter switches its class with the probability at least approximately 0.1 per time required for the accumulation of one nucleotide substitution at a non-coding site. Thus, key sequences involved in initiation of transcription evolve with substantial rates in yeast.
Collapse
Affiliation(s)
- Georgii A Bazykin
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA
| | - Alexey S Kondrashov
- National Center for Biotechnology Information, NIH, Bethesda, Maryland 20894, USA
| |
Collapse
|
11
|
Emberly E, Rajewsky N, Siggia ED. Conservation of regulatory elements between two species of Drosophila. BMC Bioinformatics 2003; 4:57. [PMID: 14629780 PMCID: PMC302112 DOI: 10.1186/1471-2105-4-57] [Citation(s) in RCA: 80] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2003] [Accepted: 11/20/2003] [Indexed: 12/19/2022] Open
Abstract
Background One of the important goals in the post-genomic era is to determine the regulatory elements within the non-coding DNA of a given organism's genome. The identification of functional cis-regulatory modules has proven difficult since the component factor binding sites are small and the rules governing their arrangement are poorly understood. However, the genomes of suitably diverged species help to predict regulatory elements based on the generally accepted assumption that conserved blocks of genomic sequence are likely to be functional. To judge the efficacy of strategies that prefilter by sequence conservation it is important to know to what extent the converse assumption holds, namely that functional elements common to both species will fall within these conserved blocks. The recently completed sequence of a second Drosophila species provides an opportunity to test this assumption for one of the experimentally best studied regulatory networks in multicellular organisms, the body patterning of the fly embryo. Results We find that 50%–70% of known binding sites reside in conserved sequence blocks, but these percentages are not greatly enriched over what is expected by chance. Finally, a computational genome-wide search in both species for regulatory modules based on clusters of binding sites suggests that genes central to the regulatory network are consistently recovered. Conclusions Our results indicate that binding sites remain clustered for these "core modules" while not necessarily residing in conserved blocks. This is an important clue as to how regulatory information is encoded in the genome and how modules evolve.
Collapse
Affiliation(s)
- Eldon Emberly
- Center for Studies in Physics and Biology, The Rockefeller University, 1230 York Avenue, New York, NY, USA
| | - Nikolaus Rajewsky
- Department of Biology, New York University, 1009 Main Building, 100 Washington Square East, New York, NY, USA
| | - Eric D Siggia
- Center for Studies in Physics and Biology, The Rockefeller University, 1230 York Avenue, New York, NY, USA
| |
Collapse
|
12
|
Arnosti DN. Analysis and function of transcriptional regulatory elements: insights from Drosophila. ANNUAL REVIEW OF ENTOMOLOGY 2001; 48:579-602. [PMID: 12359740 DOI: 10.1146/annurev.ento.48.091801.112749] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Analysis of gene expression is assuming an increasingly important role in elucidating the molecular basis of insect biology. Transcriptional regulation of gene expression is directed by a variety of cis-acting DNA elements that control spatial and temporal patterns of expression. This review summarizes current knowledge about properties of transcriptional regulatory elements, based largely on research in Drosophila melanogaster, and outlines ways that new technologies are providing tools to facilitate the study of transcriptional regulatory elements in other insects.
Collapse
Affiliation(s)
- David N Arnosti
- Department of Biochemistry and Molecular Biology and Program in Genetics, Michigan State University, East Lansing, Michigan 48824-1319, USA.
| |
Collapse
|
13
|
Kim J. Macro-evolution of the hairy enhancer in Drosophila species. THE JOURNAL OF EXPERIMENTAL ZOOLOGY 2001; 291:175-85. [PMID: 11479916 DOI: 10.1002/jez.1067] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
It has been suggested that many of the changes in the developmental program might be in the cis-acting promoters and enhancer regions. Here I study the macro-evolutionary changes of an enhancer region for the early developmental gene hairy in Drosophila melanogaster, D. simulans, D. pseudoobscura, D. willistoni, D. nebulosa, D. hydei, and D. virilis. The enhancer region is characterized by small, highly conserved blocks interspersed among highly variable regions. Nevertheless, species phylogenies constructed by the enhancer sequences agree with the widely accepted phylogeny of these species. The evolution of the variable regions is consistent with a molecular clock, while the evolution of the conserved blocks is significantly different from a clock. In particular, the D. pseudoobscura lineage shows the highest degree of species-specific change consistent with changes in expression timing reported in an earlier study. It has been suggested that the variation in sequence length between highly conserved blocks may play a role in the coordination of regulatory processes, such as protein-protein interactions; thus, stabilizing selection has been suggested to act on the length variations. Here I develop a test for stabilizing selection on length variation and show that the hairy enhancer does not show statistically significant evidence for stabilizing selection. J. Exp. Zool. (Mol. Dev. Evol.) 291:175-185, 2001.
Collapse
Affiliation(s)
- J Kim
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA.
| |
Collapse
|
14
|
Bergman CM, Kreitman M. Analysis of conserved noncoding DNA in Drosophila reveals similar constraints in intergenic and intronic sequences. Genome Res 2001; 11:1335-45. [PMID: 11483574 DOI: 10.1101/gr.178701] [Citation(s) in RCA: 129] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Comparative genomic approaches to gene and cis-regulatory prediction are based on the principle that differential DNA sequence conservation reflects variation in functional constraint. Using this principle, we analyze noncoding sequence conservation in Drosophila for 40 loci with known or suspected cis-regulatory function encompassing >100 kb of DNA. We estimate the fraction of noncoding DNA conserved in both intergenic and intronic regions and describe the length distribution of ungapped conserved noncoding blocks. On average, 22%-26% of noncoding sequences surveyed are conserved in Drosophila, with median block length approximately 19 bp. We show that point substitution in conserved noncoding blocks exhibits transition bias as well as lineage effects in base composition, and occurs more than an order of magnitude more frequently than insertion/deletion (indel) substitution. Overall, patterns of noncoding DNA structure and evolution differ remarkably little between intergenic and intronic conserved blocks, suggesting that the effects of transcription per se contribute minimally to the constraints operating on these sequences. The results of this study have implications for the development of alignment and prediction algorithms specific to noncoding DNA, as well as for models of cis-regulatory DNA sequence evolution.
Collapse
Affiliation(s)
- C M Bergman
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA.
| | | |
Collapse
|