1
|
Roman BE, Santana DJ, Prediger C, Madi-Ravazzi L. Phylogeny of Drosophila saltans group (Diptera: Drosophilidae) based on morphological and molecular evidence. PLoS One 2022; 17:e0266710. [PMID: 35390108 PMCID: PMC8989330 DOI: 10.1371/journal.pone.0266710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 03/25/2022] [Indexed: 11/19/2022] Open
Abstract
Drosophila saltans group belongs to the subgenus Sophophora (family Drosophilidae), and it is subdivided into five subgroups, with 23 species. The species in this group are widely distributed in the Americas, primarily in the Neotropics. In the literature, the phylogenetic reconstruction of this group has been performed with various markers, but many inconsistencies remain. Here, we present a phylogenetic reconstruction of the saltans group with a greater number of species, 16 species, which is the most complete to date for the saltans group and includes all subgroups, in a combined analysis with morphological and molecular markers. We incorporated 48 morphological characters of male terminalia, the highest number used to date, and molecular markers based on mitochondrial genes COI and COII. Based on the results, which have recovered the five subgroups as distinct lineages, we propose a new hypothesis regarding the phylogenetic relationships among the subgroups of the saltans group. The relationships of the species within the sturtevanti and elliptica subgroups were well supported. The saltans subgroup showed several polytomies, but the relationship between the sibling species D. austrosaltans and D. saltans and their close relation with D. nigrosaltans were well supported in the molecular and total evidence analyses. The morphological analysis additionally supported the formation of the clade D. nigrosaltans—D. pseudosaltans. The observed polytomies may represent synchronous radiations or have resulted from speciation rates that have been too fast relative to the pace of substitution accumulation.
Collapse
Affiliation(s)
- Bruna Emilia Roman
- Departament of Biology, Institute of Biosciences, Humanities and Exact Sciences, São Paulo State University (UNESP), São José do Rio Preto, São Paulo, Brazil
- * E-mail: (BER); (LMR)
| | - Diego J. Santana
- Biosciences Institute, Federal University of Mato Grosso do Sul, Campo Grande, Mato Grosso do Sul, Brazil
| | - Carolina Prediger
- Departament of Biology, Institute of Biosciences, Humanities and Exact Sciences, São Paulo State University (UNESP), São José do Rio Preto, São Paulo, Brazil
| | - Lilian Madi-Ravazzi
- Departament of Biology, Institute of Biosciences, Humanities and Exact Sciences, São Paulo State University (UNESP), São José do Rio Preto, São Paulo, Brazil
- * E-mail: (BER); (LMR)
| |
Collapse
|
2
|
Matsuo Y. The Adenine/Thymine Deleterious Selection Model for GC Content Evolution at the Third Codon Position of the Histone Genes in Drosophila. Genes (Basel) 2021; 12:721. [PMID: 34065869 PMCID: PMC8150595 DOI: 10.3390/genes12050721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 05/07/2021] [Accepted: 05/07/2021] [Indexed: 12/02/2022] Open
Abstract
The evolution of the GC (guanine cytosine) content at the third codon position of the histone genes (H1, H2A, H2B, H3, H4, H2AvD, H3.3A, H3.3B, and H4r) in 12 or more Drosophila species is reviewed. For explaining the evolution of the GC content at the third codon position of the genes, a model assuming selection with a deleterious effect for adenine/thymine and a size effect is presented. The applicability of the model to whole-genome genes is also discussed.
Collapse
Affiliation(s)
- Yoshinori Matsuo
- Division of Science and Technology, Tokushima University, 2-1 Minamijosanjima-cho, Tokushima 770-8506, Japan
| |
Collapse
|
3
|
Nakashima Y, Higashiyama A, Ushimaru A, Nagoda N, Matsuo Y. Evolution of GC content in the histone gene repeating units from Drosophila lutescens, D. takahashii and D. pseudoobscura. Genes Genet Syst 2016; 91:27-36. [PMID: 27021916 DOI: 10.1266/ggs.15-00018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A subset of histone genes (H1, H2A, H2B and H4), which are encoded along with H3 within repeating units, were analyzed in Drosophila lutescens, D. takahashii and D. pseudoobscura to investigate the evolutionary mechanisms influencing this multigene family and its GC content. Nucleotide divergence among species was more marked in the less functional regions. A strong inverse relationship was observed between the extent of evolutionary divergence and GC content within the repeating units; this finding indicated that the functional constraint on a region must be associated with both divergence and GC content. The GC content at 3(rd) codon positions in the histone genes from D. lutescens and D. takahashii was higher than that from D. melanogaster, while that from D. pseudoobscura was similar. These evolutionary patterns were similar to those of H3 gene regions. Based on these findings, we propose that the evolutionary mechanisms governing nucleotide content at 3(rd) codon positions tend to eliminate A and T nucleotides more frequently than G and C nucleotides. These changes might be the consequence of negative selection and would result in GC-rich 3(rd) codon positions. In addition, interspecific differences in GC content, which exhibited the same pattern for all histone genes, could be explained by different selection efficiencies that result from changes in population size.
Collapse
Affiliation(s)
- Yuko Nakashima
- Laboratory of Adaptive Evolution, Institute of Socio-Arts and Sciences, Tokushima University
| | | | | | | | | |
Collapse
|
4
|
Poh YP, Ting CT, Fu HW, Langley CH, Begun DJ. Population genomic analysis of base composition evolution in Drosophila melanogaster. Genome Biol Evol 2013; 4:1245-55. [PMID: 23160062 PMCID: PMC3542573 DOI: 10.1093/gbe/evs097] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The relative importance of mutation, selection, and biased gene conversion to patterns of base composition variation in Drosophila melanogaster, and to a lesser extent, D. simulans, has been investigated for many years. However, genomic data from sufficiently large samples to thoroughly characterize patterns of base composition polymorphism within species have been lacking. Here, we report a genome-wide analysis of coding and noncoding polymorphism in a large sample of inbred D. melanogaster strains from Raleigh, North Carolina. Consistent with previous results, we observed that AT mutations fix more frequently than GC mutations in D. melanogaster. Contrary to predictions of previous models of codon usage in D. melanogaster, we found that synonymous sites segregating for derived AT polymorphisms were less skewed toward low frequencies compared with sites segregating a derived GC polymorphism. However, no such pattern was observed for comparable base composition polymorphisms in noncoding DNA. These results suggest that AT-ending codons could currently be favored by natural selection in the D. melanogaster lineage.
Collapse
Affiliation(s)
- Yu-Ping Poh
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Taiwan, Republic of China.
| | | | | | | | | |
Collapse
|
5
|
Gao JJ, Hu YG, Toda MJ, Katoh T, Tamura K. Phylogenetic relationships between Sophophora and Lordiphosa, with proposition of a hypothesis on the vicariant divergences of tropical lineages between the Old and New Worlds in the family Drosophilidae. Mol Phylogenet Evol 2011; 60:98-107. [DOI: 10.1016/j.ympev.2011.04.012] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2010] [Revised: 04/11/2011] [Accepted: 04/18/2011] [Indexed: 10/18/2022]
|
6
|
DuMont VLB, Singh ND, Wright MH, Aquadro CF. Locus-specific decoupling of base composition evolution at synonymous sites and introns along the Drosophila melanogaster and Drosophila sechellia lineages. Genome Biol Evol 2009; 1:67-74. [PMID: 20333178 PMCID: PMC2817403 DOI: 10.1093/gbe/evp008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/15/2009] [Indexed: 12/20/2022] Open
Abstract
Selection is thought to be partially responsible for patterns of molecular evolution at synonymous sites within numerous Drosophila species. Recently, “per-site” and likelihood methods have been developed to detect loci for which positive selection is a major component of synonymous site evolution. An underlying assumption of these methods, however, is a homogeneous mutation process. To address this potential shortcoming, we perform a complementary analysis making gene-by-gene comparisons of paired synonymous site and intron substitution rates toward and away from the nucleotides G and C because preferred codons are G or C ending in Drosophila. This comparison may reduce both the false-positive rate (due to broadscale heterogeneity in mutation) and false-negative rate (due to lack of power comparing small numbers of sites) of the per-site and likelihood methods. We detect loci with patterns of evolution suggestive of synonymous site selection pressures predominately favoring unpreferred and preferred codons along the Drosophila melanogaster and Drosophila sechellia lineages, respectively. Intron selection pressures do not appear sufficient to explain all these results as the magnitude of the difference in synonymous and intron evolution is dependent on recombination environment and chromosomal location in a direction supporting the hypothesis of selectively driven synonymous fixations. This comparison identifies 101 loci with an apparent switch in codon preference between D. melanogaster and D. sechellia, a pattern previously only observed at the Notch locus.
Collapse
|
7
|
Singh ND, Arndt PF, Clark AG, Aquadro CF. Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila. Mol Biol Evol 2009; 26:1591-605. [PMID: 19351792 DOI: 10.1093/molbev/msp071] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Rates of single nucleotide substitution in Drosophila are highly variable within the genome, and several examples illustrate that evolutionary rates differ among Drosophila species as well. Here, we use a maximum likelihood method to quantify lineage-specific substitutional patterns and apply this method to 4-fold degenerate synonymous sites and introns from more than 8,000 genes aligned in the Drosophila melanogaster group. We find that within species, different classes of sequence evolve at different rates, with long introns evolving most slowly and short introns evolving most rapidly. Relative rates of individual single nucleotide substitutions vary approximately 3-fold among lineages, yielding patterns of substitution that are comparatively less GC-biased in the melanogaster species complex relative to Drosophila yakuba and Drosophila erecta. These results are consistent with a model coupling a mutational shift toward reduced GC content, or a shift in mutation-selection balance, in the D. melanogaster species complex, with variation in selective constraint among different classes of DNA sequence. Finally, base composition of coding and intronic sequences is not at equilibrium with respect to substitutional patterns, which primarily reflects the slow rate of the substitutional process. These results thus support the view that mutational and/or selective processes are labile on an evolutionary timescale and that if the process is indeed selection driven, then the distribution of selective constraint is variable across the genome.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Molecular Biology and Genetics, Cornell University.
| | | | | | | |
Collapse
|
8
|
de Setta N, Loreto ELS, Carareto CMA. Is the evolutionary history of the O-type P element in the saltans and willistoni groups of Drosophila similar to that of the canonical P element? J Mol Evol 2007; 65:715-24. [PMID: 18034216 DOI: 10.1007/s00239-007-9051-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2006] [Revised: 07/26/2007] [Accepted: 10/15/2007] [Indexed: 11/26/2022]
Abstract
We studied the occurrence of O-type P elements in at least one species of each subgroup of the saltans group, in order to better understand the phylogenetic relationships among the elements within the saltans group and with those of species belonging to the willistoni group. We found that the O-type subfamily has a patchy distribution within the saltans group (it does not occur in D. neocordata and D. emarginata), low sequence divergence among species of the saltans group as well as in relation to species of the willistoni group, a lower rate of synonymous substitution for coding sequences compared to Adh, and phylogenetic incongruities. These findings suggest that the evolutionary history of the O-type subfamily within the saltans and willistoni groups follows the same model proposed for the canonical subfamily of P elements, i.e., events of horizontal transfer between species of the saltans and willistoni groups.
Collapse
Affiliation(s)
- Nathalia de Setta
- UNESP-Universidade Estadual Paulista, São José do Rio Preto, SP, Brazil
| | | | | |
Collapse
|
9
|
Drosophila 12 Genomes Consortium, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W, Iyer VN, Pollard DA, Sackton TB, Larracuente AM, Singh ND, Abad JP, Abt DN, Adryan B, Aguade M, Akashi H, Anderson WW, Aquadro CF, Ardell DH, Arguello R, Artieri CG, Barbash DA, Barker D, Barsanti P, Batterham P, Batzoglou S, Begun D, Bhutkar A, Blanco E, Bosak SA, Bradley RK, Brand AD, Brent MR, Brooks AN, Brown RH, Butlin RK, Caggese C, Calvi BR, Bernardo de Carvalho A, Caspi A, Castrezana S, Celniker SE, Chang JL, Chapple C, Chatterji S, Chinwalla A, Civetta A, Clifton SW, Comeron JM, Costello JC, Coyne JA, Daub J, David RG, Delcher AL, Delehaunty K, Do CB, Ebling H, Edwards K, Eickbush T, Evans JD, Filipski A, Findeiss S, Freyhult E, Fulton L, Fulton R, Garcia ACL, Gardiner A, Garfield DA, Garvin BE, Gibson G, Gilbert D, Gnerre S, Godfrey J, Good R, Gotea V, Gravely B, Greenberg AJ, Griffiths-Jones S, Gross S, Guigo R, Gustafson EA, Haerty W, Hahn MW, Halligan DL, Halpern AL, Halter GM, Han MV, Heger A, Hillier L, Hinrichs AS, Holmes I, Hoskins RA, Hubisz MJ, Hultmark D, Huntley MA, Jaffe DB, et alDrosophila 12 Genomes Consortium, Clark AG, Eisen MB, Smith DR, Bergman CM, Oliver B, Markow TA, Kaufman TC, Kellis M, Gelbart W, Iyer VN, Pollard DA, Sackton TB, Larracuente AM, Singh ND, Abad JP, Abt DN, Adryan B, Aguade M, Akashi H, Anderson WW, Aquadro CF, Ardell DH, Arguello R, Artieri CG, Barbash DA, Barker D, Barsanti P, Batterham P, Batzoglou S, Begun D, Bhutkar A, Blanco E, Bosak SA, Bradley RK, Brand AD, Brent MR, Brooks AN, Brown RH, Butlin RK, Caggese C, Calvi BR, Bernardo de Carvalho A, Caspi A, Castrezana S, Celniker SE, Chang JL, Chapple C, Chatterji S, Chinwalla A, Civetta A, Clifton SW, Comeron JM, Costello JC, Coyne JA, Daub J, David RG, Delcher AL, Delehaunty K, Do CB, Ebling H, Edwards K, Eickbush T, Evans JD, Filipski A, Findeiss S, Freyhult E, Fulton L, Fulton R, Garcia ACL, Gardiner A, Garfield DA, Garvin BE, Gibson G, Gilbert D, Gnerre S, Godfrey J, Good R, Gotea V, Gravely B, Greenberg AJ, Griffiths-Jones S, Gross S, Guigo R, Gustafson EA, Haerty W, Hahn MW, Halligan DL, Halpern AL, Halter GM, Han MV, Heger A, Hillier L, Hinrichs AS, Holmes I, Hoskins RA, Hubisz MJ, Hultmark D, Huntley MA, Jaffe DB, Jagadeeshan S, Jeck WR, Johnson J, Jones CD, Jordan WC, Karpen GH, Kataoka E, Keightley PD, Kheradpour P, Kirkness EF, Koerich LB, Kristiansen K, Kudrna D, Kulathinal RJ, Kumar S, Kwok R, Lander E, Langley CH, Lapoint R, Lazzaro BP, Lee SJ, Levesque L, Li R, Lin CF, Lin MF, Lindblad-Toh K, Llopart A, Long M, Low L, Lozovsky E, Lu J, Luo M, Machado CA, Makalowski W, Marzo M, Matsuda M, Matzkin L, McAllister B, McBride CS, McKernan B, McKernan K, Mendez-Lago M, Minx P, Mollenhauer MU, Montooth K, Mount SM, Mu X, Myers E, Negre B, Newfeld S, Nielsen R, Noor MAF, O'Grady P, Pachter L, Papaceit M, Parisi MJ, Parisi M, Parts L, Pedersen JS, Pesole G, Phillippy AM, Ponting CP, Pop M, Porcelli D, Powell JR, Prohaska S, Pruitt K, Puig M, Quesneville H, Ram KR, Rand D, Rasmussen MD, Reed LK, Reenan R, Reily A, Remington KA, Rieger TT, Ritchie MG, Robin C, Rogers YH, Rohde C, Rozas J, Rubenfield MJ, Ruiz A, Russo S, Salzberg SL, Sanchez-Gracia A, Saranga DJ, Sato H, Schaeffer SW, Schatz MC, Schlenke T, Schwartz R, Segarra C, Singh RS, Sirot L, Sirota M, Sisneros NB, Smith CD, Smith TF, Spieth J, Stage DE, Stark A, Stephan W, Strausberg RL, Strempel S, Sturgill D, Sutton G, Sutton GG, Tao W, Teichmann S, Tobari YN, Tomimura Y, Tsolas JM, Valente VLS, Venter E, Venter JC, Vicario S, Vieira FG, Vilella AJ, Villasante A, Walenz B, Wang J, Wasserman M, Watts T, Wilson D, Wilson RK, Wing RA, Wolfner MF, Wong A, Wong GKS, Wu CI, Wu G, Yamamoto D, Yang HP, Yang SP, Yorke JA, Yoshida K, Zdobnov E, Zhang P, Zhang Y, Zimin AV, Baldwin J, Abdouelleil A, Abdulkadir J, Abebe A, Abera B, Abreu J, Acer SC, Aftuck L, Alexander A, An P, Anderson E, Anderson S, Arachi H, Azer M, Bachantsang P, Barry A, Bayul T, Berlin A, Bessette D, Bloom T, Blye J, Boguslavskiy L, Bonnet C, Boukhgalter B, Bourzgui I, Brown A, Cahill P, Channer S, Cheshatsang Y, Chuda L, Citroen M, Collymore A, Cooke P, Costello M, D'Aco K, Daza R, De Haan G, DeGray S, DeMaso C, Dhargay N, Dooley K, Dooley E, Doricent M, Dorje P, Dorjee K, Dupes A, Elong R, Falk J, Farina A, Faro S, Ferguson D, Fisher S, Foley CD, Franke A, Friedrich D, Gadbois L, Gearin G, Gearin CR, Giannoukos G, Goode T, Graham J, Grandbois E, Grewal S, Gyaltsen K, Hafez N, Hagos B, Hall J, Henson C, Hollinger A, Honan T, Huard MD, Hughes L, Hurhula B, Husby ME, Kamat A, Kanga B, Kashin S, Khazanovich D, Kisner P, Lance K, Lara M, Lee W, Lennon N, Letendre F, LeVine R, Lipovsky A, Liu X, Liu J, Liu S, Lokyitsang T, Lokyitsang Y, Lubonja R, Lui A, MacDonald P, Magnisalis V, Maru K, Matthews C, McCusker W, McDonough S, Mehta T, Meldrim J, Meneus L, Mihai O, Mihalev A, Mihova T, Mittelman R, Mlenga V, Montmayeur A, Mulrain L, Navidi A, Naylor J, Negash T, Nguyen T, Nguyen N, Nicol R, Norbu C, Norbu N, Novod N, O'Neill B, Osman S, Markiewicz E, Oyono OL, Patti C, Phunkhang P, Pierre F, Priest M, Raghuraman S, Rege F, Reyes R, Rise C, Rogov P, Ross K, Ryan E, Settipalli S, Shea T, Sherpa N, Shi L, Shih D, Sparrow T, Spaulding J, Stalker J, Stange-Thomann N, Stavropoulos S, Stone C, Strader C, Tesfaye S, Thomson T, Thoulutsang Y, Thoulutsang D, Topham K, Topping I, Tsamla T, Vassiliev H, Vo A, Wangchuk T, Wangdi T, Weiand M, Wilkinson J, Wilson A, Yadav S, Young G, Yu Q, Zembek L, Zhong D, Zimmer A, Zwirko Z, Jaffe DB, Alvarez P, Brockman W, Butler J, Chin C, Gnerre S, Grabherr M, Kleber M, Mauceli E, MacCallum I. Evolution of genes and genomes on the Drosophila phylogeny. Nature 2007; 450:203-18. [PMID: 17994087 DOI: 10.1038/nature06341] [Show More Authors] [Citation(s) in RCA: 1537] [Impact Index Per Article: 85.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2007] [Accepted: 10/05/2007] [Indexed: 12/11/2022]
Abstract
Comparative analysis of multiple genomes in a phylogenetic framework dramatically improves the precision and sensitivity of evolutionary inference, producing more robust results than single-genome analyses can provide. The genomes of 12 Drosophila species, ten of which are presented here for the first time (sechellia, simulans, yakuba, erecta, ananassae, persimilis, willistoni, mojavensis, virilis and grimshawi), illustrate how rates and patterns of sequence divergence across taxa can illuminate evolutionary processes on a genomic scale. These genome sequences augment the formidable genetic tools that have made Drosophila melanogaster a pre-eminent model for animal genetics, and will further catalyse fundamental research on mechanisms of development, cell biology, genetics, disease, neurobiology, behaviour, physiology and evolution. Despite remarkable similarities among these Drosophila species, we identified many putatively non-neutral changes in protein-coding genes, non-coding RNA genes, and cis-regulatory regions. These may prove to underlie differences in the ecology and behaviour of these diverse species.
Collapse
|
10
|
Akashi H, Goel P, John A. Ancestral inference and the study of codon bias evolution: implications for molecular evolutionary analyses of the Drosophila melanogaster subgroup. PLoS One 2007; 2:e1065. [PMID: 17957249 PMCID: PMC2020436 DOI: 10.1371/journal.pone.0001065] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2007] [Accepted: 09/21/2007] [Indexed: 11/18/2022] Open
Abstract
Reliable inference of ancestral sequences can be critical to identifying both patterns and causes of molecular evolution. Robustness of ancestral inference is often assumed among closely related species, but tests of this assumption have been limited. Here, we examine the performance of inference methods for data simulated under scenarios of codon bias evolution within the Drosophila melanogaster subgroup. Genome sequence data for multiple, closely related species within this subgroup make it an important system for studying molecular evolutionary genetics. The effects of asymmetric and lineage-specific substitution rates (i.e., varying levels of codon usage bias and departures from equilibrium) on the reliability of ancestral codon usage was investigated. Maximum parsimony inference, which has been widely employed in analyses of Drosophila codon bias evolution, was compared to an approach that attempts to account for uncertainty in ancestral inference by weighting ancestral reconstructions by their posterior probabilities. The latter approach employs maximum likelihood estimation of rate and base composition parameters. For equilibrium and most non-equilibrium scenarios that were investigated, the probabilistic method appears to generate reliable ancestral codon bias inferences for molecular evolutionary studies within the D. melanogaster subgroup. These reconstructions are more reliable than parsimony inference, especially when codon usage is strongly skewed. However, inference biases are considerable for both methods under particular departures from stationarity (i.e., when adaptive evolution is prevalent). Reliability of inference can be sensitive to branch lengths, asymmetry in substitution rates, and the locations and nature of lineage-specific processes within a gene tree. Inference reliability, even among closely related species, can be strongly affected by (potentially unknown) patterns of molecular evolution in lineages ancestral to those of interest.
Collapse
Affiliation(s)
- Hiroshi Akashi
- Institute of Molecular Evolutionary Genetics, Department of Biology, Pennsylvania State University, State College, Pennsylvania, United States of America.
| | | | | |
Collapse
|
11
|
Díaz-Castillo C, Golic KG. Evolution of gene sequence in response to chromosomal location. Genetics 2007; 177:359-74. [PMID: 17890366 PMCID: PMC2013720 DOI: 10.1534/genetics.107.077081] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2007] [Accepted: 06/06/2007] [Indexed: 12/26/2022] Open
Abstract
Evolutionary forces acting on the repetitive DNA of heterochromatin are not constrained by the same considerations that apply to protein-coding genes. Consequently, such sequences are subject to rapid evolutionary change. By examining the Troponin C gene family of Drosophila melanogaster, which has euchromatic and heterochromatic members, we find that protein-coding genes also evolve in response to their chromosomal location. The heterochromatic members of the family show a reduced CG content and increased variation in DNA sequence. We show that the CG reduction applies broadly to the protein-coding sequences of genes located at the heterochromatin:euchromatin interface, with a very strong correlation between CG content and the distance from centric heterochromatin. We also observe a similar trend in the transition from telomeric heterochromatin to euchromatin. We propose that the methylation of DNA is one of the forces driving this sequence evolution.
Collapse
|
12
|
Abstract
Research into the origins of introns is at a critical juncture in the resolution of theories on the evolution of early life (which came first, RNA or DNA?), the identity of LUCA (the last universal common ancestor, was it prokaryotic- or eukaryotic-like?), and the significance of noncoding nucleotide variation. One early notion was that introns would have evolved as a component of an efficient mechanism for the origin of genes. But alternative theories emerged as well. From the debate between the "introns-early" and "introns-late" theories came the proposal that introns arose before the origin of genetically encoded proteins and DNA, and the more recent "introns-first" theory, which postulates the presence of introns at that early evolutionary stage from a reconstruction of the "RNA world." Here we review seminal and recent ideas about intron origins. Recent discoveries about the patterns and causes of intron evolution make this one of the most hotly debated and exciting topics in molecular evolutionary biology today.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
13
|
Singh ND, Arndt PF, Petrov DA. Minor shift in background substitutional patterns in the Drosophila saltans and willistoni lineages is insufficient to explain GC content of coding sequences. BMC Biol 2006; 4:37. [PMID: 17049096 PMCID: PMC1626080 DOI: 10.1186/1741-7007-4-37] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2006] [Accepted: 10/18/2006] [Indexed: 11/10/2022] Open
Abstract
Background Several lines of evidence suggest that codon usage in the Drosophila saltans and D. willistoni lineages has shifted towards a less frequent use of GC-ending codons. Introns in these lineages show a parallel shift toward a lower GC content. These patterns have been alternatively ascribed to either a shift in mutational patterns or changes in the definition of preferred and unpreferred codons in these lineages. Results and discussion To gain additional insight into this question, we quantified background substitutional patterns in the saltans/willistoni group using inactive copies of a novel, Q-like retrotransposable element. We demonstrate that the pattern of background substitutions in the saltans/willistoni lineage has shifted to a significant degree, primarily due to changes in mutational biases. These differences predict a lower equilibrium GC content in the genomes of the saltans/willistoni species compared with that in the D. melanogaster species group. The magnitude of the difference can readily account for changes in intronic GC content, but it appears insufficient to explain changes in codon usage within the saltans/willistoni lineage. Conclusion We suggest that the observed changes in codon usage in the saltans/willistoni clade reflects either lineage-specific changes in the definitions of preferred and unpreferred codons, or a weaker selective pressure on codon bias in this lineage.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, CA 94305, USA
| | - Peter F Arndt
- Max Planck for Molecular Genetics, 14195 Berlin, Germany
| | - Dmitri A Petrov
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, CA 94305, USA
| |
Collapse
|
14
|
Ko WY, Piao S, Akashi H. Strong regional heterogeneity in base composition evolution on the Drosophila X chromosome. Genetics 2006; 174:349-62. [PMID: 16547109 PMCID: PMC1569809 DOI: 10.1534/genetics.105.054346] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2005] [Accepted: 05/08/2006] [Indexed: 11/18/2022] Open
Abstract
Fluctuations in base composition appear to be prevalent in Drosophila and mammal genome evolution, but their timescale, genomic breadth, and causes remain obscure. Here, we study base composition evolution within the X chromosomes of Drosophila melanogaster and five of its close relatives. Substitutions were inferred on six extant and two ancestral lineages for 14 near-telomeric and 9 nontelomeric genes. GC content evolution is highly variable both within the genome and within the phylogenetic tree. In the lineages leading to D. yakuba and D. orena, GC content at silent sites has increased rapidly near telomeres, but has decreased in more proximal (nontelomeric) regions. D. orena shows a 17-fold excess of GC-increasing vs. AT-increasing synonymous changes within a small (approximately 130-kb) region close to the telomeric end. Base composition changes within introns are consistent with changes in mutation patterns, but stronger GC elevation at synonymous sites suggests contributions of natural selection or biased gene conversion. The Drosophila yakuba lineage shows a less extreme elevation of GC content distributed over a wider genetic region (approximately 1.2 Mb). A lack of change in GC content for most introns within this region suggests a role of natural selection in localized base composition fluctuations.
Collapse
Affiliation(s)
- Wen-Ya Ko
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | |
Collapse
|
15
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Models of spliceosomal intron proliferation in the face of widespread ectopic expression. Gene 2006; 366:201-8. [PMID: 16288838 DOI: 10.1016/j.gene.2005.09.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2005] [Revised: 08/04/2005] [Accepted: 09/02/2005] [Indexed: 11/27/2022]
Abstract
It is now certain that today living organisms can acquire new spliceosomal introns in their genes. The proposed sources of spliceosomal introns are exons, transposons, and other introns, including spliceosomal and group II self-splicing introns. Spliceosomal introns are thought to be the most likely source, because the inserted sequence would immediately be endowed with the essential set of intron recognition sequences, thereby preventing the deleterious effects associated with incorrect splicing. The most obvious spliceosomal intron duplication pathways involve an RNA transcript intermediate step. Therefore, for a spliceosomal intron to be originated by duplication, either the source gene from which the novel intron is derived, or that gene and the recipient gene, which contains the novel intron, would need to be expressed in the germ line. Intron proliferation surveys indicate that putative intron duplicate-containing genes do not always match detectable expression in the germ line, which casts doubt on the generality of the duplication model. However, judging mechanisms of intron gain (or loss) from present-day gene expression profiles could be erroneous, if expression patterns were different at the time the introns arose. In fact, this may likely be so in most cases. Ectopic expression, i.e., the expression of genes at times and locations where the target gene is not known to have a function, is a much more common phenomenon than previously realized. We conclude with a speculation on a possible interplay between spliceosomal introns and ectopic expression at the origin of multicellularity.
Collapse
Affiliation(s)
- Francisco Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
16
|
Simmons MP, Carr TG, O'Neill K. Relative character-state space, amount of potential phylogenetic information, and heterogeneity of nucleotide and amino acid characters. Mol Phylogenet Evol 2005; 32:913-26. [PMID: 15288066 DOI: 10.1016/j.ympev.2004.04.011] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2003] [Revised: 03/10/2004] [Indexed: 11/16/2022]
Abstract
We examined a broad selection of protein-coding loci from a diverse array of clades and genomes to quantify three factors that determine whether nucleotide or amino acid characters should be preferred for phylogenetic inference. First, we quantified the difference in observed character-state space between nucleotides and amino acids. Second, we quantified the loss of potential phylogenetic signal from silent substitutions when amino acids are used. Third, we used the disparity index to quantify the relative compositional heterogeneity of nucleotides and amino acids and then determined how commonly convergent (rather than unique) shifts in nucleotide and amino acid composition occur in a phylogenetic context. The greater potential phylogenetic signal for nucleotide characters was found to be enormous (on average 440% that of amino acids), whereas the greater observed character-state space for amino acids was less impressive (on average 150.4% that of nucleotides). While matrices of amino acid sequences had less compositional heterogeneity than their corresponding nucleotide sequences, heterogeneity in amino acid composition may be more homoplasious than heterogeneity in nucleotide composition. Given the ability of increased taxon sampling to better utilize the greater potential phylogenetic signal of nucleotide characters and decrease the potential for artifacts caused by heterogeneous nucleotide composition among taxa, we suggest that increased taxon sampling be performed whenever possible instead of restricting analyses to amino acid characters.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA.
| | | | | |
Collapse
|
17
|
Polachini de Castro J, Carareto CMA. P elements in the saltans group of Drosophila: a new evaluation of their distribution and number of genomic insertion sites. Mol Phylogenet Evol 2005; 32:383-7. [PMID: 15186822 DOI: 10.1016/j.ympev.2004.01.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2003] [Revised: 01/02/2004] [Indexed: 11/28/2022]
Abstract
Few are studies on P elements that have addressed the saltans group. These studies had shown that species from the cordata and elliptica subgroups were devoid of any discernible P homologous sequences, while species from the parasaltans, sturtevanti, and saltans subgroups all contain P element sequences. Our analyses showed the presence of one to 15 P element insertion sites in species of the saltans group, including Drosophila neocordata and Drosophila emarginata (cordata and elliptica subgroups, respectively). From these species, only those from the parasaltans, sturtevanti, and saltans subgroups harbor canonical P elements and, only those of the last two subgroups seem to harbor putative full-sized elements. Due to the low similarity of the sequences found in D. neocordata and D. emarginata to those earlier described, we suggest that these sequences might be rudimental P element derivatives that were present in the ancestral of the subgenus Sophophora.
Collapse
Affiliation(s)
- Juliana Polachini de Castro
- Departamento de Biologia, Universidade Estadual Paulista-UNESP, Rua Cristóvão Colombo 2265, CEP 15054-000, São José do Rio Preto-SP, Brazil
| | | |
Collapse
|
18
|
Costas J, Pereira PS, Vieira CP, Pinho S, Vieira J, Casares F. Dynamics and function of intron sequences of the wingless gene during the evolution of the Drosophila genus. Evol Dev 2004; 6:325-35. [PMID: 15330865 DOI: 10.1111/j.1525-142x.2004.04040.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
To understand the function and evolution of genes with complex patterns of expression, such as the Drosophila wingless gene, it is essential to know how their transcription is regulated. However, extracting the relevant regulatory information from a genome is still a complex task. We used a combination of comparative genomics and functional approaches to identify putative regulatory sequences in two introns (1 and 3) of the wingless gene and to infer their evolution. Comparison of the sequences obtained from several Drosophila species revealed colinear and well-conserved sequence blocks in both introns. Drosophila willistoni showed a rate of evolution, in both introns, faster than expected from its phylogenetic position. Intron 3 appeared to be composed of two separate modules, one of them lost in the willistoni group. We tested whether sequence conservation in noncoding regions is a reliable indicator of regulatory function and, if this function is conserved, by analyzing D. melanogaster transgenic reporter lines harboring intron 3 sequences from D. melanogaster (Sophophora subgenus) and the species from the Drosophila subgenus presenting the most divergent sequence, D. americana. The analysis indicated that intron 3 contains pupal enhancers conserved during the evolution of the genus, despite the fact that only 30% of the D. melanogaster intron 3 sequences lie in conserved blocks. Additional analysis of D. melanogaster transgenic reporter lines harboring intron 3 sequences from D. willistoni revealed the absence of an abdomen-specific expression pattern, probably due to the above-mentioned loss of a regulatory module in this species.
Collapse
Affiliation(s)
- J Costas
- Instituto de Biologia Molecular e Celular (IBMC), Universidade do Porto, Rua do Campo Alegre 823, Porto 4150-180, Portugal
| | | | | | | | | | | |
Collapse
|
19
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Convergent neofunctionalization by positive Darwinian selection after ancient recurrent duplications of the xanthine dehydrogenase gene. Proc Natl Acad Sci U S A 2003; 100:13413-7. [PMID: 14576276 PMCID: PMC263828 DOI: 10.1073/pnas.1835646100] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Gene duplication is a primary source of molecular substrate for the emergence of evolutionary novelties. The chances for redundant gene sequences to evolve new functions are small compared with the probability that the copies become disabled by deleterious mutations. Functional divergence after gene duplication can result in two alternative evolutionary fates: one copy acquires a novel function (neofunctionalization), or each copy adopts part of the tasks of their parental gene (subfunctionalization). The relative prevalence of each outcome is unknown. Similarly unknown is the relative importance of positive selection versus random fixation of neutral mutations. Aldehyde oxidase (Ao) and xanthine dehydrogenase (Xdh) genes encode two complex members of the xanthine oxidase family of molybdo-flavoenzymes that carry different functions. Ao is known to have originated from a duplicate of an Xdh gene in eukaryotes, before the origin of multicellularity. We show that (i) Ao evolved independently twice from two different Xdh paralogs, the second time in the chordates, before the diversification of vertebrates; (ii) after each duplication, the Ao duplicate underwent a period of rapid evolution during which identical sites across the two molecules, involving the flavin adenine dinucleotide and substrate-binding pockets, were subjected to intense positive Darwinian selection; and (iii) the second Ao gene likely endured two periods of redundancy, initially as a duplicate of Xdh and later as a functional equivalent of the old Ao, which is currently absent from the vertebrate genome. Caution is appropriate in structural genomics when using sequence similarity for assigning protein function.
Collapse
|
20
|
Tarrio R, Rodríguez-Trelles F, Ayala FJ. A new Drosophila spliceosomal intron position is common in plants. Proc Natl Acad Sci U S A 2003; 100:6580-3. [PMID: 12750476 PMCID: PMC164489 DOI: 10.1073/pnas.0731952100] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The 25-year-old debate about the origin of introns between proponents of "introns early" and "introns late" has yielded significant advances, yet important questions remain to be ascertained. One question concerns the density of introns in the last common ancestor of the three multicellular kingdoms. Approaches to this issue thus far have relied on counts of the numbers of identical intron positions across present-day taxa on the assumption that the introns at those sites are orthologous. However, dismissing parallel intron gain for those sites may be unwarranted, because various factors can potentially constrain the site of intron insertion. Demonstrating parallel intron gain is severely handicapped, because intron sequences often evolve exceedingly fast and intron phylogenetic distributions are usually ambiguous, such that alternative loss and gain scenarios cannot be clearly distinguished. We have identified an intron position that was gained independently in animals and plants in the xanthine dehydrogenase gene. The extremely disjointed phylogenetic distribution of the intron argues strongly for separate gain rather than recurrent loss. If the observed phylogenetic pattern had resulted from recurrent loss, all observational support previously gathered for the introns-late theory of intron origins based on the phylogenetic distribution of introns would be invalidated.
Collapse
Affiliation(s)
- Rosa Tarrio
- Department of Ecology and Evolutionary Biology, University of California, Irvine 92697-2525, USA
| | | | | |
Collapse
|
21
|
Matsuo Y. Evolution of the GC content of the histone 3 gene in seven Drosophila species. Genes Genet Syst 2003; 78:309-18. [PMID: 14532710 DOI: 10.1266/ggs.78.309] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The molecular evolution of the histone multigene family was studied by cloning and determining the nucleotide sequences of the histone 3 genes in seven Drosophila species, D. takahashii, D. lutescens, D. ficusphila, D. persimilis, D.pseudoobscura, D. americana and D. immigrans. CT repeats, a TATA box and an AGTG motif in the 5' region, and a hairpin loop and purine-rich motifs (CAA(T/G)GAGA) in the 3' region were conserved even in distantly related species. In D. hydei and D.americana, the GC content at the third codon position in the protein coding region was relatively low (49% and 45%), while in D. takahashii and D. lutescens it was relatively high (64% and 65%). The non- significant correlation between the GC contents in the 3' region and at the third codon position as well as the evidence of less constraint in the 3' region suggested that mutational bias may not be the major mechanism responsible for the biased nucleotide change at the third codon position or for codon usage bias.
Collapse
Affiliation(s)
- Yoshinori Matsuo
- Laboratory of Adaptive Evolution, Faculty of Integrated Arts and Sciences, The University of Tokushima, Tokushima, Japan.
| |
Collapse
|
22
|
Begun DJ, Whitley P. Molecular population genetics of Xdh and the evolution of base composition in Drosophila. Genetics 2002; 162:1725-35. [PMID: 12524344 PMCID: PMC1462376 DOI: 10.1093/genetics/162.4.1725] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Few loci have been measured for DNA polymorphism and divergence in several species. Here we report such data from the protein-coding region of xanthine dehydrogenase (Xdh) in 22 species of Drosophila. Many of our samples were from closely related species, allowing us to confidently assign substitutions to individual lineages. Surprisingly, Xdh appears to be fixing more A/T mutations than G/C mutations in most lineages, leading to evolution of higher A/T content in the recent past. We found no compelling evidence for selection on protein variation, though some aspects of the data support the notion that a significant fraction of amino acid polymorphisms are slightly deleterious. Finally, we found no convincing evidence that levels of silent heterozygosity are associated with rates of protein evolution.
Collapse
Affiliation(s)
- David J Begun
- Section of Integrative Biology, University of Texas, Austin, Texas 78712, USA.
| | | |
Collapse
|
23
|
Wright SI, Lauga B, Charlesworth D. Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol Biol Evol 2002; 19:1407-20. [PMID: 12200469 DOI: 10.1093/oxfordjournals.molbev.a004204] [Citation(s) in RCA: 167] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The evolution of self-fertilization is associated with a large reduction in the effective rate of recombination and a corresponding decline in effective population size. If many spontaneous mutations are slightly deleterious, this shift in the breeding system is expected to lead to a reduced efficacy of natural selection and genome-wide changes in the rates of molecular evolution. Here, we investigate the effects of the breeding system on molecular evolution in the highly self-fertilizing plant Arabidopsis thaliana by comparing its coding and noncoding genomic regions with those of its close outcrossing relative, the self-incompatible A. lyrata. More distantly related species in the Brassicaceae are used as outgroups to polarize the substitutions along each lineage. In contrast to expectations, no significant difference in the rates of protein evolution is observed between selfing and outcrossing Arabidopsis species. Similarly, no consistent overall difference in codon bias is observed between the species, although for low-biased genes A. lyrata shows significantly higher major codon usage. There is also evidence of intron size evolution in A. thaliana, which has consistently smaller introns than its outcrossing congener, potentially reflecting directional selection on intron size. The results are discussed in the context of heterogeneity in selection coefficients across loci and the effects of life history and population structure on rates of molecular evolution. Using estimates of substitution rates in coding regions and approximate estimates of divergence and generation times, the genomic deleterious mutation rate (U) for amino acid substitutions in Arabidopsis is estimated to be approximately 0.2-0.6 per generation.
Collapse
Affiliation(s)
- Stephen I Wright
- Institute of Cell, Animal, and Population Biology, Ashworth Laboratories, University of Edinburgh.
| | | | | |
Collapse
|
24
|
Bergman CM, Pfeiffer BD, Rincón-Limas DE, Hoskins RA, Gnirke A, Mungall CJ, Wang AM, Kronmiller B, Pacleb J, Park S, Stapleton M, Wan K, George RA, de Jong PJ, Botas J, Rubin GM, Celniker SE. Assessing the impact of comparative genomic sequence data on the functional annotation of the Drosophila genome. Genome Biol 2002; 3:RESEARCH0086. [PMID: 12537575 PMCID: PMC151188 DOI: 10.1186/gb-2002-3-12-research0086] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2002] [Revised: 11/25/2002] [Accepted: 12/05/2002] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND It is widely accepted that comparative sequence data can aid the functional annotation of genome sequences; however, the most informative species and features of genome evolution for comparison remain to be determined. RESULTS We analyzed conservation in eight genomic regions (apterous, even-skipped, fushi tarazu, twist, and Rhodopsins 1, 2, 3 and 4) from four Drosophila species (D. erecta, D. pseudoobscura, D. willistoni, and D. littoralis) covering more than 500 kb of the D. melanogaster genome. All D. melanogaster genes (and 78-82% of coding exons) identified in divergent species such as D. pseudoobscura show evidence of functional constraint. Addition of a third species can reveal functional constraint in otherwise non-significant pairwise exon comparisons. Microsynteny is largely conserved, with rearrangement breakpoints, novel transposable element insertions, and gene transpositions occurring in similar numbers. Rates of amino-acid substitution are higher in uncharacterized genes relative to genes that have previously been studied. Conserved non-coding sequences (CNCSs) tend to be spatially clustered with conserved spacing between CNCSs, and clusters of CNCSs can be used to predict enhancer sequences. CONCLUSIONS Our results provide the basis for choosing species whose genome sequences would be most useful in aiding the functional annotation of coding and cis-regulatory sequences in Drosophila. Furthermore, this work shows how decoding the spatial organization of conserved sequences, such as the clustering of CNCSs, can complement efforts to annotate eukaryotic genomes on the basis of sequence conservation alone.
Collapse
Affiliation(s)
- Casey M Bergman
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
- These authors contributed equally to this work
| | - Barret D Pfeiffer
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
- These authors contributed equally to this work
| | - Diego E Rincón-Limas
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Current address: Departamento de Biologia Molecular, Universidad Autonoma de Tamaulipas-UAMRA, Reynosa, CP 88740, Mexico
| | - Roger A Hoskins
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| | | | - Chris J Mungall
- Howard Hughes Medical Institute, Department of Molecular and Cellular Biology, University of California, Berkeley, CA 94720, USA
| | - Adrienne M Wang
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
- Current address: Department of Physiology, University of California, San Francisco, CA 94143, USA
| | - Brent Kronmiller
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
- Current address: Department of Bioinformatics and Computational Biology, Iowa State University, Ames, IA 50011, USA
| | - Joanne Pacleb
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| | - Soo Park
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| | - Mark Stapleton
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| | - Kenneth Wan
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| | - Reed A George
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| | - Pieter J de Jong
- Children's Hospital and Research Center at Oakland, Oakland, CA 94609, USA
| | - Juan Botas
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Gerald M Rubin
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
- Howard Hughes Medical Institute, Department of Molecular and Cellular Biology, University of California, Berkeley, CA 94720, USA
| | - Susan E Celniker
- Berkeley Drosophila Genome Project, Lawrence Berkeley National Laboratory, One Cyclotron Road, Berkeley, CA 94720, USA
| |
Collapse
|
25
|
Tsunemoto K, Matsuo Y. Molecular evolutionary analysis of a histone gene repeating unit from Drosophila simulans. Genes Genet Syst 2001; 76:355-61. [PMID: 11922104 DOI: 10.1266/ggs.76.355] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A repeating unit of the histone gene cluster from Drosophila simulans containing the H1, H2A, H2B and H4 genes (the H3 gene region has already been analyzed) was cloned and analyzed. A nucleotide sequence of about 4.6 kbp was determined to study the nucleotide divergence and molecular evolution of the histone gene cluster. Comparison of the structure and nucleotide sequence with those of Drosophila melanogaster showed that the four histone genes were located at identical positions and in the same directions. The proportion of different nucleotide sites was 6.3% in total. The amino acid sequence of H1 was divergent, with a 5.1% difference. However, no amino acid change has been observed for the other three histone proteins. Analysis of the GC contents and the base substitution patterns in the two lineages, D. melanogaster and D. simulans, with a common ancestor showed the following. 1) A strong negative correlation was found between the GC content and the nucleotide divergence in the whole repeating unit. 2) The mode of molecular evolution previously found for the H3 gene was also observed for the whole repeating unit of histone genes; the nucleotide substitutions were stationary in the 3' and spacer regions, and there was a directional change of the codon usage to the AT-rich codons. 3) No distinct difference in the mode or pattern of molecular evolution was detected for the histone gene repeating unit in the D. melanogaster and D. simulans lineages. These results suggest that selectional pressure for the coding regions of histones, which eliminate A and T, is less effective in the D. melanogaster and D. simulans lineages than in the other GC-rich species.
Collapse
Affiliation(s)
- K Tsunemoto
- Laboratory of Adaptive Evolution, Faculty of Integrated Arts and Sciences, The University of Tokushima, Japan
| | | |
Collapse
|
26
|
Tarrío R, Rodríguez-Trelles F, Ayala FJ. Shared nucleotide composition biases among species and their impact on phylogenetic reconstructions of the Drosophilidae. Mol Biol Evol 2001; 18:1464-73. [PMID: 11470837 DOI: 10.1093/oxfordjournals.molbev.a003932] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Compositional changes are a major feature of genome evolution. Overlooking nucleotide composition differences among sequences can seriously mislead phylogenetic reconstructions. Large compositional variation exists among the members of the family Drosophilidae. Until now, however, base composition differences have been largely neglected in the formulations of the nucleotide substitution process used to reconstruct the phylogeny of this important group of species. The present study adopts a maximum-likelihood framework of phylogenetic inference in order to analyze five nuclear gene regions and shows that (1) the pattern of compositional variation in the Drosophilidae does not match the phylogeny of the species; (2) accounting for the heterogeneous GC content with Galtier and Gouy's nucleotide substitution model leads to a tree that differs in significant aspects from the tree inferred when the nucleotide composition differences are ignored, even though both phylogenetic hypotheses attain strong nodal support in the bootstrap analyses; and (3) the LogDet distance correction cannot completely overcome the distorting effects of the compositional variation that exists among the species of the Drosophilidae. Our analyses confidently place the Chymomyza genus as an outgroup closer than the genus Scaptodrosophila to the Drosophila genus and conclusively support the monophyly of the Sophophora subgenus.
Collapse
Affiliation(s)
- R Tarrío
- Department of Ecology and Evolutionary Biology, University of California at Irvine, 92697-2525, USA
| | | | | |
Collapse
|
27
|
Takano-Shimizu T. Local changes in GC/AT substitution biases and in crossover frequencies on Drosophila chromosomes. Mol Biol Evol 2001; 18:606-19. [PMID: 11264413 DOI: 10.1093/oxfordjournals.molbev.a003841] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
I present here evidence of remarkable local changes in GC/AT substitution biases and in crossover frequencies on Drosophila chromosomes. The substitution pattern at 10 loci in the telomeric region of the X chromosome was studied for four species of the Drosophila melanogaster species subgroup. Drosophila orena and Drosophila erecta are clearly the most closely related species pair (the erecta complex) among the four species studied; however, the overall data at the 10 loci revealed a clear dichotomy in the silent substitution patterns between the AT-biased- substitution melanogaster and erecta lineages and the GC-biased-substitution yakuba and orena lineages, suggesting two or more independent changes in GC/AT substitution biases. More importantly, the results indicated a between- loci heterogeneity in GC/AT substitution bias in this small region independently in the yakuba and orena lineages. Indeed, silent substitutions in the orena lineage were significantly biased toward G and C at the consecutive yellow, lethal of scute, and asense loci, but they were significantly biased toward A and T at sta. The substitution bias toward G and C was centered in different areas in yakuba (significantly biased at EG:165H7.3, EG:171D11.2, and suppressor of sable). The similar silent substitution patterns in coding and noncoding regions, furthermore, suggested mutational biases as a cause of the substitution biases. On the other hand, previous study reveals that Drosophila yakuba has about 20-fold higher crossover frequencies in the telomeric region of the X chromosome than does D. melanogaster; this study revealed that the total genetic map length of the yakuba X chromosome was only about 1.5 times as large as that of melanogaster and that the map length of the X-telomeric y-sta region did not differ between Drosophila yakuba and D. erecta. Taken together, the data strongly suggested that an approximately 20- fold reduction in the X-telomeric crossover frequencies occurred in the ancestral population of D. melanogaster after the melanogaster-yakuba divergence but before the melanogaster-simulans divergence.
Collapse
Affiliation(s)
- T Takano-Shimizu
- Department of Population Genetics, National Institute of Genetics, Mishima, Shizuoka-ken, Japan.
| |
Collapse
|
28
|
Gailey DA, Ho SK, Ohshima S, Liu JH, Eyassu M, Washington MA, Yamamoto D, Davis T. A phylogeny of the Drosophilidae using the sex-behaviour gene fruitless. Hereditas 2001; 133:81-3. [PMID: 11206858 DOI: 10.1111/j.1601-5223.2000.00081.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Affiliation(s)
- D A Gailey
- Department of Biological Sciences, California State University, Hayward 94542, USA
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Dunn KA, Bielawski JP, Yang Z. Substitution rates in Drosophila nuclear genes: implications for translational selection. Genetics 2001; 157:295-305. [PMID: 11139510 PMCID: PMC1461466 DOI: 10.1093/genetics/157.1.295] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The relationships between synonymous and nonsynonymous substitution rates and between synonymous rate and codon usage bias are important to our understanding of the roles of mutation and selection in the evolution of Drosophila genes. Previous studies used approximate estimation methods that ignore codon bias. In this study we reexamine those relationships using maximum-likelihood methods to estimate substitution rates, which accommodate the transition/transversion rate bias and codon usage bias. We compiled a sample of homologous DNA sequences at 83 nuclear loci from Drosophila melanogaster and at least one other species of Drosophila. Our analysis was consistent with previous studies in finding that synonymous rates were positively correlated with nonsynonymous rates. Our analysis differed from previous studies, however, in that synonymous rates were unrelated to codon bias. We therefore conducted a simulation study to investigate the differences between approaches. The results suggested that failure to properly account for multiple substitutions at the same site and for biased codon usage by approximate methods can lead to an artifactual correlation between synonymous rate and codon bias. Implications of the results for translational selection are discussed.
Collapse
Affiliation(s)
- K A Dunn
- Department of Biology, Galton Laboratory, University College, London NW1 2HE, United Kingdom.
| | | | | |
Collapse
|
30
|
Abstract
Study of the nucleotide composition in Drosophila, focusing on the saltans and willistoni groups, has revealed unanticipated differences in nucleotide composition among lineages. Compositional differences are associated with an accelerated rate of nucleotide substitution in functionally less constrained regions. These observations have been set forth against the extended opinion that the pattern of point mutation has remained constant during the evolution of the genus. A crucial assumption has been that the most recent common ancestor of the subgenus Sophophora had an elevated GC content. Until now, this assumption has been supported by indirect arguments, consisting of extrapolations from closely related outgroups and limited by the robustness of mathematical descriptions concerning the extensive nucleotide composition differences among sequences. The present study seeks to test the assumption of a high ancestral GC content using realistic representations of the nucleotide substitution process to account for potential biases induced by the heterogeneous GC content of the taxa. The analysis of eight nuclear genes unambiguously corroborates that the common ancestor of Sophophora had an elevated GC content.
Collapse
Affiliation(s)
- F Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California at Irvine, Irvine, CA 92697-2525, USA.
| | | | | |
Collapse
|
31
|
Singer GA, Hickey DA. Nucleotide bias causes a genomewide bias in the amino acid composition of proteins. Mol Biol Evol 2000; 17:1581-8. [PMID: 11070046 DOI: 10.1093/oxfordjournals.molbev.a026257] [Citation(s) in RCA: 183] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We analyzed the nucleotide contents of several completely sequenced genomes, and we show that nucleotide bias can have a dramatic effect on the amino acid composition of the encoded proteins. By surveying the genes in 21 completely sequenced eubacterial and archaeal genomes, along with the entire Saccharomyces cerevisiae genome and two Plasmodium falciparum chromosomes, we show that biased DNA encodes biased proteins on a genomewide scale. The predicted bias affects virtually all genes within the genome, and it could be clearly seen even when we limited the analysis to sets of homologous gene sequences. Parallel patterns of compositional bias were found within the archaea and the eubacteria. We also found a positive correlation between the degree of amino acid bias and the magnitude of protein sequence divergence. We conclude that mutational bias can have a major effect on the molecular evolution of proteins. These results could have important implications for the interpretation of protein-based molecular phylogenies and for the inference of functional protein adaptation from comparative sequence data.
Collapse
Affiliation(s)
- G A Singer
- Department of Biology, University of Ottawa, Ottawa, Ontario, Canada
| | | |
Collapse
|
32
|
Matsuo Y. Molecular evolution of the histone 3 multigene family in the Drosophila melanogaster species subgroup. Mol Phylogenet Evol 2000; 16:339-43. [PMID: 10991787 DOI: 10.1006/mpev.2000.0810] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Molecular evolution of the histone multigene family was studied by cloning and sequencing regions of the histone 3 gene in the Drosophila melanogaster species subgroup. Analysis of the nucleotide substitution pattern showed that in the coding region synonymous changes occurred more frequently to A or T in contrast to the GC-rich base composition, while in the 3' region the nucleotide substitutions were most likely in equilibrium. These results suggested that the base composition at the third codon position of the H3 gene, i.e., codon usage, has been changing to A or T in the Drosophila melanogaster species subgroup.
Collapse
Affiliation(s)
- Y Matsuo
- Laboratory of Adaptive Evolution, Department of Mathematical and Natural Sciences, Faculty of Integrated Arts and Sciences, The University of Tokushima, Tokushima 770-8502, Japan.
| |
Collapse
|
33
|
Tarrío R, Rodríguez-Trelles F, Ayala FJ. Tree rooting with outgroups when they differ in their nucleotide composition from the ingroup: the Drosophila saltans and willistoni groups, a case study. Mol Phylogenet Evol 2000; 16:344-9. [PMID: 10991788 DOI: 10.1006/mpev.2000.0813] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Rooting is frequently the most precarious step in any phylogenetic analysis. Outgroups can become useless for rooting if they are too distantly related to the ingroup. Specifically, little attention has been paid to scenarios where outgroups have evolved different nucleotide frequencies from the ingroup. We investigate one empirical example that arose seeking to determine the phylogenetic relationship between the saltans and the willistoni groups of Drosophila (subgenus Sophophora). We have analyzed 2085 coding nucleotides from the xanthine dehydrogenase (Xdh) gene in 14 species, 6 from the saltans group and 8 from the willistoni group. We adopt a two-step strategy: (1) we investigate the phylogeny without outgroups, rooting the network by the midpoint method; (2) we reinvestigate the rooting of this phylogeny using predefined outgroups in both a parsimony- and a model-based maximum-likelihood framework. A satisfactory description of the substitution process along the Xdh region calls for six substitution types and substitution rate variation among codon positions. When the ingroup sequences are considered alone, the phylogeny obtained using this description corroborates the known relationships derived from anatomical criteria. Inclusion of the outgroups makes the root unstable, apparently because of differences between ingroups and outgroups in the substitution processes; these differences are better accounted for by a simplified model of evolution than by more complex, realistic descriptions of the substitution process.
Collapse
Affiliation(s)
- R Tarrío
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA
| | | | | |
Collapse
|
34
|
Abstract
Base composition varies at all levels of the phylogenetic hierarchy and throughout the genome, and can be caused by active selection or passive mutation pressure. This variation can make reconstructing trees difficult. However, recent tree-based analyses have shed light on the forces responsible for the evolution of base composition, forces that might be very general. More explicit tree-based work is encouraged.
Collapse
|
35
|
Conticello SG, Pilpel Y, Glusman G, Fainzilber M. Position-specific codon conservation in hypervariable gene families. Trends Genet 2000; 16:57-9. [PMID: 10652528 DOI: 10.1016/s0168-9525(99)01956-3] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- S G Conticello
- Laboratory of Molecular Neurobiology, Department of Biological Chemistry, Weizmann Institute of Science, 76100 Rehovot, Israel.
| | | | | | | |
Collapse
|