1
|
Adhikari TB, Ingram T, Halterman D, Louws FJ. Gene Genealogies Reveal High Nucleotide Diversity and Admixture Haplotypes Within Three Alternaria Species Associated with Tomato and Potato. PHYTOPATHOLOGY 2020; 110:1449-1464. [PMID: 32202481 DOI: 10.1094/phyto-12-19-0487-r] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Early blight (EB) and leaf blight are two destructive diseases of tomato in North Carolina (NC), caused by Alternaria linariae and A. alternata, respectively. During the last decade, EB caused by A. solani has increased in potato-producing areas in Wisconsin (WI). We collected 152 isolates of three Alternaria spp. associated with tomato and potato in NC and WI and used the gene genealogical approach to compare the genetic relationships among them. Two nuclear genes: the glyceraldehyde-3-phosphate dehydrogenase (GPDH), RNA polymerase second largest subunit (RPB2), and the rDNA internal transcribed spacer (ITS) region of these isolates were sequenced. Besides, sequences of the GPDH locus from international isolates described in previous studies were included for comparison purposes. A set of single nucleotide polymorphisms was assembled to identify locus-specific and species-specific haplotypes. Nucleotide diversity varied among gene sequences and species analyzed. For example, the estimates of nucleotide diversity and Watterson's theta were higher in A. alternata than in A. linariae and A. solani. There was little or no polymorphisms in the ITS sequences and thus restricted haplotype placement. The RPB2 sequences were less informative to detect haplotype diversity in A. linariae and A. solani, yet six haplotypes were detected in A. alternata. The GPDH sequences enabled strongly supported phylogenetic inferences with the highest haplotype diversity and belonged to five haplotypes (AaH1 to AaH5), which consisted of only A. alternata from NC. However, 13 haplotypes were identified within and among A. linariae and A. solani sequences. Among them, six (AsAlH1 to AsAlH6) were identical to previously reported haplotypes in global samples and the remaining were new haplotypes. The most divergent haplotypes were AaH1, AsAlH2/AsAlH3, and AsAlH4 and consisted exclusively of A. alternata, A. linariae, and A. solani, respectively. Neutrality tests suggested an excess of mutations and population expansion, and selection may play an important role in nucleotide diversity of Alternaria spp.
Collapse
Affiliation(s)
- Tika B Adhikari
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27695
| | - Thomas Ingram
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27695
| | - Dennis Halterman
- United States Department of Agriculture-Agricultural Research Service, Vegetable Crops Research Unit, Madison, WI 53706
| | - Frank J Louws
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, NC 27695
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27695
| |
Collapse
|
2
|
Frequent nonallelic gene conversion on the human lineage and its effect on the divergence of gene duplicates. Proc Natl Acad Sci U S A 2017; 114:12779-12784. [PMID: 29138319 DOI: 10.1073/pnas.1708151114] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Gene conversion is the copying of a genetic sequence from a "donor" region to an "acceptor." In nonallelic gene conversion (NAGC), the donor and the acceptor are at distinct genetic loci. Despite the role NAGC plays in various genetic diseases and the concerted evolution of gene families, the parameters that govern NAGC are not well characterized. Here, we survey duplicate gene families and identify converted tracts in 46% of them. These conversions reflect a large GC bias of NAGC. We develop a sequence evolution model that leverages substantially more information in duplicate sequences than used by previous methods and use it to estimate the parameters that govern NAGC in humans: a mean converted tract length of 250 bp and a probability of [Formula: see text] per generation for a nucleotide to be converted (an order of magnitude higher than the point mutation rate). Despite this high baseline rate, we show that NAGC slows down as duplicate sequences diverge-until an eventual "escape" of the sequences from its influence. As a result, NAGC has a small average effect on the sequence divergence of duplicates. This work improves our understanding of the NAGC mechanism and the role that it plays in the evolution of gene duplicates.
Collapse
|
3
|
The Rate and Tract Length of Gene Conversion between Duplicated Genes. Genes (Basel) 2011; 2:313-31. [PMID: 24710193 PMCID: PMC3924818 DOI: 10.3390/genes2020313] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2011] [Revised: 03/11/2011] [Accepted: 03/17/2011] [Indexed: 11/26/2022] Open
Abstract
Interlocus gene conversion occurs such that a certain length of DNA fragment is non-reciprocally transferred (copied and pasted) between paralogous regions. To understand the rate and tract length of gene conversion, there are two major approaches. One is based on mutation-accumulation experiments, and the other uses natural DNA sequence variation. In this review, we overview the two major approaches and discuss their advantages and disadvantages. In addition, to demonstrate the importance of statistical analysis of empirical and evolutionary data for estimating tract length, we apply a maximum likelihood method to several data sets.
Collapse
|
4
|
Kaye AC, Moyer JW, Parks EJ, Carbone I, Cubeta MA. Population genetic analysis of Tomato spotted wilt virus on peanut in North Carolina and Virginia. PHYTOPATHOLOGY 2011; 101:147-153. [PMID: 20839960 DOI: 10.1094/phyto-01-10-0035] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Exploring the genetic diversity and evolutionary history of plant viruses is critical to understanding their ecology and epidemiology. In this study, maximum-likelihood and population genetics-based methods were used to investigate the population structure, genetic diversity, and sources of genetic variation in field isolates of Tomato spotted wilt virus (TSWV) from peanut in North Carolina and Virginia. Selected regions of the nucleocapsid, movement, and RNA-dependent RNA polymerase genes were amplified and sequenced to identify haplotypes and infer genetic relationships between isolates of TSWV with heuristic methods. The haplotype structure of each locus consisted of 1 or 2 predominant haplotypes and >100 haplotypes represented by a single isolate. No specific haplotypes were associated with geographic area, peanut cultivar, or year of isolation. The population was panmictic at the regional level and high levels of genetic diversity were observed among isolates. There was evidence for positive selection on single amino acids in each gene on a background of predominant purifying selection acting upon each locus. The results of compatibility analyses and the persistence of specific gene sequences in isolates collected over three field seasons suggest that recombination was occurring in the population. Estimates of the population mutation rate suggest that mutation has had a significant effect on the shaping of this population and, together with purifying selection, these forces have been the predominant evolutionary forces influencing the TSWV population in peanut in North Carolina and Virginia.
Collapse
Affiliation(s)
- A C Kaye
- Department of Plant Pathology, P.O. Box 7616, North Carolina State University, Raleigh 27695, USA.
| | | | | | | | | |
Collapse
|
5
|
Lawson MJ, Jiao J, Fan W, Zhang L. A pattern analysis of gene conversion literature. Comp Funct Genomics 2010; 2009:761512. [PMID: 20148076 PMCID: PMC2817553 DOI: 10.1155/2009/761512] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2009] [Accepted: 11/15/2009] [Indexed: 11/18/2022] Open
Abstract
Gene conversion is an important biological process that involves the transfer of genetic (sequence) information from one gene to another. This can have a variety of effects on an organism, both short-term and long-term and both positive and detrimental. In an effort to better understand this process, we searched through over 3,000 abstracts that contain research on gene conversions, tagging the important data and performing an analysis on what we extract. Through this we established trends that give a better insight into gene conversion research and genetic research in general. Our results show the importance of the process and the importance of continuing gene conversion research.
Collapse
Affiliation(s)
- Mark J. Lawson
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Jian Jiao
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Weiguo Fan
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| | - Liqing Zhang
- Department of Computer Science, Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
6
|
Abstract
Interlocus gene conversion can homogenize DNA sequences of duplicated regions with high homology. Such nonvertical events sometimes cause a misleading evolutionary interpretation of data when the effect of gene conversion is ignored. To avoid this problem, it is crucial to test the data for the presence of gene conversion. Here, we performed extensive simulations to compare four major methods to detect gene conversion. One might expect that the power increases with increase of the gene conversion rate. However, we found this is true for only two methods. For the other two, limited power is expected when gene conversion is too frequent. We suggest using multiple methods to minimize the chance of missing the footprint of gene conversion.
Collapse
|
7
|
Rosendahl S, McGee P, Morton JB. Lack of global population genetic differentiation in the arbuscular mycorrhizal fungus Glomus mosseae suggests a recent range expansion which may have coincided with the spread of agriculture. Mol Ecol 2009; 18:4316-29. [PMID: 19765226 DOI: 10.1111/j.1365-294x.2009.04359.x] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The arbuscular mycorrhizal fungus Glomus mosseae is commonly found in agricultural fields. The cosmopolitan species is found in Africa, Europe, America, Asia and Australia. Three hypotheses may explain this worldwide distribution: First, speciation occurred before the continents separated 120 Ma; second, the distribution is a result of human-mediated dispersal related to agriculture and finally, the morphologically defined species may encompass several local endemic species. To test these hypotheses, three genes were sequenced from 82 isolates of G. mosseae originating from six continents and the resulting sequences analysed for geographical subdivision and estimation of migration between continents. Coalescent analyses estimated divergence and age of mutations. Bayesian coalescent modelling was used to reveal important past population changes in the global population. The sequence data showed no geographical structure, with identical genotypes found on different continents. Coalescence analyses indicated a recent diversification in the species, and the data could be explained by a recent population expansion in G. mosseae. The results of this study suggest that speciation and the range expansion happened much later than continental spread and that human activity may have had a major impact on the dispersal and the population structure of the fungus.
Collapse
Affiliation(s)
- Søren Rosendahl
- Department of Biology, University of Copenhagen, DK-1353 Copenhagen K, Denmark.
| | | | | |
Collapse
|
8
|
Lourenço V, Moya A, González-Candelas F, Carbone I, Maffia LA, Mizubuti ESG. Molecular diversity and evolutionary processes of Alternaria solani in Brazil inferred using genealogical and coalescent approaches. PHYTOPATHOLOGY 2009; 99:765-774. [PMID: 19453237 DOI: 10.1094/phyto-99-6-0765] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Alternaria spp. form a heterogeneous group of saprophytic and plant-pathogenic fungi widespread in temperate and tropical regions. However, the relationship between evolutionary processes and genetic diversity with epidemics is unknown for several plant-pathogenic Alternaria spp. The interaction of Alternaria solani populations with potato and tomato plants is an interesting case study for addressing questions related to molecular evolution of an asexual fungus. Gene genealogies based on the coalescent process were used to infer evolutionary processes that shape the A. solani population. Sequences of the rDNA internal transcribed spacer (ITS) region and the genes which encode the allergenic protein alt a 1 (Alt a 1) and glyceraldehyde-3-phosphate dehydrogenase (Gpd) were used to estimate haplotype and nucleotide diversity as well as for the coalescent analyses. The highest number of parsimony informative sites (n = 14), nucleotide diversity (0.007), and the average number of nucleotide differences (3.20) were obtained for Alt a 1. Although the highest number of haplotypes (n = 7) was generated for ITS, haplotype diversity was the lowest (0.148) for this region. Recombination was not detected. Subdivision was inferred from populations associated with hosts but there was no evidence of geographic subdivision, and gene flow is occurring among subpopulations. In the analysis of the Alt a 1, balancing selection and population expansion or purifying selection could have occurred in A. solani subpopulations associated with potato and tomato plants, respectively. There is strong evidence that the subpopulation of A. solani that causes early blight in potato is genetically distinct from the subpopulation that causes early blight in tomato. The population of A. solani is clonal, and gene flow and mutation are the main evolutionary processes shaping its genetic structure.
Collapse
Affiliation(s)
- Valdir Lourenço
- Departamento de Fitopatologia, Universidade Federal de Viçosa, 36570-000 Viçosa, MG, Brazil
| | | | | | | | | | | |
Collapse
|
9
|
Lawson MJ, Zhang L. Sexy gene conversions: locating gene conversions on the X-chromosome. Nucleic Acids Res 2009; 37:4570-9. [PMID: 19487239 PMCID: PMC2724270 DOI: 10.1093/nar/gkp421] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Gene conversion can have a profound impact on both the short- and long-term evolution of genes and genomes. Here, we examined the gene families that are located on the X-chromosomes of human (Homo sapiens), chimpanzee (Pan troglodytes), mouse (Mus musculus) and rat (Rattus norvegicus) for evidence of gene conversion. We identified seven gene families (WD repeat protein family, Ferritin Heavy Chain family, RAS-related Protein RAB-40 family, Diphosphoinositol polyphosphate phosphohydrolase family, Transcription Elongation Factor A family, LDOC1-related family, Zinc Finger Protein ZIC, and GLI family) that show evidence of gene conversion. Through phylogenetic analyses and synteny evidence, we show that gene conversion has played an important role in the evolution of these gene families and that gene conversion has occurred independently in both primates and rodents. Comparing the results with those of two gene conversion prediction programs (GENECONV and Partimatrix), we found that both GENECONV and Partimatrix have very high false negative rates (i.e. failed to predict gene conversions), which leads to many undetected gene conversions. The combination of phylogenetic analyses with physical synteny evidence exhibits high resolution in the detection of gene conversions.
Collapse
|
10
|
Hyman IT, Ho SY, Jermiin LS. Molecular phylogeny of Australian Helicarionidae, Euconulidae and related groups (Gastropoda: Pulmonata: Stylommatophora) based on mitochondrial DNA. Mol Phylogenet Evol 2007; 45:792-812. [DOI: 10.1016/j.ympev.2007.08.018] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2006] [Revised: 07/27/2007] [Accepted: 08/07/2007] [Indexed: 10/22/2022]
|
11
|
Corbin CJ, Hughes AL, Heffelfinger JR, Berger T, Waltzek TB, Roser JF, Santos TC, Miglino MA, Oliveira MF, Braga FC, Meirelles FV, Conley AJ. Evolution of suiform aromatases: ancestral duplication with conservation of tissue-specific expression in the collared peccary (Pecari tayassu). J Mol Evol 2007; 65:403-12. [PMID: 17912474 DOI: 10.1007/s00239-007-9021-0] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2006] [Revised: 04/24/2007] [Accepted: 07/31/2007] [Indexed: 11/25/2022]
Abstract
Aromatase cytochrome P450 (P450arom), the enzyme that catalyzes estrogen synthesis, is required for successful reproduction and is encoded by a single copy gene (CYP19) in most mammals. However, pigs and their distant suiform relatives the peccaries experienced CYP19 duplication. Here, the evolutionary origin of CYP19 duplication, and the evolution of the gene paralogs, was explored further in collared peccaries (Pecari tayassu). Exons IV and V, and the intervening intron, representing duplicated CYP19 genes, were cloned and sequenced from collared peccary, pig, and hippopotamus. Sequence alignment and analysis identified a gene conversion in collared peccary with a breakpoint 102 base pairs (bp) upstream of exon V. Phylogenetic analyses of nucleotide and amino acid sequence upstream of the breakpoint supported a tree in which one peccary sequence was orthologous with the porcine gonadal gene. Cloning and sequencing of tissue transcripts, using reverse-transcriptase polymerase chain reaction techniques (RT-PCR), confirmed that the gonadal ortholog was expressed in collared peccary testis. Orthology of the other genomic sequence with the porcine placental gene was not resolved, but its placenta-specific expression in collared peccary was confirmed by similar transcript analysis. Immunoblot and enzyme activity in collared peccary testes demonstrated much lower levels of P450arom than in pig testis. Collared peccary placental P450arom expression also seemed much lower than pigs. Thus, suiform CYP19 genes arose from an ancestral duplication that has maintained gonad- and placenta-specific expression, but at lower levels in peccaries than pigs, perhaps facilitating the emergence of different reproductive strategies as Suiformes diverged and evolved.
Collapse
Affiliation(s)
- C J Corbin
- Department of Population Health and Reproduction, University of California, Davis, CA, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
12
|
Quesada H, Posada D, Caballero A, Morán P, Rolán-Alvarez E. PHYLOGENETIC EVIDENCE FOR MULTIPLE SYMPATRIC ECOLOGICAL DIVERSIFICATION IN A MARINE SNAIL. Evolution 2007; 61:1600-12. [PMID: 17598743 DOI: 10.1111/j.1558-5646.2007.00135.x] [Citation(s) in RCA: 81] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Parallel speciation can occur when traits determining reproductive isolation evolve independently in different populations that experience a similar range of environments. However, a common problem in studies of parallel evolution is to distinguish this hypothesis from an alternative one in which different ecotypes arose only once in allopatry and now share a sympatric scenario with substantial gene flow between them. Here we show that the combination of a phylogenetic approach with life-history data is able to disentangle both hypotheses in the case of the intertidal marine snail Littorina saxatilis on the rocky shores of Galicia in northwestern Spain. In this system, numerous phenotypic and genetic differences have evolved between two sympatric ecotypes spanning a sharp ecological gradient, and as aside effect of the former have produced partial reproductive isolation. A mitochondrial phylogeny of these populations strongly suggests that the two sympatric ecotypes have originated independently several times. Building upon earlier work demonstrating size-based assortative mating as the main contributor to reproductive isolation among ecotypes, our analysis provides strong evidence that divergent selection across a sharp ecological gradient promoted the parallel divergence of body size and shape between two sympatric ecotypes. Thus, divergent selection occurring independently in different populations has produced the marine equivalent of host races, which may represent the first step in speciation.
Collapse
Affiliation(s)
- Humberto Quesada
- Departamento de Bioquímica, Genética e Inmunología, Facultad de Biología, Universidad de Vigo, Campus As Lagoas-Marcosende, 36310 Vigo, Spain.
| | | | | | | | | |
Collapse
|
13
|
Malvárez G, Carbone I, Grünwald NJ, Subbarao KV, Schafer M, Kohn LM. New Populations of Sclerotinia sclerotiorum from Lettuce in California and Peas and Lentils in Washington. PHYTOPATHOLOGY 2007; 97:470-483. [PMID: 18943288 DOI: 10.1094/phyto-97-4-0470] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
ABSTRACT Four populations of Sclerotinia sclerotiorum in North America were inferred previously, based on analyses of both rapidly evolving markers (DNA fingerprint and mycelial compatiblity), and multilocus DNA sequence spanning the range between fast and slow evolution. Each population was defined as an interbreeding unit of conspecific individuals sharing a common recent ancestor and arising in a unique evolutionary event. The present study applies this standard to extend characterization of S. sclerotiorum populations to the Western United States. Isolates of S. sclerotiorum (N = 294) were determined to represent three genetically differentiated populations: California (CA, lettuce), Washington (WA, pea/lentil), and Ontario (ON, lettuce). CA was the most diverse population yet sampled in North America. Clonality was detected in ON and WA. No DNA fingerprints were common among the populations. The index of association (I(A)), based on fingerprint, was closer to zero (0) for CA than it was for the other populations. High diversity and lack of association of markers in California are consistent either with genetic exchange and recombination, or with large population size and high standing genetic variation. Intra- and interlocus conflict among three DNA sequence loci was consistent with recombination. The coalescent IGS genealogy confirmed subdivision and showed CA to be older than WA or ON. The Nearest Neighbor statistic on combined data confirmed subdivision among all present and previously defined populations. All isolates had both MAT1-1 and MAT1-2, consistent with uniform homothallism.
Collapse
|
14
|
Hu JM, Fu HC, Lin CH, Su HJ, Yeh HH. Reassortment and concerted evolution in banana bunchy top virus genomes. J Virol 2007; 81:1746-61. [PMID: 17135318 PMCID: PMC1797577 DOI: 10.1128/jvi.01390-06] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2006] [Accepted: 11/16/2006] [Indexed: 12/12/2022] Open
Abstract
The nanovirus Banana bunchy top virus (BBTV) has six standard components in its genome and occasionally contains components encoding additional Rep (replication initiation protein) genes. Phylogenetic network analysis of coding sequences of DNA 1 and 3 confirmed the two major groups of BBTV, a Pacific and an Asian group, but show evidence of web-like phylogenies for some genes. Phylogenetic analysis of 102 major common regions (CR-Ms) from all six components showed a possible concerted evolution within the Pacific group, which is likely due to recombination in this region. The CR-M of additional Rep genes is close to that of DNA 1 and 2. Comparison of tree topologies constructed with DNA 1 and DNA 3 coding sequences of 14 BBTV isolates showed distinct phylogenetic histories based on Kishino-Hasegawa and Shimodaira-Hasegawa tests. The results of principal component analysis of amino acid and codon usages indicate that DNA 1 and 3 have a codon bias different from that of all other genes of nanoviruses, including all currently known additional Rep genes of BBTV, which suggests a possible ancient genome reassortment event between distinctive nanoviruses.
Collapse
Affiliation(s)
- Jer-Ming Hu
- Institute of Ecology and Evolutionary Biology, National Taiwan University, 1, Sec. 4, Roosevelt Road, Taipei 106, Taiwan.
| | | | | | | | | |
Collapse
|
15
|
Lin YS, Byrnes JK, Hwang JK, Li WH. Codon-usage bias versus gene conversion in the evolution of yeast duplicate genes. Proc Natl Acad Sci U S A 2006; 103:14412-6. [PMID: 16971485 PMCID: PMC1599977 DOI: 10.1073/pnas.0606348103] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Many Saccharomyces cerevisiae duplicate genes that were derived from an ancient whole-genome duplication (WGD) unexpectedly show a small synonymous divergence (K(S)), a higher sequence similarity to each other than to orthologues in Saccharomyces bayanus, or slow evolution compared with the orthologue in Kluyveromyces waltii, a non-WGD species. This decelerated evolution was attributed to gene conversion between duplicates. Using approximately 300 WGD gene pairs in four species and their orthologues in non-WGD species, we show that codon-usage bias and protein-sequence conservation are two important causes for decelerated evolution of duplicate genes, whereas gene conversion is effective only in the presence of strong codon-usage bias or protein-sequence conservation. Furthermore, we find that change in mutation pattern or in tDNA copy number changed codon-usage bias and increased the K(S) distance between K. waltii and S. cerevisiae. Intriguingly, some proteins showed fast evolution before the radiation of WGD species but little or no sequence divergence between orthologues and paralogues thereafter, indicating that functional conservation after the radiation may also be responsible for decelerated evolution in duplicates.
Collapse
Affiliation(s)
- Yeong-Shin Lin
- *Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, IL 60637; and
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan
| | - Jake K. Byrnes
- *Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, IL 60637; and
| | - Jenn-Kang Hwang
- Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan
| | - Wen-Hsiung Li
- *Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, IL 60637; and
| |
Collapse
|
16
|
Chan CX, Beiko RG, Ragan MA. Detecting recombination in evolving nucleotide sequences. BMC Bioinformatics 2006; 7:412. [PMID: 16978423 PMCID: PMC1592127 DOI: 10.1186/1471-2105-7-412] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Accepted: 09/18/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation.
Collapse
Affiliation(s)
- Cheong Xin Chan
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Robert G Beiko
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Mark A Ragan
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
17
|
Jackson MS, Oliver K, Loveland J, Humphray S, Dunham I, Rocchi M, Viggiano L, Park JP, Hurles ME, Santibanez-Koref M. Evidence for widespread reticulate evolution within human duplicons. Am J Hum Genet 2005; 77:824-40. [PMID: 16252241 PMCID: PMC1271390 DOI: 10.1086/497704] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2004] [Accepted: 08/25/2004] [Indexed: 11/04/2022] Open
Abstract
Approximately 5% of the human genome consists of segmental duplications that can cause genomic mutations and may play a role in gene innovation. Reticulate evolutionary processes, such as unequal crossing-over and gene conversion, are known to occur within specific duplicon families, but the broader contribution of these processes to the evolution of human duplications remains poorly characterized. Here, we use phylogenetic profiling to analyze multiple alignments of 24 human duplicon families that span >8 Mb of DNA. Our results indicate that none of them are evolving independently, with all alignments showing sharp discontinuities in phylogenetic signal consistent with reticulation. To analyze these results in more detail, we have developed a quartet method that estimates the relative contribution of nucleotide substitution and reticulate processes to sequence evolution. Our data indicate that most of the duplications show a highly significant excess of sites consistent with reticulate evolution, compared with the number expected by nucleotide substitution alone, with 15 of 30 alignments showing a >20-fold excess over that expected. Using permutation tests, we also show that at least 5% of the total sequence shares 100% sequence identity because of reticulation, a figure that includes 74 independent tracts of perfect identity >2 kb in length. Furthermore, analysis of a subset of alignments indicates that the density of reticulation events is as high as 1 every 4 kb. These results indicate that phylogenetic relationships within recently duplicated human DNA can be rapidly disrupted by reticulate evolution. This finding has important implications for efforts to finish the human genome sequence, complicates comparative sequence analysis of duplicon families, and could profoundly influence the tempo of gene-family evolution.
Collapse
Affiliation(s)
- Michael S Jackson
- Institute of Human Genetics, University of Newcastle upon Tyne, International Centre for Life, Newcastle upon Tyne, United Kingdom.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Sukhnanand S, Alcaine S, Warnick LD, Su WL, Hof J, Craver MPJ, McDonough P, Boor KJ, Wiedmann M. DNA sequence-based subtyping and evolutionary analysis of selected Salmonella enterica serotypes. J Clin Microbiol 2005; 43:3688-98. [PMID: 16081897 PMCID: PMC1233967 DOI: 10.1128/jcm.43.8.3688-3698.2005] [Citation(s) in RCA: 89] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
While serotyping and phage typing have been used widely to characterize Salmonella isolates, sensitive subtyping methods that allow for evolutionary analyses are essential for examining Salmonella transmission, ecology, and evolution. A set of 25 Salmonella enterica isolates, representing five clinically relevant serotypes (serotypes Agona, Heidelberg, Schwarzengrund, Typhimurium, and Typhimurium var. Copenhagen) was initially used to develop a multilocus sequence typing (MLST) scheme for Salmonella targeting seven housekeeping and virulence genes (panB, fimA, aceK, mdh, icdA, manB, and spaN). A total of eight MLST types were found among the 25 isolates sequenced. A good correlation between MLST types and Salmonella serotypes was observed; only one serotype Typhimurium var. Copenhagen isolate displayed an MLST type otherwise typical for serotype Typhimurium isolates. Since manB, fimA, and mdh allowed for the highest subtype discrimination among the initial 25 isolates, we chose these three genes to perform DNA sequencing of an additional 41 Salmonella isolates representing a larger diversity of serotypes. This "three-gene sequence typing scheme" allowed discrimination of 25 sequence types (STs) among a total of 66 isolates; STs correlated well with serotypes and allowed within-serotype differentiation for 9 of the 12 serotypes characterized. Phylogenetic analyses showed that serotypes Kentucky and Newport could each be separated into two distinct, statistically well supported evolutionary lineages. Our results show that a three-gene sequence typing scheme allows for accurate serotype prediction and for limited subtype discrimination among clinically relevant serotypes of Salmonella. Three-gene sequence typing also supports the notion that Salmonella serotypes represent both monophyletic and polyphyletic lineages.
Collapse
Affiliation(s)
- Sharinne Sukhnanand
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Sam Alcaine
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Lorin D. Warnick
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Wan-Lin Su
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Jessica Hof
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Mary Pat J. Craver
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Patrick McDonough
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Kathryn J. Boor
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
| | - Martin Wiedmann
- Department of Food Science, Department of Population Medicine and Diagnostic Sciences, Cornell University, Ithaca, New York 14853
- Corresponding author. Mailing address: Department of Food Science, 412 Stocking Hall, Cornell University, Ithaca, NY 14853. Phone: (607) 254-2838. Fax: (607) 254-4868. E-mail:
| |
Collapse
|
19
|
Aguileta G, Bielawski JP, Yang Z. Gene conversion and functional divergence in the beta-globin gene family. J Mol Evol 2005; 59:177-89. [PMID: 15486692 DOI: 10.1007/s00239-004-2612-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2003] [Accepted: 02/16/2004] [Indexed: 11/26/2022]
Abstract
Different models of gene family evolution have been proposed to explain the mechanism whereby gene copies created by gene duplications are maintained and diverge in function. Ohta proposed a model which predicts a burst of nonsynonymous substitutions following gene duplication and the preservation of duplicates through positive selection. An alternative model, the duplication-degeneration-complementation (DDC) model, does not explicitly require the action of positive Darwinian selection for the maintenance of duplicated gene copies, although purifying selection is assumed to continue to act on both copies. A potential outcome of the DDC model is heterogeneity in purifying selection among the gene copies, due to partitioning of subfunctions which complement each other. By using the d(N)/ d(S) (omega) rate ratio to measure selection pressure, we can distinguish between these two very different evolutionary scenarios. In this study we investigated these scenarios in the beta-globin family of genes, a textbook example of evolution by gene duplication. We assembled a comprehensive dataset of 72 vertebrate beta-globin sequences. The estimated phylogeny suggested multiple gene duplication and gene conversion events. By using different programs to detect recombination, we confirmed several cases of gene conversion and detected two new cases. We tested evolutionary scenarios derived from Ohta's model and the DDC model by examining selective pressures along lineages in a phylogeny of beta-globin genes in eutherian mammals. We did not find significant evidence for an increase in the omega ratio following major duplication events in this family. However, one exception to this pattern was the duplication of gamma-globin in simian primates, after which a few sites were identified to be under positive selection. Overall, our results suggest that following gene duplications, paralogous copies of beta-globin genes evolved under a nonepisodic process of functional divergence.
Collapse
Affiliation(s)
- Gabriela Aguileta
- Department of Biology, University College London, Darwin Building, Gower Street, WC1E 6BT, London, England
| | | | | |
Collapse
|
20
|
Tsaousis AD, Martin DP, Ladoukakis ED, Posada D, Zouros E. Widespread Recombination in Published Animal mtDNA Sequences1. Mol Biol Evol 2005; 22:925-33. [PMID: 15647518 DOI: 10.1093/molbev/msi084] [Citation(s) in RCA: 132] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Mitochondrial DNA (mtDNA) recombination has been observed in several animal species, but there are doubts as to whether it is common or only occurs under special circumstances. Animal mtDNA sequences retrieved from public databases were unambiguously aligned and rigorously tested for evidence of recombination. At least 30 recombination events were detected among 186 alignments examined. Recombinant sequences were found in invertebrates and vertebrates, including primates. It appears that mtDNA recombination may occur regularly in the animal cell but rarely produces new haplotypes because of homoplasmy. Common animal mtDNA recombination would necessitate a reexamination of phylogenetic and biohistorical inference based on the assumption of clonal mtDNA transmission. Recombination may also have an important role in producing and purging mtDNA mutations and thus in mtDNA-based diseases and senescence.
Collapse
Affiliation(s)
- A D Tsaousis
- Department of Biology, University of Crete, Iraklio, Crete, Greece
| | | | | | | | | |
Collapse
|
21
|
Carbone I, Liu YC, Hillman BI, Milgroom MG. Recombination and migration of Cryphonectria hypovirus 1 as inferred from gene genealogies and the coalescent. Genetics 2005; 166:1611-29. [PMID: 15126384 PMCID: PMC1470819 DOI: 10.1534/genetics.166.4.1611] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genealogy-based methods were used to estimate migration of the fungal virus Cryphonectria hypovirus 1 between vegetative compatibility types of the host fungus, Cryphonectria parasitica, as a means of estimating horizontal transmission within two host populations. Vegetative incompatibility is a self/non-self recognition system that inhibits virus transmission under laboratory conditions but its effect on transmission in nature has not been clearly demonstrated. Recombination within and among different loci in the virus genome restricted the genealogical analyses to haplotypes with common mutation and recombinational histories. The existence of recombination necessitated that we also use genealogical approaches that can take advantage of both the mutation and recombinational histories of the sample. Virus migration between populations was significantly restricted. In contrast, estimates of migration between vegetative compatibility types were relatively high within populations despite previous evidence that transmission in the laboratory was restricted. The discordance between laboratory estimates and migration estimates from natural populations highlights the challenges in estimating pathogen transmission rates. Genealogical analyses inferred migration patterns throughout the entire coalescent history of one viral region in natural populations and not just recent patterns of migration or laboratory transmission. This application of genealogical analyses provides markedly stronger inferences on overall transmission rates than laboratory estimates do.
Collapse
Affiliation(s)
- Ignazio Carbone
- Department of Plant Pathology, North Carolina State University, Raleigh, North Carolina 27695, USA
| | | | | | | |
Collapse
|
22
|
Abstract
Alignments of nucleotide or amino acid sequences may contain a variety of different signals, one of which is the historical signal that we often try to recover by phylogenetic analysis. Other signals, such as those arising due to compositional heterogeneities, among-lineage and among-site rate heterogeneities, invariant sites, and covariotides, may interfere adversely with the recovery of the historical signal. The effect of the interaction of these signals on phylogenetic inference is not well understood and may, in many cases, even be underappreciated. In this study, we investigate this matter and present results based on Monte Carlo simulations. We explored the success of four phylogenetic methods in recovering the true tree from data that had evolved under conditions where the equilibrium base frequencies and substitution rates were allowed to vary among lineages. Seven scenarios with increasingly complex conditions were investigated. All of the methods tested, with the exception of neighbor-joining using LogDet distances, were sensitive to compositional convergence in nonsister lineages. Maximum parsimony was also susceptible to attraction between long edges. In many cases, however, phylogenetic inference methods can still recover the true tree when misleading signals are present, in some instances even when the historical signal is no longer dominant. These results highlight the growing need for simple methods to detect violation of the phylogenetic assumptions.
Collapse
Affiliation(s)
- Simon Y Ho
- 1School of Biological Sciences, University of Sydney, NSW 2006, Australia
| | | |
Collapse
|
23
|
Carbone I, Liu YC, Hillman BI, Milgroom MG. Recombination and Migration of Cryphonectria hypovirus 1 as Inferred From Gene Genealogies and the Coalescent. Genetics 2004. [DOI: 10.1093/genetics/166.4.1611] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Abstract
Genealogy-based methods were used to estimate migration of the fungal virus Cryphonectria hypovirus 1 between vegetative compatibility types of the host fungus, Cryphonectria parasitica, as a means of estimating horizontal transmission within two host populations. Vegetative incompatibility is a self/non-self recognition system that inhibits virus transmission under laboratory conditions but its effect on transmission in nature has not been clearly demonstrated. Recombination within and among different loci in the virus genome restricted the genealogical analyses to haplotypes with common mutation and recombinational histories. The existence of recombination necessitated that we also use genealogical approaches that can take advantage of both the mutation and recombinational histories of the sample. Virus migration between populations was significantly restricted. In contrast, estimates of migration between vegetative compatibility types were relatively high within populations despite previous evidence that transmission in the laboratory was restricted. The discordance between laboratory estimates and migration estimates from natural populations highlights the challenges in estimating pathogen transmission rates. Genealogical analyses inferred migration patterns throughout the entire coalescent history of one viral region in natural populations and not just recent patterns of migration or laboratory transmission. This application of genealogical analyses provides markedly stronger inferences on overall transmission rates than laboratory estimates do.
Collapse
Affiliation(s)
- Ignazio Carbone
- Center for Integrated Fungal Research, Department of Plant Pathology, North Carolina State University, Raleigh, North Carolina 27695
| | - Yir-Chung Liu
- Department of Plant Pathology, Cornell University, Ithaca, New York 14853
| | - Bradley I Hillman
- Department of Plant Pathology, Rutgers University, New Brunswick, New Jersey 08901
| | - Michael G Milgroom
- Department of Plant Pathology, Cornell University, Ithaca, New York 14853
| |
Collapse
|
24
|
|
25
|
Abstract
Recombination can be a dominant force in shaping genomes and associated phenotypes. To better understand the impact of recombination on genomic evolution, we need to be able to identify recombination in aligned sequences. We review bioinformatic approaches for detecting recombination and measuring recombination rates. We also examine the impact of recombination on the reconstruction of evolutionary histories and the estimation of population genetic parameters. Finally, we review the role of recombination in the evolutionary history of bacteria, viruses, and human mitochondria. We conclude by highlighting a number of areas for future development of tools to help quantify the role of recombination in genomic evolution.
Collapse
Affiliation(s)
- David Posada
- Variagenics Inc. Cambridge, Massachusetts 02139, USA.
| | | | | |
Collapse
|
26
|
Nagawa F, Yoshihara SI, Tsuboi A, Serizawa S, Itoh K, Sakano H. Genomic analysis of the murine odorant receptor MOR28 cluster: a possible role of gene conversion in maintaining the olfactory map. Gene 2002; 292:73-80. [PMID: 12119101 DOI: 10.1016/s0378-1119(02)00670-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Genomic analysis was performed for the murine odorant receptor (OR) genes. The MOR28 cluster on chromosome 14 was extensively studied. It contains six OR genes, MOR28, 10, 83, 29A, 29B and 30. The human homolog of this cluster is located on the human chromosome 14, and contains five OR genes, HOR28/10, 83, 29A, 29B and 30. Sequence comparison of these OR gene paralogs and orthologs suggests that the coding homologies are accounted for not only by recent gene duplication, but also by gene conversion among the coding sequences within the cluster. A possible role of gene conversion in the olfactory system is discussed in the context of the olfactory map.
Collapse
Affiliation(s)
- Fumikiyo Nagawa
- Department of Biophysics and Biochemistry, Graduate School of Science, The University of Tokyo, 2-11-16 Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan.
| | | | | | | | | | | |
Collapse
|
27
|
Abstract
The performance of 14 different recombination detection methods was evaluated by analyzing several empirical data sets where the presence of recombination has been suggested or where recombination is assumed to be absent. In general, recombination methods seem to be more powerful with increasing levels of divergence, but different methods showed distinct performance. Substitution methods using summary statistics gave more accurate inferences than most phylogenetic methods. However, definitive conclusions about the presence of recombination should not be derived on the basis of a single method. Performance patterns observed from the analysis of real data sets coincided very well with previous computer simulation results. Previous recombination inferences from some of the data sets analyzed here should be reconsidered. In particular, recombination in HIV-1 seems to be much more widespread than previously thought. This finding might have serious implications on vaccine development and on the reliability of previous inferences of HIV-1 evolutionary history and dynamics.
Collapse
Affiliation(s)
- David Posada
- Department of Zoology, Brigham Young University, Provo, Utah, USA.
| |
Collapse
|
28
|
Posada D, Crandall KA. Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc Natl Acad Sci U S A 2001; 98:13757-62. [PMID: 11717435 PMCID: PMC61114 DOI: 10.1073/pnas.241370698] [Citation(s) in RCA: 1074] [Impact Index Per Article: 46.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2001] [Indexed: 11/18/2022] Open
Abstract
Recombination is a key evolutionary process that shapes the architecture of genomes and the genetic structure of populations. Although many statistical methods are available for the detection of recombination from DNA sequences, their absolute and relative performance is still unknown. Here we evaluated the performance of 14 different recombination detection algorithms. We used the coalescent with recombination to simulate DNA sequences with different levels of recombination, genetic diversity, and rate variation among sites. Recombination detection methods were applied to these data sets, and whether they detected or not recombination was recorded. Different recombination methods showed distinct performance depending on the amount of recombination, genetic diversity, and rate variation among sites. The model of nucleotide substitution under which the data were generated did not seem to have a significant effect. Most methods increase power with more sequence divergence. In general, recombination detection methods seem to capture the presence of recombination, but they are not very powerful. Methods that use substitution patterns or incompatibility among sites were more powerful than methods based on phylogenetic incongruence. Most methods do not seem to infer more false positives than expected by chance. Especially depending on the amount of diversity in the data, different methods could be used to attain maximum power while minimizing false positives. Results shown here will provide some guidance in the selection of the most appropriate method/s for the analysis of the particular data at hand.
Collapse
Affiliation(s)
- D Posada
- Department of Zoology, Brigham Young University, Provo, UT 84602, USA.
| | | |
Collapse
|
29
|
Canavez F, Young NT, Guethlein LA, Rajalingam R, Khakoo SI, Shum BP, Parham P. Comparison of chimpanzee and human leukocyte Ig-like receptor genes reveals framework and rapidly evolving genes. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2001; 167:5786-94. [PMID: 11698452 DOI: 10.4049/jimmunol.167.10.5786] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
The leukocyte receptor complex (LRC) on human chromosome 19 contains related Ig superfamily killer cell Ig-like receptor (KIR) and leukocyte Ig-like receptor (LIR) genes. Previously, we discovered much difference in the KIR genes between humans and chimpanzees, primate species estimated to have approximately 98.8% genomic sequence similarity. Here, the common chimpanzee LIR genes are identified, characterized, and compared with their human counterparts. From screening a chimpanzee splenocyte cDNA library, clones corresponding to nine different chimpanzee LIRs were isolated and sequenced. Analysis of genomic DNA from 48 unrelated chimpanzees showed 42 to have all nine LIR genes, and six animals to lack just one of the genes. In structural diversity and functional type, the chimpanzee LIRs cover the range of human LIRs. Although both species have the same number of inhibitory LIRs, humans have more activating receptors, a trend also seen for KIRs. Four chimpanzee LIRs are clearly orthologs of human LIRs. Five other chimpanzee LIRs have paralogous relationships with clusters of human LIRs and have undergone much recombination. Like the human genes, chimpanzee LIR genes appear to be organized into two duplicated blocks, each block containing two orthologous genes. This organization provides a conserved framework within which there are clusters of faster evolving genes. Human and chimpanzee KIR genes have an analogous arrangement. Whereas both KIR and LIR genes can exhibit greater interspecies differences than the genome average, within each species the LIR gene family is more conserved than the KIR gene family.
Collapse
Affiliation(s)
- F Canavez
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | | | | | | | | | | | |
Collapse
|
30
|
Wiuf C, Christensen T, Hein J. A simulation study of the reliability of recombination detection methods. Mol Biol Evol 2001; 18:1929-39. [PMID: 11557798 DOI: 10.1093/oxfordjournals.molbev.a003733] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
There exist many methods to detect recombination or mosaic structure in a sample of DNA sequences. But how reliable are they? Four methods were investigated with respect to their power to detect recombination in simulated samples with different amounts of recombination and mutation. In addition, we investigated the impact of the shape of the underlying genealogy on their performances. We found that the methods detected far fewer recombinations than were theoretically possible and that methods based on the principle of incompatibility in general had more power than methods that did not make use of this principle explicitly. This seemed, in particular, to be the case for phylogenies generated under population expansion scenarios which result in long branches at the tips and small deep branches. In addition to the results obtained through simulations, a series of new theoretical results on recombination is presented.
Collapse
Affiliation(s)
- C Wiuf
- Department of Statistics, University of Oxford, United Kingdom.
| | | | | |
Collapse
|
31
|
Templeton AR, Clark AG, Weiss KM, Nickerson DA, Boerwinkle E, Sing CF. Recombinational and mutational hotspots within the human lipoprotein lipase gene. Am J Hum Genet 2000; 66:69-83. [PMID: 10631137 PMCID: PMC1288350 DOI: 10.1086/302699] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Here an analysis is presented of the roles of recombination and mutation in shaping previously determined haplotype variation in 9.7 kb of genomic DNA sequence from the human lipoprotein lipase gene (LPL), scored in 71 individuals from three populations: 24 African Americans, 24 Finns, and 23 non-Hispanic whites. Recombination and gene-conversion events inferred from data on 88 haplotypes that were defined by 69 variable sites were tested. The analysis revealed 29 statistically significant recombination events and one gene-conversion event. The recombination events were concentrated in a 1.9-kb region, near the middle of the segment, that contains a microsatellite and a pair of tandem and complementary mononucleotide runs; both the microsatellite and the runs show length variation. An analysis of site variation revealed that 9.6% of the nucleotides at CpG sites were variable, as were 3% of the nucleotides found in mononucleotide runs of >/=5 nucleotides, 3% of the nucleotides found </=3 bp from certain putative polymerase alpha-arrest sites, and 0. 5% of the remaining nucleotides. This nonhomogeneous distribution of variation suggests that multiple mutational hits at certain sites are common, an observation that challenges the fundamental assumption of the infinite-sites-mutation model. The nonrandom patterns of recombination and mutation suggest that randomly chosen single-nucleotide polymorphisms may not be optimal for disequilibrium mapping of this gene. Overall, these results indicate that both recombinational and mutational hotspots have played significant roles in shaping the haplotype variation at the LPL locus.
Collapse
Affiliation(s)
- A R Templeton
- Department of Biology, Washington University, St. Louis, MO 63130-4899, USA.
| | | | | | | | | | | |
Collapse
|
32
|
Abstract
Although exchange of genetic information by recombination plays a role in the evolution of viruses, the extent to which it generates diversity is not clear. We analyzed genomes of geminiviruses for recombination using a new statistical procedure developed to detect gene conversions. Geminiviruses (family, Geminiviridae) are a group of plant viruses characterized by a genome of circular single-stranded DNA (approximately 2700 nucleotides in length) encapsidated in twinned quasi-isometric particles. Complete nucleotide sequences of geminiviruses were aligned, and recombination events were detected by searching pairs of viruses for sequences that are significantly more similar than expected based on random distribution of polymorphic sites. The analyses revealed that recombination is very frequent and occurs between species and within and across genera. Tests identified 420 statistically significant recombinant fragments distributed across the genome. The results suggest that recombination is a significant contributor to geminivirus evolution. The high rate of recombination may be contributing to the recent emergence of new geminivirus diseases.
Collapse
Affiliation(s)
- M Padidam
- Rohm and Haas Company, 727 Norristown Road, Spring House, Pennsylvania 19477, USA
| | | | | |
Collapse
|
33
|
Abstract
There are two types of recombination that we may wish to detect: rare recombinants between members of different populations or species and repeated recombination within a population. Methods appropriate in the former context are inappropriate in the latter because they depend on recognizing the existence of runs of nucleotides with similar ancestry. If recombination is sufficiently frequent, no such runs will be present. Several methods, including the homoplasy test and the incompatibility test, are described that are appropriate for detecting repeated recombination and for measuring its importance, relative to mutation, in causing genetic change. The sensitivity of these tests is investigated by simulating populations with varying frequencies of mutation and recombination and calculating the various statistics on samples.
Collapse
Affiliation(s)
- J M Smith
- School of Biological Sciences, University of Sussex, Brighton BN1 9QG, United Kingdom
| |
Collapse
|
34
|
|
35
|
Clark AG, Weiss KM, Nickerson DA, Taylor SL, Buchanan A, Stengård J, Salomaa V, Vartiainen E, Perola M, Boerwinkle E, Sing CF. Haplotype structure and population genetic inferences from nucleotide-sequence variation in human lipoprotein lipase. Am J Hum Genet 1998; 63:595-612. [PMID: 9683608 PMCID: PMC1377318 DOI: 10.1086/301977] [Citation(s) in RCA: 326] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Allelic variation in 9.7 kb of genomic DNA sequence from the human lipoprotein lipase gene (LPL) was scored in 71 healthy individuals (142 chromosomes) from three populations: African Americans (24) from Jackson, MS; Finns (24) from North Karelia, Finland; and non-Hispanic Whites (23) from Rochester, MN. The sequences had a total of 88 variable sites, with a nucleotide diversity (site-specific heterozygosity) of .002+/-.001 across this 9.7-kb region. The frequency spectrum of nucleotide variation exhibited a slight excess of heterozygosity, but, in general, the data fit expectations of the infinite-sites model of mutation and genetic drift. Allele-specific PCR helped resolve linkage phases, and a total of 88 distinct haplotypes were identified. For 1,410 (64%) of the 2,211 site pairs, all four possible gametes were present in these haplotypes, reflecting a rich history of past recombination. Despite the strong evidence for recombination, extensive linkage disequilibrium was observed. The number of haplotypes generally is much greater than the number expected under the infinite-sites model, but there was sufficient multisite linkage disequilibrium to reveal two major clades, which appear to be very old. Variation in this region of LPL may depart from the variation expected under a simple, neutral model, owing to complex historical patterns of population founding, drift, selection, and recombination. These data suggest that the design and interpretation of disease-association studies may not be as straightforward as often is assumed.
Collapse
Affiliation(s)
- A G Clark
- Institute of Molecular Evolutionary Genetics, Department of Biology, Pennsylvania State University, University Park, PA 16802, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|