101
|
Monteiro FA, Barrett TV, Fitzpatrick S, Cordon-Rosales C, Feliciangeli D, Beard CB. Molecular phylogeography of the Amazonian Chagas disease vectors Rhodnius prolixus and R. robustus. Mol Ecol 2003; 12:997-1006. [PMID: 12753218 DOI: 10.1046/j.1365-294x.2003.01802.x] [Citation(s) in RCA: 136] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The phylogeographical structure of the closely related species Rhodnius prolixus and R. robustus is presented based on a 663-base pair (bp) fragment of the mitochondrial cytochrome b gene. Twenty haplotypes were recovered from 84 samples examined, representing 26 populations from seven Latin American countries. The resulting phylogenetic tree is composed of five major reciprocally monophyletic clades, one representing R. prolixus and four representing R. robustus. While R. prolixus is a very homogeneous assemblage, R. robustus has deeper clades and is paraphyletic, with the clade comprising R. robustus from Venezuela (Orinoco region) more closely related to the R. prolixus clade than to the other R. robustus populations from the Amazon region. The R. robustus paraphyly was supported further by the analysis of a nuclear gene (D2 region of the 28S RNA) for a subset of specimens. The data support the view that R. robustus represents a species complex. Levels of sequence divergence between clades within each region are compatible with a Pleistocene origin. Nucleotide diversity (pi) for all R. prolixus populations was extremely low (0.0008), suggesting that this species went through a recent bottleneck, and was subsequently dispersed by man.
Collapse
Affiliation(s)
- Fernando A Monteiro
- Division of Parasitic Diseases, Centers for Disease Control and Prevention, Atlanta, USA.
| | | | | | | | | | | |
Collapse
|
102
|
Machulla HKG, Batnasan D, Steinborn F, Uyar FA, Saruhan-Direskeneli G, Oguz FS, Carin MN, Dorak MT. Genetic affinities among Mongol ethnic groups and their relationship to Turks. TISSUE ANTIGENS 2003; 61:292-9. [PMID: 12753667 DOI: 10.1034/j.1399-0039.2003.00043.x] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The central Asian country Mongolia is home to more than 20 tribes and ethnic groups, some of which are related to neighboring Turkic populations. The main Mongolian people, Khalkha, live in central and eastern Mongolia while the Tsaatan minority lives in the north of the country. The Oold minority is from the western Altai mountain region and live in close proximity with Turkic people. We have typed the HLA-A, -B, -Cw, -DRB1 and -DQB1 loci by PCR-SSP in these three Mongolian populations as well as a sample of the German population. To examine their genetic relationships, a sample of the Turkish population already typed at the HLA-A, -B and -DRB1 loci were used. Altogether five populations were analyzed: Khalkha (n = 100), Tsaatan (n = 72), Oold (n = 52), German (n = 260) and (Anatolian) Turkish (n = 498). Nei's unbiased genetic identity (GI) and genetic distance (GD) were estimated from genotypes using PopGene v1.31, and dendrograms were constructed using phylip. The results suggested a close relationship of the Khalkha to the Tsaatan. The Turks and Germans were equally distant to all three Mongolian populations. These results confirmed the lack of strong genetic relationship between the Mongols and the Turks despite the close relationship of their languages (Altaic group) and shared historical neighborhood. This study has provided useful population data for genetic and anthropologic studies bridging eastern and western populations.
Collapse
Affiliation(s)
- H K G Machulla
- Interbranch HLA Laboratory, Martin-Luther University, Halle, Germany.
| | | | | | | | | | | | | | | |
Collapse
|
103
|
O’Neill R, Snowdon R, Köhler W. Population Genetics: Aspects of Biodiversity. PROGRESS IN BOTANY 2003. [DOI: 10.1007/978-3-642-55819-1_7] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
104
|
Martin W, Rujan T, Richly E, Hansen A, Cornelsen S, Lins T, Leister D, Stoebe B, Hasegawa M, Penny D. Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc Natl Acad Sci U S A 2002; 99:12246-51. [PMID: 12218172 PMCID: PMC129430 DOI: 10.1073/pnas.182432999] [Citation(s) in RCA: 754] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Chloroplasts were once free-living cyanobacteria that became endosymbionts, but the genomes of contemporary plastids encode only approximately 5-10% as many genes as those of their free-living cousins, indicating that many genes were either lost from plastids or transferred to the nucleus during the course of plant evolution. Previous estimates have suggested that between 800 and perhaps as many as 2,000 genes in the Arabidopsis genome might come from cyanobacteria, but genome-wide phylogenetic surveys that could provide direct estimates of this number are lacking. We compared 24,990 proteins encoded in the Arabidopsis genome to the proteins from three cyanobacterial genomes, 16 other prokaryotic reference genomes, and yeast. Of 9,368 Arabidopsis proteins sufficiently conserved for primary sequence comparison, 866 detected homologues only among cyanobacteria and 834 other branched with cyanobacterial homologues in phylogenetic trees. Extrapolating from these conserved proteins to the whole genome, the data suggest that approximately 4,500 of Arabidopsis protein-coding genes ( approximately 18% of the total) were acquired from the cyanobacterial ancestor of plastids. These proteins encompass all functional classes, and the majority of them are targeted to cell compartments other than the chloroplast. Analysis of 15 sequenced chloroplast genomes revealed 117 nuclear-encoded proteins that are also still present in at least one chloroplast genome. A phylogeny of chloroplast genomes inferred from 41 proteins and 8,303 amino acids sites indicates that at least two independent secondary endosymbiotic events have occurred involving red algae and that amino acid composition bias in chloroplast proteins strongly affects plastid genome phylogeny.
Collapse
Affiliation(s)
- William Martin
- Institut für Botanik, Heinrich Heine Universität, Universitätsstrasse 1, 40225 Düsseldorf, Germany.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
105
|
Lipovich L, Hughes AL, King MC, Abkowitz JL, Quigley JG. Genomic structure and evolutionary context of the human feline leukemia virus subgroup C receptor (hFLVCR) gene: evidence for block duplications and de novo gene formation within duplicons of the hFLVCR locus. Gene 2002; 286:203-13. [PMID: 11943475 DOI: 10.1016/s0378-1119(02)00457-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In this paper we sought to analyze the genomic structure and context of human feline leukemia virus subgroup C receptor (hFLVCR), a human glucarate transporter-like gene at chromosome 1q31, and compare it to that of a paralog (FLVCR14q) at chromosome 14q24. Splicing, polyadenylation, and expression patterns, as estimated by in silico analysis, differed between the two FLVCR genes despite their similar genomic structures, suggesting active and independent evolution of transcriptional and messenger RNA processing patterns after gene duplication. Promoter activity was bi-directional for hFLVCR, but not for its 14q paralog. The upstream 1q transcribed sequences were determined to comprise a novel gene of unknown function, LQK1. Annotation of contigs centered at hFLVCR and FLVCRL14q also revealed highly conserved gene clusters on chromosomes 1 and 14, inferred to result from a duplication. The clusters contained members of the FLVCR, Angel (KIAA0759), JDP, p21SNFT, and TGF- families, as well as two uncharacterized families. The genome-wide locations of both previously recognized and four de novo in silico predicted genes belonging to these seven families were determined. Phylogenetic analyses of these families were consistent with the hypothesis that the 1q/14q duplication occurred early within, or immediately prior to the vertebrate divergence, after the protostome-deuterostome divergence but before the amniote-amphibian divergence.
Collapse
MESH Headings
- 3' Untranslated Regions/genetics
- Alternative Splicing
- Animals
- Cats
- Chromosomes, Human, Pair 1/genetics
- Chromosomes, Human, Pair 14/genetics
- DNA, Complementary/chemistry
- DNA, Complementary/genetics
- Evolution, Molecular
- Gene Duplication
- Genes/genetics
- Humans
- Molecular Sequence Data
- Phylogeny
- Poly A/genetics
- Promoter Regions, Genetic/genetics
- Receptors, Virus/genetics
- Sequence Analysis, DNA
- Time Factors
- Transcription Initiation Site
Collapse
Affiliation(s)
- Leonard Lipovich
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA 98195-7710, USA
| | | | | | | | | |
Collapse
|
106
|
Abstract
Procedures for performing cladistic analyses can provide powerful tools for understanding the evolution of neuropeptide and polypeptide hormone coding genes. These analyses can be done on either amino acid data sets or nucleotide data sets and can utilize several different algorithms that are dependent on distinct sets of operating assumptions and constraints. In some cases, the results of these analyses can be used to gauge phylogenetic relationships between taxa. Selecting the proper cladistic analysis strategy is dependent on the taxonomic level of analysis and the rate of evolution within the orthologous genes being evaluated. For example, previous studies have shown that the amino acid sequence of proopiomelanocortin (POMC), the common precursor for the melanocortins and beta-endorphin, can be used to resolve phylogenetic relationships at the class and order level. This study tested the hypothesis that POMC sequences could be used to resolve phylogenetic relationships at the family taxonomic level. Cladistic analyses were performed on amphibian POMC sequences characterized from the marine toad, Bufo marinus (family Bufonidae; this study), the spadefoot toad, Spea multiplicatus (family Pelobatidae), the African clawed frog, Xenopus laevis (family Pipidae) and the laughing frog, Rana ridibunda (family Ranidae). In these analyses the sequence of Australian lungfish POMC was used as the outgroup. The analyses were done at the amino acid level using the maximum parsimony algorithm and at the nucleotide level using the maximum likelihood algorithm. For the anuran POMC genes, analysis at the nucleotide level using the maximum likelihood algorithm generated a cladogram with higher bootstrap values than the maximum parsimony analysis of the POMC amino acid data set. For anuran POMC sequences, analysis of nucleotide sequences using the maximum likelihood algorithm would appear to be the preferred strategy for resolving phylogenetic relationships at the family taxonomic level.
Collapse
Affiliation(s)
- Jasem Alrubaian
- University of Kuwait, Department of Biological Sciences, 13060, Kuwait City, Kuwait
| | | | | | | |
Collapse
|
107
|
Abstract
Several molecular forms of DNA polymerases have been identified in eukaryotic cells. Although three DNA polymerases alpha, delta, and epsilon, have been well studied and indicated to be involved in nuclear DNA replication process, it remains unclear how this hetero-polymerase system might have arisen. Here I wish to consider its past and future, viewed in the context of molecular evolution. Comparative analysis has revealed some nucleotides and/or amino acids to be conserved in DNA polymerase delta, in polymerase domains III and IV, which have disappeared in DNA polymerase alpha. Furthermore, the codon usage for serine residues in conserved domains of DNA polymerase alpha varies and is not as conservative as for DNA polymerase delta. Recently and in the present study, I have reported that DNA polymerase delta could substitute for the function of DNA polymerase alpha in vitro, and proposed the hypothesis that eukaryotic DNA polymerase alpha arose due to symbiotic contacts. This 'exogenous' polymerase would be expected to be excluded from the eukaryotic DNA replication system, and my analysis in the present study suggests it is about to degenerate.
Collapse
Affiliation(s)
- Masaharu Takemura
- Laboratory of Cancer Cell Biology, Research Institute for Disease Mechanism and Control, Nagoya University Graduate School of Medicine, Tsurumai-cho 65, Showa-ku, Nagoya, Aichi 466-8550, Japan.
| |
Collapse
|
108
|
Abstract
The advent of whole-genome data resources--not only sequence but also other genome-scale data collections such as gene expression, protein interaction, and genetic variation--is having two marked, complementary effects on the relatively new discipline of bioinformatics. First, the veritable flood of data is creating a need and demand for new tools for dealing adequately with the deluge, and, second, the unprecedented extent, diversity, and impending completeness of the data sets are creating opportunities for new approaches to discovery based on computational methods.
Collapse
Affiliation(s)
- D B Searls
- Bioinformatics Department, SmithKline Beecham Pharmaceuticals, King of Prussia, Pennsylvania 19406, USA.
| |
Collapse
|
109
|
Abstract
In this article, we present some simple yet effective statistical techniques for analysing and comparing large DNA sequences. These techniques are based on frequency distributions of DNA words in a large sequence, and have been packaged into a software called SWORDS. Using sequences available in public domain databases housed in the Internet, we demonstrate how SWORDS can be conveniently used by molecular biologists and geneticists to unmask biologically important features hidden in large sequences and assess their statistical significance.
Collapse
Affiliation(s)
- Probal Chaudhuri
- Theoretical Statistics and Mathematics Unit, Indian Statistical Institute, 203 BT Road, Kolkata 700 108, India.
| | | |
Collapse
|
110
|
Rosenberg MS, Kumar S. Incomplete taxon sampling is not a problem for phylogenetic inference. Proc Natl Acad Sci U S A 2001; 98:10751-6. [PMID: 11526218 PMCID: PMC58547 DOI: 10.1073/pnas.191248498] [Citation(s) in RCA: 214] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2001] [Indexed: 11/18/2022] Open
Abstract
A major issue in all data collection for molecular phylogenetics is taxon sampling, which refers to the use of data from only a small representative set of species for inferring higher-level evolutionary history. Insufficient taxon sampling is often cited as a significant source of error in phylogenetic studies, and consequently, acquisition of large data sets is advocated. To test this assertion, we have conducted computer simulation studies by using natural collections of evolutionary parameters--rates of evolution, species sampling, and gene lengths--determined from data available in genomic databases. A comparison of the true tree with trees constructed by using taxa subsamples and trees constructed by using all taxa shows that the amount of phylogenetic error per internal branch is similar; a result that holds true for the neighbor-joining, minimum evolution, maximum parsimony, and maximum likelihood methods. Furthermore, our results show that even though trees inferred by using progressively larger taxa subsamples of a real data set become increasingly similar to trees inferred by using the full sample, all inferred trees are equidistant from the true tree in terms of phylogenetic error per internal branch. Our results suggest that longer sequences, rather than extensive sampling, will better improve the accuracy of phylogenetic inference.
Collapse
Affiliation(s)
- M S Rosenberg
- Department of Biology, Arizona State University, Tempe, AZ 85287-1501, USA
| | | |
Collapse
|
111
|
Rooney AP, Honeycutt RL, Derr JN. Historical population size change of bowhead whales inferred from DNA sequence polymorphism data. Evolution 2001; 55:1678-85. [PMID: 11580027 DOI: 10.1111/j.0014-3820.2001.tb00687.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Nucleotide sequence data from the mitochondrial control region were used from a phylogenetic context to investigate the long-term history of a population of bowhead whales (Balaena mysticetus). In addition, the coalescence time of these sequences was used to estimate the age of the inferred patterns of population size change. The results indicate that mitochondrial genetic polymorphism was not affected by a recent bottleneck that occurred near the turn of the 20th century, thereby preserving the signature of historical population size change in the mitochondrial genome. Further analysis showed that this population underwent an expansion initiated in the Middle to Late Pleistocene. As such, early Holocene changes in Arctic sea ice distribution appear to have had little influence on patterns of genetic variability in this population.
Collapse
Affiliation(s)
- A P Rooney
- Mississippi State University, Department of Biological Sciences, Mississippi 39762, USA.
| | | | | |
Collapse
|
112
|
Sola C, Filliol I, Gutierrez MC, Mokrousov I, Vincent V, Rastogi N. Spoligotype Database ofMycobacterium tuberculosis: Biogeographic Distribution of Shared Types and Epidemiologic and Phylogenetic Perspectives. Emerg Infect Dis 2001. [DOI: 10.3201/10.3201/eid0703.0107304] [Citation(s) in RCA: 95] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
| | - Ingrid Filliol
- Institut Pasteur de Guadeloupe, Pointe à Pitre, Guadeloupe
| | | | - Igor Mokrousov
- Institut Pasteur de Guadeloupe, Pointe à Pitre, Guadeloupe
| | - Véronique Vincent
- Centre National de Référence des Mycobactéries, Institut Pasteur, Paris, France
| | - Nalin Rastogi
- Institut Pasteur de Guadeloupe, Pointe à Pitre, Guadeloupe
| |
Collapse
|
113
|
Guryev V, Makarevitch I, Blinov A, Martin J. Phylogeny of the genus Chironomus (Diptera) inferred from DNA sequences of mitochondrial cytochrome b and cytochrome oxidase I. Mol Phylogenet Evol 2001; 19:9-21. [PMID: 11286487 DOI: 10.1006/mpev.2001.0898] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Two mitochondrial genes, Cytochrome b (Cytb) and Cytochrome c oxidase subunit I (COI), have been used as phylogenetic markers in Chironomids. The nucleotide sequences of 685 bp from Cytb and 596 bp from COI have been determined for 36 Chironomus species from the Palearctic, or Holarctic, and Australasia. The concatenated sequence of 1281 bp from both genes was used to investigate the phylogenetic relationships among these species. The nucleotide sequence alignments were used for construction of phylogenetic trees based on maximum-parsimony and neighbor-joining methods. Both techniques produced similar phylogenies. Monophyly of the genus Chironomus is supported by a bootstrap value of 100% at the basal branch. Six clusters of species have been revealed with high bootstrap values supporting both monophyly of each cluster and the validity of the branching order within each cluster. Four species, C. circumdatus, C. nepeanensis, C. dorsalis, and C. crassiforceps, cannot be placed into any cluster. Cytological phylogenies were constructed using the same set of species, except for C. biwaprimus. These trees showed many similarities to that obtained from the mitochondrial (mt) sequence analysis, but also a number of significant differences. When compared with the tree constructed from the sequence of 23 species available for one of the globin genes, globin 2b (gb2b), there was better support for the mt tree than for the cytological trees. An intron, which varies in its occurrence and position in gb2b, was also investigated and the distribution of the introns supports the phylogenetic history of the genus Chironomus obtained with mt data. The differences observed in the cytological trees seem to be attributable more to the retention of the same chromosome banding sequence across several species, rather than convergent evolutionary events. An important question is the determination of the position of the subgenus Camptochironomus in relation to the representatives of the nominal subgenus Chironomus, since it has been suggested that this is a separate genus. The Camptochironomus species are internal to the trees and have arisen more recently than some of the species of the subgenus Chironomus, indicating that they are not sufficiently differentiated to be considered more than a subgenus.
Collapse
Affiliation(s)
- V Guryev
- Laboratory of Cell Biology, Institute of Cytology and Genetics, Novosibirsk 630090, Russia
| | | | | | | |
Collapse
|
114
|
Rujan T, Martin W. How many genes in Arabidopsis come from cyanobacteria? An estimate from 386 protein phylogenies. Trends Genet 2001; 17:113-20. [PMID: 11226586 DOI: 10.1016/s0168-9525(00)02209-5] [Citation(s) in RCA: 97] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
It is well known that chloroplasts and mitochondria donated many genes to nuclear chromosomes during evolution - but how many is "many"? A sample of 3961 Arabidopsis nuclear protein-coding genes was compared with the complete set of proteins from yeast and 17 reference prokaryotic genomes, including one cyanobacterium (the lineage from which plastids arose). The analysis of 386 phylogenetic trees distilled from these data suggests that between approximately 400 (1.6%) and approximately 2200 (9.2%) of Arabidopsis nuclear genes stem from cyanobacteria. The degree of conservation preserved in protein sequences in addition to lateral gene transfer between free-living prokaryotes pose substantial challenges to genome phylogenetics.
Collapse
Affiliation(s)
- T Rujan
- Institute of Botany III, Universität Düsseldorf, Universitätsstr. 1, D-40225 Düsseldorf, Germany
| | | |
Collapse
|
115
|
Nei M, Xu P, Glazko G. Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms. Proc Natl Acad Sci U S A 2001; 98:2497-502. [PMID: 11226267 PMCID: PMC30166 DOI: 10.1073/pnas.051611498] [Citation(s) in RCA: 220] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/22/2000] [Indexed: 11/18/2022] Open
Abstract
When many protein sequences are available for estimating the time of divergence between two species, it is customary to estimate the time for each protein separately and then use the average for all proteins as the final estimate. However, it can be shown that this estimate generally has an upward bias, and that an unbiased estimate is obtained by using distances based on concatenated sequences. We have shown that two concatenation-based distances, i.e., average gamma distance weighted with sequence length (d(2)) and multiprotein gamma distance (d(3)), generally give more satisfactory results than other concatenation-based distances. Using these two distance measures for 104 protein sequences, we estimated the time of divergence between mice and rats to be approximately 33 million years ago. Similarly, the time of divergence between humans and rodents was estimated to be approximately 96 million years ago. We also investigated the dependency of time estimates on statistical methods and various assumptions made by using sequence data from eubacteria, protists, plants, fungi, and animals. Our best estimates of the times of divergence between eubacteria and eukaryotes, between protists and other eukaryotes, and between plants, fungi, and animals were 3, 1.7, and 1.3 billion years ago, respectively. However, estimates of ancient divergence times are subject to a substantial amount of error caused by uncertainty of the molecular clock, horizontal gene transfer, errors in sequence alignments, etc.
Collapse
Affiliation(s)
- M Nei
- Institute of Molecular Evolutionary Genetics and Department of Biology, 328 Mueller Laboratory, Pennsylvania State University, University Park, PA 16802, USA.
| | | | | |
Collapse
|
116
|
Abstract
Conflicting results often accompany phylogenetic analyses of RNA, DNA, or protein sequences across diverse species. Causes contributing to these conflicts relate to ambiguities in identifying homologous characters of alignments, sensitivity of tree-making methods to unequal evolutionary rates, biases in species sampling, unrecognized paralogy, functional differentiation, loss of phylogenetic informational content due to long branches or fast evolution, and difficulties with the assumptions and approximations used to infer phylogenetic relationships. Attempts to surmount these conflicts by averaging over many proteins are problematic due to inherent biases of selected families, lack of signal in others, and events of lateral transfer, fusion, and/or chimerism. The process of assessing reliability of the results using the bootstrap method is strewn with obstacles because of lack of independence and inhomogeneity in the molecular data. Problems inherent to the three major procedures for developing phylogenetic trees--parsimony, likelihood, distance--are reviewed. Special attention is given to the problem of inferring evolutionary distances from patterns of similarity among sequences. The difficulties encountered by methods of phylogenetic reconstructions based on the analysis of divergent sequence families make new methods based on the analysis of complete genomes reasonable alternatives. Several of these are considered, including the signature sequences of Gupta and associates, the study of genome profiles, and the genomic signature set forth by Karlin and colleagues.
Collapse
Affiliation(s)
- L Brocchieri
- Department of Mathematics, Stanford University, Stanford, California 94305-2125, USA
| |
Collapse
|
117
|
Rooney AP, Honeycutt RL, Derr JN. HISTORICAL POPULATION SIZE CHANGE OF BOWHEAD WHALES INFERRED FROM DNA SEQUENCE POLYMORPHISM DATA. Evolution 2001. [DOI: 10.1554/0014-3820(2001)055[1678:hpscob]2.0.co;2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
118
|
Sola C, Filliol I, Gutierrez MC, Mokrousov I, Vincent V, Rastogi N. Spoligotype database of Mycobacterium tuberculosis: biogeographic distribution of shared types and epidemiologic and phylogenetic perspectives. Emerg Infect Dis 2001; 7:390-6. [PMID: 11384514 PMCID: PMC2631784 DOI: 10.3201/eid0703.010304] [Citation(s) in RCA: 83] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
We give an update on the worldwide spoligotype database, which now contains 3,319 spoligotype patterns of Mycobacterium tuberculosis in 47 countries, with 259 shared types, i.e., identical spoligotypes shared by two or more patient isolates. The 259 shared types contained a total of 2,779 (84%) of all the isolates. Seven major genetic groups represented 37% of all clustered isolates. Two types (119 and 137) were found almost exclusively in the USA and accounted for 9% of clustered isolates. The remaining 1,517 isolates were scattered into 252 different spoligotypes. This database constitutes a tool for pattern comparison of M. tuberculosis clinical isolates for global epidemiologic studies and phylogenetic purposes.
Collapse
Affiliation(s)
- C Sola
- Institut Pasteur de Guadeloupe, Pointe poundà Pitre, Guadeloupe.
| | | | | | | | | | | |
Collapse
|
119
|
Barkman TJ, Chenery G, McNeal JR, Lyons-Weiler J, Ellisens WJ, Moore G, Wolfe AD, dePamphilis CW. Independent and combined analyses of sequences from all three genomic compartments converge on the root of flowering plant phylogeny. Proc Natl Acad Sci U S A 2000; 97:13166-71. [PMID: 11069280 PMCID: PMC27196 DOI: 10.1073/pnas.220427497] [Citation(s) in RCA: 160] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2000] [Accepted: 09/06/2000] [Indexed: 11/18/2022] Open
Abstract
Plant phylogenetic estimates are most likely to be reliable when congruent evidence is obtained independently from the mitochondrial, plastid, and nuclear genomes with all methods of analysis. Here, results are presented from separate and combined genomic analyses of new and previously published data, including six and nine genes (8, 911 bp and 12,010 bp, respectively) for different subsets of taxa that suggest Amborella + Nymphaeales (water lilies) are the first-branching angiosperm lineage. Before and after tree-independent noise reduction, most individual genomic compartments and methods of analysis estimated the Amborella + Nymphaeales basal topology with high support. Previous phylogenetic estimates placing Amborella alone as the first extant angiosperm branch may have been misled because of a series of specific problems with paralogy, suboptimal outgroups, long-branch taxa, and method dependence. Ancestral character state reconstructions differ between the two topologies and affect inferences about the features of early angiosperms.
Collapse
Affiliation(s)
- T J Barkman
- Department of Biology, Institute of Molecular Evolutionary Genetics, and Life Sciences Consortium, Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | | | | | | | | | |
Collapse
|
120
|
Rogers SA, Pauls KP. Ty1-copia-like retrotransposons of tomato (Lycopersicon esculentum Mill.). Genome 2000; 43:887-94. [PMID: 11081980 DOI: 10.1139/g00-056] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We have used a PCR and cloning strategy to identify Ty1-copia-like retrotransposons in tomato, Lycopersicon esculentum Mill. Using degenerate oligonucleotide primers corresponding to conserved domains of the Ty1-copia retrotransposon reverse transcriptase (RT), fragments of about 260 bp were obtained by PCR amplification. Sequences of 20 cloned amplification fragments showed similarity to retrotransposon sequences. The copy number for total tomato Ty1-copia-like RT population was estimated to be approximately 2500 and may account for about 1.5% of the tomato genome. Copy numbers for four of the individual RT clones ranged from 20 to 1400 copies. A comparison of the conceptual translations of the RT sequences identified four clusters as well as three sequences which were ungrouped. When compared to RT sequences reported from several other sources, the tomato RT population was found to be widely dispersed with the majority of the RT sequences from Lycopersicon species delineated by the four tomato cluster groups. The gag region of a tomato retrotransposon was cloned from PCRs with primers based on the Tnt1 retrotransposon of tobacco. The tomato clone (pTom1.1) had 81% sequence similarity to the Tntl gag region. Several pTom1.1 sequences are present in other solanaceous species as indicated by Southern hybridization.
Collapse
Affiliation(s)
- S A Rogers
- Plant Agriculture Department, University of Guelph, ON, Canada
| | | |
Collapse
|
121
|
Tanabe Y, O'Donnell K, Saikawa M, Sugiyama J. Molecular phylogeny of parasitic zygomycota (Dimargaritales, zoopagales) based on nuclear small subunit ribosomal DNA sequences. Mol Phylogenet Evol 2000; 16:253-62. [PMID: 10942611 DOI: 10.1006/mpev.2000.0775] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We analyzed sequence data of the 18S rDNA gene from representatives of nine mycoparasitic or zooparasitic genera to infer the phylogenetic relationships of these fungi within the Zygomycota. Phylogenetic analyses identified a novel monophyletic clade consisting of the Zoopagales, Kickxellales, Spiromyces, and Harpellales. Analyses also identified a monophyletic mycoparasitic-zooparasitic Zoopagales clade in which Syncephalis, Thamnocephalis, and Rhopalomyces form a sister group to a Piptocephalis-Kuzuhaea clade. Although monophyly of the mycoparasitic Dimargaritales received strong bootstrap and decay index support, phylogenetic relationships of this order could not be resolved because of the unusually high rate of base substitutions within the 18S rDNA gene. Overall, the 18S gene tree topology is weak, as reflected by low bootstrap and decay index support for virtually all internal nodes uniting ordinal and superordinal taxa. Nevertheless, the 18S rDNA phylogeny is mostly consistent with traditional phenotypic-based classification schemes of the Fungi.
Collapse
Affiliation(s)
- Y Tanabe
- Institute of Molecular and Cellular Biosciences, The University of Tokyo, Tokyo, 113, Japan
| | | | | | | |
Collapse
|
122
|
Abstract
The minimum sum of branch lengths (S), or the minimum evolution (ME) principle, has been shown to be a good optimization criterion in phylogenetic inference. Unfortunately, the number of topologies to be analyzed is computationally prohibitive when a large number of taxa are involved. Therefore, simplified, heuristic methods, such as the neighbor-joining (NJ) method, are usually employed instead. The NJ method analyzes only a small number of trees (compared with the size of the entire search space); so, the tree obtained may not be the ME tree (for which the S value is minimum over the entire search space). Different compromises between very restrictive and exhaustive search spaces have been proposed recently. In particular, the "stepwise algorithm" (SA) utilizes what is known in computer science as the "beam search," whereas the NJ method employs a "greedy search." SA is virtually guaranteed to find the ME trees while being much faster than exhaustive search algorithms. In this study we propose an even faster method for finding the ME tree. The new algorithm adjusts its search exhaustiveness (from greedy to complete) according to the statistical reliability of the tree node being reconstructed. It is also virtually guaranteed to find the ME tree. The performances and computational efficiencies of ME, SA, NJ, and our new method were compared in extensive simulation studies. The new algorithm was found to perform practically as well as the SA (and, therefore, ME) methods and slightly better than the NJ method. For searching for the globally optimal ME tree, the new algorithm is significantly faster than existing ones, thus making it relatively practical for obtaining all trees with an S value equal to or smaller than that of the NJ tree, even when a large number of taxa is involved.
Collapse
Affiliation(s)
- A Rodin
- Human Genetics Center, School of Public Health, University of Texas Health Science Center, Texas, Houston 77225, USA
| | | |
Collapse
|
123
|
Takahashi K, Nei M. Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used. Mol Biol Evol 2000; 17:1251-8. [PMID: 10908645 DOI: 10.1093/oxfordjournals.molbev.a026408] [Citation(s) in RCA: 197] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In phylogenetic inference by maximum-parsimony (MP), minimum-evolution (ME), and maximum-likelihood (ML) methods, it is customary to conduct extensive heuristic searches of MP, ME, and ML trees, examining a large number of different topologies. However, these extensive searches tend to give incorrect tree topologies. Here we show by extensive computer simulation that when the number of nucleotide sequences (m) is large and the number of nucleotides used (n) is relatively small, the simple MP or ML tree search algorithms such as the stepwise addition (SA) plus nearest neighbor interchange (NNI) search and the SA plus subtree pruning regrafting (SPR) search are as efficient as the extensive search algorithms such as the SA plus tree bisection-reconnection (TBR) search in inferring the true tree. In the case of ME methods, the simple neighbor-joining (NJ) algorithm is as efficient as or more efficient than the extensive NJ+TBR search. We show that when ME methods are used, the simple p distance generally gives better results in phylogenetic inference than more complicated distance measures such as the Hasegawa-Kishino-Yano (HKY) distance, even when nucleotide substitution follows the HKY model. When ML methods are used, the simple Jukes-Cantor (JC) model of phylogenetic inference generally shows a better performance than the HKY model even if the likelihood value for the HKY model is much higher than that for the JC model. This indicates that at least in the present case, selecting of a substitution model by using the likelihood ratio test or the AIC index is not appropriate. When n is small relative to m and the extent of sequence divergence is high, the NJ method with p distance often shows a better performance than ML methods with the JC model. However, when the level of sequence divergence is low, this is not the case.
Collapse
Affiliation(s)
- K Takahashi
- Institute of Molecular Evolutionary Genetics and Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | |
Collapse
|
124
|
|
125
|
Jacoby CP, Fonseca CG. Linguistic analysis in phylogeny estimation: a case study of mtDNAs of Bovidae. J Biomol Struct Dyn 2000; 17:1047-55. [PMID: 10949171 DOI: 10.1080/07391102.2000.10506592] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Molecular phylogenetic studies are executed by the alignment of protein or nucleotide sequences, followed by the construction of trees according either to distance, parsimony or maximum likelihood methods. Linguistic analysis was investigated here as an alternative method to aligning sequences. In an empirical study, we inferred trees for a variable number of Bovidae and sister taxa based on three different mitochondrial orthologous sequences. Comparison of our results with existing phylogenies indicated that the method, except for some still disputable points, was able to establish sensible systematic relationships, similar to patterns of radiation of the family found in recent studies.
Collapse
Affiliation(s)
- C P Jacoby
- Departamento de Zootecnia, Escola de Veterinária, Universidade Federal de Minas Gerais, Belo Horizonte, Brasil.
| | | |
Collapse
|
126
|
Nilsen F. Small subunit ribosomal DNA phylogeny of microsporidia with particular reference to genera that infect fish. J Parasitol 2000; 86:128-33. [PMID: 10701575 DOI: 10.1645/0022-3395(2000)086[0128:ssrdpo]2.0.co;2] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Molecular data have proved useful in the study of microsporidia phylogeny. Previous studies have shown that there are several important differences between phylogenies based on rRNA and morphological data. In the present study, small subunit (SSU) rDNA sequences were obtained from 7 different fish-infecting microsporidia from 4 different genera (Glugea Thélohan, 1891, Loma Morrison and Sprague, 1981, Pleistophora Gurley, 1893, and Spraguea Weissenberg, 1976). The lengths of the SSU rDNA genes in these species were between 1,332 and 1,343 base pairs. Phylogenetic analysis was performed using parsimony, maximum likelihood, and Kimura 2-parameter with neighbor joining. The analyses revealed that the microsporidia could be divided into 3 major groups. With the exception of Nucleospora salmonis Hedrick, Groff, and Baxa, 1991, all the microsporidia infecting fishes occurred in the same group. The analysis showed that Pleistophora mirandellae Vaney and Conte, 1901 and Pleistophora aguillarum Hoshina, 1951 are not species of Pleistophora. Furthermore, the analysis showed that Loma is not a member of Glugeidae Thélohan, 1892.
Collapse
Affiliation(s)
- F Nilsen
- Department of Fisheries and Marine Biology, University of Bergen, Bergen High Technology Centre, Norway
| |
Collapse
|
127
|
Bujnicki JM, Radlinska M. Molecular evolution of DNA-(cytosine-N4) methyltransferases: evidence for their polyphyletic origin. Nucleic Acids Res 1999; 27:4501-9. [PMID: 10536161 PMCID: PMC148735 DOI: 10.1093/nar/27.22.4501] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
DNA N4-cytosine methyltransferases (N4mC MTases) are a family of S-adenosyl-L-methionine (AdoMet)-dependent MTases. Members of this family were previously found to share nine conserved sequence motifs, but the evolutionary basis of these similarities has never been studied in detail. We performed phylogenetic analysis of 37 known and potential new family members from the multiple sequence alignment using distance matrix, parsimony and maximum likelihood approaches to infer the evolutionary relationship among the N4mC MTases and classify them into groups of orthologs. All the treeing algorithms employed as well as results of exhaustive sequence database searching support a scenario, in which the majority of N4mC MTases, except for M. Bal I and M. Bam HI, arose by divergence from a common ancestor. Interestingly, MTases M. Bal I and M. Bam HI apparently originated from N6-adenine MTases and represent the most recent addendum to the N4mC MTase family. In addition to the previously reported nine sequence motifs, two more conserved sequence patches were detected. Phylogenetic analysis also provided the evidence for massive horizontal transfer of MTase genes, presumably with the whole restriction-modification systems, between Bacteria and Archaea.
Collapse
Affiliation(s)
- J M Bujnicki
- Molecular Biology Research Program, Henry Ford Health System, One Ford Place Suite 5D, Detroit, MI 48202, USA.
| | | |
Collapse
|
128
|
Abstract
The literature on sequencing as a tool for yeast molecular taxonomy is reviewed. Ribosomal DNA has been preferred for sequencing over other molecules such as mitochondrial DNA, and a large database is now available. rDNA consists of regions that evolve at different rates, allowing comparison of different levels of relationship among yeasts. Sequences of the 18S rDNA and the 25S rDNA have been largely used for yeast systematics and phylogeny, but the search for regions with increased resolving power has led to the study of the spacer regions of the rDNA. Few studies are concerned with signature sequences.
Collapse
Affiliation(s)
- P Valente
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Ilha do Fundão, Brasil.
| | | | | |
Collapse
|
129
|
Meyers BC, Dickerman AW, Michelmore RW, Sivaramakrishnan S, Sobral BW, Young ND. Plant disease resistance genes encode members of an ancient and diverse protein family within the nucleotide-binding superfamily. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 1999; 20:317-32. [PMID: 10571892 DOI: 10.1046/j.1365-313x.1999.t01-1-00606.x] [Citation(s) in RCA: 439] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
The nucleotide binding site (NBS) is a characteristic domain of many plant resistance gene products. An increasing number of NBS-encoding sequences are being identified through gene cloning, PCR amplification with degenerate primers, and genome sequencing projects. The NBS domain was analyzed from 14 known plant resistance genes and more than 400 homologs, representing 26 genera of monocotyledonous, dicotyle-donous and one coniferous species. Two distinct groups of diverse sequences were identified, indicating divergence during evolution and an ancient origin for these sequences. One group was comprised of sequences encoding an N-terminal domain with Toll/Interleukin-1 receptor homology (TIR), including the known resistance genes, N, M, L6, RPP1 and RPP5. Surprisingly, this group was entirely absent from monocot species in searches of both random genomic sequences and large collections of ESTs. A second group contained monocot and dicot sequences, including the known resistance genes, RPS2, RPM1, I2, Mi, Dm3, Pi-B, Xa1, RPP8, RPS5 and Prf. Amino acid signatures in the conserved motifs comprising the NBS domain clearly distinguished these two groups. The Arabidopsis genome is estimated to contain approximately 200 genes that encode related NBS motifs; TIR sequences were more abundant and outnumber non-TIR sequences threefold. The Arabidopsis NBS sequences currently in the databases are located in approximately 21 genomic clusters and 14 isolated loci. NBS-encoding sequences may be more prevalent in rice. The wide distribution of these sequences in the plant kingdom and their prevalence in the Arabidopsis and rice genomes indicate that they are ancient, diverse and common in plants. Sequence inferences suggest that these genes encode a novel class of nucleotide-binding proteins.
Collapse
Affiliation(s)
- B C Meyers
- Department of Vegetable Crops, University of California, Davis 95616, USA.
| | | | | | | | | | | |
Collapse
|
130
|
Wang N, Chen R. Comparison between phylogeny of introns and exons in primate. ACTA ACUST UNITED AC 1999. [DOI: 10.1007/bf02887113] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
131
|
Nikaido M, Rooney AP, Okada N. Phylogenetic relationships among cetartiodactyls based on insertions of short and long interpersed elements: hippopotamuses are the closest extant relatives of whales. Proc Natl Acad Sci U S A 1999; 96:10261-6. [PMID: 10468596 PMCID: PMC17876 DOI: 10.1073/pnas.96.18.10261] [Citation(s) in RCA: 243] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/1999] [Accepted: 06/16/1999] [Indexed: 11/18/2022] Open
Abstract
Insertion analysis of short and long interspersed elements is a powerful method for phylogenetic inference. In a previous study of short interspersed element data, it was found that cetaceans, hippopotamuses, and ruminants form a monophyletic group. To further resolve the relationships among these taxa, we now have isolated and characterized 10 additional loci. A phylogenetic analysis of these data was able to resolve relationships among the major cetartiodactyl groups, thereby shedding light on the origin of whales. The results indicated (i) that cetaceans are deeply nested within Artiodactyla, (ii) that cetaceans and hippopotamuses form a monophyletic group, (iii) that pigs and peccaries form a monophyletic group to the exclusion of hippopotamuses, (iv) that chevrotains diverged first among ruminants, and (v) that camels diverged first among cetartiodactyls. These findings lead us to conclude that cetaceans evolved from an immediate artiodactyl, not mesonychian, ancestor.
Collapse
Affiliation(s)
- M Nikaido
- Faculty of Bioscience and Biotechnology, Tokyo Institute of Technology, 4259 Nagatsuta-cho, Yokohama, Midori-ku, Kanagawa 226-8501, Japan
| | | | | |
Collapse
|
132
|
Abstract
Elucidation of interrelationships among sequence, structure, function, and evolution (FESS relationships) of a family of genes or gene products is a central theme of modern molecular biology. Multiple sequence alignment has been proven to be a powerful tool for many fields of studies such as phylogenetic reconstruction, illumination of functionally important regions, and prediction of higher order structures of proteins and RNAs. However, it is far too trivial to automatically construct a multiple alignment from a set of related sequences. A variety of methods for solving this computationally difficult problem are reviewed. Several important applications of multiple alignment for elucidation of the FESS relationships are also discussed. For a long period, progressive methods have been the only practical means to solve a multiple alignment problem of appreciable size. This situation is now changing with the development of new techniques including several classes of iterative methods. Today's progress in multiple sequence alignment methods has been made by the multidisciplinary endeavors of mathematicians, computer scientists, and biologists in various fields including biophysicists in particular. The ideas are also originated from various backgrounds, pure algorithmics, statistics, thermodynamics, and others. The outcomes are now enjoyed by researchers in many fields of biological sciences. In the near future, generalized multiple alignment may play a central role in studies of FESS relationships. The organized mixture of knowledge from multiple fields will ferment to develop fruitful results which would be hard to obtain within each area. I hope this review provides a useful information resource for future development of theory and practice in this rapidly expanding area of bioinformatics.
Collapse
Affiliation(s)
- O Gotoh
- Saitama Cancer Center Research Institute, Japan
| |
Collapse
|
133
|
Frothingham R, Strickland PL, Bretzel G, Ramaswamy S, Musser JM, Williams DL. Phenotypic and genotypic characterization of Mycobacterium africanum isolates from West Africa. J Clin Microbiol 1999; 37:1921-6. [PMID: 10325347 PMCID: PMC84985 DOI: 10.1128/jcm.37.6.1921-1926.1999] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The Mycobacterium tuberculosis complex includes M. tuberculosis, M. bovis, M. africanum, and M. microti. Most clinical isolates are M. tuberculosis or M. bovis. These species can be distinguished by phenotypes and genotypes. However, there is no simple definition of M. africanum, and some authors question the validity of this species. We analyzed 17 human isolates from Sierra Leone, identified as M. africanum by biochemical and growth characteristics. We sequenced polymorphic genes and intergenic regions. We amplified DNA from six loci with variable numbers of tandem repeats (VNTRs) and determined the exact number of repeats at each locus in each strain. All M. africanum isolates had the ancestral CTG Leu at katG codon 463. Drug-resistant M. africanum isolates had katG and rpoB mutations similar to those found in drug-resistant M. bovis and M. tuberculosis. Fourteen Sierra Leone M. africanum isolates (designated group A) had katG codon 203 ACC Thr, also found in M. africanumT (the T indicates type strain) from Senegal. Group A isolates clustered with M. africanumT by VNTR analysis. Three M. africanum isolates (group B) had katG codon 203 ACT Thr, found in M. tuberculosisT, and clustered with M. tuberculosisT by VNTR analysis. Phenotypic identification of M. africanum yielded a heterogeneous collection of strains. Genotypic analyses identified a cluster (M. africanum group A) which included M. africanumT and was distinct from the rest of the M. tuberculosis complex. Future studies of M. africanum should include both phenotypic and genotypic analyses.
Collapse
Affiliation(s)
- R Frothingham
- Infectious Disease Section, Veterans Affairs Medical Center, Durham, North Carolina 27705, USA.
| | | | | | | | | | | |
Collapse
|
134
|
RAPD-PCR, isozyme, rDNA RFLP and rDNA sequence analyses in identification of Finnish Fusarium oxysporum isolates. ACTA ACUST UNITED AC 1999. [DOI: 10.1017/s0953756298007485] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
135
|
Krossøy B, Hordvik I, Nilsen F, Nylund A, Endresen C. The putative polymerase sequence of infectious salmon anemia virus suggests a new genus within the Orthomyxoviridae. J Virol 1999; 73:2136-42. [PMID: 9971796 PMCID: PMC104458 DOI: 10.1128/jvi.73.3.2136-2142.1999] [Citation(s) in RCA: 88] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The infectious salmon anemia virus (ISAV) is an orthomyxovirus-like virus infecting teleosts. The disease caused by this virus has had major economic consequences for the Atlantic salmon farming industry in Norway, Canada, and Scotland. In this work, we report the cloning and sequencing of an ISAV-specific cDNA comprising 2,245 bp with an open reading frame coding for a predicted protein with a calculated molecular weight of 80.5 kDa. The putative protein sequence shows the core polymerase motifs characteristic of all viral RNA-dependent RNA polymerases. Comparison of the conserved motifs with the corresponding regions of other segmented negative-stranded RNA viruses shows a closer relationship with members of the Orthomyxoviridae than with viruses in other families. The putative ISAV polymerase protein (PB1) has a length of 708 amino acids, a charge of +22 at neutral pH, and a pI of 9.9, which are consistent with the properties of the PB1 proteins of other members of the family. Calculations of the distances between the different PB1 proteins indicate that the ISAV is distantly related to the other members of the family but more closely related to the influenza viruses than to the Thogoto viruses. Based on these and previously published results, we propose that the ISAV comprises a new, fifth genus in the Orthomyxoviridae.
Collapse
Affiliation(s)
- B Krossøy
- Department of Fisheries and Marine Biology, University of Bergen, Bergen, Norway.
| | | | | | | | | |
Collapse
|
136
|
Reconstruction of phylogenetic relationships. J Biosci 1999. [DOI: 10.1007/bf02941116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
137
|
Young GB, Jack DL, Smith DW, Saier MH. The amino acid/auxin:proton symport permease family. BIOCHIMICA ET BIOPHYSICA ACTA 1999; 1415:306-22. [PMID: 9889387 DOI: 10.1016/s0005-2736(98)00196-5] [Citation(s) in RCA: 95] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Amino acids and their derivatives are transported into and out of cells by a variety of permease types which comprise several distinct protein families. We here present a systematic analysis of a group of homologous transport proteins which together comprise the eukaryotic-specific amino acid/auxin permease (AAAP) family (TC #2. 18). In characterizing this family, we have (1) identified all sequenced members of the family, (2) aligned their sequences, (3) identified regions of striking conservation, (4) derived a family-specific signature sequence, and (5) proposed a topological model that appears to be applicable to all members of the family. We have also constructed AAAP family phylogenetic trees and dendrograms using six different programs that allow us to trace the evolutionary history of the family, estimate the relatedness of proteins from dissimilar organismal phyla, and evaluate the reliability of the different programs available for phylogenetic studies. The TREE and neighbor-joining programs gave fully consistent results while CLUSTAL W gave similar but non-identical results. Other programs gave less consistent results. The phylogenetic analyses reveal (1) that many plant AAAP family proteins arose recently by multiple gene duplication events that occurred within a single organism, (2) that some plant members of the family with strikingly different specificities diverged early in evolutionary history, and (3) that AAAP family proteins from fungi and animals diverged from the plant proteins long ago, possibly when animals, plants and fungi diverged from each other. The Neurospora protein nevertheless exhibits overlapping specificity with those found in plants. Preliminary evidence is presented suggesting that proteins of the AAAP family are distantly related to proteins of the large ubiquitous amino acid/polyamine/choline family (TC #2.3) as well as to those of two small bacterial amino acid transporter families, the ArAAP family (TC #2.42) and the STP family (TC #2.43).
Collapse
Affiliation(s)
- G B Young
- Department of Biology, University of California at San Diego, La Jolla, CA 92093-0116, USA
| | | | | | | |
Collapse
|
138
|
Bulach DM, Kumar CA, Zaia A, Liang B, Tribe DE. Group II nucleopolyhedrovirus subgroups revealed by phylogenetic analysis of polyhedrin and DNA polymerase gene sequences. J Invertebr Pathol 1999; 73:59-73. [PMID: 9878291 DOI: 10.1006/jipa.1998.4797] [Citation(s) in RCA: 100] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Two major clades, designated Groups I and II, of nucleopolyhedroviruses (NPVs) from lepidopteran hosts have been previously identified. To reveal more detailed relationships, a series of DNA polymerase nucleotide sequences from the taxa MbMNPV, SeMNPV, HzSNPV, HearNPV, SpltNPV, BusuNPV, and OranNPV have been determined using a polymerase chain reaction (PCR)-based approach. This technique enabled gene sequence determination using microliter samples of NPV-infected insect cadavers. Polyhedrin genes from HearNPV, OranNPV, SeMNPV, and SpltNPV were also isolated and sequenced using a similar approach. These sequences, together with other database entries, were aligned for positional homology of peptide sequences. Phylogenetic analysis of DNA polymerase molecular sequence alignments supports LdMNPV as a taxon of Group II and three Group II subclades, designated A, B, and C. Comparison of DNA polymerase trees with those estimated from occlusion protein molecular sequences enabled identification of three subclades of Group II. These are Subgroup II-A [MbMNPV, LeseNPV, MacoNPV, PaflNPV, SeMNPV, SpltNPV (India isolate), SfMNPV]; Subgroup II-B [SpliNPV, SpltNPV (Japan isolate), SpltNPV (Queensland isolate), and possibly HzSNPV, HearNPV, and ManeNPV], and Subgroup II-C [OpSNPV, OranNPV (S-type), BusuNPV (S-type), and possibly EcobNPV (S-type)]. Notably, all Subgroup II-A taxa are from noctuid hosts. Correlations of virus and host evolution within Group II taxa are discussed. The methods and data developed in this study will allow rapid sequencing of NPV DNA polymerase genes.
Collapse
Affiliation(s)
- D M Bulach
- Department of Microbiology and Immunology, The University of Melbourne, Parkville, Victoria, 3052, Australia
| | | | | | | | | |
Collapse
|
139
|
Castresana J, Feldmaier-Fuchs G, Yokobori S, Satoh N, Pääbo S. The mitochondrial genome of the hemichordate Balanoglossus carnosus and the evolution of deuterostome mitochondria. Genetics 1998; 150:1115-23. [PMID: 9799263 PMCID: PMC1460392 DOI: 10.1093/genetics/150.3.1115] [Citation(s) in RCA: 71] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The complete nucleotide sequence of the mitochondrial genome of the hemichordate Balanoglossus carnosus (acorn worm) was determined. The arrangement of the genes encoding 13 protein, 22 tRNA, and 2 rRNA genes is essentially the same as in vertebrates, indicating that the vertebrate and hemichordate mitochondrial gene arrangement is close to that of their common ancestor, and, thus, that it has been conserved for more than 600 million years, whereas that of echinoderms has been rearranged extensively. The genetic code of hemichordate mitochondria is similar to that of echinoderms in that ATA encodes isoleucine and AGA serine, whereas the codons AAA and AGG, whose amino acid assignments also differ between echinoderms and vertebrates, are absent from the B. carnosus mitochondrial genome. There are three noncoding regions of length 277, 41, and 32 bp: the larger one is likely to be equivalent to the control region of other deuterostomes, while the two others may contain transcriptional promoters for genes encoded on the minor coding strand. Phylogenetic trees estimated from the inferred protein sequences indicate that hemichordates are a sister group of echinoderms.
Collapse
Affiliation(s)
- J Castresana
- Institute of Zoology, University of Munich, D-80333 Munich, Germany.
| | | | | | | | | |
Collapse
|
140
|
Nei M, Kumar S, Takahashi K. The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small. Proc Natl Acad Sci U S A 1998; 95:12390-7. [PMID: 9770497 PMCID: PMC22842 DOI: 10.1073/pnas.95.21.12390] [Citation(s) in RCA: 108] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In the maximum parsimony (MP) and minimum evolution (ME) methods of phylogenetic inference, evolutionary trees are constructed by searching for the topology that shows the minimum number of mutational changes required (M) and the smallest sum of branch lengths (S), respectively, whereas in the maximum likelihood (ML) method the topology showing the highest maximum likelihood (A) of observing a given data set is chosen. However, the theoretical basis of the optimization principle remains unclear. We therefore examined the relationships of M, S, and A for the MP, ME, and ML trees with those for the true tree by using computer simulation. The results show that M and S are generally greater for the true tree than for the MP and ME trees when the number of nucleotides examined (n) is relatively small, whereas A is generally lower for the true tree than for the ML tree. This finding indicates that the optimization principle tends to give incorrect topologies when n is small. To deal with this disturbing property of the optimization principle, we suggest that more attention should be given to testing the statistical reliability of an estimated tree rather than to finding the optimal tree with excessive efforts. When a reliability test is conducted, simplified MP, ME, and ML algorithms such as the neighbor-joining method generally give conclusions about phylogenetic inference very similar to those obtained by the more extensive tree search algorithms.
Collapse
Affiliation(s)
- M Nei
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, PA 16802-5301, USA.
| | | | | |
Collapse
|
141
|
Buettner JA, Glusman G, Ben-Arie N, Ramos P, Lancet D, Evans GA. Organization and evolution of olfactory receptor genes on human chromosome 11. Genomics 1998; 53:56-68. [PMID: 9787077 DOI: 10.1006/geno.1998.5422] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Olfactory receptors (OR) are encoded by a large multigene family including hundreds of members dispersed throughout the human genome. Cloning and mapping studies have determined that a large proportion of the olfactory receptor genes are located on human chromosomes 6, 11, and 17, as well as distributed on other chromosomes. In this paper, we describe and characterize the organization of olfactory receptor genes on human chromosome 11 by using degenerate PCR-based probes to screen chromosome 11-specific and whole genome clone libraries for members of the OR gene family. OR genes were identified by DNA sequencing and then localized to regions of chromosome 11. Physical maps of several gene clusters were constructed to determine the chromosomal relationships between various members of the family. This work identified 25 new OR genes located on chromosome 11 in at least seven distinct regions. Three of these regions contain gene clusters that include additional members of this gene family not yet identified by sequencing. Phylogenetic analysis of the newly described OR genes suggests a mechanism for the generation of genetic diversity.
Collapse
Affiliation(s)
- J A Buettner
- McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center at Dallas, Dallas, Texas, 75235-8591, USA
| | | | | | | | | | | |
Collapse
|
142
|
Abstract
Morphological differences between species, from simple single-character differences to large-scale variation in body plans, can be traced to changes in the timing and location of developmental events. This has led to a growing interest in understanding the genetic basis behind the evolution of developmental systems. Molecular evolutionary genetics provides one of several approaches to dissecting the evolution of developmental systems, by allowing us to reconstruct the history of developmental genetic pathways, infer the origin and diversification of developmental gene functions, and assess the relative contributions of various evolutionary forces in shaping regulatory gene evolution.
Collapse
Affiliation(s)
- M D Purugganan
- Department of Genetics, North Carolina State University, Raleigh 27695, USA. michaelp@unity,ncsu.edu
| |
Collapse
|
143
|
Nilsen F, Endresen C, Hordvik I. Molecular phylogeny of microsporidians with particular reference to species that infect the muscles of fish. J Eukaryot Microbiol 1998; 45:535-43. [PMID: 9783455 DOI: 10.1111/j.1550-7408.1998.tb05113.x] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Ribosomal DNA from eight species of microsporidians infecting fish have been sequenced. Seven of these species infect the skeletal muscle of fish (Pleistophora spp.) and one species infects migratory mesenchyma cells (Glugea anomala). These sequences, in addition to other available microsporidian rDNA sequences from a broad range of host taxa, have been used in phylogenetic analysis. This analysis revealed that muscle-infecting microsporidians from fish are a polyphyletic group, indicating that characters supposed to be important in the classification of the genus Pleistophora have to be re-evaluated. One character that probably has a polyphyletic origin is the amorphous coat, which has been extensively used in the definition of this genus. Furthermore, our results showed that the insect parasitizing Pleistophora spp. are not related to the true pleistophorans parasitic in skeletal muscle of fish. Phylogenetic analysis of small subunit rDNA sequences revealed disagreements between the molecular phylogeny and classifications based upon ultrastructure. Many of the morphological characters claimed to be important in microsporidian classifications appeared to have arisen several times during evolution: for example, the diplokaryon and sporophorous vesicles.
Collapse
Affiliation(s)
- F Nilsen
- Department of Fisheries and Marine Biology, University of Bergen, Bergen High Technology Centre, Norway.
| | | | | |
Collapse
|
144
|
Gu X, Li WH. Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution. Proc Natl Acad Sci U S A 1998; 95:5899-905. [PMID: 9600890 PMCID: PMC34493 DOI: 10.1073/pnas.95.11.5899] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Estimation of evolutionary distances has always been a major issue in the study of molecular evolution because evolutionary distances are required for estimating the rate of evolution in a gene, the divergence dates between genes or organisms, and the relationships among genes or organisms. Other closely related issues are the estimation of the pattern of nucleotide substitution, the estimation of the degree of rate variation among sites in a DNA sequence, and statistical testing of the molecular clock hypothesis. Mathematical treatments of these problems are considerably simplified by the assumption of a stationary process in which the nucleotide compositions of the sequences under study have remained approximately constant over time, and there now exist fairly extensive studies of stationary models of nucleotide substitution, although some problems remain to be solved. Nonstationary models are much more complex, but significant progress has been recently made by the development of the paralinear and LogDet distances. This paper reviews recent studies on the above issues and reports results on correcting the estimation bias of evolutionary distances, the estimation of the pattern of nucleotide substitution, and the estimation of rate variation among the sites in a sequence.
Collapse
Affiliation(s)
- X Gu
- Institute of Molecular Evolutionary Genetics, 328 Mueller Laboratory, Pennsylvania State University, University Park, PA 16802, USA
| | | |
Collapse
|
145
|
Cohen BL, Stark S, Gawthrop AB, Burke ME, Thayer CW. Comparison of articulate brachiopod nuclear and mitochondrial gene trees leads to a clade-based redefinition of protostomes (Protostomozoa) and deuterostomes (Deuterostomozoa). Proc Biol Sci 1998; 265:475-82. [PMID: 9569666 PMCID: PMC1688916 DOI: 10.1098/rspb.1998.0319] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Nuclear and mtDNA sequences from selected short-looped terebratuloid (terebratulacean) articulate brachiopods yield congruent and genetically independent phylogenetic reconstructions by parsimony, neighbour-joining and maximum likelihood methods, suggesting that both sources of data are reliable guides to brachiopod species phylogeny. The present-day genealogical relationships and geographical distributions of the tested terebratuloid brachiopods are consistent with a tethyan dispersal and subsequent radiation. Concordance of nuclear and mitochondrial gene phylogenies reinforces previous indications that articulate brachiopods, inarticulate brachiopods, phoronids and ectoprocts cluster with other organisms generally regarded as protostomes. Since ontogeny and morphology in brachiopods, ectoprocts and phoronids depart in important respects from those features supposedly diagnostic of protostomes, this demonstrates that the operational definition of protostomy by the usual ontological characters must be misleading or unreliable. New, molecular, operational definitions are proposed to replace the traditional criteria for the recognition of protostomes and deuterostomes, and the clade-based terms 'Protostomoza' and 'Deuterostomozoa' are proposed to replace the existing term 'Protostomia' and 'Deuterostomia'.
Collapse
Affiliation(s)
- B L Cohen
- University of Glasgow, Division of Molecular Genetics, UK.
| | | | | | | | | |
Collapse
|
146
|
Heidrich P, Amengual J, Wink M. Phylogenetic relationships in Mediterranean and North Atlantic shearwaters (Aves: Procellariidae) based on nucleotide sequences of mtDNA. BIOCHEM SYST ECOL 1998. [DOI: 10.1016/s0305-1978(97)00085-9] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
147
|
Searls DB. Grand challenges in computational biology. COMPUTATIONAL METHODS IN MOLECULAR BIOLOGY 1998. [DOI: 10.1016/s0167-7306(08)60458-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
148
|
Barta JR. Investigating phylogenetic relationships within the Apicomplexa using sequence data: the search for homology. Methods 1997; 13:81-8. [PMID: 9405192 DOI: 10.1006/meth.1997.0501] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Whether stated explicitly or not, all molecular studies that seek to infer "homologies" among sequences or that attempt to determine the "relatedness" of taxa based on sequence comparisons are evolutionary studies. The generation of a reliable evolutionary hypothesis based on molecular sequences is dependent almost exclusively on the ability to align sequences such that bases or amino acids in the same position of two sequences are positionally homologous (i.e., they share the same position in the gene under study). The selection of suitable gene targets (commonly 18S small subunit rRNA gene sequences in the Apicomplexa) and appropriate ingroup and outgroup taxa will affect the ability to align sequences unambiguously. Mathematically derived alignments based on local sequence similarity have been shown to be less reliable than alignments based on conserved secondary structures coupled with an analysis of compensatory base changes. Use of staggered sequence alignments through hypervariable regions of 18S small subunit rRNA gene sequences in which subsets of taxa are aligned independently may permit inclusion of more of the primary sequences with an associated increase in information content in the data set. The use of these highly variable regions is critical for determining the branching order of closely related terminal taxa in the phylum Apicomplexa.
Collapse
MESH Headings
- Animals
- Apicomplexa/classification
- Apicomplexa/genetics
- Base Sequence
- Conserved Sequence
- DNA, Protozoan/chemistry
- DNA, Protozoan/genetics
- DNA, Ribosomal/chemistry
- DNA, Ribosomal/genetics
- Genetic Techniques
- Molecular Sequence Data
- Nucleic Acid Conformation
- Phylogeny
- RNA, Protozoan/chemistry
- RNA, Protozoan/genetics
- RNA, Ribosomal, 18S/chemistry
- RNA, Ribosomal, 18S/genetics
- Sequence Alignment
- Sequence Homology, Nucleic Acid
Collapse
Affiliation(s)
- J R Barta
- Department of Pathobiology, University of Guelph, Guelph, Ontario, N1G 2W1, Canada.
| |
Collapse
|
149
|
Affiliation(s)
- S T Sherry
- Department of Pathology, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Medical Center, New Orleans, Louisiana 70112 USA
| | | |
Collapse
|
150
|
Affiliation(s)
- G A Hawkins
- Epicentre Technologies, Madison, WI 53713, USA
| | | |
Collapse
|