1
|
Brito AF, Pinney JW. The evolution of protein domain repertoires: Shedding light on the origins of the Herpesviridae family. Virus Evol 2020; 6:veaa001. [PMID: 32042448 PMCID: PMC7000910 DOI: 10.1093/ve/veaa001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Herpesviruses (HVs, Family: Herpesviridae) have large genomes that encode hundreds of proteins. Apart from amino acid mutations, protein domain acquisitions, duplications and losses are also common modes of evolution. HV domain repertoires differ across species, and only a core set is shared among all species, aspect that raises a question: How have HV domain repertoires diverged while keeping some similarities? To answer such question, we used profile Hidden Markov Models (HMMs) to search for domains in all possible translated open reading frames (ORFs) of fully sequenced HV genomes. With at least 274 domains being identified, we built a matrix of domain counts per species, and applied a parsimony method to reconstruct the ancestral states of these domains along the HV phylogeny. It revealed events of domain gain, duplication, and loss over more than 400 millions of years, where Alpha-, Beta-, and GammaHVs expanded and condensed their domain repertoires at distinct rates. Most of the acquired domains perform ‘Modulation and Control’, ‘Envelope’, or ‘Auxiliary’ functions, categories that showed high flexibility (number of domains) and redundancy (number of copies). Conversely, few gains and duplications were observed for domains involved in ‘Capsid assembly and structure’, and ‘DNA Replication, recombination and metabolism’. Among the forty-one primordial domains encoded by Herpesviridae ancestors, twenty-eight are still found in all present-day HVs. Because of their distinct evolutionary strategies, HV domain repertoires are very specific at the subfamily, genus and species levels. Differences in domain composition may not only explain HV host range and tissue tropism, but also provide hints to the origins of HVs.
Collapse
Affiliation(s)
- Anderson F Brito
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | - John W Pinney
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| |
Collapse
|
2
|
Sasani TA, Cone KR, Quinlan AR, Elde NC. Long read sequencing reveals poxvirus evolution through rapid homogenization of gene arrays. eLife 2018; 7:35453. [PMID: 30156554 PMCID: PMC6115191 DOI: 10.7554/elife.35453] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2018] [Accepted: 08/12/2018] [Indexed: 12/21/2022] Open
Abstract
Poxvirus adaptation can involve combinations of recombination-driven gene copy number variation and beneficial single nucleotide variants (SNVs) at the same loci. How these distinct mechanisms of genetic diversification might simultaneously facilitate adaptation to host immune defenses is unknown. We performed experimental evolution with vaccinia virus populations harboring a SNV in a gene actively undergoing copy number amplification. Using long sequencing reads from the Oxford Nanopore Technologies platform, we phased SNVs within large gene copy arrays for the first time. Our analysis uncovered a mechanism of adaptive SNV homogenization reminiscent of gene conversion, which is actively driven by selection. This study reveals a new mechanism for the fluid gain of beneficial mutations in genetic regions undergoing active recombination in viruses and illustrates the value of long read sequencing technologies for investigating complex genome dynamics in diverse biological systems.
Collapse
Affiliation(s)
- Thomas A Sasani
- Department of Human Genetics, University of Utah, Salt Lake, United States
| | - Kelsey R Cone
- Department of Human Genetics, University of Utah, Salt Lake, United States
| | - Aaron R Quinlan
- Department of Human Genetics, University of Utah, Salt Lake, United States
| | - Nels C Elde
- Department of Human Genetics, University of Utah, Salt Lake, United States
| |
Collapse
|
3
|
Alves JMP, de Oliveira AL, Sandberg TOM, Moreno-Gallego JL, de Toledo MAF, de Moura EMM, Oliveira LS, Durham AM, Mehnert DU, Zanotto PMDA, Reyes A, Gruber A. GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data. Front Microbiol 2016; 7:269. [PMID: 26973638 PMCID: PMC4777721 DOI: 10.3389/fmicb.2016.00269] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 02/19/2016] [Indexed: 01/01/2023] Open
Abstract
This work reports the development of GenSeed-HMM, a program that implements seed-driven progressive assembly, an approach to reconstruct specific sequences from unassembled data, starting from short nucleotide or protein seed sequences or profile Hidden Markov Models (HMM). The program can use any one of a number of sequence assemblers. Assembly is performed in multiple steps and relatively few reads are used in each cycle, consequently the program demands low computational resources. As a proof-of-concept and to demonstrate the power of HMM-driven progressive assemblies, GenSeed-HMM was applied to metagenomic datasets in the search for diverse ssDNA bacteriophages from the recently described Alpavirinae subfamily. Profile HMMs were built using Alpavirinae-specific regions from multiple sequence alignments (MSA) using either the viral protein 1 (VP1; major capsid protein) or VP4 (genome replication initiation protein). These profile HMMs were used by GenSeed-HMM (running Newbler assembler) as seeds to reconstruct viral genomes from sequencing datasets of human fecal samples. All contigs obtained were annotated and taxonomically classified using similarity searches and phylogenetic analyses. The most specific profile HMM seed enabled the reconstruction of 45 partial or complete Alpavirinae genomic sequences. A comparison with conventional (global) assembly of the same original dataset, using Newbler in a standalone execution, revealed that GenSeed-HMM outperformed global genomic assembly in several metrics employed. This approach is capable of detecting organisms that have not been used in the construction of the profile HMM, which opens up the possibility of diagnosing novel viruses, without previous specific information, constituting a de novo diagnosis. Additional applications include, but are not limited to, the specific assembly of extrachromosomal elements such as plastid and mitochondrial genomes from metagenomic data. Profile HMM seeds can also be used to reconstruct specific protein coding genes for gene diversity studies, and to determine all possible gene variants present in a metagenomic sample. Such surveys could be useful to detect the emergence of drug-resistance variants in sensitive environments such as hospitals and animal production facilities, where antibiotics are regularly used. Finally, GenSeed-HMM can be used as an adjunct for gap closure on assembly finishing projects, by using multiple contig ends as anchored seeds.
Collapse
Affiliation(s)
- João M P Alves
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - André L de Oliveira
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Tatiana O M Sandberg
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | | | - Marcelo A F de Toledo
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Elisabeth M M de Moura
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Liliane S Oliveira
- Department of Parasitology, Institute of Biomedical Sciences, University of São PauloSão Paulo, Brazil; Department of Computer Science, Institute of Mathematics and Statistics, University of São PauloSão Paulo, Brazil
| | - Alan M Durham
- Department of Computer Science, Institute of Mathematics and Statistics, University of São Paulo São Paulo, Brazil
| | - Dolores U Mehnert
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Paolo M de A Zanotto
- Department of Microbiology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| | - Alejandro Reyes
- Department of Biological Sciences, Universidad de los AndesBogotá, Colombia; Center for Genome Sciences and Systems Biology, Department of Pathology and Immunology, Washington University in Saint LouisMO, USA
| | - Arthur Gruber
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo São Paulo, Brazil
| |
Collapse
|
4
|
Brito AFD, Braconi CT, Weidmann M, Dilcher M, Alves JMP, Gruber A, Zanotto PMDA. The Pangenome of the Anticarsia gemmatalis Multiple Nucleopolyhedrovirus (AgMNPV). Genome Biol Evol 2015; 8:94-108. [PMID: 26615220 PMCID: PMC4758234 DOI: 10.1093/gbe/evv231] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The alphabaculovirus Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is the world's most successful viral bioinsecticide. Through the 1980s and 1990s, this virus was extensively used for biological control of populations of Anticarsia gemmatalis (Velvetbean caterpillar) in soybean crops. During this period, genetic studies identified several variable loci in the AgMNPV; however, most of them were not characterized at the sequence level. In this study we report a full genome comparison among 17 wild-type isolates of AgMNPV. We found the pangenome of this virus to contain at least 167 hypothetical genes, 151 of which are shared by all genomes. The gene bro-a that might be involved in host specificity and carrying transporter is absent in some genomes, and new hypothetical genes were observed. Among these genes there is a unique rnf12-like gene, probably implicated in ubiquitination. Events of gene fission and fusion are common, as four genes have been observed as single or split open reading frames. Gains and losses of genomic fragments (from 20 to 900 bp) are observed within tandem repeats, such as in eight direct repeats and four homologous regions. Most AgMNPV genes present low nucleotide diversity, and variable genes are mainly located in a locus known to evolve through homologous recombination. The evolution of AgMNPV is mainly driven by small indels, substitutions, gain and loss of nucleotide stretches or entire coding sequences. These variations may cause relevant phenotypic alterations, which probably affect the infectivity of AgMNPV. This work provides novel information on genomic evolution of the AgMNPV in particular and of baculoviruses in general.
Collapse
Affiliation(s)
- Anderson Fernandes de Brito
- Department of Microbiology, Institute of Biomedical Sciences-ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo-USP, São Paulo, SP, Brazil
| | - Carla Torres Braconi
- Department of Microbiology, Institute of Biomedical Sciences-ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo-USP, São Paulo, SP, Brazil
| | - Manfred Weidmann
- Department of Virology, University Medical Center, Göttingen, Germany
| | - Meik Dilcher
- Department of Virology, University Medical Center, Göttingen, Germany
| | - João Marcelo Pereira Alves
- Department of Parasitology, Institute of Biomedical Sciences-ICB II, University of São Paulo-USP, São Paulo, SP, Brazil
| | - Arthur Gruber
- Department of Parasitology, Institute of Biomedical Sciences-ICB II, University of São Paulo-USP, São Paulo, SP, Brazil
| | - Paolo Marinho de Andrade Zanotto
- Department of Microbiology, Institute of Biomedical Sciences-ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo-USP, São Paulo, SP, Brazil
| |
Collapse
|
5
|
Černý J, Černá Bolfíková B, de A Zanotto PM, Grubhoffer L, Růžek D. A deep phylogeny of viral and cellular right-hand polymerases. INFECTION GENETICS AND EVOLUTION 2015; 36:275-286. [PMID: 26431690 DOI: 10.1016/j.meegid.2015.09.026] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Revised: 09/22/2015] [Accepted: 09/28/2015] [Indexed: 12/27/2022]
Abstract
Right-hand polymerases are important players in genome replication and repair in cellular organisms as well as in viruses. All right-hand polymerases are grouped into seven related protein families: viral RNA-dependent RNA polymerases, reverse transcriptases, single-subunit RNA polymerases, and DNA polymerase families A, B, D, and Y. Although the evolutionary relationships of right-hand polymerases within each family have been proposed, evolutionary relationships between families remain elusive because their sequence similarity is too low to allow classical phylogenetic analyses. The structure of viral RNA-dependent RNA polymerases recently was shown to be useful in inferring their evolution. Here, we address evolutionary relationships between right-hand polymerase families by combining sequence and structure information. We used a set of 22 viral and cellular polymerases representing all right-hand polymerase families with known protein structure. In contrast to previous studies, which focused only on the evolution of particular families, the current approach allowed us to present the first robust phylogenetic analysis unifying evolution of all right-hand polymerase families. All polymerase families branched into discrete lineages, following a fairly robust adjacency pattern. Only single-subunit RNA polymerases formed an inner group within DNA polymerase family A. RNA-dependent RNA polymerases of RNA viruses and reverse transcriptases of retroviruses formed two sister groups and were distinguishable from all other polymerases. DNA polymerases of DNA bacteriophages did not form a monophyletic group and are phylogenetically mixed with cellular DNA polymerase families A and B. Based on the highest genetic variability and structural simplicity, we assume that RNA-dependent RNA polymerases are the most ancient group of right-hand polymerases, in agreement with the RNA World hypothesis, because RNA-dependent RNA polymerases are enzymes that could serve in replication of RNA genomes. Moreover, our results show that protein structure can be used in phylogenetic analyses of distantly related proteins that share only limited sequence similarity.
Collapse
Affiliation(s)
- Jiří Černý
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Faculty of Science, University of South Bohemia in České Budějovice, Branišovská 31, 370 05 České Budějovice, Czech Republic.
| | - Barbora Černá Bolfíková
- Faculty of Tropical AgriSciences, Czech University of Life Sciences Prague, Kamýcká 126, Suchdol, 165 21 Prague 6, Czech Republic
| | - Paolo M de A Zanotto
- Department of Microbiology, Biomedical Sciences Institute, ICB II University of Sao Paulo, 05508-000 Sao Paulo, Brazil
| | - Libor Grubhoffer
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Faculty of Science, University of South Bohemia in České Budějovice, Branišovská 31, 370 05 České Budějovice, Czech Republic
| | - Daniel Růžek
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, Branišovská 31, 370 05 České Budějovice, Czech Republic; Veterinary Research Institute, Hudcova 296/70, 621 00 Brno, Czech Republic
| |
Collapse
|
6
|
Gene acquisition convergence between entomopoxviruses and baculoviruses. Viruses 2015; 7:1960-74. [PMID: 25871928 PMCID: PMC4411684 DOI: 10.3390/v7041960] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Revised: 03/25/2015] [Accepted: 04/08/2015] [Indexed: 12/30/2022] Open
Abstract
Organisms from diverse phylogenetic origins can thrive within the same ecological niches. They might be induced to evolve convergent adaptations in response to a similar landscape of selective pressures. Their genomes should bear the signature of this process. The study of unrelated virus lineages infecting the same host panels guarantees a clear identification of phyletically independent convergent adaptation. Here, we investigate the evolutionary history of genes in the accessory genome shared by unrelated insect large dsDNA viruses: the entomopoxviruses (EPVs, Poxviridae) and the baculoviruses (BVs). EPVs and BVs have overlapping ecological niches and have independently evolved similar infection processes. They are, in theory, subjected to the same selective pressures from their host’s immune responses. Their accessory genomes might, therefore, bear analogous genomic signatures of convergent adaption and could point out key genomic mechanisms of adaptation hitherto undetected in viruses. We uncovered 32 homologous, yet independent acquisitions of genes originating from insect hosts, different eukaryotes, bacteria and viruses. We showed different evolutionary levels of gene acquisition convergence in these viruses, underlining a continuous evolutionary process. We found both recent and ancient gene acquisitions possibly involved to the adaptation to both specific and distantly related hosts. Multidirectional and multipartite gene exchange networks appear to constantly drive exogenous gene assimilations, bringing key adaptive innovations and shaping the life histories of large DNA viruses. This evolutionary process might lead to genome level adaptive convergence.
Collapse
|
7
|
Oliveira JV, de Brito AF, Braconi CT, de Melo Freire CC, Iamarino A, de Andrade Zanotto PM. Modularity and evolutionary constraints in a baculovirus gene regulatory network. BMC SYSTEMS BIOLOGY 2013; 7:87. [PMID: 24006890 PMCID: PMC3879405 DOI: 10.1186/1752-0509-7-87] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/18/2013] [Accepted: 08/15/2013] [Indexed: 12/25/2022]
Abstract
BACKGROUND The structure of regulatory networks remains an open question in our understanding of complex biological systems. Interactions during complete viral life cycles present unique opportunities to understand how host-parasite network take shape and behave. The Anticarsia gemmatalis multiple nucleopolyhedrovirus (AgMNPV) is a large double-stranded DNA virus, whose genome may encode for 152 open reading frames (ORFs). Here we present the analysis of the ordered cascade of the AgMNPV gene expression. RESULTS We observed an earlier onset of the expression than previously reported for other baculoviruses, especially for genes involved in DNA replication. Most ORFs were expressed at higher levels in a more permissive host cell line. Genes with more than one copy in the genome had distinct expression profiles, which could indicate the acquisition of new functionalities. The transcription gene regulatory network (GRN) for 149 ORFs had a modular topology comprising five communities of highly interconnected nodes that separated key genes that are functionally related on different communities, possibly maximizing redundancy and GRN robustness by compartmentalization of important functions. Core conserved functions showed expression synchronicity, distinct GRN features and significantly less genetic diversity, consistent with evolutionary constraints imposed in key elements of biological systems. This reduced genetic diversity also had a positive correlation with the importance of the gene in our estimated GRN, supporting a relationship between phylogenetic data of baculovirus genes and network features inferred from expression data. We also observed that gene arrangement in overlapping transcripts was conserved among related baculoviruses, suggesting a principle of genome organization. CONCLUSIONS Albeit with a reduced number of nodes (149), the AgMNPV GRN had a topology and key characteristics similar to those observed in complex cellular organisms, which indicates that modularity may be a general feature of biological gene regulatory networks.
Collapse
Affiliation(s)
- Juliana Velasco Oliveira
- Department of Microbiology, Institute of Biomedical Sciences – ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo – USP, São Paulo, SP, Brazil
- Laboratório Nacional de Ciência e Tecnologia do Bioetanol (CTBE), Centro Nacional de Pesquisa em Energia e Materiais (CNPEM), Campinas Caixa Postal 6170, 13083-970, Brazil
| | - Anderson Fernandes de Brito
- Department of Microbiology, Institute of Biomedical Sciences – ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo – USP, São Paulo, SP, Brazil
| | - Carla Torres Braconi
- Department of Microbiology, Institute of Biomedical Sciences – ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo – USP, São Paulo, SP, Brazil
| | - Caio César de Melo Freire
- Department of Microbiology, Institute of Biomedical Sciences – ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo – USP, São Paulo, SP, Brazil
| | - Atila Iamarino
- Department of Microbiology, Institute of Biomedical Sciences – ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo – USP, São Paulo, SP, Brazil
| | - Paolo Marinho de Andrade Zanotto
- Department of Microbiology, Institute of Biomedical Sciences – ICB II, Laboratory of Molecular Evolution and Bioinformatics, University of São Paulo – USP, São Paulo, SP, Brazil
- Departamento de Microbiologia, Instituto de Ciências Biomédicas - ICB II, Laboratório de Evolução Molecular e Bioinformática, Universidade de São Paulo - USP, Av. Prof. Lineu Prestes, 1374, São Paulo CEP: 05508-900, Brasil
| |
Collapse
|
8
|
Leigh JW, Lapointe FJ, Lopez P, Bapteste E. Evaluating phylogenetic congruence in the post-genomic era. Genome Biol Evol 2011; 3:571-87. [PMID: 21712432 PMCID: PMC3156567 DOI: 10.1093/gbe/evr050] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/27/2011] [Indexed: 12/04/2022] Open
Abstract
Congruence is a broadly applied notion in evolutionary biology used to justify multigene phylogeny or phylogenomics, as well as in studies of coevolution, lateral gene transfer, and as evidence for common descent. Existing methods for identifying incongruence or heterogeneity using character data were designed for data sets that are both small and expected to be rarely incongruent. At the same time, methods that assess incongruence using comparison of trees test a null hypothesis of uncorrelated tree structures, which may be inappropriate for phylogenomic studies. As such, they are ill-suited for the growing number of available genome sequences, most of which are from prokaryotes and viruses, either for phylogenomic analysis or for studies of the evolutionary forces and events that have shaped these genomes. Specifically, many existing methods scale poorly with large numbers of genes, cannot accommodate high levels of incongruence, and do not adequately model patterns of missing taxa for different markers. We propose the development of novel incongruence assessment methods suitable for the analysis of the molecular evolution of the vast majority of life and support the investigation of homogeneity of evolutionary process in cases where markers do not share identical tree structures.
Collapse
Affiliation(s)
- Jessica W Leigh
- Department of Mathematics and Statistics, University of Otago, Dunedin, New Zealand.
| | | | | | | |
Collapse
|
9
|
Miele SAB, Garavaglia MJ, Belaich MN, Ghiringhelli PD. Baculovirus: molecular insights on their diversity and conservation. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2011; 2011:379424. [PMID: 21716740 PMCID: PMC3119482 DOI: 10.4061/2011/379424] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2010] [Revised: 01/04/2011] [Accepted: 02/14/2011] [Indexed: 12/11/2022]
Abstract
The Baculoviridae is a large group of insect viruses containing circular double-stranded DNA genomes of 80 to 180 kbp. In this study, genome sequences from 57 baculoviruses were analyzed to reevaluate the number and identity of core genes and to understand the distribution of the remaining coding sequences. Thirty one core genes with orthologs in all genomes were identified along with other 895 genes differing in their degrees of representation among reported genomes. Many of these latter genes are common to well-defined lineages, whereas others are unique to one or a few of the viruses. Phylogenetic analyses based on core gene sequences and the gene composition of the genomes supported the current division of the Baculoviridae into 4 genera: Alphabaculovirus, Betabaculovirus, Gammabaculovirus, and Deltabaculovirus.
Collapse
Affiliation(s)
- Solange Ana Belen Miele
- LIGBCM (Laboratorio de Ingeniería Genética y Biología Celular y Molecular), Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, Bernal, Argentina
| | | | | | | |
Collapse
|
10
|
Yu ZG, Chu KH, Li CP, Anh V, Zhou LQ, Wang RW. Whole-proteome phylogeny of large dsDNA viruses and parvoviruses through a composition vector method related to dynamical language model. BMC Evol Biol 2010; 10:192. [PMID: 20565983 PMCID: PMC2898692 DOI: 10.1186/1471-2148-10-192] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2009] [Accepted: 06/22/2010] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND The vast sequence divergence among different virus groups has presented a great challenge to alignment-based analysis of virus phylogeny. Due to the problems caused by the uncertainty in alignment, existing tools for phylogenetic analysis based on multiple alignment could not be directly applied to the whole-genome comparison and phylogenomic studies of viruses. There has been a growing interest in alignment-free methods for phylogenetic analysis using complete genome data. Among the alignment-free methods, a dynamical language (DL) method proposed by our group has successfully been applied to the phylogenetic analysis of bacteria and chloroplast genomes. RESULTS In this paper, the DL method is used to analyze the whole-proteome phylogeny of 124 large dsDNA viruses and 30 parvoviruses, two data sets with large difference in genome size. The trees from our analyses are in good agreement to the latest classification of large dsDNA viruses and parvoviruses by the International Committee on Taxonomy of Viruses (ICTV). CONCLUSIONS The present method provides a new way for recovering the phylogeny of large dsDNA viruses and parvoviruses, and also some insights on the affiliation of a number of unclassified viruses. In comparison, some alignment-free methods such as the CV Tree method can be used for recovering the phylogeny of large dsDNA viruses, but they are not suitable for resolving the phylogeny of parvoviruses with a much smaller genome size.
Collapse
Affiliation(s)
- Zu-Guo Yu
- School of Mathematical Sciences, Queensland University of Technology, GPO Box 2434, Brisbane, Q 4001, Australia
- School of Mathematics and Computational Science, Xiangtan University, Hunan 411105, China
| | - Ka Hou Chu
- Department of Biology, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong, China
| | - Chi Pang Li
- Department of Biology, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong, China
| | - Vo Anh
- School of Mathematical Sciences, Queensland University of Technology, GPO Box 2434, Brisbane, Q 4001, Australia
| | - Li-Qian Zhou
- School of Mathematics and Computational Science, Xiangtan University, Hunan 411105, China
| | - Roger Wei Wang
- Department of Mathematics, The Chinese University of Hong Kong, Shatin, N.T., Hong Kong, China
| |
Collapse
|
11
|
Whole-proteome phylogeny of large dsDNA virus families by an alignment-free method. Proc Natl Acad Sci U S A 2009; 106:12826-31. [PMID: 19553209 DOI: 10.1073/pnas.0905115106] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The vast sequence divergence among different virus groups has presented a great challenge to alignment-based sequence comparison among different virus families. Using an alignment-free comparison method, we construct the whole-proteome phylogeny for a population of viruses from 11 viral families comprising 142 large dsDNA eukaryote viruses. The method is based on the feature frequency profiles (FFP), where the length of the feature (l-mer) is selected to be optimal for phylogenomic inference. We observe that (i) the FFP phylogeny segregates the population into clades, the membership of each has remarkable agreement with current classification by the International Committee on the Taxonomy of Viruses, with one exception that the mimivirus joins the phycodnavirus family; (ii) the FFP tree detects potential evolutionary relationships among some viral families; (iii) the relative position of the 3 herpesvirus subfamilies in the FFP tree differs from gene alignment-based analysis; (iv) the FFP tree suggests the taxonomic positions of certain "unclassified" viruses; and (v) the FFP method identifies candidates for horizontal gene transfer between virus families.
Collapse
|