1
|
Feng A, Bevins S, Chandler J, DeLiberto TJ, Ghai R, Lantz K, Lenoch J, Retchless A, Shriner S, Tang CY, Tong SS, Torchetti M, Uehara A, Wan XF. Transmission of SARS-CoV-2 in free-ranging white-tailed deer in the United States. Nat Commun 2023; 14:4078. [PMID: 37429851 DOI: 10.1038/s41467-023-39782-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 06/29/2023] [Indexed: 07/12/2023] Open
Abstract
SARS-CoV-2 is a zoonotic virus with documented bi-directional transmission between people and animals. Transmission of SARS-CoV-2 from humans to free-ranging white-tailed deer (Odocoileus virginianus) poses a unique public health risk due to the potential for reservoir establishment where variants may persist and evolve. We collected 8,830 respiratory samples from free-ranging white-tailed deer across Washington, D.C. and 26 states in the United States between November 2021 and April 2022. We obtained 391 sequences and identified 34 Pango lineages including the Alpha, Gamma, Delta, and Omicron variants. Evolutionary analyses showed these white-tailed deer viruses originated from at least 109 independent spillovers from humans, which resulted in 39 cases of subsequent local deer-to-deer transmission and three cases of potential spillover from white-tailed deer back to humans. Viruses repeatedly adapted to white-tailed deer with recurring amino acid substitutions across spike and other proteins. Overall, our findings suggest that multiple SARS-CoV-2 lineages were introduced, became enzootic, and co-circulated in white-tailed deer.
Collapse
Affiliation(s)
- Aijing Feng
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, MO, USA
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Sarah Bevins
- USDA APHIS Wildlife Services National Wildlife Disease Program, Fort Collins, CO, USA
| | - Jeff Chandler
- National Wildlife Research Center, Wildlife Services, Animal and Plant Health Inspection Service, US Department of Agriculture, Fort Collins, CO, USA
| | | | - Ria Ghai
- One Health Office, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Kristina Lantz
- National Veterinary Services Laboratories, Animal and Plant Health Inspection Service, United States Department of Agriculture, Ames, IA, USA
| | - Julianna Lenoch
- USDA APHIS Wildlife Services National Wildlife Disease Program, Fort Collins, CO, USA
| | - Adam Retchless
- National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Susan Shriner
- National Wildlife Research Center, Wildlife Services, Animal and Plant Health Inspection Service, US Department of Agriculture, Fort Collins, CO, USA
| | - Cynthia Y Tang
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- MU Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Suxiang Sue Tong
- National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Mia Torchetti
- National Veterinary Services Laboratories, Animal and Plant Health Inspection Service, United States Department of Agriculture, Ames, IA, USA
| | - Anna Uehara
- National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Xiu-Feng Wan
- Center for Influenza and Emerging Infectious Diseases, University of Missouri, Columbia, MO, USA.
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA.
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
- MU Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA.
- Department of Electrical Engineering & Computer Science, College of Engineering, University of Missouri, Columbia, MO, USA.
| |
Collapse
|
2
|
Ding X, Qin L, Meng J, Peng Y, Wu A, Jiang T. Progress and Challenge in Computational Identification of Influenza Virus Reassortment. Virol Sin 2021; 36:1273-1283. [PMID: 34037948 DOI: 10.1007/s12250-021-00392-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2020] [Accepted: 03/29/2021] [Indexed: 12/22/2022] Open
Abstract
Genomic reassortment is an important evolutionary mechanism for influenza viruses. In this process, the novel viruses acquire new characteristics by the exchange of the intact gene segments among multiple influenza virus genomes, which may cause flu endemics and epidemics within or even across hosts. Due to the safety and ethical limitations of the experimental studies on influenza virus reassortment, numerous computational researches on the influenza virus reassortment have been done with the explosion of the influenza virus genomic data. A great amount of computational methods and bioinformatics databases were developed to facilitate the identification of influenza virus reassortments. In this review, we summarized the progress and challenge of the bioinformatics research on influenza virus reassortment, which can guide the researchers to investigate the influenza virus reassortment events reasonably and provide valuable insight to develop the related computational identification tools.
Collapse
Affiliation(s)
- Xiao Ding
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, 215123, China
| | - Luyao Qin
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, 215123, China
| | - Jing Meng
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, 215123, China
| | - Yousong Peng
- College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha, 410082, China
| | - Aiping Wu
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China.,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, 215123, China
| | - Taijiao Jiang
- Center for Systems Medicine, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100005, China. .,Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou, 510005, China. .,Suzhou Institute of Systems Medicine, Suzhou, Jiangsu, 215123, China.
| |
Collapse
|
3
|
Pinsent A, Fraser C, Ferguson NM, Riley S. A systematic review of reported reassortant viral lineages of influenza A. BMC Infect Dis 2016; 16:3. [PMID: 26732146 PMCID: PMC4702296 DOI: 10.1186/s12879-015-1298-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Accepted: 11/27/2015] [Indexed: 11/29/2022] Open
Abstract
Background Most previous evolutionary studies of influenza A have focussed on genetic drift, or reassortment of specific gene segments, hosts or subtypes. We conducted a systematic literature review to identify reported claimed reassortant influenza A lineages with genomic data available in GenBank, to obtain 646 unique first-report isolates out of a possible 20,781 open-access genomes. Results After adjusting for correlations, only: swine as host, China, Europe, Japan and years between 1997 and 2002; remained as significant risk factors for the reporting of reassortant viral lineages. For swine H1, more reassortants were observed in the North American H1 clade compared with the Eurasian avian-like H1N1 clade. Conversely, for avian H5 isolates, a higher number of reported reassortants were observed in the European H5N2/H3N2 clade compared with the H5N2 North American clade. Conclusions Despite unavoidable biases (publication, database choice and upload propensity) these results synthesize a large majority of the current literature on novel reported influenza A reassortants and are a potentially useful prerequisite to inform further algorithmic studies. Electronic supplementary material The online version of this article (doi:10.1186/s12879-015-1298-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Amy Pinsent
- Department of Infectious Disease Epidemiology, MRC Centre for Outbreak Analyses and Modelling, School of Public Health, Imperial College London, London, UK.
| | - Christophe Fraser
- Department of Infectious Disease Epidemiology, MRC Centre for Outbreak Analyses and Modelling, School of Public Health, Imperial College London, London, UK.
| | - Neil M Ferguson
- Department of Infectious Disease Epidemiology, MRC Centre for Outbreak Analyses and Modelling, School of Public Health, Imperial College London, London, UK.
| | - Steven Riley
- Department of Infectious Disease Epidemiology, MRC Centre for Outbreak Analyses and Modelling, School of Public Health, Imperial College London, London, UK.
| |
Collapse
|
4
|
Mummert A, Weiss H, Long LP, Amigó JM, Wan XF. A perspective on multiple waves of influenza pandemics. PLoS One 2013; 8:e60343. [PMID: 23637746 PMCID: PMC3634039 DOI: 10.1371/journal.pone.0060343] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2012] [Accepted: 02/25/2013] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND A striking characteristic of the past four influenza pandemic outbreaks in the United States has been the multiple waves of infections. However, the mechanisms responsible for the multiple waves of influenza or other acute infectious diseases are uncertain. Understanding these mechanisms could provide knowledge for health authorities to develop and implement prevention and control strategies. MATERIALS AND METHODS We exhibit five distinct mechanisms, each of which can generate two waves of infections for an acute infectious disease. The first two mechanisms capture changes in virus transmissibility and behavioral changes. The third mechanism involves population heterogeneity (e.g., demography, geography), where each wave spreads through one sub-population. The fourth mechanism is virus mutation which causes delayed susceptibility of individuals. The fifth mechanism is waning immunity. Each mechanism is incorporated into separate mathematical models, and outbreaks are then simulated. We use the models to examine the effects of the initial number of infected individuals (e.g., border control at the beginning of the outbreak) and the timing of and amount of available vaccinations. RESULTS Four models, individually or in any combination, reproduce the two waves of the 2009 H1N1 pandemic in the United States, both qualitatively and quantitatively. One model reproduces the two waves only qualitatively. All models indicate that significantly reducing or delaying the initial numbers of infected individuals would have little impact on the attack rate. Instead, this reduction or delay results in a single wave as opposed to two waves. Furthermore, four of these models also indicate that a vaccination program started earlier than October 2009 (when the H1N1 vaccine was initially distributed) could have eliminated the second wave of infection, while more vaccine available starting in October would not have eliminated the second wave.
Collapse
Affiliation(s)
- Anna Mummert
- Department of Mathematics, Marshall University, Huntington, West Virginia, United States of America
| | - Howard Weiss
- School of Mathematics, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Li-Ping Long
- Department of Basic Sciences, Mississippi State University, Mississippi State, Mississippi, United States of America
| | - José M. Amigó
- Centro de Investigación Operativa, Universidad Miguel Hernández, Elche, Spain
| | - Xiu-Feng Wan
- Department of Basic Sciences, Mississippi State University, Mississippi State, Mississippi, United States of America
| |
Collapse
|
5
|
Lun AT, Wong JW, Downard KM. FluShuffle and FluResort: new algorithms to identify reassorted strains of the influenza virus by mass spectrometry. BMC Bioinformatics 2012; 13:208. [PMID: 22906155 PMCID: PMC3505172 DOI: 10.1186/1471-2105-13-208] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2012] [Accepted: 08/10/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Influenza is one of the oldest and deadliest infectious diseases known to man. Reassorted strains of the virus pose the greatest risk to both human and animal health and have been associated with all pandemics of the past century, with the possible exception of the 1918 pandemic, resulting in tens of millions of deaths. We have developed and tested new computer algorithms, FluShuffle and FluResort, which enable reassorted viruses to be identified by the most rapid and direct means possible. These algorithms enable reassorted influenza, and other, viruses to be rapidly identified to allow prevention strategies and treatments to be more efficiently implemented. RESULTS The FluShuffle and FluResort algorithms were tested with both experimental and simulated mass spectra of whole virus digests. FluShuffle considers different combinations of viral protein identities that match the mass spectral data using a Gibbs sampling algorithm employing a mixed protein Markov chain Monte Carlo (MCMC) method. FluResort utilizes those identities to calculate the weighted distance of each across two or more different phylogenetic trees constructed through viral protein sequence alignments. Each weighted mean distance value is normalized by conversion to a Z-score to establish a reassorted strain. CONCLUSIONS The new FluShuffle and FluResort algorithms can correctly identify the origins of influenza viral proteins and the number of reassortment events required to produce the strains from the high resolution mass spectral data of whole virus proteolytic digestions. This has been demonstrated in the case of constructed vaccine strains as well as common human seasonal strains of the virus. The algorithms significantly improve the capability of the proteotyping approach to identify reassorted viruses that pose the greatest pandemic risk.
Collapse
Affiliation(s)
- Aaron Tl Lun
- School of Molecular Bioscience G-08, The University of Sydney, Sydney, NSW, 2006, Australia
| | | | | |
Collapse
|
6
|
Codon optimization of the major antigen encoding genes of diverse strains of influenza a virus. Interdiscip Sci 2011; 3:36-42. [DOI: 10.1007/s12539-011-0055-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2010] [Revised: 05/06/2010] [Accepted: 05/14/2010] [Indexed: 10/18/2022]
|
7
|
Wan XF, Ozden M, Lin G. Ubiquitous reassortments in influenza A viruses. J Bioinform Comput Biol 2009; 6:981-99. [PMID: 18942162 DOI: 10.1142/s0219720008003813] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2007] [Revised: 01/11/2008] [Accepted: 03/14/2008] [Indexed: 11/18/2022]
Abstract
The influenza A virus is a negative-stranded RNA virus composed of eight segmented RNA molecules, including polymerases (PB2, PB1, PA), hemagglutinin (HA), nucleoprotein (NP), neuraminidase (NA), matrix protein (MP), and nonstructure gene (NS). The influenza A viruses are notorious for rapid mutations, frequent reassortments, and possible recombinations. Among these evolutionary events, reassortments refer to exchanges of discrete RNA segments between co-infected influenza viruses, and they have facilitated the generation of pandemic and epidemic strains. Thus, identification of reassortments will be critical for pandemic and epidemic prevention and control. This paper presents a reassortment identification method based on distance measurement using complete composition vector (CCV) and segment clustering using a minimum spanning tree (MST) algorithm. By applying this method, we identified 34 potential reassortment clusters among 2,641 PB2 segments of influenza A viruses. Among the 83 serotypes tested, at least 56 (67.46%) exchanged their fragments with another serotype of influenza A viruses. These identified reassortments involve 1,957 H2N1 and 1,968 H3N2 influenza pandemic strains as well as H5N1 avian influenza virus isolates, which have generated the potential for a future pandemic threat. More frequent reassortments were found to occur in wild birds, especially migratory birds. This MST clustering program is written in Java and will be available upon request.
Collapse
Affiliation(s)
- Xiu-Feng Wan
- Systems Biology Laboratory, Department of Microbiology, Miami University, Oxford, OH 45056, USA.
| | | | | |
Collapse
|
8
|
Wan XF, Nguyen T, Davis CT, Smith CB, Zhao ZM, Carrel M, Inui K, Do HT, Mai DT, Jadhao S, Balish A, Shu B, Luo F, Emch M, Matsuoka Y, Lindstrom SE, Cox NJ, Nguyen CV, Klimov A, Donis RO. Evolution of highly pathogenic H5N1 avian influenza viruses in Vietnam between 2001 and 2007. PLoS One 2008; 3:e3462. [PMID: 18941631 PMCID: PMC2565130 DOI: 10.1371/journal.pone.0003462] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2008] [Accepted: 09/22/2008] [Indexed: 12/05/2022] Open
Abstract
Highly pathogenic avian influenza (HPAI) H5N1 viruses have caused dramatic economic losses to the poultry industry of Vietnam and continue to pose a serious threat to public health. As of June 2008, Vietnam had reported nearly one third of worldwide laboratory confirmed human H5N1 infections. To better understand the emergence, spread and evolution of H5N1 in Vietnam we studied over 300 H5N1 avian influenza viruses isolated from Vietnam since their first detection in 2001. Our phylogenetic analyses indicated that six genetically distinct H5N1 viruses were introduced into Vietnam during the past seven years. The H5N1 lineage that evolved following the introduction in 2003 of the A/duck/Hong Kong/821/2002-like viruses, with clade 1 hemagglutinin (HA), continued to predominate in southern Vietnam as of May 2007. A virus with a clade 2.3.4 HA newly introduced into northern Vietnam in 2007, reassorted with pre-existing clade 1 viruses, resulting in the emergence of novel genotypes with neuraminidase (NA) and/or internal gene segments from clade 1 viruses. A total of nine distinct genotypes have been present in Vietnam since 2001, including five that were circulating in 2007. At least four of these genotypes appear to have originated in Vietnam and represent novel H5N1 viruses not reported elsewhere. Geographic and temporal analyses of H5N1 infection dynamics in poultry suggest that the majority of viruses containing new genes were first detected in northern Vietnam and subsequently spread to southern Vietnam after reassorting with pre-existing local viruses in northern Vietnam. Although the routes of entry and spread of H5N1 in Vietnam remain speculative, enhanced poultry import controls and virologic surveillance efforts may help curb the entry and spread of new HPAI viral genes.
Collapse
Affiliation(s)
- Xiu-Feng Wan
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Tung Nguyen
- Department of Animal Health, National Centre for Veterinary Diagnostics, Hanoi, Vietnam
| | - C. Todd Davis
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Catherine B. Smith
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Zi-Ming Zhao
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Margaret Carrel
- Department of Geography, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Kenjiro Inui
- Department of Animal Health, National Centre for Veterinary Diagnostics, Hanoi, Vietnam
- Food and Agriculture Organization of Vietnam, Hanoi, Vietnam
| | - Hoa T. Do
- Department of Animal Health, National Centre for Veterinary Diagnostics, Hanoi, Vietnam
| | - Duong T. Mai
- Department of Animal Health, National Centre for Veterinary Diagnostics, Hanoi, Vietnam
| | - Samadhan Jadhao
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Amanda Balish
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Bo Shu
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Feng Luo
- School of Computing, Clemson University, Clemson, South Carolina, United States of America
| | - Michael Emch
- Department of Geography, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Yumiko Matsuoka
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Stephen E. Lindstrom
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Nancy J. Cox
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Cam V. Nguyen
- Department of Animal Health, National Centre for Veterinary Diagnostics, Hanoi, Vietnam
| | - Alexander Klimov
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| | - Ruben O. Donis
- Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| |
Collapse
|
9
|
Zhao ZM, Shortridge KF, Garcia M, Guan Y, Wan XF. Genotypic diversity of H5N1 highly pathogenic avian influenza viruses. J Gen Virol 2008; 89:2182-2193. [PMID: 18753228 DOI: 10.1099/vir.0.2008/001875-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Besides enormous economic losses to the poultry industry, recent H5N1 highly pathogenic avian influenza viruses (HPAIVs) originating in eastern Asia have posed serious threats to public health. Up to April 17, 2008, 381 human cases had been confirmed with a mortality of more than 60 %. Here, we attempt to identify potential progenitor genes for H5N1 HPAIVs since their first recognition in 1996; most were detected in the Eurasian landmass before 1996. Combinations among these progenitor genes generated at least 21 reassortants (named H5N1 progenitor reassortant, H5N1-PR1-21). H5N1-PR1 includes A/Goose/Guangdong/1/1996(H5N1). Only reassortants H5N1-PR2 and H5N1-PR7 were associated with confirmed human cases: H5N1-PR2 in the Hong Kong H5N1 outbreak in 1997 and H5N1-PR7 in laboratory confirmed human cases since 2003. H5N1-PR7 also contains a majority of the H5N1 viruses causing avian influenza outbreaks in birds, including the first wave of genotype Z, Qinghai-like and Fujian-like virus lineages. Among the 21 reassortants identified, 13 are first reported here. This study illustrates evolutionary patterns of H5N1 HPAIVs, which may be useful toward pandemic preparedness as well as avian influenza prevention and control.
Collapse
Affiliation(s)
- Zi-Ming Zhao
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
- Systems Biology Laboratory, Department of Microbiology, Miami University, Oxford, OH 45056, USA
| | - Kennedy F Shortridge
- State Key Laboratory of Emerging Infectious Diseases, The University of Hong Kong, Hong Kong SAR
| | - Maricarmen Garcia
- Department of Avian Medicine, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA
| | - Yi Guan
- State Key Laboratory of Emerging Infectious Diseases, The University of Hong Kong, Hong Kong SAR
| | - Xiu-Feng Wan
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
- Systems Biology Laboratory, Department of Microbiology, Miami University, Oxford, OH 45056, USA
| |
Collapse
|
10
|
Abstract
BACKGROUND Historically, two categories of computational algorithms (alignment-based and alignment-free) have been applied to sequence comparison-one of the most fundamental issues in bioinformatics. Multiple sequence alignment, although dominantly used by biologists, possesses both fundamental as well as computational limitations. Consequently, alignment-free methods have been explored as important alternatives in estimating sequence similarity. Of the alignment-free methods, the string composition vector (CV) methods, which use the frequencies of nucleotide or amino acid strings to represent sequence information, show promising results in genome sequence comparison of prokaryotes. The existing CV-based methods, however, suffer certain statistical problems, thereby underestimating the amount of evolutionary information in genetic sequences. RESULTS We show that the existing string composition based methods have two problems, one related to the Markov model assumption and the other associated with the denominator of the frequency normalization equation. We propose an improved complete composition vector method under the assumption of a uniform and independent model to estimate sequence information contributing to selection for sequence comparison. Phylogenetic analyses using both simulated and experimental data sets demonstrate that our new method is more robust compared with existing counterparts and comparable in robustness with alignment-based methods. CONCLUSION We observed two problems existing in the currently used string composition methods and proposed a new robust method for the estimation of evolutionary information of genetic sequences. In addition, we discussed that it might not be necessary to use relatively long strings to build a complete composition vector (CCV), due to the overlapping nature of vector strings with a variable length. We suggested a practical approach for the choice of an optimal string length to construct the CCV.
Collapse
Affiliation(s)
- Guoqing Lu
- Department of Biology, University of Nebraska, Omaha, NE 68182, USA.
| | | | | |
Collapse
|
11
|
Wan XF, Chen G, Luo F, Emch M, Donis R. A quantitative genotype algorithm reflecting H5N1 Avian influenza niches. Bioinformatics 2007; 23:2368-75. [PMID: 17623701 DOI: 10.1093/bioinformatics/btm354] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Computational genotyping analyses are critical for characterizing molecular evolutionary footprints, thus providing important information for designing the strategies of influenza prevention and control. Most of the current methods that are available are based on multiple sequence alignment and phylogenetic tree construction, which are time consuming and limited by the number of taxa. Arbitrarily defining genotypes further complicates the interpretation of genotyping results. METHODS In this study, we describe a quantitative influenza genotyping algorithm based on the theory of quasispecies. First, the complete composition vector (CCV) was utilized to calculate the pairwise evolutionary distance between genotypes. Next, Hierarchical Bayesian Modeling using the Gibbs Sampling algorithm was applied to identify the segment genotype threshold, which is used to identify influenza segment genotype through a modularity calculation. The viral genotype was defined by combining eight segment genotypes based on the genetic reassortment feature of influenza A viruses. RESULTS We applied this method for H5N1 avian influenza viruses and identified 107 niches among 283 viruses with a complete genome set. The diversity of viral genotypes, and their correlation with geographic locations suggests that these viruses form local niches after being introduced to a new ecological environment through poultry trade or bird migration. This novel method allows us to define genotypes in a robust, quantitative as well as hierarchical manner. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiu-Feng Wan
- Department of Microbiology, Miami University, Oxford, OH 45056, USA.
| | | | | | | | | |
Collapse
|