1
|
Pekar JE, Lytras S, Ghafari M, Magee AF, Parker E, Wang Y, Ji X, Havens JL, Katzourakis A, Vasylyeva TI, Suchard MA, Hughes AC, Hughes J, Rambaut A, Robertson DL, Dellicour S, Worobey M, Wertheim JO, Lemey P. The recency and geographical origins of the bat viruses ancestral to SARS-CoV and SARS-CoV-2. Cell 2025:S0092-8674(25)00353-8. [PMID: 40339581 DOI: 10.1016/j.cell.2025.03.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 11/21/2024] [Accepted: 03/19/2025] [Indexed: 05/10/2025]
Abstract
The emergence of SARS-CoV in 2002 and SARS-CoV-2 in 2019 led to increased sampling of sarbecoviruses circulating in horseshoe bats. Employing phylogenetic inference while accounting for recombination of bat sarbecoviruses, we find that the closest-inferred bat virus ancestors of SARS-CoV and SARS-CoV-2 existed less than a decade prior to their emergence in humans. Phylogeographic analyses show bat sarbecoviruses traveled at rates approximating their horseshoe bat hosts and circulated in Asia for millennia. We find that the direct ancestors of SARS-CoV and SARS-CoV-2 are unlikely to have reached their respective sites of emergence via dispersal in the bat reservoir alone, supporting interactions with intermediate hosts through wildlife trade playing a role in zoonotic spillover. These results can guide future sampling efforts and demonstrate that viral genomic regions extremely closely related to SARS-CoV and SARS-CoV-2 were circulating in horseshoe bats, confirming their importance as the reservoir species for SARS viruses.
Collapse
Affiliation(s)
- Jonathan E Pekar
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh, UK; Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA; Department of Biomedical Informatics, University of California, San Diego, La Jolla, CA 92093, USA; Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA.
| | - Spyros Lytras
- Division of Systems Virology, Department of Microbiology and Immunology, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan; Medical Research Council, University of Glasgow Centre for Virus Research, Glasgow, UK.
| | - Mahan Ghafari
- Department of Biology, University of Oxford, Oxford, UK
| | - Andrew F Magee
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Edyth Parker
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA; Institute of Genomics and Global Health, Redeemer's University, Ede, Osun State, Nigeria
| | - Yu Wang
- Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Xiang Ji
- Department of Mathematics, School of Science and Engineering, Tulane University, New Orleans, LA, USA
| | - Jennifer L Havens
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | | | - Tetyana I Vasylyeva
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA; Department of Population Health and Disease Prevention, University of California, Irvine, Irvine, CA 92617, USA
| | - Marc A Suchard
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Alice C Hughes
- School of Biological Sciences, University of Hong Kong, Hong Kong, Hong Kong; China Biodiversity Green Development Foundation, Beijing, China
| | - Joseph Hughes
- Medical Research Council, University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Andrew Rambaut
- Institute of Ecology and Evolution, University of Edinburgh, Edinburgh, UK
| | - David L Robertson
- Medical Research Council, University of Glasgow Centre for Virus Research, Glasgow, UK
| | - Simon Dellicour
- Spatial Epidemiology Lab (SpELL), Université Libre de Bruxelles, CP160/12, 50 av. FD Roosevelt, 1050 Bruxelles, Belgium; Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory for Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium.
| | - Michael Worobey
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA.
| | - Joel O Wertheim
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093, USA.
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory for Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium.
| |
Collapse
|
2
|
Hoscheit P, Desbiez C. Phylodynamics and phylogeography of watermelon mosaic virus: Multiple local invasion routes in southern France and recombination-driven limits to global analysis. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2025; 129:105732. [PMID: 40020892 DOI: 10.1016/j.meegid.2025.105732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2024] [Revised: 02/22/2025] [Accepted: 02/25/2025] [Indexed: 03/03/2025]
Abstract
Watermelon mosaic virus (WMV) is a major plant pathogen, infecting over 170 plant species, including cucurbits and legumes. Though mostly propagated locally by aphids in a non-persistent manner, long-range dispersal can occur through human-induced plant or vector movements. Understanding patterns of local and global spread of WMV is crucial to help formulate adequate control strategies. We used phylodynamic methods based on partial and whole-genome sequences collected in France between 2000 and 2017 to reconstruct the introduction of new lineages in the past 30 years and their subsequent diffusion in the country. We identified at least 11 different introduction events, hailing from different parts of the global diversity of WMV, highlighting the critical role international exchanges play in the spread of plant pathogens. For three of these lineages, we estimated the time and location of their introduction in the mid-1990s in the south of France and the speed at which they spread in this specific landscape. We also showed that the highly recombinogenic nature of WMV, as with most potyviruses, makes the use of whole genomes necessary to classify these viruses on a global scale and must be taken into consideration to reconstruct viral evolutionary history. Our results demonstrate how genomic sequencing of plant viruses can help reconstruct specific viral outbreaks and understand global circulation patterns of plant pathogens.
Collapse
Affiliation(s)
- Patrick Hoscheit
- Université Paris-Saclay, INRAE, MaIAGE, 78350 Jouy-en-Josas, France.
| | | |
Collapse
|
3
|
Case JB, Jain S, Suthar MS, Diamond MS. SARS-CoV-2: The Interplay Between Evolution and Host Immunity. Annu Rev Immunol 2025; 43:29-55. [PMID: 39705164 DOI: 10.1146/annurev-immunol-083122-043054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2024]
Abstract
The persistence of SARS-CoV-2 infections at a global level reflects the repeated emergence of variant strains encoding unique constellations of mutations. These variants have been generated principally because of a dynamic host immune landscape, the countermeasures deployed to combat disease, and selection for enhanced infection of the upper airway and respiratory transmission. The resulting viral diversity creates a challenge for vaccination efforts to maintain efficacy, especially regarding humoral aspects of protection. Here, we review our understanding of how SARS-CoV-2 has evolved during the pandemic, the immune mechanisms that confer protection, and the impact viral evolution has had on transmissibility and adaptive immunity elicited by natural infection and/or vaccination. Evidence suggests that SARS-CoV-2 evolution initially selected variants with increased transmissibility but currently is driven by immune escape. The virus likely will continue to drift to maintain fitness until countermeasures capable of disrupting transmission cycles become widely available.
Collapse
Affiliation(s)
- James Brett Case
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri, USA;
| | - Shilpi Jain
- Emory Vaccine Center, Emory National Primate Research Center, Atlanta, Georgia, USA
- Center for Childhood Infections and Vaccines of Children's Healthcare of Atlanta, Department of Pediatrics, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Mehul S Suthar
- Emory Vaccine Center, Emory National Primate Research Center, Atlanta, Georgia, USA
- Center for Childhood Infections and Vaccines of Children's Healthcare of Atlanta, Department of Pediatrics, Emory University School of Medicine, Atlanta, Georgia, USA
| | - Michael S Diamond
- Department of Pathology & Immunology; Department of Molecular Microbiology; and Andrew M. and Jane M. Bursky Center for Human Immunology and Immunotherapy Programs, Washington University School of Medicine, St. Louis, Missouri, USA
- Department of Medicine, Washington University School of Medicine, St. Louis, Missouri, USA;
| |
Collapse
|
4
|
Teo B, Bastide P, Ané C. Leveraging graphical model techniques to study evolution on phylogenetic networks. Philos Trans R Soc Lond B Biol Sci 2025; 380:20230310. [PMID: 39976402 PMCID: PMC11867149 DOI: 10.1098/rstb.2023.0310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 08/27/2024] [Accepted: 09/16/2024] [Indexed: 02/21/2025] Open
Abstract
The evolution of molecular and phenotypic traits is commonly modelled using Markov processes along a phylogeny. This phylogeny can be a tree, or a network if it includes reticulations, representing events such as hybridization or admixture. Computing the likelihood of data observed at the leaves is costly as the size and complexity of the phylogeny grows. Efficient algorithms exist for trees, but cannot be applied to networks. We show that a vast array of models for trait evolution along phylogenetic networks can be reformulated as graphical models, for which efficient belief propagation algorithms exist. We provide a brief review of belief propagation on general graphical models, then focus on linear Gaussian models for continuous traits. We show how belief propagation techniques can be applied for exact or approximate (but more scalable) likelihood and gradient calculations, and prove novel results for efficient parameter inference of some models. We highlight the possible fruitful interactions between graphical models and phylogenetic methods. For example, approximate likelihood approaches have the potential to greatly reduce computational costs for phylogenies with reticulations.This article is part of the theme issue '"A mathematical theory of evolution": phylogenetic models dating back 100 years'.
Collapse
Affiliation(s)
- Benjamin Teo
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA
| | - Paul Bastide
- IMAG, Université de Montpellier, CNRS, Montpellier, France
| | - Cécile Ané
- Department of Statistics, University of Wisconsin-Madison, Madison, WI, USA
- Department of Botany, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
5
|
Hole K, Vernygora O, Handel K, Nebroski M, Lung O, Nfon C, Babiuk S. Phylogeographic Characterizations of Recent (2015-2023) Senecavirus A Isolates from Canada. Viruses 2025; 17:141. [PMID: 40006896 PMCID: PMC11860792 DOI: 10.3390/v17020141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2024] [Revised: 01/14/2025] [Accepted: 01/20/2025] [Indexed: 02/27/2025] Open
Abstract
Senecavirus A (SVA) continues to cause vesicular lesions in swine in Canada and many regions worldwide. Since the vesicular lesions caused by SVA are similar to those caused by foot and mouth disease virus, swine vesicular disease virus and vesicular stomatitis virus, a foreign animal disease investigation must be initiated to rule out these diseases. SVA isolates from pigs displaying vesicular lesions in Canada from 2015 to 2023 were sequenced, and phylogeographic analysis was performed using the complete genome sequences. The results infer that SVA has spread between the United States and Canada several times. In addition, the results suggest that SVA spreads from different regions. SVA spread was inferred from Canada into Thailand, India and Mexico and inferred from the United States to Brazil, Columbia, Chile and China with ten separate introductions. Furthermore, recombination was observed in SVA genomes from Canada, the United States and China.
Collapse
Affiliation(s)
- Kate Hole
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
| | - Oksana Vernygora
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
| | - Katherine Handel
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
| | - Michelle Nebroski
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
| | - Oliver Lung
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
- Department of Biological Sciences, University of Manitoba, Winnipeg, MB R3T 2N2, Canada
| | - Charles Nfon
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
| | - Shawn Babiuk
- National Centre for Foreign Animal Disease, Winnipeg, MB R3E 3M4, Canada; (O.V.); (K.H.); (M.N.); (O.L.); (C.N.)
- Department of Immunology, University of Manitoba, Winnipeg, MB R3E 0T5, Canada
| |
Collapse
|
6
|
Nanduri S, Black A, Bedford T, Huddleston J. Dimensionality reduction distills complex evolutionary relationships in seasonal influenza and SARS-CoV-2. Virus Evol 2024; 10:veae087. [PMID: 39610652 PMCID: PMC11604119 DOI: 10.1093/ve/veae087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Revised: 09/30/2024] [Accepted: 10/11/2024] [Indexed: 11/30/2024] Open
Abstract
Public health researchers and practitioners commonly infer phylogenies from viral genome sequences to understand transmission dynamics and identify clusters of genetically-related samples. However, viruses that reassort or recombine violate phylogenetic assumptions and require more sophisticated methods. Even when phylogenies are appropriate, they can be unnecessary or difficult to interpret without specialty knowledge. For example, pairwise distances between sequences can be enough to identify clusters of related samples or assign new samples to existing phylogenetic clusters. In this work, we tested whether dimensionality reduction methods could capture known genetic groups within two human pathogenic viruses that cause substantial human morbidity and mortality and frequently reassort or recombine, respectively: seasonal influenza A/H3N2 and SARS-CoV-2. We applied principal component analysis, multidimensional scaling (MDS), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection to sequences with well-defined phylogenetic clades and either reassortment (H3N2) or recombination (SARS-CoV-2). For each low-dimensional embedding of sequences, we calculated the correlation between pairwise genetic and Euclidean distances in the embedding and applied a hierarchical clustering method to identify clusters in the embedding. We measured the accuracy of clusters compared to previously defined phylogenetic clades, reassortment clusters, or recombinant lineages. We found that MDS embeddings accurately represented pairwise genetic distances including the intermediate placement of recombinant SARS-CoV-2 lineages between parental lineages. Clusters from t-SNE embeddings accurately recapitulated known phylogenetic clades, H3N2 reassortment groups, and SARS-CoV-2 recombinant lineages. We show that simple statistical methods without a biological model can accurately represent known genetic relationships for relevant human pathogenic viruses. Our open source implementation of these methods for analysis of viral genome sequences can be easily applied when phylogenetic methods are either unnecessary or inappropriate.
Collapse
Affiliation(s)
- Sravani Nanduri
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, United States
| | - Allison Black
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, United States
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, United States
- Howard Hughes Medical Institute, Seattle, WA, United States
| | - John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, United States
| |
Collapse
|
7
|
Mah MG, Zeller MA, Zhang R, Zhuang Y, Maro VP, Crump JA, Rubach MP, Ooi EE, Low JG, Wang DY, Smith GJD, Su YCF. Discordant phylodynamic and spatiotemporal transmission patterns driving the long-term persistence and evolution of human coronaviruses. NPJ VIRUSES 2024; 2:49. [PMID: 40295720 PMCID: PMC11721344 DOI: 10.1038/s44298-024-00058-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 09/10/2024] [Indexed: 04/30/2025]
Abstract
Four distinct species of human coronaviruses (HCoVs) circulate in humans. Despite the recent attention due to SARS-CoV-2, a comprehensive understanding of the molecular epidemiology and genomic evolution of HCoVs remains unclear. Here, we employed primary differentiated human nasal epithelial cells for the successful isolation and genome sequencing of HCoVs derived from two retrospective cohorts in Singapore and Tanzania. Phylodynamic inference shows that HCoV-229E and HCoV-OC43 were subject to stronger genetic drift and reduced purifying selection from the early 2000s onwards, primarily targeting spike Domain A and B. This resulted in increased lineage diversification, coinciding with a higher effective reproductive number (Re>1.0). However, HCoV-NL63 and HCoV-HKU1 experienced weaker genetic drift and selective pressure with prolonged regional persistence. Our findings suggest that HCoV-229E and HCoV-OC43 viruses are adept at generating new variants and achieving widespread intercontinental dissemination driven by continuous genetic drift, recombination, and complex migration patterns.
Collapse
Affiliation(s)
- Marcus G Mah
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
| | - Michael A Zeller
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
- Veterinary Diagnostic and Production Animal Medicine, Iowa State University, Ames, IA, USA
| | - Rong Zhang
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
| | - Yan Zhuang
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
| | - Venance P Maro
- Kilimanjaro Christian Medical Centre, Moshi, Tanzania
- Kilimanjaro Christian Medical University College, Moshi, Tanzania
| | - John A Crump
- Kilimanjaro Christian Medical Centre, Moshi, Tanzania
- Kilimanjaro Christian Medical University College, Moshi, Tanzania
- Centre for International Health, University of Otago, Dunedin, New Zealand
- Division of Infectious Diseases and International Health, Department of Medicine, Duke University, Durham, NC, USA
- Duke Global Health Institute, Duke University, Durham, NC, USA
| | - Matthew P Rubach
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
- Centre for International Health, University of Otago, Dunedin, New Zealand
- Department of Infectious Diseases, Singapore General Hospital, Singapore, Singapore
- Department of Otolaryngology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Eng Eong Ooi
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
| | - Jenny G Low
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore
- Department of Infectious Diseases, Singapore General Hospital, Singapore, Singapore
| | - De Yun Wang
- Department of Otolaryngology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore
| | - Gavin J D Smith
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore.
- Centre for Outbreak Preparedness, Duke-NUS Medical School, Singapore, Singapore.
| | - Yvonne C F Su
- Programme in Emerging Infectious Diseases, Duke-NUS Medical School, 8 College Road, Singapore, Singapore.
| |
Collapse
|
8
|
Li X, Trovão NS. The evolutionary and transmission dynamics of HIV-1 CRF08_BC. PLoS One 2024; 19:e0310027. [PMID: 39241052 PMCID: PMC11379155 DOI: 10.1371/journal.pone.0310027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 08/22/2024] [Indexed: 09/08/2024] Open
Abstract
HIV-1 CRF08_BC is a significant subtype in China, though its origin and spread remain incompletely understood. Previous studies using partial genomic data have provided insights but lack comprehensive analysis. Here, we investigate the early evolutionary and spatiotemporal dynamics of HIV-1 CRF08_BC in China and Myanmar using near-complete genome sequences. We analyzed 28 near-complete HIV-1 CRF08_BC genomes from China and Myanmar (1997-2013). Phylogenetic, molecular clock, and Bayesian discrete trait analyses were performed to infer the virus's origin, spread, and associated risk groups. Based on Bayesian time-scaled inference with the best-fitting combination of models determined by marginal likelihood estimation (MLE), we inferred the time to the most recent common ancestor (TMRCA) and evolutionary rate of HIV-1 CRF08_BC to be at 3 October 1991 (95% HPD: 22 February1989-27 November 1993) and 2.30 × 10-3 substitutions per site per year (95% HPD: 1.96 × 10-3-2.63 × 10-3), respectively. Our analysis suggests that HIV-1 CRF08_BC originated in Yunnan Province, China, among injecting drug users, and subsequently spread to other regions. This study provides valuable insights into the early dynamics of HIV-1 CRF08_BC through combined genomic and epidemiological data, which may inform effective prevention and mitigation efforts. However, the limited genomic data influenced the extent of our findings, and challenges in collecting accurate risk group information during surveillance were evident.
Collapse
Affiliation(s)
- Xingguang Li
- Ningbo No.2 Hospital, Ningbo, China
- Guoke Ningbo Life Science and Health Industry Research Institute, Ningbo, China
| | - Nídia S Trovão
- National Institutes of Health, Fogarty International Center, Bethesda, Maryland, United States of America
| |
Collapse
|
9
|
Nanduri S, Black A, Bedford T, Huddleston J. Dimensionality reduction distills complex evolutionary relationships in seasonal influenza and SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.07.579374. [PMID: 39253501 PMCID: PMC11383015 DOI: 10.1101/2024.02.07.579374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Public health researchers and practitioners commonly infer phylogenies from viral genome sequences to understand transmission dynamics and identify clusters of genetically-related samples. However, viruses that reassort or recombine violate phylogenetic assumptions and require more sophisticated methods. Even when phylogenies are appropriate, they can be unnecessary or difficult to interpret without specialty knowledge. For example, pairwise distances between sequences can be enough to identify clusters of related samples or assign new samples to existing phylogenetic clusters. In this work, we tested whether dimensionality reduction methods could capture known genetic groups within two human pathogenic viruses that cause substantial human morbidity and mortality and frequently reassort or recombine, respectively: seasonal influenza A/H3N2 and SARS-CoV-2. We applied principal component analysis (PCA), multidimensional scaling (MDS), t-distributed stochastic neighbor embedding (t-SNE), and uniform manifold approximation and projection (UMAP) to sequences with well-defined phylogenetic clades and either reassortment (H3N2) or recombination (SARS-CoV-2). For each low-dimensional embedding of sequences, we calculated the correlation between pairwise genetic and Euclidean distances in the embedding and applied a hierarchical clustering method to identify clusters in the embedding. We measured the accuracy of clusters compared to previously defined phylogenetic clades, reassortment clusters, or recombinant lineages. We found that MDS embeddings accurately represented pairwise genetic distances including the intermediate placement of recombinant SARS-CoV-2 lineages between parental lineages. Clusters from t-SNE embeddings accurately recapitulated known phylogenetic clades, H3N2 reassortment groups, and SARS-CoV-2 recombinant lineages. We show that simple statistical methods without a biological model can accurately represent known genetic relationships for relevant human pathogenic viruses. Our open source implementation of these methods for analysis of viral genome sequences can be easily applied when phylogenetic methods are either unnecessary or inappropriate.
Collapse
Affiliation(s)
- Sravani Nanduri
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Allison Black
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Howard Hughes Medical Institute, Seattle, WA, USA
| | - John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| |
Collapse
|
10
|
Kim BJ, Choi J, Kim SH. Whole-genome demography of COVID-19 virus during its pandemic period and on "panvalent" vaccine design. Sci Rep 2024; 14:17752. [PMID: 39085292 PMCID: PMC11291670 DOI: 10.1038/s41598-024-68432-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 07/23/2024] [Indexed: 08/02/2024] Open
Abstract
With over 16 million submitted genomic sequences, the SARS-CoV-2 (SC2) virus, the cause of the most recent worldwide COVID-19 pandemic, has become the most sequenced genome of all known viruses, revealing, for example, a vast number of expanding viral lineages. Since the pandemic phase appears to be over, we performed a retrospective re-examination of the demographic grouping pattern and their genomic characteristics during the entire pandemic period up to the peak of the last pandemic wave. For our study, we extracted from the NCBI only unique viral sequences and converted each sequence data to a relational vector, indicating the presence/absence of each variational event compared to a "reference" sequence. Our study revealed several genomic features that are unexpected or different from those of previous studies. For example, approximately 44,000 variants with unique sequences emerged during the pandemic period; they group into only four major viral-genomic groups and each has a set of mostly unique highly-conserved variant-genotypes (HCVGs); and a small set from the first ("ancestral") group was inherited by the three ("descendant") groups, suggesting that HCVGs in the next group may be predictable from the current group(s). Such a concept may be potentially important in designing "panvalent" vaccines against the current and future waves of viral infections.
Collapse
Affiliation(s)
- Byung-Ju Kim
- Department of Chemistry and Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
- Convergence Research Center for Insect Vectors, Incheon National University, Incheon, 22012, Republic of Korea
- Department of Dermatology, Chobanian and Avedisian School of Medicine, Boston University, Boston, MA, USA
| | - JaeJin Choi
- Department of Chemistry and Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
- Graduate Program of Comparative Biochemistry, University of California, Berkeley, CA, 94720, USA
- Division of Biological Systems and Engineering, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Sung-Hou Kim
- Department of Chemistry and Center for Computational Biology, University of California, Berkeley, CA, 94720, USA.
- Department of Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA.
| |
Collapse
|
11
|
Siragam V, Maltseva M, Castonguay N, Galipeau Y, Srinivasan MM, Soto JH, Dankar S, Langlois MA. Seasonal human coronaviruses OC43, 229E, and NL63 induce cell surface modulation of entry receptors and display host cell-specific viral replication kinetics. Microbiol Spectr 2024; 12:e0422023. [PMID: 38864599 PMCID: PMC11218498 DOI: 10.1128/spectrum.04220-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Accepted: 04/25/2024] [Indexed: 06/13/2024] Open
Abstract
The emergence of the COVID-19 pandemic prompted an increased interest in seasonal human coronaviruses. OC43, 229E, NL63, and HKU1 are endemic seasonal coronaviruses that cause the common cold and are associated with generally mild respiratory symptoms. In this study, we identified cell lines that exhibited cytopathic effects (CPE) upon infection by three of these coronaviruses and characterized their viral replication kinetics and the effect of infection on host surface receptor expression. We found that NL63 produced CPE in LLC-MK2 cells, while OC43 produced CPE in MRC-5, HCT-8, and WI-38 cell lines, while 229E produced CPE in MRC-5 and WI-38 by day 3 post-infection. We observed a sharp increase in nucleocapsid and spike viral RNA (vRNA) from day 3 to day 5 post-infection for all viruses; however, the abundance and the proportion of vRNA copies measured in the supernatants and cell lysates of infected cells varied considerably depending on the virus-host cell pair. Importantly, we observed modulation of coronavirus entry and attachment receptors upon infection. Infection with 229E and OC43 led to a downregulation of CD13 and GD3, respectively. In contrast, infection with NL63 and OC43 leads to an increase in ACE2 expression. Attempts to block entry of NL63 using either soluble ACE2 or anti-ACE2 monoclonal antibodies demonstrated the potential of these strategies to greatly reduce infection. Overall, our results enable a better understanding of seasonal coronaviruses infection kinetics in permissive cell lines and reveal entry receptor modulation that may have implications in facilitating co-infections with multiple coronaviruses in humans.IMPORTANCESeasonal human coronavirus is an important cause of the common cold associated with generally mild upper respiratory tract infections that can result in respiratory complications for some individuals. There are no vaccines available for these viruses, with only limited antiviral therapeutic options to treat the most severe cases. A better understanding of how these viruses interact with host cells is essential to identify new strategies to prevent infection-related complications. By analyzing viral replication kinetics in different permissive cell lines, we find that cell-dependent host factors influence how viral genes are expressed and virus particles released. We also analyzed entry receptor expression on infected cells and found that these can be up- or down-modulated depending on the infecting coronavirus. Our findings raise concerns over the possibility of infection enhancement upon co-infection by some coronaviruses, which may facilitate genetic recombination and the emergence of new variants and strains.
Collapse
MESH Headings
- Humans
- Virus Replication
- Coronavirus NL63, Human/physiology
- Coronavirus NL63, Human/genetics
- Coronavirus 229E, Human/physiology
- Coronavirus 229E, Human/genetics
- Coronavirus OC43, Human/physiology
- Coronavirus OC43, Human/genetics
- Cell Line
- Virus Internalization
- Seasons
- Kinetics
- Receptors, Virus/metabolism
- Receptors, Virus/genetics
- Common Cold/virology
- Common Cold/metabolism
- SARS-CoV-2/physiology
- SARS-CoV-2/genetics
- SARS-CoV-2/metabolism
- RNA, Viral/metabolism
- RNA, Viral/genetics
- Animals
- COVID-19/virology
- COVID-19/metabolism
- Coronavirus/physiology
- Coronavirus/genetics
Collapse
Affiliation(s)
- Vinayakumar Siragam
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Mariam Maltseva
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Nicolas Castonguay
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Yannick Galipeau
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Mrudhula Madapuji Srinivasan
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Justino Hernandez Soto
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Samar Dankar
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
| | - Marc-André Langlois
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, Canada
- The Center for Infection, Immunity, and Inflammation (CI3), University of Ottawa, Ottawa, Canada
| |
Collapse
|
12
|
Alfonsi T, Bernasconi A, Chiara M, Ceri S. Data-driven recombination detection in viral genomes. Nat Commun 2024; 15:3313. [PMID: 38632281 PMCID: PMC11024102 DOI: 10.1038/s41467-024-47464-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 03/25/2024] [Indexed: 04/19/2024] Open
Abstract
Recombination is a key molecular mechanism for the evolution and adaptation of viruses. The first recombinant SARS-CoV-2 genomes were recognized in 2021; as of today, more than ninety SARS-CoV-2 lineages are designated as recombinant. In the wake of the COVID-19 pandemic, several methods for detecting recombination in SARS-CoV-2 have been proposed; however, none could faithfully confirm manual analyses by experts in the field. We hereby present RecombinHunt, an original data-driven method for the identification of recombinant genomes, capable of recognizing recombinant SARS-CoV-2 genomes (or lineages) with one or two breakpoints with high accuracy and within reduced turn-around times. ReconbinHunt shows high specificity and sensitivity, compares favorably with other state-of-the-art methods, and faithfully confirms manual analyses by experts. RecombinHunt identifies recombinant viral genomes from the recent monkeypox epidemic in high concordance with manually curated analyses by experts, suggesting that our approach is robust and can be applied to any epidemic/pandemic virus.
Collapse
Affiliation(s)
- Tommaso Alfonsi
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy
| | - Anna Bernasconi
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy.
| | - Matteo Chiara
- Department of Biosciences, Università degli Studi di Milano, Via Celoria 26, 20133, Milan, Italy
| | - Stefano Ceri
- Department of Electronics, Information, and Bioengineering, Politecnico di Milano, Via Ponzio 34/5, 20133, Milan, Italy
| |
Collapse
|
13
|
Samson S, Lord É, Makarenkov V. Assessing the emergence time of SARS-CoV-2 zoonotic spillover. PLoS One 2024; 19:e0301195. [PMID: 38574109 PMCID: PMC10994396 DOI: 10.1371/journal.pone.0301195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 03/12/2024] [Indexed: 04/06/2024] Open
Abstract
Understanding the evolution of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV-2) and its relationship to other coronaviruses in the wild is crucial for preventing future virus outbreaks. While the origin of the SARS-CoV-2 pandemic remains uncertain, mounting evidence suggests the direct involvement of the bat and pangolin coronaviruses in the evolution of the SARS-CoV-2 genome. To unravel the early days of a probable zoonotic spillover event, we analyzed genomic data from various coronavirus strains from both human and wild hosts. Bayesian phylogenetic analysis was performed using multiple datasets, using strict and relaxed clock evolutionary models to estimate the occurrence times of key speciation, gene transfer, and recombination events affecting the evolution of SARS-CoV-2 and its closest relatives. We found strong evidence supporting the presence of temporal structure in datasets containing SARS-CoV-2 variants, enabling us to estimate the time of SARS-CoV-2 zoonotic spillover between August and early October 2019. In contrast, datasets without SARS-CoV-2 variants provided mixed results in terms of temporal structure. However, they allowed us to establish that the presence of a statistically robust clade in the phylogenies of gene S and its receptor-binding (RBD) domain, including two bat (BANAL) and two Guangdong pangolin coronaviruses (CoVs), is due to the horizontal gene transfer of this gene from the bat CoV to the pangolin CoV that occurred in the middle of 2018. Importantly, this clade is closely located to SARS-CoV-2 in both phylogenies. This phylogenetic proximity had been explained by an RBD gene transfer from the Guangdong pangolin CoV to a very recent ancestor of SARS-CoV-2 in some earlier works in the field before the BANAL coronaviruses were discovered. Overall, our study provides valuable insights into the timeline and evolutionary dynamics of the SARS-CoV-2 pandemic.
Collapse
Affiliation(s)
- Stéphane Samson
- Department of Computer Sciences, Université du Québec à Montréal, Montréal, Canada
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, Québec, Canada
| | - Étienne Lord
- Saint-Jean-sur-Richelieu Research and Development Centre, Agriculture and Agri-Food Canada, Saint-Jean-sur-Richelieu, Québec, Canada
| | - Vladimir Makarenkov
- Department of Computer Sciences, Université du Québec à Montréal, Montréal, Canada
- Mila—Quebec AI Institute, Montreal, QC, Canada
| |
Collapse
|
14
|
Steiner S, Kratzel A, Barut GT, Lang RM, Aguiar Moreira E, Thomann L, Kelly JN, Thiel V. SARS-CoV-2 biology and host interactions. Nat Rev Microbiol 2024; 22:206-225. [PMID: 38225365 DOI: 10.1038/s41579-023-01003-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/01/2023] [Indexed: 01/17/2024]
Abstract
The zoonotic emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and the ensuing coronavirus disease 2019 (COVID-19) pandemic have profoundly affected our society. The rapid spread and continuous evolution of new SARS-CoV-2 variants continue to threaten global public health. Recent scientific advances have dissected many of the molecular and cellular mechanisms involved in coronavirus infections, and large-scale screens have uncovered novel host-cell factors that are vitally important for the virus life cycle. In this Review, we provide an updated summary of the SARS-CoV-2 life cycle, gene function and virus-host interactions, including recent landmark findings on general aspects of coronavirus biology and newly discovered host factors necessary for virus replication.
Collapse
Affiliation(s)
- Silvio Steiner
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - Annika Kratzel
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - G Tuba Barut
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - Reto M Lang
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
- Graduate School for Cellular and Biomedical Sciences, University of Bern, Bern, Switzerland
| | - Etori Aguiar Moreira
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - Lisa Thomann
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
| | - Jenna N Kelly
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland
- Multidisciplinary Center for Infectious Diseases, University of Bern, Bern, Switzerland
- European Virus Bioinformatics Center, Jena, Germany
| | - Volker Thiel
- Institute of Virology and Immunology, Bern and Mittelhäusern, Bern, Switzerland.
- Department of Infectious Diseases and Pathobiology, Vetsuisse Faculty, University of Bern, Bern, Switzerland.
- Multidisciplinary Center for Infectious Diseases, University of Bern, Bern, Switzerland.
- European Virus Bioinformatics Center, Jena, Germany.
| |
Collapse
|
15
|
Do HQ, Yeom M, Moon S, Lee H, Chung CU, Chung HC, Park JW, Na W, Song D. Genetic characterization and pathogenicity in a mouse model of newly isolated bat-originated mammalian orthoreovirus in South Korea. Microbiol Spectr 2024; 12:e0176223. [PMID: 38289932 PMCID: PMC10913406 DOI: 10.1128/spectrum.01762-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 12/29/2023] [Indexed: 02/01/2024] Open
Abstract
Mammalian orthoreoviruses (MRVs) infect a wide range of hosts, including humans, livestock, and wildlife. In the present study, we isolated a novel Mammalian orthoreovirus from the intestine of a microbat (Myotis aurascens) and investigated its biological and pathological characteristics. Phylogenetic analysis indicated that the new isolate was serotype 2, sharing the segments with those from different hosts. Our results showed that it can infect a wide range of cell lines from different mammalian species, including human, swine, and non-human primate cell lines. Additionally, media containing trypsin, yeast extract, and tryptose phosphate broth promoted virus propagation in primate cell lines and most human cell lines, but not in A549 and porcine cell lines. Mice infected with this strain via the intranasal route, but not via the oral route, exhibited weight loss and respiratory distress. The virus is distributed in a broad range of organs and causes lung damage. In vitro and in vivo experiments also suggested that the new virus could be a neurotropic infectious strain that can infect a neuroblastoma cell line and replicate in the brains of infected mice. Additionally, it caused a delayed immune response, as indicated by the high expression levels of cytokines and chemokines only at 14 days post-infection (dpi). These data provide an important understanding of the genetics and pathogenicity of mammalian orthoreoviruses in bats at risk of spillover infections.IMPORTANCEMammalian orthoreoviruses (MRVs) have a broad range of hosts and can cause serious respiratory and gastroenteritis diseases in humans and livestock. Some strains infect the central nervous system, causing severe encephalitis. In this study, we identified BatMRV2/SNU1/Korea/2021, a reassortment of MRV serotype 2, isolated from bats with broad tissue tropism, including the neurological system. In addition, it has been shown to cause respiratory syndrome in mouse models. The given data will provide more evidence of the risk of mammalian orthoreovirus transmission from wildlife to various animal species and the sources of spillover infections.
Collapse
Affiliation(s)
- Hai Quynh Do
- Department of Virology, College of Veterinary Medicine and Research Institute for Veterinary Science, Seoul National University, Seoul, South Korea
| | - Minjoo Yeom
- Department of Virology, College of Veterinary Medicine and Research Institute for Veterinary Science, Seoul National University, Seoul, South Korea
| | - Suyun Moon
- College of Veterinary Medicine, Chonnam National University, Gwangju, South Korea
| | - Hanbyeul Lee
- Department of Virology, College of Veterinary Medicine and Research Institute for Veterinary Science, Seoul National University, Seoul, South Korea
| | - Chul-un Chung
- Department of Life Science, Dongguk University, Gyeongju, South Korea
| | - Hee-chun Chung
- Department of Microbiology and Immunology, Institute for Immunology and Immunological Diseases, Yonsei University College of Medicine, Seoul, South Korea
| | - Jun Won Park
- Division of Biomedical Convergence, College of Biomedical Science, Kangwon National University, Chuncheon, South Korea
| | - Woonsung Na
- College of Veterinary Medicine, Chonnam National University, Gwangju, South Korea
| | - Daesub Song
- Department of Virology, College of Veterinary Medicine and Research Institute for Veterinary Science, Seoul National University, Seoul, South Korea
| |
Collapse
|
16
|
Rasmussen DA, Guo F. Espalier: Efficient Tree Reconciliation and Ancestral Recombination Graphs Reconstruction Using Maximum Agreement Forests. Syst Biol 2023; 72:1154-1170. [PMID: 37458753 PMCID: PMC10627558 DOI: 10.1093/sysbio/syad040] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 06/26/2023] [Accepted: 06/30/2023] [Indexed: 11/08/2023] Open
Abstract
In the presence of recombination individuals may inherit different regions of their genome from different ancestors, resulting in a mosaic of phylogenetic histories across their genome. Ancestral recombination graphs (ARGs) can capture how phylogenetic relationships vary across the genome due to recombination, but reconstructing ARGs from genomic sequence data is notoriously difficult. Here, we present a method for reconciling discordant phylogenetic trees and reconstructing ARGs using maximum agreement forests (MAFs). Given two discordant trees, a MAF identifies the smallest possible set of topologically concordant subtrees present in both trees. We show how discordant trees can be reconciled through their MAF in a way that retains discordances strongly supported by sequence data while eliminating conflicts likely attributable to phylogenetic noise. We further show how MAFs and our reconciliation approach can be combined to select a path of local trees across the genome that maximizes the likelihood of the genomic sequence data, minimizes discordance between neighboring local trees, and identifies the recombination events necessary to explain remaining discordances to obtain a fully connected ARG. While heuristic, our ARG reconstruction approach is often as accurate as more exact methods while being much more computationally efficient. Moreover, important demographic parameters such as recombination rates can be accurately estimated from reconstructed ARGs. Finally, we apply our approach to plant infecting RNA viruses in the genus Potyvirus to demonstrate how true recombination events can be disentangled from phylogenetic noise using our ARG reconstruction methods.
Collapse
Affiliation(s)
- David A Rasmussen
- Department of Entomology and Plant Pathology, North Carolina State University, Campus Box 7613, Raleigh, NC 27695, USA
- Bioinformatics Research Center, North Carolina State University, Campus Box 7566, Raleigh, NC 27695, USA
| | - Fangfang Guo
- Department of Entomology and Plant Pathology, North Carolina State University, Campus Box 7613, Raleigh, NC 27695, USA
| |
Collapse
|
17
|
Fang L, Xu J, Zhao Y, Fan J, Shen J, Liu W, Cao G. The effects of amino acid substitution of spike protein and genomic recombination on the evolution of SARS-CoV-2. Front Microbiol 2023; 14:1228128. [PMID: 37560529 PMCID: PMC10409611 DOI: 10.3389/fmicb.2023.1228128] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 07/03/2023] [Indexed: 08/11/2023] Open
Abstract
Over three years' pandemic of 2019 novel coronavirus disease (COVID-19), multiple variants and novel subvariants have emerged successively, outcompeted earlier variants and become predominant. The sequential emergence of variants reflects the evolutionary process of mutation-selection-adaption of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Amino acid substitution/insertion/deletion in the spike protein causes altered viral antigenicity, transmissibility, and pathogenicity of SARS-CoV-2. Early in the pandemic, D614G mutation conferred virus with advantages over previous variants and increased transmissibility, and it also laid a conservative background for subsequent substantial mutations. The role of genomic recombination in the evolution of SARS-CoV-2 raised increasing concern with the occurrence of novel recombinants such as Deltacron, XBB.1.5, XBB.1.9.1, and XBB.1.16 in the late phase of pandemic. Co-circulation of different variants and co-infection in immunocompromised patients accelerate the emergence of recombinants. Surveillance for SARS-CoV-2 genomic variations, particularly spike protein mutation and recombination, is essential to identify ongoing changes in the viral genome and antigenic epitopes and thus leads to the development of new vaccine strategies and interventions.
Collapse
Affiliation(s)
- Letian Fang
- Key Laboratory of Biological Defense, Ministry of Education, Shanghai, China
- Shanghai Key Laboratory of Medical Bioprotection, Shanghai, China
- Department of Epidemiology, Second Military Medical University, Shanghai, China
| | - Jie Xu
- Department of Foreign Languages, International Exchange Center for Military Medicine, Second Military Medical University, Shanghai, China
| | - Yue Zhao
- Key Laboratory of Biological Defense, Ministry of Education, Shanghai, China
- Shanghai Key Laboratory of Medical Bioprotection, Shanghai, China
- Department of Epidemiology, Second Military Medical University, Shanghai, China
| | - Junyan Fan
- Key Laboratory of Biological Defense, Ministry of Education, Shanghai, China
- Shanghai Key Laboratory of Medical Bioprotection, Shanghai, China
- Department of Epidemiology, Second Military Medical University, Shanghai, China
| | - Jiaying Shen
- School of Medicine, Tongji University, Shanghai, China
| | - Wenbin Liu
- Key Laboratory of Biological Defense, Ministry of Education, Shanghai, China
- Shanghai Key Laboratory of Medical Bioprotection, Shanghai, China
- Department of Epidemiology, Second Military Medical University, Shanghai, China
| | - Guangwen Cao
- Key Laboratory of Biological Defense, Ministry of Education, Shanghai, China
- Shanghai Key Laboratory of Medical Bioprotection, Shanghai, China
- Department of Epidemiology, Second Military Medical University, Shanghai, China
| |
Collapse
|
18
|
Pekar JE, Lytras S, Ghafari M, Magee AF, Parker E, Havens JL, Katzourakis A, Vasylyeva TI, Suchard MA, Hughes AC, Hughes J, Robertson DL, Dellicour S, Worobey M, Wertheim JO, Lemey P. The recency and geographical origins of the bat viruses ancestral to SARS-CoV and SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.12.548617. [PMID: 37502985 PMCID: PMC10369958 DOI: 10.1101/2023.07.12.548617] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
The emergence of SARS-CoV in 2002 and SARS-CoV-2 in 2019 has led to increased sampling of related sarbecoviruses circulating primarily in horseshoe bats. These viruses undergo frequent recombination and exhibit spatial structuring across Asia. Employing recombination-aware phylogenetic inference on bat sarbecoviruses, we find that the closest-inferred bat virus ancestors of SARS-CoV and SARS-CoV-2 existed just ~1-3 years prior to their emergence in humans. Phylogeographic analyses examining the movement of related sarbecoviruses demonstrate that they traveled at similar rates to their horseshoe bat hosts and have been circulating for thousands of years in Asia. The closest-inferred bat virus ancestor of SARS-CoV likely circulated in western China, and that of SARS-CoV-2 likely circulated in a region comprising southwest China and northern Laos, both a substantial distance from where they emerged. This distance and recency indicate that the direct ancestors of SARS-CoV and SARS-CoV-2 could not have reached their respective sites of emergence via the bat reservoir alone. Our recombination-aware dating and phylogeographic analyses reveal a more accurate inference of evolutionary history than performing only whole-genome or single gene analyses. These results can guide future sampling efforts and demonstrate that viral genomic fragments extremely closely related to SARS-CoV and SARS-CoV-2 were circulating in horseshoe bats, confirming their importance as the reservoir species for SARS viruses.
Collapse
Affiliation(s)
- Jonathan E Pekar
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Biomedical Informatics, University of California San Diego, La Jolla, CA 92093, USA
- These authors contributed equally
| | - Spyros Lytras
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, UK
- These authors contributed equally
| | - Mahan Ghafari
- Department of Biology, University of Oxford, Oxford, UK
| | - Andrew F Magee
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Edyth Parker
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Jennifer L Havens
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA 92093, USA
| | | | - Tetyana I Vasylyeva
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Marc A Suchard
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Biostatistics, Fielding School of Public Health, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Alice C Hughes
- School of Biological Sciences, University of Hong Kong, Hong Kong
- China Biodiversity Green Development Foundation, Beijing, China
| | - Joseph Hughes
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, UK
| | - David L Robertson
- Medical Research Council-University of Glasgow Centre for Virus Research, Glasgow, UK
- These authors jointly supervised the work
| | - Simon Dellicour
- Spatial Epidemiology Lab (SpELL), Université Libre de Bruxelles, CP160/12, 50 av. FD Roosevelt, 1050, Bruxelles, Belgium
- Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory for Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium
- These authors jointly supervised the work
| | - Michael Worobey
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
- These authors jointly supervised the work
| | - Joel O Wertheim
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
- These authors jointly supervised the work
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory for Clinical and Epidemiological Virology, KU Leuven, Leuven, Belgium
- These authors jointly supervised the work
| |
Collapse
|
19
|
Zhang L, Abhari N, Colijn C, Wu Y. A fast and scalable method for inferring phylogenetic networks from trees by aligning lineage taxon strings. Genome Res 2023; 33:1053-1060. [PMID: 37217252 PMCID: PMC10538497 DOI: 10.1101/gr.277669.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 05/16/2023] [Indexed: 05/24/2023]
Abstract
The reconstruction of phylogenetic networks is an important but challenging problem in phylogenetics and genome evolution, as the space of phylogenetic networks is vast and cannot be sampled well. One approach to the problem is to solve the minimum phylogenetic network problem, in which phylogenetic trees are first inferred, and then the smallest phylogenetic network that displays all the trees is computed. The approach takes advantage of the fact that the theory of phylogenetic trees is mature, and there are excellent tools available for inferring phylogenetic trees from a large number of biomolecular sequences. A tree-child network is a phylogenetic network satisfying the condition that every nonleaf node has at least one child that is of indegree one. Here, we develop a new method that infers the minimum tree-child network by aligning lineage taxon strings in the phylogenetic trees. This algorithmic innovation enables us to get around the limitations of the existing programs for phylogenetic network inference. Our new program, named ALTS, is fast enough to infer a tree-child network with a large number of reticulations for a set of up to 50 phylogenetic trees with 50 taxa that have only trivial common clusters in about a quarter of an hour on average.
Collapse
Affiliation(s)
- Louxin Zhang
- Department of Mathematics and Centre for Data Science and Machine Learning, National University of Singapore, Singapore 119076, Singapore;
| | - Niloufar Abhari
- Department of Mathematics, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Yufeng Wu
- Department of Computer Science and Engineering and Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut 06269, USA
| |
Collapse
|
20
|
Wu Z, Han Y, Wang Y, Liu B, Zhao L, Zhang J, Su H, Zhao W, Liu L, Bai S, Dong J, Sun L, Zhu Y, Zhou S, Song Y, Sui H, Yang J, Wang J, Zhang S, Qian Z, Jin Q. A comprehensive survey of bat sarbecoviruses across China in relation to the origins of SARS-CoV and SARS-CoV-2. Natl Sci Rev 2023; 10:nwac213. [PMID: 37425654 PMCID: PMC10325003 DOI: 10.1093/nsr/nwac213] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 09/08/2022] [Accepted: 09/13/2022] [Indexed: 09/10/2023] Open
Abstract
SARS-CoV and SARS-CoV-2 have been thought to originate from bats. In this study, we screened pharyngeal and anal swabs from 13 064 bats collected between 2016 and 2021 at 703 locations across China for sarbecoviruses, covering almost all known southern hotspots, and found 146 new bat sarbecoviruses. Phylogenetic analyses of all available sarbecoviruses show that there are three different lineages-L1 as SARS-CoV-related CoVs (SARSr-CoVs), L2 as SARS-CoV-2-related CoVs (SC2r-CoVs) and novel L-R (recombinants of L1 and L2)-present in Rhinolophus pusillus bats, in the mainland of China. Among the 146 sequences, only four are L-Rs. Importantly, none belong in the L2 lineage, indicating that circulation of SC2r-CoVs in China might be very limited. All remaining 142 sequences belong in the L1 lineage, of which YN2020B-G shares the highest overall sequence identity with SARS-CoV (95.8%). The observation suggests endemic circulations of SARSr-CoVs, but not SC2r-CoVs, in bats in China. Geographic analysis of the collection sites in this study, together with all published reports, indicates that SC2r-CoVs may be mainly present in bats of Southeast Asia, including the southern border of Yunnan province, but absent in all other regions within China. In contrast, SARSr-CoVs appear to have broader geographic distribution, with the highest genetic diversity and sequence identity to human sarbecoviruses along the southwest border of China. Our data provide the rationale for further extensive surveys in broader geographical regions within, and beyond, Southeast Asia in order to find the most recent ancestors of human sarbecoviruses.
Collapse
Affiliation(s)
- Zhiqiang Wu
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
- Key Laboratory of Respiratory Disease Pathogenomics, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Yelin Han
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Yuyang Wang
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Bo Liu
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Lamei Zhao
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Junpeng Zhang
- College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang 110866, China
| | - Haoxiang Su
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Wenliang Zhao
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Liguo Liu
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Shibin Bai
- College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang 110866, China
| | - Jie Dong
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Lilian Sun
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Yafang Zhu
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Siyu Zhou
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Yiping Song
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Hongtao Sui
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Jian Yang
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Jianwei Wang
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Shuyi Zhang
- College of Animal Science and Veterinary Medicine, Shenyang Agricultural University, Shenyang 110866, China
| | - Zhaohui Qian
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| | - Qi Jin
- NHC Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 110730, China
| |
Collapse
|
21
|
Chattopadhyay S, Gisselsson D. Modelling evolution at the boundaries of solid tumours. Nat Ecol Evol 2023; 7:497-498. [PMID: 36894663 DOI: 10.1038/s41559-023-01996-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]
Affiliation(s)
- Subhayan Chattopadhyay
- Division of Clinical Genetics, Department of Laboratory Medicine, Lund University, Lund, Sweden.
| | - David Gisselsson
- Division of Clinical Genetics, Department of Laboratory Medicine, Lund University, Lund, Sweden
- Division of Oncology and Pathology, Department of Clinical Sciences, Lund University, Lund, Sweden
| |
Collapse
|
22
|
Zhang Z, Zhou J, Ni P, Hu B, Jolicoeur N, Deng S, Xiao Q, He Q, Li G, Xia Y, Liu M, Wang C, Fang Z, Xia N, Zhang ZR, Zhang B, Cai K, Xu Y, Liu B. PF-D-Trimer, a protective SARS-CoV-2 subunit vaccine: immunogenicity and application. NPJ Vaccines 2023; 8:38. [PMID: 36922524 PMCID: PMC10015519 DOI: 10.1038/s41541-023-00636-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Accepted: 02/27/2023] [Indexed: 03/18/2023] Open
Abstract
The COVID-19 pandemic, caused by the SARS-CoV-2 virus, has had and continues to have a significant impact on global public health. One of the characteristics of SARS-CoV-2 is a surface homotrimeric spike protein, which is primarily responsible for the host immune response upon infection. Here we present the preclinical studies of a broadly protective SARS-CoV-2 subunit vaccine developed from our trimer domain platform using the Delta spike protein, from antigen design through purification, vaccine evaluation and manufacturability. The pre-fusion trimerized Delta spike protein, PF-D-Trimer, was highly expressed in Chinese hamster ovary (CHO) cells, purified by a rapid one-step anti-Trimer Domain monoclonal antibody immunoaffinity process and prepared as a vaccine formulation with an adjuvant. Immunogenicity studies have shown that this vaccine candidate induces robust immune responses in mouse, rat and Syrian hamster models. It also protects K18-hACE2 transgenic mice in a homologous viral challenge. Neutralizing antibodies induced by this vaccine show cross-reactivity against the ancestral WA1, Delta and several Omicrons, including BA.5.2. The formulated PF-D Trimer is stable for up to six months without refrigeration. The Trimer Domain platform was proven to be a key technology in the rapid production of PF-D-Trimer vaccine and may be crucial to accelerate the development and accessibility of updated versions of SARS-CoV-2 vaccines.
Collapse
Affiliation(s)
- Zhihao Zhang
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Key Laboratory of Fermentation Engineering (Ministry of Education), Hubei Provincial Cooperative Innovation Center of Industrial Fermentation, Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, Hubei, China.,Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Jinhu Zhou
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Peng Ni
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Bing Hu
- Institute of Health Inspection and Testing, Hubei Provincial Centre for Disease Control and Prevention (Hubei CDC), Wuhan, Hubei, China
| | | | - Shuang Deng
- Nova Biologiques Inc. Montréal, Québec, Canada
| | - Qian Xiao
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Qian He
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China.,School of Pharmacy, Hubei University of Science and Technology, Xian Ning, China
| | - Gai Li
- Nova Biologiques Inc. Montréal, Québec, Canada
| | - Yan Xia
- Nova Biologiques Inc. Montréal, Québec, Canada
| | - Mei Liu
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Key Laboratory of Fermentation Engineering (Ministry of Education), Hubei Provincial Cooperative Innovation Center of Industrial Fermentation, Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, Hubei, China.,Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Cong Wang
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Zhizheng Fang
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Nan Xia
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China
| | - Zhe-Rui Zhang
- Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, China
| | - Bo Zhang
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China. .,Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, China.
| | - Kun Cai
- Institute of Health Inspection and Testing, Hubei Provincial Centre for Disease Control and Prevention (Hubei CDC), Wuhan, Hubei, China.
| | - Yan Xu
- Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China. .,Nova Biologiques Inc. Montréal, Québec, Canada.
| | - Binlei Liu
- National "111" Center for Cellular Regulation and Molecular Pharmaceutics, Key Laboratory of Fermentation Engineering (Ministry of Education), Hubei Provincial Cooperative Innovation Center of Industrial Fermentation, Hubei Key Laboratory of Industrial Microbiology, Hubei University of Technology, Wuhan, Hubei, China. .,Wuhan Binhui Biopharmaceutical Co., Ltd, Wuhan, China. .,School of Pharmacy, Hubei University of Science and Technology, Xian Ning, China.
| |
Collapse
|
23
|
Zhao LP, Cohen S, Zhao M, Madeleine M, Payne TH, Lybrand TP, Geraghty DE, Jerome KR, Corey L. Using Haplotype-Based Artificial Intelligence to Evaluate SARS-CoV-2 Novel Variants and Mutations. JAMA Netw Open 2023; 6:e230191. [PMID: 36809468 PMCID: PMC9945077 DOI: 10.1001/jamanetworkopen.2023.0191] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 01/05/2023] [Indexed: 02/23/2023] Open
Abstract
Importance Earlier detection of emerging novel SARS-COV-2 variants is important for public health surveillance of potential viral threats and for earlier prevention research. Artificial intelligence may facilitate early detection of SARS-CoV2 emerging novel variants based on variant-specific mutation haplotypes and, in turn, be associated with enhanced implementation of risk-stratified public health prevention strategies. Objective To develop a haplotype-based artificial intelligence (HAI) model for identifying novel variants, including mixture variants (MVs) of known variants and new variants with novel mutations. Design, Setting, and Participants This cross-sectional study used serially observed viral genomic sequences globally (prior to March 14, 2022) to train and validate the HAI model and used it to identify variants arising from a prospective set of viruses from March 15 to May 18, 2022. Main Outcomes and Measures Viral sequences, collection dates, and locations were subjected to statistical learning analysis to estimate variant-specific core mutations and haplotype frequencies, which were then used to construct an HAI model to identify novel variants. Results Through training on more than 5 million viral sequences, an HAI model was built, and its identification performance was validated on an independent validation set of more than 5 million viruses. Its identification performance was assessed on a prospective set of 344 901 viruses. In addition to achieving an accuracy of 92.8% (95% CI within 0.1%), the HAI model identified 4 Omicron MVs (Omicron-Alpha, Omicron-Delta, Omicron-Epsilon, and Omicron-Zeta), 2 Delta MVs (Delta-Kappa and Delta-Zeta), and 1 Alpha-Epsilon MV, among which Omicron-Epsilon MVs were most frequent (609/657 MVs [92.7%]). Furthermore, the HAI model found that 1699 Omicron viruses had unidentifiable variants given that these variants acquired novel mutations. Lastly, 524 variant-unassigned and variant-unidentifiable viruses carried 16 novel mutations, 8 of which were increasing in prevalence percentages as of May 2022. Conclusions and Relevance In this cross-sectional study, an HAI model found SARS-COV-2 viruses with MV or novel mutations in the global population, which may require closer examination and monitoring. These results suggest that HAI may complement phylogenic variant assignment, providing additional insights into emerging novel variants in the population.
Collapse
Affiliation(s)
- Lue Ping Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Seth Cohen
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
- Department of Medicine, University of Washington School of Medicine, Seattle
| | | | - Margaret Madeleine
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Thomas H. Payne
- Department of Medicine, University of Washington School of Medicine, Seattle
| | - Terry P. Lybrand
- Quintepa Computing LLC, Nashville, Tennessee
- Department of Chemistry, Vanderbilt University; Nashville, Tennessee
| | - Daniel E. Geraghty
- Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Keith R. Jerome
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
- Department of Medicine, University of Washington School of Medicine, Seattle
| | - Lawrence Corey
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
- Department of Medicine, University of Washington School of Medicine, Seattle
| |
Collapse
|
24
|
Correlated substitutions reveal SARS-like coronaviruses recombine frequently with a diverse set of structured gene pools. Proc Natl Acad Sci U S A 2023; 120:e2206945119. [PMID: 36693089 PMCID: PMC9945976 DOI: 10.1073/pnas.2206945119] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Quantifying SARS-like coronavirus (SL-CoV) evolution is critical to understanding the origins of SARS-CoV-2 and the molecular processes that could underlie future epidemic viruses. While genomic analyses suggest recombination was a factor in the emergence of SARS-CoV-2, few studies have quantified recombination rates among SL-CoVs. Here, we infer recombination rates of SL-CoVs from correlated substitutions in sequencing data using a coalescent model with recombination. Our computationally-efficient, non-phylogenetic method infers recombination parameters of both sampled sequences and the unsampled gene pools with which they recombine. We apply this approach to infer recombination parameters for a range of positive-sense RNA viruses. We then analyze a set of 191 SL-CoV sequences (including SARS-CoV-2) and find that ORF1ab and S genes frequently undergo recombination. We identify which SL-CoV sequence clusters have recombined with shared gene pools, and show that these pools have distinct structures and high recombination rates, with multiple recombination events occurring per synonymous substitution. We find that individual genes have recombined with different viral reservoirs. By decoupling contributions from mutation and recombination, we recover the phylogeny of non-recombined portions for many of these SL-CoVs, including the position of SARS-CoV-2 in this clonal phylogeny. Lastly, by analyzing >400,000 SARS-CoV-2 whole genome sequences, we show current diversity levels are insufficient to infer the within-population recombination rate of the virus since the pandemic began. Our work offers new methods for inferring recombination rates in RNA viruses with implications for understanding recombination in SARS-CoV-2 evolution and the structure of clonal relationships and gene pools shaping its origins.
Collapse
|
25
|
It takes a village to build a virus. Proc Natl Acad Sci U S A 2023; 120:e2219052120. [PMID: 36701364 PMCID: PMC9945952 DOI: 10.1073/pnas.2219052120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
|
26
|
Zhou ZJ, Yang CH, Ye SB, Yu XW, Qiu Y, Ge XY. VirusRecom: an information-theory-based method for recombination detection of viral lineages and its application on SARS-CoV-2. Brief Bioinform 2023; 24:6886420. [PMID: 36567622 DOI: 10.1093/bib/bbac513] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 10/08/2022] [Accepted: 10/27/2022] [Indexed: 12/27/2022] Open
Abstract
Genomic recombination is an important driving force for viral evolution, and recombination events have been reported for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) during the Coronavirus Disease 2019 pandemic, which significantly alter viral infectivity and transmissibility. However, it is difficult to identify viral recombination, especially for low-divergence viruses such as SARS-CoV-2, since it is hard to distinguish recombination from in situ mutation. Herein, we applied information theory to viral recombination analysis and developed VirusRecom, a program for efficiently screening recombination events on viral genome. In principle, we considered a recombination event as a transmission process of ``information'' and introduced weighted information content (WIC) to quantify the contribution of recombination to a certain region on viral genome; then, we identified the recombination regions by comparing WICs of different regions. In the benchmark using simulated data, VirusRecom showed a good balance between precision and recall compared to two competing tools, RDP5 and 3SEQ. In the detection of SARS-CoV-2 XE, XD and XF recombinants, VirusRecom providing more accurate positions of recombination regions than RDP5 and 3SEQ. In addition, we encapsulated the VirusRecom program into a command-line-interface software for convenient operation by users. In summary, we developed a novel approach based on information theory to identify viral recombination within highly similar sequences, providing a useful tool for monitoring viral evolution and epidemic control.
Collapse
Affiliation(s)
- Zhi-Jian Zhou
- Hunan Provincial Key Laboratory of Medical Virology, Institute of Pathogen Biology and Immunology, College of Biology, Hunan University, 27 Tianma Rd., Changsha, Hunan, 410012, China
| | - Chen-Hui Yang
- Hunan Provincial Key Laboratory of Medical Virology, Institute of Pathogen Biology and Immunology, College of Biology, Hunan University, 27 Tianma Rd., Changsha, Hunan, 410012, China
| | - Sheng-Bao Ye
- Hunan Provincial Key Laboratory of Medical Virology, Institute of Pathogen Biology and Immunology, College of Biology, Hunan University, 27 Tianma Rd., Changsha, Hunan, 410012, China
| | - Xiao-Wei Yu
- Hunan Provincial Key Laboratory of Medical Virology, Institute of Pathogen Biology and Immunology, College of Biology, Hunan University, 27 Tianma Rd., Changsha, Hunan, 410012, China.,Hunan Prevention and Treatment Institute for Occupational Diseases, 162 Xinjian W. Rd., Changsha, Hunan, 410000, China
| | - Ye Qiu
- Hunan Provincial Key Laboratory of Medical Virology, Institute of Pathogen Biology and Immunology, College of Biology, Hunan University, 27 Tianma Rd., Changsha, Hunan, 410012, China
| | - Xing-Yi Ge
- Hunan Provincial Key Laboratory of Medical Virology, Institute of Pathogen Biology and Immunology, College of Biology, Hunan University, 27 Tianma Rd., Changsha, Hunan, 410012, China
| |
Collapse
|