1
|
Celentano M, DeWitt WS, Prillo S, Song YS. Exact and efficient phylodynamic simulation from arbitrarily large populations. Proc Natl Acad Sci U S A 2025; 122:e2412978122. [PMID: 40366686 DOI: 10.1073/pnas.2412978122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Accepted: 04/15/2025] [Indexed: 05/15/2025] Open
Abstract
Many biological studies involve inferring the evolutionary history of a sample of individuals from a large population and interpreting the reconstructed tree. Such an ascertained tree typically represents only a small part of a comprehensive population tree and is distorted by survivorship and sampling biases. Inferring evolutionary parameters from ascertained trees requires modeling both the underlying population dynamics and the ascertainment process. A crucial component of this phylodynamic modeling involves tree simulation, which is used to benchmark probabilistic inference methods. To simulate an ascertained tree, one must first simulate the full population tree and then prune unobserved lineages. Consequently, the computational cost is determined not by the size of the final simulated tree, but by the size of the population tree in which it is embedded. In most biological scenarios, simulations of the entire population are prohibitively expensive due to computational demands placed on lineages without sampled descendants. Here, we address this challenge by proving that, for any partially ascertained process from a general multitype birth-death-mutation-sampling model, there exists an equivalent process with complete sampling and no death, a property which we leverage to develop a highly efficient algorithm for simulating trees. Our algorithm scales linearly with the size of the final simulated tree and is independent of the population size, enabling simulations from extremely large populations beyond the reach of current methods but essential for various biological applications. We anticipate that this massive speedup will significantly advance the development of novel inference methods that require extensive training data.
Collapse
Affiliation(s)
- Michael Celentano
- Department of Statistics, University of California, Berkeley, CA 94720
| | - William S DeWitt
- Computer Science Division, University of California, Berkeley, CA 94720
| | - Sebastian Prillo
- Computer Science Division, University of California, Berkeley, CA 94720
| | - Yun S Song
- Department of Statistics, University of California, Berkeley, CA 94720
- Computer Science Division, University of California, Berkeley, CA 94720
| |
Collapse
|
2
|
Rouzine IM. Evolutionary Mechanisms of the Emergence of the Variants of Concern of SARS-CoV-2. Viruses 2025; 17:197. [PMID: 40006952 PMCID: PMC11861269 DOI: 10.3390/v17020197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2025] [Revised: 01/21/2025] [Accepted: 01/29/2025] [Indexed: 02/27/2025] Open
Abstract
The evolutionary origin of the variants of concern (VOCs) of SARS-CoV-2, characterized by a large number of new substitutions and strong changes in virulence and transmission rate, is intensely debated. The leading explanation in the literature is a chronic infection in immunocompromised individuals, where the virus evolves before returning into the main population. The present article reviews less-investigated hypotheses of VOC emergence with transmission between acutely infected hosts, with a focus on the mathematical models of stochastic evolution that have proved to be useful for other viruses, such as HIV and influenza virus. The central message is that understanding the acting factors of VOC evolution requires the framework of stochastic multi-locus evolution models, and that alternative hypotheses can be effectively verified by fitting results of computer simulation to empirical data.
Collapse
Affiliation(s)
- Igor M Rouzine
- Sechenov Institute of Evolutionary Physiology and Biochemistry, Russian Academy of Sciences, St. Petersburg 194223, Russia
| |
Collapse
|
3
|
Lefrancq N, Duret L, Bouchez V, Brisse S, Parkhill J, Salje H. Learning the fitness dynamics of pathogens from phylogenies. Nature 2025; 637:683-690. [PMID: 39743587 PMCID: PMC11735385 DOI: 10.1038/s41586-024-08309-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Accepted: 10/30/2024] [Indexed: 01/04/2025]
Abstract
The dynamics of the genetic diversity of pathogens, including the emergence of lineages with increased fitness, is a foundational concept of disease ecology with key public-health implications. However, the identification of such lineages and estimation of associated fitness remain challenging, and is rarely done outside densely sampled systems1,2. Here we present phylowave, a scalable approach that summarizes changes in population composition in phylogenetic trees, enabling the automatic detection of lineages based on shared fitness and evolutionary relationships. We use our approach on a broad set of viruses and bacteria (SARS-CoV-2, influenza A subtype H3N2, Bordetella pertussis and Mycobacterium tuberculosis), which include both well-studied and understudied threats to human health. We show that phylowave recovers the main known circulating lineages for each pathogen and that it can detect specific amino acid changes linked to fitness changes. Furthermore, phylowave identifies previously undetected lineages with increased fitness, including three co-circulating B. pertussis lineages. Inference using phylowave is robust to uneven and limited observations. This widely applicable approach provides an avenue to monitor evolution in real time to support public-health action and explore fundamental drivers of pathogen fitness.
Collapse
Affiliation(s)
- Noémie Lefrancq
- Department of Genetics, University of Cambridge, Cambridge, UK.
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK.
- Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.
| | - Loréna Duret
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Valérie Bouchez
- Biodiversity and Epidemiology of Bacterial Pathogens, Institut Pasteur, Université de Paris, Paris, France
- National Reference Center for Whooping Cough and Other Bordetella Infections, Paris, France
| | - Sylvain Brisse
- Biodiversity and Epidemiology of Bacterial Pathogens, Institut Pasteur, Université de Paris, Paris, France
- National Reference Center for Whooping Cough and Other Bordetella Infections, Paris, France
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | - Henrik Salje
- Department of Genetics, University of Cambridge, Cambridge, UK
| |
Collapse
|
4
|
Meijers M, Ruchnewitz D, Eberhardt J, Karmakar M, Łuksza M, Lässig M. Concepts and Methods for Predicting Viral Evolution. Methods Mol Biol 2025; 2890:253-290. [PMID: 39890732 DOI: 10.1007/978-1-0716-4326-6_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2025]
Abstract
The seasonal human influenza virus undergoes rapid evolution, leading to significant changes in circulating viral strains from year to year. These changes are typically driven by adaptive mutations, particularly in the antigenic epitopes, the regions of the viral surface protein hemagglutinin targeted by human antibodies. Here, we describe a consistent set of methods for data-driven predictive analysis of viral evolution. Our pipeline integrates four types of data: (1) sequence data of viral isolates collected on a worldwide scale, (2) epidemiological data on incidences, (3) antigenic characterization of circulating viruses, and (4) intrinsic viral phenotypes. From the combined analysis of these data, we obtain estimates of relative fitness for circulating strains and predictions of clade frequencies for periods of up to 1 year. Furthermore, we obtain comparative estimates of protection against future viral populations for candidate vaccine strains, providing a basis for pre-emptive vaccine strain selection. Continuously updated predictions obtained from the prediction pipeline for influenza and SARS-CoV-2 are available at https://previr.app .
Collapse
Affiliation(s)
- Matthijs Meijers
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Denis Ruchnewitz
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Jan Eberhardt
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Malancha Karmakar
- Institute for Biological Physics, University of Cologne, Köln, Germany
| | - Marta Łuksza
- Departments of Oncological Sciences and Genetics and Genomic Sciences, Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Michael Lässig
- Institute for Biological Physics, University of Cologne, Köln, Germany.
| |
Collapse
|
5
|
Barrat-Charlaix P, Neher RA. Eco-evolutionary dynamics of adapting pathogens and host immunity. eLife 2024; 13:RP97350. [PMID: 39728926 DOI: 10.7554/elife.97350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2024] Open
Abstract
As pathogens spread in a population of hosts, immunity is built up, and the pool of susceptible individuals are depleted. This generates selective pressure, to which many human RNA viruses, such as influenza virus or SARS-CoV-2, respond with rapid antigenic evolution and frequent emergence of immune evasive variants. However, the host's immune systems adapt, and older immune responses wane, such that escape variants only enjoy a growth advantage for a limited time. If variant growth dynamics and reshaping of host-immunity operate on comparable time scales, viral adaptation is determined by eco-evolutionary interactions that are not captured by models of rapid evolution in a fixed environment. Here, we use a Susceptible/Infected model to describe the interaction between an evolving viral population in a dynamic but immunologically diverse host population. We show that depending on strain cross-immunity, heterogeneity of the host population, and durability of immune responses, escape variants initially grow exponentially, but lose their growth advantage before reaching high frequencies. Their subsequent dynamics follows an anomalous random walk determined by future escape variants and results in variant trajectories that are unpredictable. This model can explain the apparent contradiction between the clearly adaptive nature of antigenic evolution and the quasi-neutral dynamics of high-frequency variants observed for influenza viruses.
Collapse
Affiliation(s)
- Pierre Barrat-Charlaix
- Biozentrum, Universität Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
- DISAT, Politecnico di Torino, Torino, Italy
| | - Richard A Neher
- Biozentrum, Universität Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
6
|
He D, Wang X, Wu H, Cai K, Song X, Wang X, Hu J, Hu S, Liu X, Ding C, Peng D, Su S, Gu M, Liu X. Characterization of Conserved Evolution in H7N9 Avian Influenza Virus Prior Mass Vaccination. Virulence 2024; 15:2395837. [PMID: 39240070 PMCID: PMC11382709 DOI: 10.1080/21505594.2024.2395837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 07/09/2024] [Accepted: 07/30/2024] [Indexed: 09/07/2024] Open
Abstract
Vaccination is crucial for the prevention and mitigation of avian influenza infections in China. The inactivated H7N9 vaccine, when administered to poultry, significantly lowers the risk of infection among both poultry and humans, while also markedly decreasing the prevalence of H7N9 detections. Highly pathogenic (HP) H7N9 viruses occasionally appear, whereas their low pathogenicity (LP) counterparts have been scarcely detected since 2018. However, these contributing factors remain poorly understood. We conducted an exploratory investigation of the mechanics via the application of comprehensive bioinformatic approaches. We delineated the Yangtze River Delta (YRD) H7N9 lineage into 5 clades (YRD-A to E). Our findings highlight the emergence and peak occurrence of the LP H7N9-containing YRD-E clade during the 5th epidemic wave in China's primary poultry farming areas. A more effective control of LP H7N9 through vaccination was observed compared to that of its HP H7N9 counterpart. YRD-E exhibited a tardy evolutionary trajectory, denoted by the conservation of its genetic and antigenic variation. Our analysis of YRD-E revealed only minimal amino acid substitutions along its phylogenetic tree and a few selective sweep mutations since 2016. In terms of epidemic fitness, the YRD-E was measured to be lower than that of the HP variants. Collectively, these findings underscore the conserved evolutionary patterns distinguishing the YRD-E. Given the conservation presented in its evolutionary patterns, the YRD-E LP H7N9 is hypothesized to be associated with a reduction following the mass vaccination in a relatively short period owing to its lower probability of antigenic variation that might affect vaccine efficiency.
Collapse
Affiliation(s)
- Dongchang He
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- College of Veterinary Medicine, Jiangsu Agri-animal Husbandry Vocational College, Taizhou, China
| | - Xiyue Wang
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
| | - Huiguang Wu
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
| | - Kairui Cai
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
| | - Xiaoli Song
- Animal Epidemic Prevention Office, Jiangsu Provincial Animal Disease Control Center, Nanjing, Jiangsu, China
| | - Xiaoquan Wang
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| | - Jiao Hu
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| | - Shunlin Hu
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| | - Xiaowen Liu
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| | - Chan Ding
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Department of Avian Diseases, Shanghai Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Shanghai, China
| | - Daxin Peng
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| | - Shuo Su
- College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, China
| | - Min Gu
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| | - Xiufan Liu
- Animal Infectious Disease Laboratory, College of Veterinary Medicine, Yangzhou University, Yangzhou, Jiangsu, China
- Jiangsu Co-innovation Center for Prevention and Control of Important Animal Infectious Diseases and Zoonosis, Yangzhou University, Yangzhou, China
- Jiangsu Key Laboratory of Zoonosis, Yangzhou University, Yangzhou, China
| |
Collapse
|
7
|
Meijers M, Ruchnewitz D, Eberhardt J, Karmakar M, Łuksza M, Lässig M. Concepts and methods for predicting viral evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.19.585703. [PMID: 38746108 PMCID: PMC11092427 DOI: 10.1101/2024.03.19.585703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The seasonal human influenza virus undergoes rapid evolution, leading to significant changes in circulating viral strains from year to year. These changes are typically driven by adaptive mutations, particularly in the antigenic epitopes, the regions of the viral surface protein haemagglutinin targeted by human antibodies. Here we describe a consistent set of methods for data-driven predictive analysis of viral evolution. Our pipeline integrates four types of data: (1) sequence data of viral isolates collected on a worldwide scale, (2) epidemiological data on incidences, (3) antigenic characterization of circulating viruses, and (4) intrinsic viral phenotypes. From the combined analysis of these data, we obtain estimates of relative fitness for circulating strains and predictions of clade frequencies for periods of up to one year. Furthermore, we obtain comparative estimates of protection against future viral populations for candidate vaccine strains, providing a basis for pre-emptive vaccine strain selection. Continuously updated predictions obtained from the prediction pipeline for influenza and SARS-CoV-2 are available on the website previr.app.
Collapse
Affiliation(s)
- Matthijs Meijers
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Denis Ruchnewitz
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Jan Eberhardt
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Malancha Karmakar
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Marta Łuksza
- Tisch Cancer Institute, Departments of Oncological Sciences and Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Michael Lässig
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| |
Collapse
|
8
|
Meijers M, Ruchnewitz D, Eberhardt J, Karmakar M, Luksza M, Lässig M. Concepts and methods for predicting viral evolution. ARXIV 2024:arXiv:2403.12684v3. [PMID: 38745695 PMCID: PMC11092678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The seasonal human influenza virus undergoes rapid evolution, leading to significant changes in circulating viral strains from year to year. These changes are typically driven by adaptive mutations, particularly in the antigenic epitopes, the regions of the viral surface protein haemagglutinin targeted by human antibodies. Here we describe a consistent set of methods for data-driven predictive analysis of viral evolution. Our pipeline integrates four types of data: (1) sequence data of viral isolates collected on a worldwide scale, (2) epidemiological data on incidences, (3) antigenic characterization of circulating viruses, and (4) intrinsic viral phenotypes. From the combined analysis of these data, we obtain estimates of relative fitness for circulating strains and predictions of clade frequencies for periods of up to one year. Furthermore, we obtain comparative estimates of protection against future viral populations for candidate vaccine strains, providing a basis for pre-emptive vaccine strain selection. Continuously updated predictions obtained from the prediction pipeline for influenza and SARS-CoV-2 are available on the website previr.app.
Collapse
Affiliation(s)
- Matthijs Meijers
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Denis Ruchnewitz
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Jan Eberhardt
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Malancha Karmakar
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| | - Marta Luksza
- Tisch Cancer Institute, Departments of Oncological Sciences and Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Michael Lässig
- Institute for Biological Physics, University of Cologne, Zülpicherstr. 77, 50937, Köln, Germany
| |
Collapse
|
9
|
Zeng HL, Yang CL, Jing B, Barton J, Aurell E. Two fitness inference schemes compared using allele frequencies from 1068 391 sequences sampled in the UK during the COVID-19 pandemic. Phys Biol 2024; 22:016003. [PMID: 39536448 DOI: 10.1088/1478-3975/ad9213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Accepted: 11/13/2024] [Indexed: 11/16/2024]
Abstract
Throughout the course of the SARS-CoV-2 pandemic, genetic variation has contributed to the spread and persistence of the virus. For example, various mutations have allowed SARS-CoV-2 to escape antibody neutralization or to bind more strongly to the receptors that it uses to enter human cells. Here, we compared two methods that estimate the fitness effects of viral mutations using the abundant sequence data gathered over the course of the pandemic. Both approaches are grounded in population genetics theory but with different assumptions. One approach, tQLE, features an epistatic fitness landscape and assumes that alleles are nearly in linkage equilibrium. Another approach, MPL, assumes a simple, additive fitness landscape, but allows for any level of correlation between alleles. We characterized differences in the distributions of fitness values inferred by each approach and in the ranks of fitness values that they assign to sequences across time. We find that in a large fraction of weeks the two methods are in good agreement as to their top-ranked sequences, i.e. as to which sequences observed that week are most fit. We also find that agreement between the ranking of sequences varies with genetic unimodality in the population in a given week.
Collapse
Affiliation(s)
- Hong-Li Zeng
- School of Science, Nanjing University of Posts and Telecommunications, Key Laboratory of Radio and Micro-Nano Electronics of Jiangsu Province, Nanjing 210023, People's Republic of China
| | - Cheng-Long Yang
- School of Science, Nanjing University of Posts and Telecommunications, Key Laboratory of Radio and Micro-Nano Electronics of Jiangsu Province, Nanjing 210023, People's Republic of China
| | - Bo Jing
- School of Science, Nanjing University of Posts and Telecommunications, Key Laboratory of Radio and Micro-Nano Electronics of Jiangsu Province, Nanjing 210023, People's Republic of China
| | - John Barton
- Department of Computational & Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15260, United States of America
| | - Erik Aurell
- Department of Computational Science and Technology, AlbaNova University Center, SE-106 91 Stockholm, Sweden
| |
Collapse
|
10
|
Jones MG, Sun D, Min KH(J, Colgan WN, Tian L, Weir JA, Chen VZ, Koblan LW, Yost KE, Mathey-Andrews N, Russell AJ, Stickels RR, Balderrama KS, Rideout WM, Chang HY, Jacks T, Chen F, Weissman JS, Yosef N, Yang D. Spatiotemporal lineage tracing reveals the dynamic spatial architecture of tumor growth and metastasis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.21.619529. [PMID: 39484491 PMCID: PMC11526908 DOI: 10.1101/2024.10.21.619529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Tumor progression is driven by dynamic interactions between cancer cells and their surrounding microenvironment. Investigating the spatiotemporal evolution of tumors can provide crucial insights into how intrinsic changes within cancer cells and extrinsic alterations in the microenvironment cooperate to drive different stages of tumor progression. Here, we integrate high-resolution spatial transcriptomics and evolving lineage tracing technologies to elucidate how tumor expansion, plasticity, and metastasis co-evolve with microenvironmental remodeling in a Kras;p53-driven mouse model of lung adenocarcinoma. We find that rapid tumor expansion contributes to a hypoxic, immunosuppressive, and fibrotic microenvironment that is associated with the emergence of pro-metastatic cancer cell states. Furthermore, metastases arise from spatially-confined subclones of primary tumors and remodel the distant metastatic niche into a fibrotic, collagen-rich microenvironment. Together, we present a comprehensive dataset integrating spatial assays and lineage tracing to elucidate how sequential changes in cancer cell state and microenvironmental structures cooperate to promote tumor progression.
Collapse
Affiliation(s)
- Matthew G. Jones
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA, USA
- These authors contributed equally
| | - Dawei Sun
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
- These authors contributed equally
| | - Kyung Hoi (Joseph) Min
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - William N. Colgan
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Luyi Tian
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jackson A. Weir
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Biological and Biomedical Sciences Program, Harvard University, Cambridge, MA, USA
| | - Victor Z. Chen
- Department of Molecular Pharmacology and Therapeutics, Columbia University, New York City, NY, USA
- Department of Systems Biology, Columbia University, New York City, NY, USA
| | - Luke W. Koblan
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Kathryn E. Yost
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Nicolas Mathey-Andrews
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Andrew J.C. Russell
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
| | | | | | - William M. Rideout
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Howard Y. Chang
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Tyler Jacks
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Fei Chen
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
| | - Jonathan S. Weissman
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Nir Yosef
- Department of Systems Immunology, Weizmann Institute of Science, 234 Herzl Street, Rehovot 7610001, Israel
| | - Dian Yang
- Department of Molecular Pharmacology and Therapeutics, Columbia University, New York City, NY, USA
- Department of Systems Biology, Columbia University, New York City, NY, USA
- Herbert Irving Comprehensive Cancer Center, Columbia University, New York City, NY, USA
- Lead Contact
| |
Collapse
|
11
|
Visani GM, Pun MN, Galvin W, Daniel E, Borisiak K, Wagura U, Nourmohammad A. HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.09.602403. [PMID: 39026838 PMCID: PMC11257601 DOI: 10.1101/2024.07.09.602403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Predicting the stability and fitness effects of amino-acid mutations in proteins is a cornerstone of biological discovery and engineering. Various experimental techniques have been developed to measure mutational effects, providing us with extensive datasets across a diverse range of proteins. By training on these data, machine learning approaches have advanced significantly in predicting mutational effects. Here, we introduce HERMES, a 3D rotationally equivariant structure-based neural network model for mutation effect prediction. Pre-trained to predict amino-acid propensities from their surrounding 3D structure atomic environments, HERMES can be efficiently fine-tuned to predict mutational effects, thanks to its symmetry-aware parameterization of the output space. Benchmarking against other models demonstrates that HERMES often outperforms or matches their performance in predicting mutation effects on stability, binding, and fitness, using either computationally or experimentally resolved protein structures. HERMES offers a versatile suit of tools for evaluating mutation effects and can be easily fine-tuned for specific predictive objectives using our open-source code.
Collapse
Affiliation(s)
- Gian Marco Visani
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| | | | - William Galvin
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| | - Eric Daniel
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| | | | | | - Armita Nourmohammad
- Department of Physics, Applied Math, and CSE, University of Washington, Fred Hutch Cancer Research Center, Seattle, WA
| |
Collapse
|
12
|
DeWitt WS, Evans SN, Hiesmayr E, Hummel S. Mean-field interacting multi-type birth-death processes with a view to applications in phylodynamics. Theor Popul Biol 2024; 159:1-12. [PMID: 39019333 DOI: 10.1016/j.tpb.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 07/09/2024] [Accepted: 07/11/2024] [Indexed: 07/19/2024]
Abstract
Multi-type birth-death processes underlie approaches for inferring evolutionary dynamics from phylogenetic trees across biological scales, ranging from deep-time species macroevolution to rapid viral evolution and somatic cellular proliferation. A limitation of current phylogenetic birth-death models is that they require restrictive linearity assumptions that yield tractable message-passing likelihoods, but that also preclude interactions between individuals. Many fundamental evolutionary processes - such as environmental carrying capacity or frequency-dependent selection - entail interactions, and may strongly influence the dynamics in some systems. Here, we introduce a multi-type birth-death process in mean-field interaction with an ensemble of replicas of the focal process. We prove that, under quite general conditions, the ensemble's stochastically evolving interaction field converges to a deterministic trajectory in the limit of an infinite ensemble. In this limit, the replicas effectively decouple, and self-consistent interactions appear as nonlinearities in the infinitesimal generator of the focal process. We investigate a special case that is rich enough to model both carrying capacity and frequency-dependent selection while yielding tractable message-passing likelihoods in the context of a phylogenetic birth-death model.
Collapse
Affiliation(s)
- William S DeWitt
- Department of Electrical Engineering & Computer Sciences, University of California, Berkeley, United States of America.
| | - Steven N Evans
- Department of Statistics, University of California, Berkeley, United States of America.
| | - Ella Hiesmayr
- Department of Statistics, University of California, Berkeley, United States of America.
| | - Sebastian Hummel
- Department of Statistics, University of California, Berkeley, United States of America.
| |
Collapse
|
13
|
Perofsky AC, Huddleston J, Hansen CL, Barnes JR, Rowe T, Xu X, Kondor R, Wentworth DE, Lewis N, Whittaker L, Ermetal B, Harvey R, Galiano M, Daniels RS, McCauley JW, Fujisaki S, Nakamura K, Kishida N, Watanabe S, Hasegawa H, Sullivan SG, Barr IG, Subbarao K, Krammer F, Bedford T, Viboud C. Antigenic drift and subtype interference shape A(H3N2) epidemic dynamics in the United States. eLife 2024; 13:RP91849. [PMID: 39319780 PMCID: PMC11424097 DOI: 10.7554/elife.91849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2024] Open
Abstract
Influenza viruses continually evolve new antigenic variants, through mutations in epitopes of their major surface proteins, hemagglutinin (HA) and neuraminidase (NA). Antigenic drift potentiates the reinfection of previously infected individuals, but the contribution of this process to variability in annual epidemics is not well understood. Here, we link influenza A(H3N2) virus evolution to regional epidemic dynamics in the United States during 1997-2019. We integrate phenotypic measures of HA antigenic drift and sequence-based measures of HA and NA fitness to infer antigenic and genetic distances between viruses circulating in successive seasons. We estimate the magnitude, severity, timing, transmission rate, age-specific patterns, and subtype dominance of each regional outbreak and find that genetic distance based on broad sets of epitope sites is the strongest evolutionary predictor of A(H3N2) virus epidemiology. Increased HA and NA epitope distance between seasons correlates with larger, more intense epidemics, higher transmission, greater A(H3N2) subtype dominance, and a greater proportion of cases in adults relative to children, consistent with increased population susceptibility. Based on random forest models, A(H1N1) incidence impacts A(H3N2) epidemics to a greater extent than viral evolution, suggesting that subtype interference is a major driver of influenza A virus infection ynamics, presumably via heterosubtypic cross-immunity.
Collapse
MESH Headings
- Influenza A Virus, H3N2 Subtype/genetics
- Influenza A Virus, H3N2 Subtype/immunology
- United States/epidemiology
- Influenza, Human/epidemiology
- Influenza, Human/virology
- Influenza, Human/immunology
- Humans
- Hemagglutinin Glycoproteins, Influenza Virus/genetics
- Hemagglutinin Glycoproteins, Influenza Virus/immunology
- Epidemics
- Antigenic Drift and Shift/genetics
- Child
- Adult
- Neuraminidase/genetics
- Neuraminidase/immunology
- Adolescent
- Child, Preschool
- Antigens, Viral/immunology
- Antigens, Viral/genetics
- Young Adult
- Evolution, Molecular
- Seasons
- Middle Aged
Collapse
Affiliation(s)
- Amanda C Perofsky
- Fogarty International Center, National Institutes of Health, Bethesda, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, United States
| | - John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, United States
| | - Chelsea L Hansen
- Fogarty International Center, National Institutes of Health, Bethesda, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, United States
| | - John R Barnes
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), Atlanta, United States
| | - Thomas Rowe
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), Atlanta, United States
| | - Xiyan Xu
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), Atlanta, United States
| | - Rebecca Kondor
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), Atlanta, United States
| | - David E Wentworth
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), Atlanta, United States
| | - Nicola Lewis
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Lynne Whittaker
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Burcu Ermetal
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Ruth Harvey
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Monica Galiano
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Rodney Stuart Daniels
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - John W McCauley
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Seiichiro Fujisaki
- Influenza Virus Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Kazuya Nakamura
- Influenza Virus Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Noriko Kishida
- Influenza Virus Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Shinji Watanabe
- Influenza Virus Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Hideki Hasegawa
- Influenza Virus Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Sheena G Sullivan
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Ian G Barr
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Kanta Subbarao
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Florian Krammer
- Center for Vaccine Research and Pandemic Preparedness (C-VaRPP), Icahn School of Medicine at Mount Sinai, New York, United States
- Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Trevor Bedford
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, United States
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, United States
- Department of Genome Sciences, University of Washington, Seattle, United States
- Howard Hughes Medical Institute, Seattle, United States
| | - Cécile Viboud
- Fogarty International Center, National Institutes of Health, Bethesda, United States
| |
Collapse
|
14
|
Huddleston J, Bedford T. Timely vaccine strain selection and genomic surveillance improves evolutionary forecast accuracy of seasonal influenza A/H3N2. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.09.11.24313489. [PMID: 39314963 PMCID: PMC11419249 DOI: 10.1101/2024.09.11.24313489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/25/2024]
Abstract
For the last decade, evolutionary forecasting models have influenced seasonal influenza vaccine design. These models attempt to predict which genetic variants circulating at the time of vaccine strain selection will be dominant 12 months later in the influenza season targeted by vaccination campaign. Forecasting models depend on hemagglutinin (HA) sequences from the WHO's Global Influenza Surveillance and Response System to identify currently circulating groups of related strains (clades) and estimate clade fitness for forecasts. However, the average lag between collection of a clinical sample and the submission of its sequence to the Global Initiative on Sharing All Influenza Data (GISAID) EpiFlu database is ~3 months. Submission lags complicate the already difficult 12-month forecasting problem by reducing understanding of current clade frequencies at the time of forecasting. These constraints of a 12-month forecast horizon and 3-month average submission lags create an upper bound on the accuracy of any long-term forecasting model. The global response to the SARS-CoV-2 pandemic revealed that modern vaccine technology like mRNA vaccines can reduce how far we need to forecast into the future to 6 months or less and that expanded support for sequencing can reduce submission lags to GISAID to 1 month on average. To determine whether these recent advances could also improve long-term forecasts for seasonal influenza, we quantified the effects of reducing forecast horizons and submission lags on the accuracy of forecasts for A/H3N2 populations. We found that reducing forecast horizons from 12 months to 6 or 3 months reduced average absolute forecasting errors to 25% and 50% of the 12-month average, respectively. Reducing submission lags provided little improvement to forecasting accuracy but decreased the uncertainty in current clade frequencies by 50%. These results show the potential to substantially improve the accuracy of existing influenza forecasting models by modernizing influenza vaccine development and increasing global sequencing capacity.
Collapse
Affiliation(s)
- John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, Seattle, WA, USA
- Howard Hughes Medical Institute, Seattle, WA, USA
| |
Collapse
|
15
|
Cai C, Li J, Xia Y, Li W. FluPMT: Prediction of Predominant Strains of Influenza A Viruses via Multi-Task Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1254-1263. [PMID: 38498763 DOI: 10.1109/tcbb.2024.3378468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
Seasonal influenza vaccines play a crucial role in saving numerous lives annually. However, the constant evolution of the influenza A virus necessitates frequent vaccine updates to ensure its ongoing effectiveness. The decision to develop a new vaccine strain is generally based on the assessment of the current predominant strains. Nevertheless, the process of vaccine production and distribution is very time-consuming, leaving a window for the emergence of new variants that could decrease vaccine effectiveness, so predictions of influenza A virus evolution can inform vaccine evaluation and selection. Hence, we present FluPMT, a novel sequence prediction model that applies an encoder-decoder architecture to predict the hemagglutinin (HA) protein sequence of the upcoming season's predominant strain by capturing the patterns of evolution of influenza A viruses. Specifically, we employ time series to model the evolution of influenza A viruses, and utilize attention mechanisms to explore dependencies among residues of sequences. Additionally, antigenic distance prediction based on graph network representation learning is incorporated into the sequence prediction as an auxiliary task through a multi-task learning framework. Experimental results on two influenza datasets highlight the exceptional predictive performance of FluPMT, offering valuable insights into virus evolutionary dynamics, as well as vaccine evaluation and production.
Collapse
|
16
|
Celentano M, DeWitt WS, Prillo S, Song YS. Exact and efficient phylodynamic simulation from arbitrarily large populations. ARXIV 2024:arXiv:2402.17153v2. [PMID: 38463501 PMCID: PMC10925381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Many biological studies involve inferring the evolutionary history of a sample of individuals from a large population and interpreting the reconstructed tree. Such an ascertained tree typically represents only a small part of a comprehensive population tree and is distorted by survivorship and sampling biases. Inferring evolutionary parameters from ascertained trees requires modeling both the underlying population dynamics and the ascertainment process. A crucial component of this phylodynamic modeling involves tree simulation, which is used to benchmark probabilistic inference methods. To simulate an ascertained tree, one must first simulate the full population tree and then prune unobserved lineages. Consequently, the computational cost is determined not by the size of the final simulated tree, but by the size of the population tree in which it is embedded. In most biological scenarios, simulations of the entire population are prohibitively expensive due to computational demands placed on lineages without sampled descendants. Here, we address this challenge by proving that, for any partially ascertained process from a general multi-type birth-death-mutation-sampling model, there exists an equivalent process with complete sampling and no death, a property which we leverage to develop a highly efficient algorithm for simulating trees. Our algorithm scales linearly with the size of the final simulated tree and is independent of the population size, enabling simulations from extremely large populations beyond the reach of current methods but essential for various biological applications. We anticipate that this unprecedented speedup will significantly advance the development of novel inference methods that require extensive training data.
Collapse
Affiliation(s)
| | | | | | - Yun S Song
- Department of Statistics, University of California, Berkeley
- Computer Science Division, University of California, Berkeley
| |
Collapse
|
17
|
Gao Y, Feder AF. Detecting branching rate heterogeneity in multifurcating trees with applications in lineage tracing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.601073. [PMID: 39005367 PMCID: PMC11244928 DOI: 10.1101/2024.06.27.601073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Understanding cellular birth rate differences is crucial for predicting cancer progression and interpreting tumor-derived genetic data. Lineage tracing experiments enable detailed reconstruction of cellular genealogies, offering new opportunities to measure branching rate heterogeneity. However, the lineage tracing process can introduce complex tree features that complicate this effort. Here, we examine tree characteristics in lineage tracing-derived genealogies and find that editing window placement leads to multifurcations at a tree's root or tips. We propose several ways in which existing tree topology-based metrics can be extended to test for rate heterogeneity on trees even in the presence of lineage-tracing associated distortions. Although these methods vary in power and robustness, a test based on theJ 1 statistic effectively detects branching rate heterogeneity in simulated lineage tracing data. Tests based on other common statistics ( s ^ and the Sackin index) show interior performance toJ 1 . We apply our validated methods to xenograft experimental data and find widespread rate heterogeneity across multiple study systems. Our results demonstrate the potential of tree topology statistics in analyzing lineage tracing data, and highlight the challenges associated with adapting phylogenetic methods to these systems.
Collapse
Affiliation(s)
- Yingnan Gao
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Alison F Feder
- Department of Genome Sciences, University of Washington, Seattle, WA
- Herbold Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, WA
| |
Collapse
|
18
|
Bonetti Franceschi V, Volz E. Phylogenetic signatures reveal multilevel selection and fitness costs in SARS-CoV-2. Wellcome Open Res 2024; 9:85. [PMID: 39132669 PMCID: PMC11316176 DOI: 10.12688/wellcomeopenres.20704.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/16/2024] [Indexed: 08/13/2024] Open
Abstract
Background Large-scale sequencing of SARS-CoV-2 has enabled the study of viral evolution during the COVID-19 pandemic. Some viral mutations may be advantageous to viral replication within hosts but detrimental to transmission, thus carrying a transient fitness advantage. By affecting the number of descendants, persistence times and growth rates of associated clades, these mutations generate localised imbalance in phylogenies. Quantifying these features in closely-related clades with and without recurring mutations can elucidate the tradeoffs between within-host replication and between-host transmission. Methods We implemented a novel phylogenetic clustering algorithm ( mlscluster, https://github.com/mrc-ide/mlscluster) to systematically explore time-scaled phylogenies for mutations under transient/multilevel selection. We applied this method to a SARS-CoV-2 time-calibrated phylogeny with >1.2 million sequences from England, and characterised these recurrent mutations that may influence transmission fitness across PANGO-lineages and genomic regions using Poisson regressions and summary statistics. Results We found no major differences across two epidemic stages (before and after Omicron), PANGO-lineages, and genomic regions. However, spike, nucleocapsid, and ORF3a were proportionally more enriched for transmission fitness polymorphisms (TFP)-homoplasies than other proteins. We provide a catalog of SARS-CoV-2 sites under multilevel selection, which can guide experimental investigations within and beyond the spike protein. Conclusions This study provides empirical evidence for the existence of important tradeoffs between within-host replication and between-host transmission shaping the fitness landscape of SARS-CoV-2. This method may be used as a fast and scalable means to shortlist large sequence databases for sites under putative multilevel selection which may warrant subsequent confirmatory analyses and experimental confirmation.
Collapse
Affiliation(s)
- Vinicius Bonetti Franceschi
- Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, England, W2 1PG, UK
| | - Erik Volz
- Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, England, W2 1PG, UK
| |
Collapse
|
19
|
Visani GM, Pun MN, Galvin W, Daniel E, Borisiak K, Wagura U, Nourmohammad A. HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability prediction. ARXIV 2024:arXiv:2407.06703v1. [PMID: 39040640 PMCID: PMC11261993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/24/2024]
Abstract
Predicting the stability and fitness effects of amino acid mutations in proteins is a cornerstone of biological discovery and engineering. Various experimental techniques have been developed to measure mutational effects, providing us with extensive datasets across a diverse range of proteins. By training on these data, traditional computational modeling and more recent machine learning approaches have advanced significantly in predicting mutational effects. Here, we introduce HERMES, a 3D rotationally equivariant structure-based neural network model for mutational effect and stability prediction. Pre-trained to predict amino acid propensity from its surrounding 3D structure, HERMES can be fine-tuned for mutational effects using our open-source code. We present a suite of HERMES models, pre-trained with different strategies, and fine-tuned to predict the stability effect of mutations. Benchmarking against other models shows that HERMES often outperforms or matches their performance in predicting mutational effect on stability, binding, and fitness. HERMES offers versatile tools for evaluating mutational effects and can be fine-tuned for specific predictive objectives.
Collapse
Affiliation(s)
- Gian Marco Visani
- Department of Computer Science and Engineering, University of Washington, Seattle, USA
| | - Michael N. Pun
- Department of Physics, University of Washington, 3910 15th Avenue Northeast, Seattle, WA 98195, USA
| | - William Galvin
- Department of Computer Science and Engineering, University of Washington, Seattle, USA
| | - Eric Daniel
- Department of Computer Science and Engineering, University of Washington, Seattle, USA
| | - Kevin Borisiak
- Department of Physics, University of Washington, 3910 15th Avenue Northeast, Seattle, WA 98195, USA
| | - Utheri Wagura
- Department of Physics, University of Washington, 3910 15th Avenue Northeast, Seattle, WA 98195, USA
- Department of Physics, Massachusetts Institute of Technology, 182 Memorial Dr, Cambridge, MA 02139
| | - Armita Nourmohammad
- Department of Computer Science and Engineering, University of Washington, Seattle, USA
- Department of Physics, University of Washington, 3910 15th Avenue Northeast, Seattle, WA 98195, USA
- Department of Applied Mathematics, University of Washington, Seattle, USA
- Fred Hutchinson cancer Research Center, 1100 Fairview ave N, Seattle, WA 98109, USA
| |
Collapse
|
20
|
Perofsky AC, Huddleston J, Hansen C, Barnes JR, Rowe T, Xu X, Kondor R, Wentworth DE, Lewis N, Whittaker L, Ermetal B, Harvey R, Galiano M, Daniels RS, McCauley JW, Fujisaki S, Nakamura K, Kishida N, Watanabe S, Hasegawa H, Sullivan SG, Barr IG, Subbarao K, Krammer F, Bedford T, Viboud C. Antigenic drift and subtype interference shape A(H3N2) epidemic dynamics in the United States. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.10.02.23296453. [PMID: 37873362 PMCID: PMC10593063 DOI: 10.1101/2023.10.02.23296453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Influenza viruses continually evolve new antigenic variants, through mutations in epitopes of their major surface proteins, hemagglutinin (HA) and neuraminidase (NA). Antigenic drift potentiates the reinfection of previously infected individuals, but the contribution of this process to variability in annual epidemics is not well understood. Here we link influenza A(H3N2) virus evolution to regional epidemic dynamics in the United States during 1997-2019. We integrate phenotypic measures of HA antigenic drift and sequence-based measures of HA and NA fitness to infer antigenic and genetic distances between viruses circulating in successive seasons. We estimate the magnitude, severity, timing, transmission rate, age-specific patterns, and subtype dominance of each regional outbreak and find that genetic distance based on broad sets of epitope sites is the strongest evolutionary predictor of A(H3N2) virus epidemiology. Increased HA and NA epitope distance between seasons correlates with larger, more intense epidemics, higher transmission, greater A(H3N2) subtype dominance, and a greater proportion of cases in adults relative to children, consistent with increased population susceptibility. Based on random forest models, A(H1N1) incidence impacts A(H3N2) epidemics to a greater extent than viral evolution, suggesting that subtype interference is a major driver of influenza A virus infection dynamics, presumably via heterosubtypic cross-immunity.
Collapse
Affiliation(s)
- Amanda C Perofsky
- Fogarty International Center, National Institutes of Health, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, United States
| | - John Huddleston
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, United States
| | - Chelsea Hansen
- Fogarty International Center, National Institutes of Health, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, United States
| | - John R Barnes
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Thomas Rowe
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Xiyan Xu
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Rebecca Kondor
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - David E Wentworth
- Virology Surveillance and Diagnosis Branch, Influenza Division, National Center for Immunization and Respiratory Diseases (NCIRD), Centers for Disease Control and Prevention (CDC), United States
| | - Nicola Lewis
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Lynne Whittaker
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Burcu Ermetal
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Ruth Harvey
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Monica Galiano
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Rodney Stuart Daniels
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - John W McCauley
- WHO Collaborating Centre for Reference and Research on Influenza, Crick Worldwide Influenza Centre, The Francis Crick Institute, United Kingdom
| | - Seiichiro Fujisaki
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Kazuya Nakamura
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Noriko Kishida
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Shinji Watanabe
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Hideki Hasegawa
- Influenza Virus Research Center, National Institute of Infectious Diseases, Japan
| | - Sheena G Sullivan
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Australia
| | - Ian G Barr
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Australia
| | - Kanta Subbarao
- WHO Collaborating Centre for Reference and Research on Influenza, The Peter Doherty Institute for Infection and Immunity, Department of Microbiology and Immunology, The University of Melbourne, The Peter Doherty Institute for Infection and Immunity, Australia
| | - Florian Krammer
- Center for Vaccine Research and Pandemic Preparedness (C-VaRPP), Icahn School of Medicine at Mount Sinai, United States
- Department of Pathology, Molecular and Cell-Based Medicine, Icahn School of Medicine at Mount Sinai, United States
| | - Trevor Bedford
- Brotman Baty Institute for Precision Medicine, University of Washington, United States
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Center, United States
- Department of Genome Sciences, University of Washington, United States
- Howard Hughes Medical Institute, Seattle, United States
| | - Cécile Viboud
- Fogarty International Center, National Institutes of Health, United States
| |
Collapse
|
21
|
Shah SAW, Palomar DP, Barr I, Poon LLM, Quadeer AA, McKay MR. Seasonal antigenic prediction of influenza A H3N2 using machine learning. Nat Commun 2024; 15:3833. [PMID: 38714654 PMCID: PMC11076571 DOI: 10.1038/s41467-024-47862-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 04/10/2024] [Indexed: 05/10/2024] Open
Abstract
Antigenic characterization of circulating influenza A virus (IAV) isolates is routinely assessed by using the hemagglutination inhibition (HI) assays for surveillance purposes. It is also used to determine the need for annual influenza vaccine updates as well as for pandemic preparedness. Performing antigenic characterization of IAV on a global scale is confronted with high costs, animal availability, and other practical challenges. Here we present a machine learning model that accurately predicts (normalized) outputs of HI assays involving circulating human IAV H3N2 viruses, using their hemagglutinin subunit 1 (HA1) sequences and associated metadata. Each season, the model learns an updated nonlinear mapping of genetic to antigenic changes using data from past seasons only. The model accurately distinguishes antigenic variants from non-variants and adaptively characterizes seasonal dynamics of HA1 sites having the strongest influence on antigenic change. Antigenic predictions produced by the model can aid influenza surveillance, public health management, and vaccine strain selection activities.
Collapse
Affiliation(s)
- Syed Awais W Shah
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
| | - Daniel P Palomar
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
- Department of Industrial Engineering & Decision Analytics, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China
| | - Ian Barr
- WHO Collaborating Centre for Reference and Research on Influenza, Melbourne, Victoria, Australia
- Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia
| | - Leo L M Poon
- School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
- Centre for Immunology & Infection, Hong Kong SAR, China
| | - Ahmed Abdul Quadeer
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong SAR, China.
- Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria, Australia.
| | - Matthew R McKay
- Department of Microbiology and Immunology, University of Melbourne, at The Peter Doherty Institute for Infection and Immunity, Melbourne, Victoria, Australia.
- Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Victoria, Australia.
| |
Collapse
|
22
|
Chitwood MH, Colijn C, Yang C, Crudu V, Ciobanu N, Codreanu A, Kim J, Rancu I, Rhee K, Cohen T, Sobkowiak B. The recent rapid expansion of multidrug resistant Ural lineage Mycobacterium tuberculosis in Moldova. Nat Commun 2024; 15:2962. [PMID: 38580642 PMCID: PMC10997638 DOI: 10.1038/s41467-024-47282-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 03/26/2024] [Indexed: 04/07/2024] Open
Abstract
The projected trajectory of multidrug resistant tuberculosis (MDR-TB) epidemics depends on the reproductive fitness of circulating strains of MDR M. tuberculosis (Mtb). Previous efforts to characterize the fitness of MDR Mtb have found that Mtb strains of the Beijing sublineage (Lineage 2.2.1) may be more prone to develop resistance and retain fitness in the presence of resistance-conferring mutations than other lineages. Using Mtb genome sequences from all culture-positive cases collected over two years in Moldova, we estimate the fitness of Ural (Lineage 4.2) and Beijing strains, the two lineages in which MDR is concentrated in the country. We estimate that the fitness of MDR Ural strains substantially exceeds that of other susceptible and MDR strains, and we identify several mutations specific to these MDR Ural strains. Our findings suggest that MDR Ural Mtb has been transmitting efficiently in Moldova and poses a substantial risk of spreading further in the region.
Collapse
Affiliation(s)
- Melanie H Chitwood
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA.
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, 8888 University Drive West, Burnaby, BC, Canada
| | - Chongguang Yang
- School of Public Health (Shenzhen), Shenzhen Campus of Sun Yat-sen University, No. 132 Outer Ring East Road, Guangzhou University Town Guangdong, Guangdong, PR China
| | - Valeriu Crudu
- Phthisiopneumology Institute, Strada Constantin Vârnav 13, Chisinau, Republic of Moldova
| | - Nelly Ciobanu
- Phthisiopneumology Institute, Strada Constantin Vârnav 13, Chisinau, Republic of Moldova
| | - Alexandru Codreanu
- Phthisiopneumology Institute, Strada Constantin Vârnav 13, Chisinau, Republic of Moldova
| | - Jaehee Kim
- Department of Computational Biology, Cornell University, 237 Tower Road, Ithaca, NY, USA
| | - Isabel Rancu
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA
| | - Kyu Rhee
- Department of Medicine, Weill Cornell Medicine, 1300 York Ave, New York, NY, USA
| | - Ted Cohen
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA.
| | - Benjamin Sobkowiak
- Department of Epidemiology of Microbial Disease, Yale School of Public Health, 60 College Street, New Haven, CT, USA
| |
Collapse
|
23
|
Lou J, Liang W, Cao L, Hu I, Zhao S, Chen Z, Chan RWY, Cheung PPH, Zheng H, Liu C, Li Q, Chong MKC, Zhang Y, Yeoh EK, Chan PKS, Zee BCY, Mok CKP, Wang MH. Predictive evolutionary modelling for influenza virus by site-based dynamics of mutations. Nat Commun 2024; 15:2546. [PMID: 38514647 PMCID: PMC10958014 DOI: 10.1038/s41467-024-46918-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 03/12/2024] [Indexed: 03/23/2024] Open
Abstract
Influenza virus continuously evolves to escape human adaptive immunity and generates seasonal epidemics. Therefore, influenza vaccine strains need to be updated annually for the upcoming flu season to ensure vaccine effectiveness. We develop a computational approach, beth-1, to forecast virus evolution and select representative virus for influenza vaccine. The method involves modelling site-wise mutation fitness. Informed by virus genome and population sero-positivity, we calibrate transition time of mutations and project the fitness landscape to future time, based on which beth-1 selects the optimal vaccine strain. In season-to-season prediction in historical data for the influenza A pH1N1 and H3N2 viruses, beth-1 demonstrates superior genetic matching compared to existing approaches. In prospective validations, the model shows superior or non-inferior genetic matching and neutralization against circulating virus in mice immunization experiments compared to the current vaccine. The method offers a promising and ready-to-use tool to facilitate vaccine strain selection for the influenza virus through capturing heterogeneous evolutionary dynamics over genome space-time and linking molecular variants to population immune response.
Collapse
Affiliation(s)
- Jingzhi Lou
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
| | - Weiwen Liang
- HKU-Pasteur Research Pole, School of Public Health, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lirong Cao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Inchi Hu
- Department of Statistics, George Mason University, Fairfax, VA, USA
| | - Shi Zhao
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- School of Public Health, Tianjin Medical University, Tianjin, China
| | - Zigui Chen
- Department of Microbiology, CUHK, Hong Kong SAR, China
| | - Renee Wan Yi Chan
- Department of Paediatrics, CUHK, Hong Kong SAR, China
- Hong Kong Hub of Paediatric Excellence, CUHK, Hong Kong SAR, China
| | | | - Hong Zheng
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Caiqi Liu
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Qi Li
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
| | - Marc Ka Chun Chong
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Yexian Zhang
- Beth Bioinformatics Co. Ltd, Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Eng-Kiong Yeoh
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- Centre for Health Systems and Policy Research, CUHK, Hong Kong SAR, China
| | - Paul Kay-Sheung Chan
- Department of Microbiology, CUHK, Hong Kong SAR, China
- Stanley Ho Centre for Emerging Infectious Diseases, CUHK, Hong Kong SAR, China
| | - Benny Chung Ying Zee
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China
- CUHK Shenzhen Research Institute, Shenzhen, China
| | - Chris Ka Pun Mok
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- Li Ka Shing Institute of Health Sciences, Faculty of Medicine, CUHK, Hong Kong SAR, China.
| | - Maggie Haitian Wang
- JC School of Public Health and Primary Care (JCSPHPC), The Chinese University of Hong Kong (CUHK), Hong Kong SAR, China.
- CUHK Shenzhen Research Institute, Shenzhen, China.
| |
Collapse
|
24
|
Weng C, Yu F, Yang D, Poeschla M, Liggett LA, Jones MG, Qiu X, Wahlster L, Caulier A, Hussmann JA, Schnell A, Yost KE, Koblan LW, Martin-Rufino JD, Min J, Hammond A, Ssozi D, Bueno R, Mallidi H, Kreso A, Escabi J, Rideout WM, Jacks T, Hormoz S, van Galen P, Weissman JS, Sankaran VG. Deciphering cell states and genealogies of human haematopoiesis. Nature 2024; 627:389-398. [PMID: 38253266 PMCID: PMC10937407 DOI: 10.1038/s41586-024-07066-z] [Citation(s) in RCA: 47] [Impact Index Per Article: 47.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 01/12/2024] [Indexed: 01/24/2024]
Abstract
The human blood system is maintained through the differentiation and massive amplification of a limited number of long-lived haematopoietic stem cells (HSCs)1. Perturbations to this process underlie diverse diseases, but the clonal contributions to human haematopoiesis and how this changes with age remain incompletely understood. Although recent insights have emerged from barcoding studies in model systems2-5, simultaneous detection of cell states and phylogenies from natural barcodes in humans remains challenging. Here we introduce an improved, single-cell lineage-tracing system based on deep detection of naturally occurring mitochondrial DNA mutations with simultaneous readout of transcriptional states and chromatin accessibility. We use this system to define the clonal architecture of HSCs and map the physiological state and output of clones. We uncover functional heterogeneity in HSC clones, which is stable over months and manifests as both differences in total HSC output and biases towards the production of different mature cell types. We also find that the diversity of HSC clones decreases markedly with age, leading to an oligoclonal structure with multiple distinct clonal expansions. Our study thus provides a clonally resolved and cell-state-aware atlas of human haematopoiesis at single-cell resolution, showing an unappreciated functional diversity of human HSC clones and, more broadly, paving the way for refined studies of clonal dynamics across a range of tissues in human health and disease.
Collapse
Affiliation(s)
- Chen Weng
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Fulong Yu
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- State Key Laboratory of Respiratory Disease, Guangzhou Medical University, Guangzhou, P.R. China
| | - Dian Yang
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Molecular Pharmacology and Therapeutics, Department of Systems Biology, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA
| | - Michael Poeschla
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - L Alexander Liggett
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Matthew G Jones
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Dermatology, Stanford University, Stanford, CA, USA
- Center for Personal Dynamic Regulomes, Stanford University, Stanford, CA, USA
| | - Xiaojie Qiu
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Genetics and Computer Science, BASE Research Initiative, Betty Irene Moore Children's Heart Center, Stanford University, Stanford, CA, USA
| | - Lara Wahlster
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Alexis Caulier
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jeffrey A Hussmann
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Alexandra Schnell
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Kathryn E Yost
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Luke W Koblan
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jorge D Martin-Rufino
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Joseph Min
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Alessandro Hammond
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Daniel Ssozi
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Hematology, Brigham and Women's Hospital, Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Raphael Bueno
- Division of Thoracic and Cardiac Surgery, Brigham and Women's Hospital, Boston, MA, USA
| | - Hari Mallidi
- Division of Thoracic and Cardiac Surgery, Brigham and Women's Hospital, Boston, MA, USA
| | - Antonia Kreso
- Division of Cardiac Surgery, Massachusetts General Hospital, Boston, MA, USA
| | - Javier Escabi
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
| | - William M Rideout
- Koch Institute For Integrative Cancer Research at MIT, MIT, Cambridge, MA, USA
| | - Tyler Jacks
- Koch Institute For Integrative Cancer Research at MIT, MIT, Cambridge, MA, USA
| | - Sahand Hormoz
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Peter van Galen
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Hematology, Brigham and Women's Hospital, Department of Medicine, Harvard Medical School, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Ludwig Center at Harvard, Harvard Medical School, Boston, MA, USA
| | - Jonathan S Weissman
- Whitehead Institute for Biomedical Research, Cambridge, MA, USA.
- Department of Biology and Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Koch Institute For Integrative Cancer Research at MIT, MIT, Cambridge, MA, USA.
| | - Vijay G Sankaran
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
- Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Harvard Stem Cell Institute, Cambridge, MA, USA.
| |
Collapse
|
25
|
Lynch A, Bradford S, Burkard ME. The reckoning of chromosomal instability: past, present, future. Chromosome Res 2024; 32:2. [PMID: 38367036 DOI: 10.1007/s10577-024-09746-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 01/11/2024] [Accepted: 01/27/2024] [Indexed: 02/19/2024]
Abstract
Quantitative measures of CIN are crucial to our understanding of its role in cancer. Technological advances have changed the way CIN is quantified, offering increased accuracy and insight. Here, we review measures of CIN through its rise as a field, discuss considerations for its measurement, and look forward to future quantification of CIN.
Collapse
Affiliation(s)
- Andrew Lynch
- UW Carbone Cancer Center, University of Wisconsin, Madison, WI, USA
- McArdle Laboratory for Cancer Research, University of Wisconsin, Madison, WI, USA
- Division of Hematology/Oncology, Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA
| | - Shermineh Bradford
- UW Carbone Cancer Center, University of Wisconsin, Madison, WI, USA
- McArdle Laboratory for Cancer Research, University of Wisconsin, Madison, WI, USA
- Division of Hematology/Oncology, Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA
| | - Mark E Burkard
- UW Carbone Cancer Center, University of Wisconsin, Madison, WI, USA.
- McArdle Laboratory for Cancer Research, University of Wisconsin, Madison, WI, USA.
- Division of Hematology/Oncology, Department of Medicine, School of Medicine and Public Health, University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
26
|
Pamornchainavakul N, Kikuti M, Paploski IAD, Corzo CA, VanderWaal K. Predicting Potential PRRSV-2 Variant Emergence through Phylogenetic Inference. Transbound Emerg Dis 2024; 2024:7945955. [PMID: 40303160 PMCID: PMC12017126 DOI: 10.1155/2024/7945955] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/04/2024] [Accepted: 01/23/2024] [Indexed: 05/02/2025]
Abstract
Porcine reproductive and respiratory syndrome (PRRS) is a significant pig disease causing substantial annual losses exceeding half a billion dollars to the United States pork industry. The cocirculation and emergence of genetically distinct PRRSV-2 viruses hinder PRRS control, especially vaccine development. Similar to other viral infections like seasonal flu and SARS-CoV-2, predictive tools for identifying potential emerging viral variants may prospectively aid in preemptive disease mitigation. However, such predictions have not been made for PRRSV-2, despite the abundance of relevant data. In this study, we analyzed a decade's worth of virus ORF5 sequences (n = 20,700) and corresponding metadata to identify phylogenetic-based early indicators for short-term (12 months) and long-term (24 months) variant emergence. Our analysis focuses on PRRSV-2 Lineage 1, which was the predominant lineage within the U.S. during this period. We evaluated population expansion, spatial distribution, and genetic diversity as key success metrics for variant emergence. Our findings indicate that successful variants were best characterized as those that underwent population expansion alongside widespread geographical spread but had limited genetic diversification. Conditional logistic regression revealed the local branching index as the sole informative indicator for predicting population expansion (balanced accuracy (BA) = 0.75), while ancestral branch length was strongly linked to future genetic diversity (BA = 0.79). Predicting spatial dispersion relied on the branch length and putative antigenic difference (BA = 0.67), but their causal relationships remain unclear. Although the predictive models effectively captured most emerging variants (sensitivity = 0.58-0.81), they exhibited relatively low positive predictive value (PPV = 0.09-0.16). This initial step in PRRSV-2 prediction is a crucial step for more precise prevention strategies against PRRS in the future.
Collapse
Affiliation(s)
| | - Mariana Kikuti
- College of Veterinary Medicine, University of Minnesota, St. Paul, MN, USA
| | | | - Cesar A. Corzo
- College of Veterinary Medicine, University of Minnesota, St. Paul, MN, USA
| | | |
Collapse
|
27
|
Eales O, Plank MJ, Cowling BJ, Howden BP, Kucharski AJ, Sullivan SG, Vandemaele K, Viboud C, Riley S, McCaw JM, Shearer FM. Key Challenges for Respiratory Virus Surveillance while Transitioning out of Acute Phase of COVID-19 Pandemic. Emerg Infect Dis 2024; 30:e230768. [PMID: 38190760 PMCID: PMC10826770 DOI: 10.3201/eid3002.230768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2024] Open
Abstract
To support the ongoing management of viral respiratory diseases while transitioning out of the acute phase of the COVID-19 pandemic, many countries are moving toward an integrated model of surveillance for SARS-CoV-2, influenza virus, and other respiratory pathogens. Although many surveillance approaches catalyzed by the COVID-19 pandemic provide novel epidemiologic insight, continuing them as implemented during the pandemic is unlikely to be feasible for nonemergency surveillance, and many have already been scaled back. Furthermore, given anticipated cocirculation of SARS-CoV-2 and influenza virus, surveillance activities in place before the pandemic require review and adjustment to ensure their ongoing value for public health. In this report, we highlight key challenges for the development of integrated models of surveillance. We discuss the relative strengths and limitations of different surveillance practices and studies as well as their contribution to epidemiologic assessment, forecasting, and public health decision-making.
Collapse
|
28
|
Hoehn KB, Kleinstein SH. B cell phylogenetics in the single cell era. Trends Immunol 2024; 45:62-74. [PMID: 38151443 PMCID: PMC10872299 DOI: 10.1016/j.it.2023.11.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 11/22/2023] [Accepted: 11/24/2023] [Indexed: 12/29/2023]
Abstract
The widespread availability of single-cell RNA sequencing (scRNA-seq) has led to the development of new methods for understanding immune responses. Single-cell transcriptome data can now be paired with B cell receptor (BCR) sequences. However, RNA from BCRs cannot be analyzed like most other genes because BCRs are genetically diverse within individuals. In humans, BCRs are shaped through recombination followed by mutation and selection for antigen binding. As these processes co-occur with cell division, B cells can be studied using phylogenetic trees representing the mutations within a clone. B cell trees can link experimental timepoints, tissues, or cellular subtypes. Here, we review the current state and potential of how B cell phylogenetics can be combined with single-cell data to understand immune responses.
Collapse
Affiliation(s)
- Kenneth B Hoehn
- Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, NH 03755, USA.
| | - Steven H Kleinstein
- Department of Pathology, Yale School of Medicine, New Haven, CT 06520, USA; Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA; Department of Immunobiology, Yale School of Medicine, New Haven, CT 06520, USA
| |
Collapse
|
29
|
Tijhuis AE, Foijer F. Characterizing chromosomal instability-driven cancer evolution and cell fitness at a glance. J Cell Sci 2024; 137:jcs260199. [PMID: 38224461 DOI: 10.1242/jcs.260199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2024] Open
Abstract
Chromosomal instability (CIN), an increased rate of chromosome segregation errors during mitosis, is a hallmark of cancer cells. CIN leads to karyotype differences between cells and thus large-scale heterogeneity among individual cancer cells; therefore, it plays an important role in cancer evolution. Studying CIN and its consequences is technically challenging, but various technologies have been developed to track karyotype dynamics during tumorigenesis, trace clonal lineages and link genomic changes to cancer phenotypes at single-cell resolution. These methods provide valuable insight not only into the role of CIN in cancer progression, but also into cancer cell fitness. In this Cell Science at a Glance article and the accompanying poster, we discuss the relationship between CIN, cancer cell fitness and evolution, and highlight techniques that can be used to study the relationship between these factors. To that end, we explore methods of assessing cancer cell fitness, particularly for chromosomally unstable cancer.
Collapse
Affiliation(s)
- Andréa E Tijhuis
- European Research Institute for the Biology of Ageing , University Medical Center Groningen, University of Groningen,9713 AV Groningen, The Netherlands
| | - Floris Foijer
- European Research Institute for the Biology of Ageing , University Medical Center Groningen, University of Groningen,9713 AV Groningen, The Netherlands
| |
Collapse
|
30
|
Bull JJ, Nuismer SL, Remien CH, Griffiths ME, Antia R. Recombinant transmissible vaccines will be intrinsically contained despite the ability to superinfect. Expert Rev Vaccines 2024; 23:294-302. [PMID: 38372241 PMCID: PMC11003445 DOI: 10.1080/14760584.2024.2320845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 02/15/2024] [Indexed: 02/20/2024]
Abstract
INTRODUCTION Transmissible vaccines offer a novel approach to suppressing viruses in wildlife populations, with possible applications against viruses that infect humans as zoonoses - Lassa, Ebola, rabies. To ensure safety, current designs propose a recombinant vector platform in which the vector is isolated from the target wildlife population. Because using an endemic vector creates the potential for preexisting immunity to block vaccine transmission, these designs focus on vector viruses capable of superinfection, spreading throughout the host population following vaccination of few individuals. AREAS COVERED We present original theoretical arguments that, regardless of its R0 value, a recombinant vaccine using a superinfecting vector is not expected to expand its active infection coverage when released into a wildlife population that already carries the vector. However, if superinfection occurs at a high rate such that individuals are repeatedly infected throughout their lives, the immunity footprint in the population can be high despite a low incidence of active vaccine infections. Yet we provide reasons that the above expectation is optimistic. EXPERT OPINION High vaccine coverage will typically require repeated releases or release into a population lacking the vector, but careful attention to vector choice and vaccine engineering should also help improve transmissible vaccine utility.
Collapse
Affiliation(s)
- James J Bull
- Department of Biological Sciences, U. Idaho, Moscow, ID 83844 USA
| | - Scott L Nuismer
- Department of Biological Sciences. University of Idaho. Moscow, ID 83844
- Department of Mathematics. University of Idaho. Moscow, ID 83844
| | | | - Megan E Griffiths
- School of Biodiversity, One Health and Veterinary Medicine, University of Glasgow, Glasgow G61 1QH, United Kingdom
| | - Rustom Antia
- Department of Biology, Emory University, Atlanta, Georgia, 30322 USA
| |
Collapse
|
31
|
Prillo S, Ravoor A, Yosef N, Song YS. ConvexML: Scalable and accurate inference of single-cell chronograms from CRISPR/Cas9 lineage tracing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.03.569785. [PMID: 38076815 PMCID: PMC10705529 DOI: 10.1101/2023.12.03.569785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
CRISPR/Cas9 gene editing technology has enabled lineage tracing for thousands of cells in vivo. However, most of the analysis of CRISPR/Cas9 lineage tracing data has so far been limited to the reconstruction of single-cell tree topologies, which depict lineage relationships between cells, but not the amount of time that has passed between ancestral cell states and the present. Time-resolved trees, known as chronograms, would allow one to study the evolutionary dynamics of cell populations at an unprecedented level of resolution. Indeed, time-resolved trees would reveal the timing of events on the tree, the relative fitness of subclones, and the dynamics underlying phenotypic changes in the cell population - among other important applications. In this work, we introduce the first scalable and accurate method to refine any given single-cell tree topology into a single-cell chronogram by estimating its branch lengths. To do this, we leverage a statistical model of CRISPR/Cas9 cutting with missing data, paired with a conservative version of maximum parsimony that reconstructs only the ancestral states that we are confident about. As part of our method, we propose a novel approach to represent and handle missing data - specifically, double-resection events - which greatly simplifies and speeds up branch length estimation without compromising quality. All this leads to a convex maximum likelihood estimation (MLE) problem that can be readily solved in seconds with off-the-shelf convex optimization solvers. To stabilize estimates in low-information regimes, we propose a simple penalized version of MLE using a minimum branch length and pseudocounts. We benchmark our method using simulations and show that it performs well on several tasks, outperforming more naive baselines. Our method, which we name 'ConvexML', is available through the cassiopeia open source Python package.
Collapse
Affiliation(s)
| | - Akshay Ravoor
- Computer Science Division, University of California, Berkeley
| | - Nir Yosef
- Computer Science Division, University of California, Berkeley
- Department of Systems Immunology, Weizmann Institute of Science
| | - Yun S. Song
- Computer Science Division, University of California, Berkeley
- Department of Statistics, University of California, Berkeley
| |
Collapse
|
32
|
Hayati M, Sobkowiak B, Stockdale JE, Colijn C. Phylogenetic identification of influenza virus candidates for seasonal vaccines. SCIENCE ADVANCES 2023; 9:eabp9185. [PMID: 37922357 PMCID: PMC10624341 DOI: 10.1126/sciadv.abp9185] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Accepted: 10/05/2023] [Indexed: 11/05/2023]
Abstract
The seasonal influenza (flu) vaccine is designed to protect against those influenza viruses predicted to circulate during the upcoming flu season, but identifying which viruses are likely to circulate is challenging. We use features from phylogenetic trees reconstructed from hemagglutinin (HA) and neuraminidase (NA) sequences, together with a support vector machine, to predict future circulation. We obtain accuracies of 0.75 to 0.89 (AUC 0.83 to 0.91) over 2016-2020. We explore ways to select potential candidates for a seasonal vaccine and find that the machine learning model has a moderate ability to select strains that are close to future populations. However, consensus sequences among the most recent 3 years also do well at this task. We identify similar candidate strains to those proposed by the World Health Organization, suggesting that this approach can help inform vaccine strain selection.
Collapse
Affiliation(s)
- Maryam Hayati
- School of Computing Science, Simon Fraser University, Burnaby, BC V5A 1S6, Canada
| | - Benjamin Sobkowiak
- Department of Mathematics, Simon Fraser University, Burnaby, BC V5A 1S6, Canada
| | | | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, BC V5A 1S6, Canada
| |
Collapse
|
33
|
Gunnarsson PA, Babu MM. Predicting evolutionary outcomes through the probability of accessing sequence variants. SCIENCE ADVANCES 2023; 9:eade2903. [PMID: 37506212 PMCID: PMC10381947 DOI: 10.1126/sciadv.ade2903] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Accepted: 06/27/2023] [Indexed: 07/30/2023]
Abstract
Natural selection can only operate on available genetic variation. Thus, determining the probability of accessing different sequence variants from a starting sequence can help predict evolutionary trajectories and outcomes. We define the concept of "variant accessibility" as the probability that a set of genotypes encoding a particular protein function will arise through mutations before subject to natural selection. This probability is shaped by the mutational biases of nucleotides and the structure of the genetic code. Using the influenza A virus as a model, we discuss how a more accessible but less fit variant can emerge as an adaptation rather than a more fit variant. We describe a genotype-accessibility landscape, complementary to the genotype-fitness landscape, that informs the likelihood of a starting sequence reaching different parts of genotype space. The proposed framework lays the foundation for predicting the emergence of adaptive genotypes in evolving systems such as viruses and tumors.
Collapse
Affiliation(s)
- P. Alexander Gunnarsson
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
- Department of Structural Biology and Center of Excellence for Data-Driven Discovery, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - M. Madan Babu
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
- Department of Structural Biology and Center of Excellence for Data-Driven Discovery, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA
| |
Collapse
|
34
|
Thézé J, Ambroset C, Barry S, Masseglia S, Colin A, Tricot A, Tardy F, Bailly X. Genome-wide phylodynamic approach reveals the epidemic dynamics of the main Mycoplasma bovis subtype circulating in France. Microb Genom 2023; 9:mgen001067. [PMID: 37486749 PMCID: PMC10438803 DOI: 10.1099/mgen.0.001067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 06/19/2023] [Indexed: 07/25/2023] Open
Abstract
Mycoplasma bovis is a major aetiological agent of bovine respiratory disease worldwide. Genome-based analyses are increasingly being used to monitor the genetic diversity and global distribution of M. bovis, complementing existing subtyping schemes based on locus sequencing. However, these analyses have so far provided limited information on the spatiotemporal and population dynamics of circulating subtypes. Here we applied a genome-wide phylodynamic approach to explore the epidemic dynamics of 88 French M. bovis strains collected between 2000 and 2019 in France and belonging to the currently dominant polC subtype 2 (st2). A strong molecular clock signal detected in the genomic data enabled robust phylodynamic inferences, which estimated that the M. bovis st2 population in France is composed of two lineages that successively emerged from independent introductions of international strains. The first lineage appeared around 2000 and supplanted the previously established antimicrobial-susceptible polC subtype 1. The second lineage, which is likely more transmissible, progressively replaced the first M. bovis st2 lineage population from 2005 onward and became predominant after 2010. Analyses also showed a brief decline in this second M. bovis st2 lineage population in around 2011, possibly due to the challenge from the concurrent emergence of M. bovis polC subtype 3 in France. Finally, we identified non-synonymous mutations in genes associated with lineages, which raises prospects for identifying new surveillance molecular markers. A genome-wide phylodynamic approach provides valuable resources for monitoring the evolution and epidemic dynamics of circulating M. bovis subtypes, and may prove critical for developing more effective surveillance systems and disease control strategies.
Collapse
Affiliation(s)
- Julien Thézé
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR EPIA, Saint-Genès-Champanelle, France
| | - Chloé Ambroset
- Université de Lyon, ANSES, VetAgro Sup, UMR Mycoplasmoses animales, Lyon, France
| | - Séverine Barry
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR EPIA, Saint-Genès-Champanelle, France
| | - Sébastien Masseglia
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR EPIA, Saint-Genès-Champanelle, France
| | - Adélie Colin
- Université de Lyon, ANSES, VetAgro Sup, UMR Mycoplasmoses animales, Lyon, France
| | - Agnès Tricot
- Université de Lyon, ANSES, VetAgro Sup, UMR Mycoplasmoses animales, Lyon, France
| | - Florence Tardy
- Université de Lyon, ANSES, VetAgro Sup, UMR Mycoplasmoses animales, Lyon, France
| | - Xavier Bailly
- Université Clermont Auvergne, INRAE, VetAgro Sup, UMR EPIA, Saint-Genès-Champanelle, France
| |
Collapse
|
35
|
Peng F, Xia Y, Li W. Prediction of Antigenic Distance in Influenza A Using Attribute Network Embedding. Viruses 2023; 15:1478. [PMID: 37515165 PMCID: PMC10385503 DOI: 10.3390/v15071478] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 06/23/2023] [Accepted: 06/28/2023] [Indexed: 07/30/2023] Open
Abstract
Owing to the rapid changes in the antigenicity of influenza viruses, it is difficult for humans to obtain lasting immunity through antiviral therapy. Hence, tracking the dynamic changes in the antigenicity of influenza viruses can provide a basis for vaccines and drug treatments to cope with the spread of influenza viruses. In this paper, we developed a novel quantitative prediction method to predict the antigenic distance between virus strains using attribute network embedding techniques. An antigenic network is built to model and combine the genetic and antigenic characteristics of the influenza A virus H3N2, using the continuous distributed representation of the virus strain protein sequence (ProtVec) as a node attribute and the antigenic distance between virus strains as an edge weight. The results show a strong positive correlation between supplementing genetic features and antigenic distance prediction accuracy. Further analysis indicates that our prediction model can comprehensively and accurately track the differences in antigenic distances between vaccines and influenza virus strains, and it outperforms existing methods in predicting antigenic distances between strains.
Collapse
Affiliation(s)
- Fujun Peng
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| | - Yuanling Xia
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650500, China
| | - Weihua Li
- School of Information Science and Engineering, Yunnan University, Kunming 650500, China
| |
Collapse
|
36
|
Rouzine IM, Rozhnova G. Evolutionary implications of SARS-CoV-2 vaccination for the future design of vaccination strategies. COMMUNICATIONS MEDICINE 2023; 3:86. [PMID: 37336956 PMCID: PMC10279745 DOI: 10.1038/s43856-023-00320-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 06/07/2023] [Indexed: 06/21/2023] Open
Abstract
Once the first SARS-CoV-2 vaccine became available, mass vaccination was the main pillar of the public health response to the COVID-19 pandemic. It was very effective in reducing hospitalizations and deaths. Here, we discuss the possibility that mass vaccination might accelerate SARS-CoV-2 evolution in antibody-binding regions compared to natural infection at the population level. Using the evidence of strong genetic variation in antibody-binding regions and taking advantage of the similarity between the envelope proteins of SARS-CoV-2 and influenza, we assume that immune selection pressure acting on these regions of the two viruses is similar. We discuss the consequences of this assumption for SARS-CoV-2 evolution in light of mathematical models developed previously for influenza. We further outline the implications of this phenomenon, if our assumptions are confirmed, for the future design of SARS-CoV-2 vaccination strategies.
Collapse
Affiliation(s)
- Igor M Rouzine
- Immunogenetics, Sechenov Institute of Evolutionary Physiology and Biochemistry of Russian Academy of Sciences, Saint-Petersburg, Russia.
| | - Ganna Rozhnova
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands.
- BioISI - Biosystems & Integrative Sciences Institute, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal.
- Center for Complex Systems Studies (CCSS), Utrecht University, Utrecht, The Netherlands.
| |
Collapse
|
37
|
Moulana A, Dupic T, Phillips AM, Desai MM. Genotype-phenotype landscapes for immune-pathogen coevolution. Trends Immunol 2023; 44:384-396. [PMID: 37024340 PMCID: PMC10147585 DOI: 10.1016/j.it.2023.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 03/08/2023] [Accepted: 03/09/2023] [Indexed: 04/07/2023]
Abstract
Our immune systems constantly coevolve with the pathogens that challenge them, as pathogens adapt to evade our defense responses, with our immune repertoires shifting in turn. These coevolutionary dynamics take place across a vast and high-dimensional landscape of potential pathogen and immune receptor sequence variants. Mapping the relationship between these genotypes and the phenotypes that determine immune-pathogen interactions is crucial for understanding, predicting, and controlling disease. Here, we review recent developments applying high-throughput methods to create large libraries of immune receptor and pathogen protein sequence variants and measure relevant phenotypes. We describe several approaches that probe different regions of the high-dimensional sequence space and comment on how combinations of these methods may offer novel insight into immune-pathogen coevolution.
Collapse
Affiliation(s)
- Alief Moulana
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Thomas Dupic
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Angela M Phillips
- Department of Microbiology and Immunology, University of California at San Francisco, San Francisco, CA 94143, USA
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA; Department of Physics, Harvard University, Cambridge, MA 02138, USA; NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge, MA 02138, USA; Quantitative Biology Initiative, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
38
|
Berman DS, Howser C, Mehoke T, Ernlund AW, Evans JD. MutaGAN: A sequence-to-sequence GAN framework to predict mutations of evolving protein populations. Virus Evol 2023; 9:vead022. [PMID: 37066021 PMCID: PMC10104372 DOI: 10.1093/ve/vead022] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 02/03/2023] [Accepted: 03/30/2023] [Indexed: 04/18/2023] Open
Abstract
The ability to predict the evolution of a pathogen would significantly improve the ability to control, prevent, and treat disease. Machine learning, however, is yet to be used to predict the evolutionary progeny of a virus. To address this gap, we developed a novel machine learning framework, named MutaGAN, using generative adversarial networks with sequence-to-sequence, recurrent neural networks generator to accurately predict genetic mutations and evolution of future biological populations. MutaGAN was trained using a generalized time-reversible phylogenetic model of protein evolution with maximum likelihood tree estimation. MutaGAN was applied to influenza virus sequences because influenza evolves quickly and there is a large amount of publicly available data from the National Center for Biotechnology Information's Influenza Virus Resource. MutaGAN generated 'child' sequences from a given 'parent' protein sequence with a median Levenshtein distance of 4.00 amino acids. Additionally, the generator was able to generate sequences that contained at least one known mutation identified within the global influenza virus population for 72.8 per cent of parent sequences. These results demonstrate the power of the MutaGAN framework to aid in pathogen forecasting with implications for broad utility in evolutionary prediction for any protein population.
Collapse
Affiliation(s)
- Daniel S Berman
- Johns Hopkins Applied Physics Laboratory, 11100 Johns Hopkins Rd., Laurel, MD 20723, USA
| | - Craig Howser
- Johns Hopkins Applied Physics Laboratory, 11100 Johns Hopkins Rd., Laurel, MD 20723, USA
| | - Thomas Mehoke
- Johns Hopkins Applied Physics Laboratory, 11100 Johns Hopkins Rd., Laurel, MD 20723, USA
| | - Amanda W Ernlund
- Johns Hopkins Applied Physics Laboratory, 11100 Johns Hopkins Rd., Laurel, MD 20723, USA
| | - Jared D Evans
- Johns Hopkins Applied Physics Laboratory, 11100 Johns Hopkins Rd., Laurel, MD 20723, USA
| |
Collapse
|
39
|
Harvey WT, Davies V, Daniels RS, Whittaker L, Gregory V, Hay AJ, Husmeier D, McCauley JW, Reeve R. A Bayesian approach to incorporate structural data into the mapping of genotype to antigenic phenotype of influenza A(H3N2) viruses. PLoS Comput Biol 2023; 19:e1010885. [PMID: 36972311 PMCID: PMC10079231 DOI: 10.1371/journal.pcbi.1010885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/06/2023] [Accepted: 01/20/2023] [Indexed: 03/29/2023] Open
Abstract
Surface antigens of pathogens are commonly targeted by vaccine-elicited antibodies but antigenic variability, notably in RNA viruses such as influenza, HIV and SARS-CoV-2, pose challenges for control by vaccination. For example, influenza A(H3N2) entered the human population in 1968 causing a pandemic and has since been monitored, along with other seasonal influenza viruses, for the emergence of antigenic drift variants through intensive global surveillance and laboratory characterisation. Statistical models of the relationship between genetic differences among viruses and their antigenic similarity provide useful information to inform vaccine development, though accurate identification of causative mutations is complicated by highly correlated genetic signals that arise due to the evolutionary process. Here, using a sparse hierarchical Bayesian analogue of an experimentally validated model for integrating genetic and antigenic data, we identify the genetic changes in influenza A(H3N2) virus that underpin antigenic drift. We show that incorporating protein structural data into variable selection helps resolve ambiguities arising due to correlated signals, with the proportion of variables representing haemagglutinin positions decisively included, or excluded, increased from 59.8% to 72.4%. The accuracy of variable selection judged by proximity to experimentally determined antigenic sites was improved simultaneously. Structure-guided variable selection thus improves confidence in the identification of genetic explanations of antigenic variation and we also show that prioritising the identification of causative mutations is not detrimental to the predictive capability of the analysis. Indeed, incorporating structural information into variable selection resulted in a model that could more accurately predict antigenic assay titres for phenotypically-uncharacterised virus from genetic sequence. Combined, these analyses have the potential to inform choices of reference viruses, the targeting of laboratory assays, and predictions of the evolutionary success of different genotypes, and can therefore be used to inform vaccine selection processes.
Collapse
Affiliation(s)
- William T. Harvey
- Boyd Orr Centre for Population and Ecosystem Health, School of Biodiversity, One Health and Veterinary Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
- * E-mail: (WTH); (RR)
| | - Vinny Davies
- School of Computing, College of Science and Engineering, University of Glasgow, Glasgow, United Kingdom
- School of Mathematics and Statistics, College of Science and Engineering, University of Glasgow, Glasgow, United Kingdom
| | - Rodney S. Daniels
- Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Lynne Whittaker
- Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Victoria Gregory
- Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Alan J. Hay
- Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Dirk Husmeier
- School of Mathematics and Statistics, College of Science and Engineering, University of Glasgow, Glasgow, United Kingdom
| | - John W. McCauley
- Worldwide Influenza Centre, The Francis Crick Institute, London, United Kingdom
| | - Richard Reeve
- Boyd Orr Centre for Population and Ecosystem Health, School of Biodiversity, One Health and Veterinary Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, United Kingdom
- * E-mail: (WTH); (RR)
| |
Collapse
|
40
|
Makau DN, Prieto C, Martínez-Lobo FJ, Paploski IAD, VanderWaal K. Predicting Antigenic Distance from Genetic Data for PRRSV-Type 1: Applications of Machine Learning. Microbiol Spectr 2023; 11:e0408522. [PMID: 36511691 PMCID: PMC9927307 DOI: 10.1128/spectrum.04085-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 11/18/2022] [Indexed: 12/15/2022] Open
Abstract
The control of porcine reproductive and respiratory syndrome (PRRS) remains a significant challenge due to the genetic and antigenic variability of the causative virus (PRRSV). Predominantly, PRRSV management includes using vaccines and live virus inoculations to confer immunity against PRRSV on farms. While understanding cross-protection among strains is crucial for the continued success of these interventions, understanding how genetic diversity translates to antigenic diversity remains elusive. We developed machine learning algorithms to estimate antigenic distance in silico, based on genetic sequence data, and identify differences in specific amino acid sites associated with antigenic differences between viruses. First, we obtained antigenic distance estimates derived from serum neutralization assays cross-reacting PRRSV monospecific antisera with virus isolates from 27 PRRSV1 viruses circulating in Europe. Antigenic distances were weakly to moderately associated with ectodomain amino acid distance for open reading frames (ORFs) 2 to 4 (ρ < 0.2) and ORF5 (ρ = 0.3), respectively. Dividing the antigenic distance values at the median, we then categorized the sera-virus pairs into two levels: low and high antigenic distance (dissimilarity). In the machine learning models, we used amino acid distances in the ectodomains of ORFs 2 to 5 and site-wise amino acid differences between the viruses as potential predictors of antigenic dissimilarity. Using mixed-effect gradient boosting models, we estimated the antigenic distance (high versus low) between serum-virus pairs with an accuracy of 81% (95% confidence interval, 76 to 85%); sensitivity and specificity were 86% and 75%, respectively. We demonstrate that using sequence data we can estimate antigenic distance and potential cross-protection between PRRSV1 strains. IMPORTANCE Understanding cross-protection between cocirculating PRRSV1 strains is crucial to reducing losses associated with PRRS outbreaks on farms. While experimental studies to determine cross-protection are instrumental, these in vivo studies are not always practical or timely for the many cocirculating and emerging PRRSV strains. In this study, we demonstrate the ability to rapidly estimate potential immunologic cross-reaction between different PRRSV1 strains in silico using sequence data routinely collected by production systems. These models can provide fast turn-around information crucial for improving PRRS management decisions such as selecting vaccines/live virus inoculation to be used on farms and assessing the risk of outbreaks by emerging strains on farms previously exposed to certain PRRSV strains and vaccine development among others.
Collapse
Affiliation(s)
- Dennis N. Makau
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Minneapolis, USA
| | - Cinta Prieto
- Departamento de Sanidad Animal, Facultad de Veterinaria, Universidad Complutense de Madrid, Madrid, Spain
| | | | - I. A. D. Paploski
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Minneapolis, USA
| | - Kimberly VanderWaal
- Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, Minneapolis, USA
| |
Collapse
|
41
|
Wortel MT, Agashe D, Bailey SF, Bank C, Bisschop K, Blankers T, Cairns J, Colizzi ES, Cusseddu D, Desai MM, van Dijk B, Egas M, Ellers J, Groot AT, Heckel DG, Johnson ML, Kraaijeveld K, Krug J, Laan L, Lässig M, Lind PA, Meijer J, Noble LM, Okasha S, Rainey PB, Rozen DE, Shitut S, Tans SJ, Tenaillon O, Teotónio H, de Visser JAGM, Visser ME, Vroomans RMA, Werner GDA, Wertheim B, Pennings PS. Towards evolutionary predictions: Current promises and challenges. Evol Appl 2023; 16:3-21. [PMID: 36699126 PMCID: PMC9850016 DOI: 10.1111/eva.13513] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 11/11/2022] [Accepted: 11/14/2022] [Indexed: 12/14/2022] Open
Abstract
Evolution has traditionally been a historical and descriptive science, and predicting future evolutionary processes has long been considered impossible. However, evolutionary predictions are increasingly being developed and used in medicine, agriculture, biotechnology and conservation biology. Evolutionary predictions may be used for different purposes, such as to prepare for the future, to try and change the course of evolution or to determine how well we understand evolutionary processes. Similarly, the exact aspect of the evolved population that we want to predict may also differ. For example, we could try to predict which genotype will dominate, the fitness of the population or the extinction probability of a population. In addition, there are many uses of evolutionary predictions that may not always be recognized as such. The main goal of this review is to increase awareness of methods and data in different research fields by showing the breadth of situations in which evolutionary predictions are made. We describe how diverse evolutionary predictions share a common structure described by the predictive scope, time scale and precision. Then, by using examples ranging from SARS-CoV2 and influenza to CRISPR-based gene drives and sustainable product formation in biotechnology, we discuss the methods for predicting evolution, the factors that affect predictability and how predictions can be used to prevent evolution in undesirable directions or to promote beneficial evolution (i.e. evolutionary control). We hope that this review will stimulate collaboration between fields by establishing a common language for evolutionary predictions.
Collapse
Affiliation(s)
- Meike T. Wortel
- Swammerdam Institute for Life SciencesUniversity of AmsterdamAmsterdamThe Netherlands
| | - Deepa Agashe
- National Centre for Biological SciencesBangaloreIndia
| | | | - Claudia Bank
- Institute of Ecology and EvolutionUniversity of BernBernSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
- Gulbenkian Science InstituteOeirasPortugal
| | - Karen Bisschop
- Institute for Biodiversity and Ecosystem DynamicsUniversity of AmsterdamAmsterdamThe Netherlands
- Origins CenterGroningenThe Netherlands
- Laboratory of Aquatic Biology, KU Leuven KulakKortrijkBelgium
| | - Thomas Blankers
- Institute for Biodiversity and Ecosystem DynamicsUniversity of AmsterdamAmsterdamThe Netherlands
- Origins CenterGroningenThe Netherlands
| | | | - Enrico Sandro Colizzi
- Origins CenterGroningenThe Netherlands
- Mathematical InstituteLeiden UniversityLeidenThe Netherlands
| | | | | | - Bram van Dijk
- Max Planck Institute for Evolutionary BiologyPlönGermany
| | - Martijn Egas
- Institute for Biodiversity and Ecosystem DynamicsUniversity of AmsterdamAmsterdamThe Netherlands
| | - Jacintha Ellers
- Department of Ecological ScienceVrije Universiteit AmsterdamAmsterdamThe Netherlands
| | - Astrid T. Groot
- Institute for Biodiversity and Ecosystem DynamicsUniversity of AmsterdamAmsterdamThe Netherlands
| | | | | | - Ken Kraaijeveld
- Leiden Centre for Applied BioscienceUniversity of Applied Sciences LeidenLeidenThe Netherlands
| | - Joachim Krug
- Institute for Biological PhysicsUniversity of CologneCologneGermany
| | - Liedewij Laan
- Department of Bionanoscience, Kavli Institute of NanoscienceTU DelftDelftThe Netherlands
| | - Michael Lässig
- Institute for Biological PhysicsUniversity of CologneCologneGermany
| | - Peter A. Lind
- Department Molecular BiologyUmeå UniversityUmeåSweden
| | - Jeroen Meijer
- Theoretical Biology and Bioinformatics, Department of BiologyUtrecht UniversityUtrechtThe Netherlands
| | - Luke M. Noble
- Institute de Biologie, École Normale Supérieure, CNRS, InsermParisFrance
| | | | - Paul B. Rainey
- Department of Microbial Population BiologyMax Planck Institute for Evolutionary BiologyPlönGermany
- Laboratoire Biophysique et Évolution, CBI, ESPCI Paris, Université PSL, CNRSParisFrance
| | - Daniel E. Rozen
- Institute of Biology, Leiden UniversityLeidenThe Netherlands
| | - Shraddha Shitut
- Origins CenterGroningenThe Netherlands
- Institute of Biology, Leiden UniversityLeidenThe Netherlands
| | | | | | | | | | - Marcel E. Visser
- Department of Animal EcologyNetherlands Institute of Ecology (NIOO‐KNAW)WageningenThe Netherlands
| | - Renske M. A. Vroomans
- Origins CenterGroningenThe Netherlands
- Informatics InstituteUniversity of AmsterdamAmsterdamThe Netherlands
| | | | - Bregje Wertheim
- Groningen Institute for Evolutionary Life SciencesUniversity of GroningenGroningenThe Netherlands
| | | |
Collapse
|
42
|
Makau DN, Lycett S, Michalska-Smith M, Paploski IAD, Cheeran MCJ, Craft ME, Kao RR, Schroeder DC, Doeschl-Wilson A, VanderWaal K. Ecological and evolutionary dynamics of multi-strain RNA viruses. Nat Ecol Evol 2022; 6:1414-1422. [PMID: 36138206 DOI: 10.1038/s41559-022-01860-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 07/28/2022] [Indexed: 11/09/2022]
Abstract
Potential interactions among co-circulating viral strains in host populations are often overlooked in the study of virus transmission. However, these interactions probably shape transmission dynamics by influencing host immune responses or altering the relative fitness among co-circulating strains. In this Review, we describe multi-strain dynamics from ecological and evolutionary perspectives, outline scales in which multi-strain dynamics occur and summarize important immunological, phylogenetic and mathematical modelling approaches used to quantify interactions among strains. We also discuss how host-pathogen interactions influence the co-circulation of pathogens. Finally, we highlight outstanding questions and knowledge gaps in the current theory and study of ecological and evolutionary dynamics of multi-strain viruses.
Collapse
Affiliation(s)
- Dennis N Makau
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA
| | | | | | - Igor A D Paploski
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA
| | - Maxim C-J Cheeran
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA
| | - Meggan E Craft
- Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, MN, USA
| | - Rowland R Kao
- Roslin Institute, University of Edinburgh, Edinburgh, UK
| | - Declan C Schroeder
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA
- School of Biological Sciences, University of Reading, Reading, UK
| | | | - Kimberly VanderWaal
- Department of Veterinary Population Medicine, University of Minnesota, St. Paul, MN, USA.
| |
Collapse
|
43
|
Williams BJM, Ogbunugafor CB, Althouse BM, Hébert-Dufresne L. Immunity-induced criticality of the genotype network of influenza A (H3N2) hemagglutinin. PNAS NEXUS 2022; 1:pgac143. [PMID: 36060623 PMCID: PMC9434636 DOI: 10.1093/pnasnexus/pgac143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 07/22/2022] [Indexed: 11/17/2022]
Abstract
Seasonal influenza kills hundreds of thousands every year, with multiple constantly changing strains in circulation at any given time. A high mutation rate enables the influenza virus to evade recognition by the human immune system, including immunity acquired through past infection and vaccination. Here, we capture the genetic similarity of influenza strains and their evolutionary dynamics with genotype networks. We show that the genotype networks of influenza A (H3N2) hemagglutinin are characterized by heavy-tailed distributions of module sizes and connectivity indicative of critical behavior. We argue that (i) genotype networks are driven by mutation and host immunity to explore a subspace of networks predictable in structure and (ii) genotype networks provide an underlying structure necessary to capture the rich dynamics of multistrain epidemic models. In particular, inclusion of strain-transcending immunity in epidemic models is dependent upon the structure of an underlying genotype network. This interplay is consistent with self-organized criticality where the epidemic dynamics of influenza locates critical regions of its genotype network. We conclude that this interplay between disease dynamics and network structure might be key for future network analysis of pathogen evolution and realistic multistrain epidemic models.
Collapse
Affiliation(s)
- Blake J M Williams
- Vermont Complex Systems Center, University of Vermont , Burlington, VT 05405, USA
| | - C Brandon Ogbunugafor
- Vermont Complex Systems Center, University of Vermont , Burlington, VT 05405, USA
- Department of Ecology and Evolutionary Biology, Yale University , New Haven, CT 06511, USA
- Santa Fe Institute , Santa Fe, NM 87501, USA
- Public Health Modeling Unit, Yale School of Public Health , New Haven, CT 06510, USA
| | - Benjamin M Althouse
- Institute for Disease Modeling, Global Health, Bill & Melinda Gates Foundation , Seattle, WA 98109, USA
- Information School, University of Washington , Seattle, WA 98195, USA
- Department of Biology, New Mexico State University , Las Cruces, NM 88003, USA
| | - Laurent Hébert-Dufresne
- Vermont Complex Systems Center, University of Vermont , Burlington, VT 05405, USA
- Department of Computer Science, University of Vermont , Burlington VT 05405, USA
| |
Collapse
|
44
|
Yang D, Jones MG, Naranjo S, Rideout WM, Min KHJ, Ho R, Wu W, Replogle JM, Page JL, Quinn JJ, Horns F, Qiu X, Chen MZ, Freed-Pastor WA, McGinnis CS, Patterson DM, Gartner ZJ, Chow ED, Bivona TG, Chan MM, Yosef N, Jacks T, Weissman JS. Lineage tracing reveals the phylodynamics, plasticity, and paths of tumor evolution. Cell 2022; 185:1905-1923.e25. [PMID: 35523183 DOI: 10.1016/j.cell.2022.04.015] [Citation(s) in RCA: 174] [Impact Index Per Article: 58.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 02/09/2022] [Accepted: 04/08/2022] [Indexed: 12/19/2022]
Abstract
Tumor evolution is driven by the progressive acquisition of genetic and epigenetic alterations that enable uncontrolled growth and expansion to neighboring and distal tissues. The study of phylogenetic relationships between cancer cells provides key insights into these processes. Here, we introduced an evolving lineage-tracing system with a single-cell RNA-seq readout into a mouse model of Kras;Trp53(KP)-driven lung adenocarcinoma and tracked tumor evolution from single-transformed cells to metastatic tumors at unprecedented resolution. We found that the loss of the initial, stable alveolar-type2-like state was accompanied by a transient increase in plasticity. This was followed by the adoption of distinct transcriptional programs that enable rapid expansion and, ultimately, clonal sweep of stable subclones capable of metastasizing. Finally, tumors develop through stereotypical evolutionary trajectories, and perturbing additional tumor suppressors accelerates progression by creating novel trajectories. Our study elucidates the hierarchical nature of tumor evolution and, more broadly, enables in-depth studies of tumor progression.
Collapse
Affiliation(s)
- Dian Yang
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Matthew G Jones
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Biological and Medical Informatics Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA; Integrative Program in Quantitative Biology, University of California, San Francisco, San Francisco, CA 94158, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Santiago Naranjo
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - William M Rideout
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Kyung Hoi Joseph Min
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Raymond Ho
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Wei Wu
- Department of Medicine, University of California, San Francisco, San Francisco, CA 94158, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Joseph M Replogle
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Medical Scientist Training Program, University of California, San Francisco, San Francisco, CA 94158, USA; Tetrad Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Jennifer L Page
- Cell and Genome Engineering Core, University of California San Francisco, San Francisco, CA 94158, USA
| | - Jeffrey J Quinn
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Felix Horns
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Xiaojie Qiu
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Michael Z Chen
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Medical Scientist Training Program, Harvard Medical School, Boston, MA 02115, USA
| | - William A Freed-Pastor
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA
| | - Christopher S McGinnis
- Tetrad Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - David M Patterson
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Zev J Gartner
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA; Chan Zuckerberg BioHub Investigator, University of California, San Francisco, San Francisco, CA 94158, USA; Center for Cellular Construction, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Eric D Chow
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA 94158, USA; Center for Advanced Technology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Trever G Bivona
- Department of Medicine, University of California, San Francisco, San Francisco, CA 94158, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Michelle M Chan
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA; Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - Nir Yosef
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Chan Zuckerberg BioHub Investigator, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Electrical Engineering and Computer Science, University of California Berkeley, Berkeley, CA 94720, USA; Ragon Institute of Massachusetts General Hospital, MIT and Harvard University, Cambridge, MA, USA.
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.
| | - Jonathan S Weissman
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.
| |
Collapse
|
45
|
Deviatkin AA, Simonov RA, Trutneva KA, Maznina AA, Khavina EM, Volchkov PY. Universal Flu mRNA Vaccine: Promises, Prospects, and Problems. Vaccines (Basel) 2022; 10:vaccines10050709. [PMID: 35632465 PMCID: PMC9145388 DOI: 10.3390/vaccines10050709] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 04/26/2022] [Accepted: 04/28/2022] [Indexed: 02/05/2023] Open
Abstract
The seasonal flu vaccine is, essentially, the only known way to prevent influenza epidemics. However, this approach has limited efficacy due to the high diversity of influenza viruses. Several techniques could potentially overcome this obstacle. A recent first-in-human study of a chimeric hemagglutinin-based universal influenza virus vaccine demonstrated promising results. The coronavirus pandemic triggered the development of fundamentally new vaccine platforms that have demonstrated their effectiveness in humans. Currently, there are around a dozen messenger RNA and self-amplifying RNA flu vaccines in clinical or preclinical trials. However, the applicability of novel approaches for a universal influenza vaccine creation remains unclear. The current review aims to cover the current state of this problem and to suggest future directions for RNA-based flu vaccine development.
Collapse
Affiliation(s)
- Andrei A. Deviatkin
- The National Medical Research Center for Endocrinology, 117036 Moscow, Russia; (A.A.D.); (K.A.T.)
- Genome Engineering Lab, Life Sciences Research Center, Moscow Institute of Physics and Technology (National Research University), 141700 Dolgoprudniy, Russia; (R.A.S.); (A.A.M.); (E.M.K.)
| | - Ruslan A. Simonov
- Genome Engineering Lab, Life Sciences Research Center, Moscow Institute of Physics and Technology (National Research University), 141700 Dolgoprudniy, Russia; (R.A.S.); (A.A.M.); (E.M.K.)
| | - Kseniya A. Trutneva
- The National Medical Research Center for Endocrinology, 117036 Moscow, Russia; (A.A.D.); (K.A.T.)
- Genome Engineering Lab, Life Sciences Research Center, Moscow Institute of Physics and Technology (National Research University), 141700 Dolgoprudniy, Russia; (R.A.S.); (A.A.M.); (E.M.K.)
| | - Anna A. Maznina
- Genome Engineering Lab, Life Sciences Research Center, Moscow Institute of Physics and Technology (National Research University), 141700 Dolgoprudniy, Russia; (R.A.S.); (A.A.M.); (E.M.K.)
| | - Elena M. Khavina
- Genome Engineering Lab, Life Sciences Research Center, Moscow Institute of Physics and Technology (National Research University), 141700 Dolgoprudniy, Russia; (R.A.S.); (A.A.M.); (E.M.K.)
| | - Pavel Y. Volchkov
- The National Medical Research Center for Endocrinology, 117036 Moscow, Russia; (A.A.D.); (K.A.T.)
- Genome Engineering Lab, Life Sciences Research Center, Moscow Institute of Physics and Technology (National Research University), 141700 Dolgoprudniy, Russia; (R.A.S.); (A.A.M.); (E.M.K.)
- Correspondence:
| |
Collapse
|
46
|
Jones MG, Rosen Y, Yosef N. Interactive, integrated analysis of single-cell transcriptomic and phylogenetic data with PhyloVision. CELL REPORTS METHODS 2022; 2:100200. [PMID: 35497495 PMCID: PMC9046453 DOI: 10.1016/j.crmeth.2022.100200] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 02/19/2022] [Accepted: 03/28/2022] [Indexed: 02/07/2023]
Abstract
Recent advances in CRISPR-Cas9 engineering and single-cell assays have enabled the simultaneous measurement of single-cell transcriptomic and phylogenetic profiles. However, there are few computational tools enabling users to integrate and derive insight from a joint analysis of these two modalities. Here, we describe "PhyloVision": an open-source software for interactively exploring data from both modalities and for identifying and interpreting heritable gene modules whose concerted expression are associated with phylogenetic relationships. PhyloVision provides a feature-rich, interactive, and shareable web-based report for investigating these modules while also supporting several other data and meta-data exploration capabilities. We demonstrate the utility of PhyloVision using a published dataset of metastatic lung adenocarcinoma cells, whose phylogeny was resolved using a CRISPR-Cas9-based lineage-tracing system. Together, we anticipate that PhyloVision and the methods it implements will be a useful resource for scalable and intuitive data exploration for any assay that simultaneously measures cell state and lineage.
Collapse
Affiliation(s)
- Matthew G. Jones
- Department of Electrical Engineering and Computer Science, University of California, Berkeley, Berkeley, CA 94720 USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720 USA
- Integrative Program in Quantitative Biology, University of California, San Francisco, San Francisco, CA 94143, USA
- Whitehead Institute, Cambridge, MA 02142 USA
| | - Yanay Rosen
- Department of Electrical Engineering and Computer Science, University of California, Berkeley, Berkeley, CA 94720 USA
| | - Nir Yosef
- Department of Electrical Engineering and Computer Science, University of California, Berkeley, Berkeley, CA 94720 USA
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720 USA
- Chan Zuckerberg Biohub Investigator, San Francisco, CA 94158 USA
- Ragon Institute of Massachusetts General Hospital, MIT and Harvard University, Cambridge, MA 02114 USA
| |
Collapse
|
47
|
Melissa MJ, Good BH, Fisher DS, Desai MM. Population genetics of polymorphism and divergence in rapidly evolving populations. Genetics 2022; 221:6564664. [PMID: 35389471 PMCID: PMC9339298 DOI: 10.1093/genetics/iyac053] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 03/19/2022] [Indexed: 11/14/2022] Open
Abstract
In rapidly evolving populations, numerous beneficial and deleterious mutations can arise and segregate within a population at the same time. In this regime, evolutionary dynamics cannot be analyzed using traditional population genetic approaches that assume that sites evolve independently. Instead, the dynamics of many loci must be analyzed simultaneously. Recent work has made progress by first analyzing the fitness variation within a population, and then studying how individual lineages interact with this traveling fitness wave. However, these "traveling wave" models have previously been restricted to extreme cases where selection on individual mutations is either much faster or much slower than the typical coalescent timescale Tc. In this work, we show how the traveling wave framework can be extended to intermediate regimes in which the scaled fitness effects of mutations (Tcs) are neither large nor small compared to one. This enables us to describe the dynamics of populations subject to a wide range of fitness effects, and in particular, in cases where it is not immediately clear which mutations are most important in shaping the dynamics and statistics of genetic diversity. We use this approach to derive new expressions for the fixation probabilities and site frequency spectra of mutations as a function of their scaled fitness effects, along with related results for the coalescent timescale Tc and the rate of adaptation or Muller's ratchet. We find that competition between linked mutations can have a dramatic impact on the proportions of neutral and selected polymorphisms, which is not simply summarized by the scaled selection coefficient Tcs. We conclude by discussing the implications of these results for population genetic inferences.
Collapse
Affiliation(s)
- Matthew J Melissa
- Department of Organismic and Evolutionary Biology, Department of Physics, Quantitative Biology Initiative, and NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge MA 02138, USA
| | - Benjamin H Good
- Department of Applied Physics and Department of Bioengineering, Stanford University, Stanford CA 94305, USA
| | - Daniel S Fisher
- Department of Applied Physics and Department of Bioengineering, Stanford University, Stanford CA 94305, USA
| | - Michael M Desai
- Department of Organismic and Evolutionary Biology, Department of Physics, Quantitative Biology Initiative, and NSF-Simons Center for Mathematical and Statistical Analysis of Biology, Harvard University, Cambridge MA 02138, USA
| |
Collapse
|
48
|
Lynch AR, Arp NL, Zhou AS, Weaver BA, Burkard ME. Quantifying chromosomal instability from intratumoral karyotype diversity using agent-based modeling and Bayesian inference. eLife 2022; 11:e69799. [PMID: 35380536 PMCID: PMC9054132 DOI: 10.7554/elife.69799] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 04/01/2022] [Indexed: 12/03/2022] Open
Abstract
Chromosomal instability (CIN)-persistent chromosome gain or loss through abnormal mitotic segregation-is a hallmark of cancer that drives aneuploidy. Intrinsic chromosome mis-segregation rate, a measure of CIN, can inform prognosis and is a promising biomarker for response to anti-microtubule agents. However, existing methodologies to measure this rate are labor intensive, indirect, and confounded by selection against aneuploid cells, which reduces observable diversity. We developed a framework to measure CIN, accounting for karyotype selection, using simulations with various levels of CIN and models of selection. To identify the model parameters that best fit karyotype data from single-cell sequencing, we used approximate Bayesian computation to infer mis-segregation rates and karyotype selection. Experimental validation confirmed the extensive chromosome mis-segregation rates caused by the chemotherapy paclitaxel (18.5 ± 0.5/division). Extending this approach to clinical samples revealed that inferred rates fell within direct observations of cancer cell lines. This work provides the necessary framework to quantify CIN in human tumors and develop it as a predictive biomarker.
Collapse
Affiliation(s)
- Andrew R Lynch
- Carbone Cancer Center, University of Wisconsin-MadisonMadisonUnited States
- McArdle Laboratory for Cancer Research, University of Wisconsin-MadisonMadisonUnited States
| | - Nicholas L Arp
- Carbone Cancer Center, University of Wisconsin-MadisonMadisonUnited States
| | - Amber S Zhou
- Carbone Cancer Center, University of Wisconsin-MadisonMadisonUnited States
- McArdle Laboratory for Cancer Research, University of Wisconsin-MadisonMadisonUnited States
| | - Beth A Weaver
- Carbone Cancer Center, University of Wisconsin-MadisonMadisonUnited States
- McArdle Laboratory for Cancer Research, University of Wisconsin-MadisonMadisonUnited States
- Department of Cell and Regenerative Biology, University of WisconsinMadisonUnited States
| | - Mark E Burkard
- Carbone Cancer Center, University of Wisconsin-MadisonMadisonUnited States
- McArdle Laboratory for Cancer Research, University of Wisconsin-MadisonMadisonUnited States
- Division of Hematology Medical Oncology and Palliative Care, Department of Medicine University of WisconsinMadisonUnited States
| |
Collapse
|
49
|
Wang Y, Tang CY, Wan XF. Antigenic characterization of influenza and SARS-CoV-2 viruses. Anal Bioanal Chem 2022; 414:2841-2881. [PMID: 34905077 PMCID: PMC8669429 DOI: 10.1007/s00216-021-03806-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 11/21/2021] [Accepted: 11/24/2021] [Indexed: 12/24/2022]
Abstract
Antigenic characterization of emerging and re-emerging viruses is necessary for the prevention of and response to outbreaks, evaluation of infection mechanisms, understanding of virus evolution, and selection of strains for vaccine development. Primary analytic methods, including enzyme-linked immunosorbent/lectin assays, hemagglutination inhibition, neuraminidase inhibition, micro-neutralization assays, and antigenic cartography, have been widely used in the field of influenza research. These techniques have been improved upon over time for increased analytical capacity, and some have been mobilized for the rapid characterization of the SARS-CoV-2 virus as well as its variants, facilitating the development of highly effective vaccines within 1 year of the initially reported outbreak. While great strides have been made for evaluating the antigenic properties of these viruses, multiple challenges prevent efficient vaccine strain selection and accurate assessment. For influenza, these barriers include the requirement for a large virus quantity to perform the assays, more than what can typically be provided by the clinical samples alone, cell- or egg-adapted mutations that can cause antigenic mismatch between the vaccine strain and circulating viruses, and up to a 6-month duration of vaccine development after vaccine strain selection, which allows viruses to continue evolving with potential for antigenic drift and, thus, antigenic mismatch between the vaccine strain and the emerging epidemic strain. SARS-CoV-2 characterization has faced similar challenges with the additional barrier of the need for facilities with high biosafety levels due to its infectious nature. In this study, we review the primary analytic methods used for antigenic characterization of influenza and SARS-CoV-2 and discuss the barriers of these methods and current developments for addressing these challenges.
Collapse
Affiliation(s)
- Yang Wang
- MU Center for Influenza and Emerging Infectious Diseases (CIEID), University of Missouri, Columbia, MO, USA
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Cynthia Y Tang
- MU Center for Influenza and Emerging Infectious Diseases (CIEID), University of Missouri, Columbia, MO, USA
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Xiu-Feng Wan
- MU Center for Influenza and Emerging Infectious Diseases (CIEID), University of Missouri, Columbia, MO, USA.
- Department of Molecular Microbiology and Immunology, School of Medicine, University of Missouri, Columbia, MO, USA.
- Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA.
- Department of Electrical Engineering & Computer Science, College of Engineering, University of Missouri, Columbia, MO, USA.
| |
Collapse
|
50
|
Zhao L, Wymant C, Blanquart F, Golubchik T, Gall A, Bakker M, Bezemer D, Hall M, Ong SH, Albert J, Bannert N, Fellay J, Grabowski MK, Gunsenheimer-Bartmeyer B, Günthard HF, Kivelä P, Kouyos RD, Laeyendecker O, Meyer L, Porter K, van Sighem A, van der Valk M, Berkhout B, Kellam P, Cornelissen M, Reiss P, Fraser C, Ferretti L. Phylogenetic estimation of the viral fitness landscape of HIV-1 set-point viral load. Virus Evol 2022; 8:veac022. [PMID: 35402002 PMCID: PMC8986633 DOI: 10.1093/ve/veac022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 02/25/2022] [Accepted: 03/15/2022] [Indexed: 11/16/2022] Open
Abstract
Set-point viral load (SPVL), a common measure of human immunodeficiency virus (HIV)-1 virulence, is partially determined by viral genotype. Epidemiological evidence suggests that this viral property has been under stabilising selection, with a typical optimum for the virus between 104 and 105 copies of viral RNA per ml. Here we aimed to detect transmission fitness differences between viruses from individuals with different SPVLs directly from phylogenetic trees inferred from whole-genome sequences. We used the local branching index (LBI) as a proxy for transmission fitness. We found that LBI is more sensitive to differences in infectiousness than to differences in the duration of the infectious state. By analysing subtype-B samples from the Bridging the Evolution and Epidemiology of HIV in Europe project, we inferred a significant positive relationship between SPVL and LBI up to approximately 105 copies/ml, with some evidence for a peak around this value of SPVL. This is evidence of selection against low values of SPVL in HIV-1 subtype-B strains, likely related to lower infectiousness, and perhaps a peak in the transmission fitness in the expected range of SPVL. The less prominent signatures of selection against higher SPVL could be explained by an inherent limit of the method or the deployment of antiretroviral therapy.
Collapse
Affiliation(s)
- Lele Zhao
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Headington, Oxford OX3 7LF, UK
| | - Chris Wymant
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Headington, Oxford OX3 7LF, UK
| | - François Blanquart
- Centre for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS, INSERM, PSL Research University, Cedex 05, Paris 75231, France
| | - Tanya Golubchik
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Headington, Oxford OX3 7LF, UK
| | - Astrid Gall
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, UK
| | - Margreet Bakker
- Laboratory of Experimental Virology, Department of Medical Microbiology and Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, MB 1007, Netherlands
| | - Daniela Bezemer
- Stichting HIV Monitoring, Amsterdam, Amsterdam, AZ 1105, Netherlands
| | - Matthew Hall
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Headington, Oxford OX3 7LF, UK
| | - Swee Hoe Ong
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Jan Albert
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Solna, Stockholm 171 77, Sweden
- Department of Clinical Microbiology, Karolinska University Hospital, Solna, Stockholm S-171 76, Sweden
| | - Norbert Bannert
- Division for HIV and Other Retroviruses, Department of Infectious Diseases, Robert Koch Institute, Berlin 13353, Germany
| | - Jacques Fellay
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne CH-1015, Switzerland
- Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
- Precision Medicine Unit, Lausanne University Hospital and University of Lausanne, Lausanne CH-1015, Switzerland
| | - M Kate Grabowski
- Department of Pathology, John Hopkins University, Baltimore, MD 21287, USA
| | | | - Huldrych F Günthard
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich CH-8091, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich 8057, Switzerland
| | - Pia Kivelä
- Department of Infectious Diseases, Helsinki University Hospital, Helsinki FI-00029, Finland
| | - Roger D Kouyos
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich CH-8091, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich 8057, Switzerland
| | | | - Laurence Meyer
- INSERM CESP U1018, Université Paris Saclay, APHP, Service de Santé Publique, Hôpital de Bicêtre, Le Kremlin-Bicêtre 94270, France
| | - Kholoud Porter
- Institute for Global Health, University College London, London WC1N 1EH, UK
| | - Ard van Sighem
- Stichting HIV Monitoring, Amsterdam, Amsterdam, AZ 1105, Netherlands
| | - Marc van der Valk
- Stichting HIV Monitoring, Amsterdam, Amsterdam, AZ 1105, Netherlands
| | - Ben Berkhout
- Laboratory of Experimental Virology, Department of Medical Microbiology and Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, MB 1007, Netherlands
| | - Paul Kellam
- Kymab Ltd, Babraham Research Campus, Cambridge CB22 3AT, UK
- Department of Infectious Diseases, Faculty of Medicine, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Marion Cornelissen
- Laboratory of Experimental Virology, Department of Medical Microbiology and Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, MB 1007, Netherlands
- Molecular Diagnostic Unit, Department of Medical Microbiology and Infection Prevention, Amsterdam University Medical Centers, University of Amsterdam, Amsterdam, MB 1007, Netherlands
| | - Peter Reiss
- Stichting HIV Monitoring, Amsterdam, Amsterdam, AZ 1105, Netherlands
- Department of Global Health, Amsterdam University Medical Centers, University of Amsterdam and Amsterdam Institute for Global Health and Development, Amsterdam, DE 1100, Netherlands
| | - Christophe Fraser
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Headington, Oxford OX3 7LF, UK
| | - Luca Ferretti
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Old Road Campus, Headington, Oxford OX3 7LF, UK
| |
Collapse
|