1
|
Chowell G, Skums P. Investigating and forecasting infectious disease dynamics using epidemiological and molecular surveillance data. Phys Life Rev 2024; 51:294-327. [PMID: 39488136 DOI: 10.1016/j.plrev.2024.10.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2024] [Accepted: 10/23/2024] [Indexed: 11/04/2024]
Abstract
The integration of viral genomic data into public health surveillance has revolutionized our ability to track and forecast infectious disease dynamics. This review addresses two critical aspects of infectious disease forecasting and monitoring: the methodological workflow for epidemic forecasting and the transformative role of molecular surveillance. We first present a detailed approach for validating epidemic models, emphasizing an iterative workflow that utilizes ordinary differential equation (ODE)-based models to investigate and forecast disease dynamics. We recommend a more structured approach to model validation, systematically addressing key stages such as model calibration, assessment of structural and practical parameter identifiability, and effective uncertainty propagation in forecasts. Furthermore, we underscore the importance of incorporating multiple data streams by applying both simulated and real epidemiological data from the COVID-19 pandemic to produce more reliable forecasts with quantified uncertainty. Additionally, we emphasize the pivotal role of viral genomic data in tracking transmission dynamics and pathogen evolution. By leveraging advanced computational tools such as Bayesian phylogenetics and phylodynamics, researchers can more accurately estimate transmission clusters and reconstruct outbreak histories, thereby improving data-driven modeling and forecasting and informing targeted public health interventions. Finally, we discuss the transformative potential of integrating molecular epidemiology with mathematical modeling to complement and enhance epidemic forecasting and optimize public health strategies.
Collapse
Affiliation(s)
- Gerardo Chowell
- Department of Population Health Sciences, School of Public Health, Georgia State University, Atlanta, GA, USA; Department of Applied Mathematics, Kyung Hee University, Yongin 17104, Korea.
| | - Pavel Skums
- School of Computing, University of Connecticut, Storrs, CT, USA
| |
Collapse
|
2
|
Mohebbi F, Zelikovsky A, Mangul S, Chowell G, Skums P. Early detection of emerging viral variants through analysis of community structure of coordinated substitution networks. Nat Commun 2024; 15:2838. [PMID: 38565543 PMCID: PMC10987511 DOI: 10.1038/s41467-024-47304-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 03/20/2024] [Indexed: 04/04/2024] Open
Abstract
The emergence of viral variants with altered phenotypes is a public health challenge underscoring the need for advanced evolutionary forecasting methods. Given extensive epistatic interactions within viral genomes and known viral evolutionary history, efficient genomic surveillance necessitates early detection of emerging viral haplotypes rather than commonly targeted single mutations. Haplotype inference, however, is a significantly more challenging problem precluding the use of traditional approaches. Here, using SARS-CoV-2 evolutionary dynamics as a case study, we show that emerging haplotypes with altered transmissibility can be linked to dense communities in coordinated substitution networks, which become discernible significantly earlier than the haplotypes become prevalent. From these insights, we develop a computational framework for inference of viral variants and validate it by successful early detection of known SARS-CoV-2 strains. Our methodology offers greater scalability than phylogenetic lineage tracing and can be applied to any rapidly evolving pathogen with adequate genomic surveillance data.
Collapse
Affiliation(s)
- Fatemeh Mohebbi
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
- Titus Family Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA, USA
| | - Alex Zelikovsky
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Serghei Mangul
- Titus Family Department of Clinical Pharmacy, USC Alfred E. Mann School of Pharmacy and Pharmaceutical Sciences, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, USC Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA, USA
| | - Gerardo Chowell
- School of Public Health, Georgia State University, Atlanta, GA, USA
| | - Pavel Skums
- Department of Computer Science, Georgia State University, Atlanta, GA, USA.
- School of Computing, College of Engineering, University of Connecticut, Storrs, CT, USA.
| |
Collapse
|
3
|
Bunimovich L, Skums P. Fractal networks: Topology, dimension, and complexity. CHAOS (WOODBURY, N.Y.) 2024; 34:042101. [PMID: 38598678 DOI: 10.1063/5.0200632] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 03/24/2024] [Indexed: 04/12/2024]
Abstract
Over the past two decades, the study of self-similarity and fractality in discrete structures, particularly complex networks, has gained momentum. This surge of interest is fueled by the theoretical developments within the theory of complex networks and the practical demands of real-world applications. Nonetheless, translating the principles of fractal geometry from the domain of general topology, dealing with continuous or infinite objects, to finite structures in a mathematically rigorous way poses a formidable challenge. In this paper, we overview such a theory that allows to identify and analyze fractal networks through the innate methodologies of graph theory and combinatorics. It establishes the direct graph-theoretical analogs of topological (Lebesgue) and fractal (Hausdorff) dimensions in a way that naturally links them to combinatorial parameters that have been studied within the realm of graph theory for decades. This allows to demonstrate that the self-similarity in networks is defined by the patterns of intersection among densely connected network communities. Moreover, the theory bridges discrete and continuous definitions by demonstrating how the combinatorial characterization of Lebesgue dimension via graph representation by its subsets (subgraphs/communities) extends to general topological spaces. Using this framework, we rigorously define fractal networks and connect their properties with established combinatorial concepts, such as graph colorings and descriptive complexity. The theoretical framework surveyed here sets a foundation for applications to real-life networks and future studies of fractal characteristics of complex networks using combinatorial methods and algorithms.
Collapse
Affiliation(s)
- L Bunimovich
- School of Mathematics, Georgia Institute of Technology, 686 Cherry St NW, Atlanta, Georgia 30332, USA
| | - P Skums
- School of Computing, University of Connecticut, 371 Fairfield Way, Storrs, Connecticut 06269, USA
| |
Collapse
|
4
|
Bunimovich L, Ram A, Skums P. Antigenic cooperation in viral populations: Transformation of functions of intra-host viral variants. J Theor Biol 2024; 580:111719. [PMID: 38158118 DOI: 10.1016/j.jtbi.2023.111719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 09/10/2023] [Accepted: 12/20/2023] [Indexed: 01/03/2024]
Abstract
In this paper, we study intra-host viral adaptation by antigenic cooperation - a mechanism of immune escape that serves as an alternative to the standard mechanism of escape by continuous genomic diversification and allows to explain a number of experimental observations associated with the establishment of chronic infections by highly mutable viruses. Within this mechanism, the topology of a cross-immunoreactivity network forces intra-host viral variants to specialize for complementary roles and adapt to the host's immune response as a quasi-social ecosystem. Here we study dynamical changes in immune adaptation caused by evolutionary and epidemiological events. First, we show that the emergence of a viral variant with altered antigenic features may result in a rapid re-arrangement of the viral ecosystem and a change in the roles played by existing viral variants. In particular, it may push the population under immune escape by genomic diversification towards the stable state of adaptation by antigenic cooperation. Next, we study the effect of a viral transmission between two chronically infected hosts, which results in the merging of two intra-host viral populations in the state of stable immune-adapted equilibrium. In this case, we also describe how the newly formed viral population adapts to the host's environment by changing the functions of its members. The results are obtained analytically for minimal cross-immunoreactivity networks and numerically for larger populations.
Collapse
Affiliation(s)
- Leonid Bunimovich
- School of Mathematics, Georgia Institute of Technology, Atlanta, 30332, GA, USA.
| | - Athulya Ram
- School of Mathematics, Georgia Institute of Technology, Atlanta, 30332, GA, USA; Interdisciplinary Graduate Program in Quantitative Biosciences, Georgia Institute of Technology, Atlanta, 30332, GA, USA.
| | - Pavel Skums
- Department of Computer Science and Engineering, University of Connecticut, Storrs, 06269, CT, USA.
| |
Collapse
|
5
|
Mosa A, Campo D, Khudyakov Y, AbouHaidar M, Gehring A, Zahoor A, Ball J, Urbanowicz R, Feld J. Polyvalent immunization elicits a synergistic broadly neutralizing immune response to hypervariable region 1 variants of hepatitis C virus. Proc Natl Acad Sci U S A 2023; 120:e2220294120. [PMID: 37276424 PMCID: PMC10268328 DOI: 10.1073/pnas.2220294120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Accepted: 04/29/2023] [Indexed: 06/07/2023] Open
Abstract
A hepatitis C virus (HCV) vaccine is urgently needed. Vaccine development has been hindered by HCV's genetic diversity, particularly within the immunodominant hypervariable region 1 (HVR1). Here, we developed a strategy to elicit broadly neutralizing antibodies to HVR1, which had previously been considered infeasible. We first applied a unique information theory-based measure of genetic distance to evaluate phenotypic relatedness between HVR1 variants. These distances were used to model the structure of HVR1's sequence space, which was found to have five major clusters. Variants from each cluster were used to immunize mice individually, and as a pentavalent mixture. Sera obtained following immunization neutralized every variant in a diverse HCVpp panel (n = 10), including those resistant to monovalent immunization, and at higher mean titers (1/ID50 = 435) than a glycoprotein E2 (1/ID50 = 205) vaccine. This synergistic immune response offers a unique approach to overcoming antigenic variability and may be applicable to other highly mutable viruses.
Collapse
Affiliation(s)
- Alexander I. Mosa
- Toronto Centre for Liver Disease, Toronto General Hospital, Toronto, M5G 2C4ON, Canada
| | - David S. Campo
- Molecular Epidemiology and Bioinformatics, Centers for Disease Control and Prevention, Atlanta30333, Georgia
| | - Yury Khudyakov
- Molecular Epidemiology and Bioinformatics, Centers for Disease Control and Prevention, Atlanta30333, Georgia
| | - Mounir G. AbouHaidar
- Department of Cell and Systems Biology, University of Toronto, Toronto, M5S 3G5ON, Canada
| | - Adam J. Gehring
- Department of Immunology, University of Toronto, Toronto, M5S 1A8ON, Canada
| | - Atif Zahoor
- Toronto Centre for Liver Disease, Toronto General Hospital, Toronto, M5G 2C4ON, Canada
| | - Jonathan K. Ball
- Wolfson Centre for Global Virus Infections, University of Nottingham, NottinghamNG8 1BB, United Kingdom
| | - Richard A. Urbanowicz
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, LiverpoolCH64 7TE, United Kingdom
| | - Jordan J. Feld
- Toronto Centre for Liver Disease, Toronto General Hospital, Toronto, M5G 2C4ON, Canada
| |
Collapse
|
6
|
Skums P, Mohebbi F, Tsyvina V, Baykal PI, Nemira A, Ramachandran S, Khudyakov Y. SOPHIE: Viral outbreak investigation and transmission history reconstruction in a joint phylogenetic and network theory framework. Cell Syst 2022; 13:844-856.e4. [PMID: 36265470 PMCID: PMC9590096 DOI: 10.1016/j.cels.2022.07.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 07/05/2022] [Accepted: 07/19/2022] [Indexed: 01/26/2023]
Abstract
Genomic epidemiology is now widely used for viral outbreak investigations. Still, this methodology faces many challenges. First, few methods account for intra-host viral diversity. Second, maximum parsimony principle continues to be employed for phylogenetic inference of transmission histories, even though maximum likelihood or Bayesian models are usually more consistent. Third, many methods utilize case-specific data, such as sampling times or infection exposure intervals. This impedes study of persistent infections in vulnerable groups, where such information has a limited use. Finally, most methods implicitly assume that transmission events are independent, although common source outbreaks violate this assumption. We propose a maximum likelihood framework, SOPHIE, based on the integration of phylogenetic and random graph models. It infers transmission networks from viral phylogenies and expected properties of inter-host social networks modeled as random graphs with given expected degree distributions. SOPHIE is scalable, accounts for intra-host diversity, and accurately infers transmissions without case-specific epidemiological data.
Collapse
Affiliation(s)
- Pavel Skums
- Department of Computer Science, Georgia State University, Atlanta, GA, USA.
| | - Fatemeh Mohebbi
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Vyacheslav Tsyvina
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Pelin Icer Baykal
- Department of Biosystems Science & Engineering, ETH Zurich, Basel, Switzerland
| | - Alina Nemira
- Department of Computer Science, Georgia State University, Atlanta, GA, USA
| | - Sumathi Ramachandran
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Yury Khudyakov
- Division of Viral Hepatitis, Centers for Disease Control and Prevention, Atlanta, GA, USA
| |
Collapse
|
7
|
Orlovich Y, Kukharenko K, Kaibel V, Skums P. Scale-Free Spanning Trees and Their Application in Genomic Epidemiology. J Comput Biol 2021; 28:945-960. [PMID: 34491104 PMCID: PMC8670573 DOI: 10.1089/cmb.2020.0500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
We study the algorithmic problem of finding the most “scale-free-like” spanning tree of a connected graph. This problem is motivated by the fundamental problem of genomic epidemiology: given viral genomes sampled from infected individuals, reconstruct the transmission network (“who infected whom”). We use two possible objective functions for this problem and introduce the corresponding algorithmic problems termedm-SF (-scale free) ands-SF Spanning Tree problems. We prove that those problems are APX- and NP-hard, respectively, even in the classes of cubic and bipartite graphs. We propose two integer linear programming (ILP) formulations for thes-SF Spanning Tree problem, and experimentally assess its performance using simulated and experimental data. In particular, we demonstrate that the ILP-based approach allows for accurate reconstruction of transmission histories of several hepatitis C outbreaks.
Collapse
Affiliation(s)
- Yury Orlovich
- Faculty of Applied Mathematics and Computer Science, Belarusian State University, Minsk, Belarus
| | - Kirill Kukharenko
- Institute for Mathematical Optimization, Otto von Guericke University Magdeburg, Magdeburg, Germany
| | - Volker Kaibel
- Institute for Mathematical Optimization, Otto von Guericke University Magdeburg, Magdeburg, Germany
| | - Pavel Skums
- Department of Computer Science, Georgia State University, Atlanta, Georgia, USA
| |
Collapse
|
8
|
Knyazev S, Tsyvina V, Shankar A, Melnyk A, Artyomenko A, Malygina T, Porozov YB, Campbell EM, Switzer WM, Skums P, Mangul S, Zelikovsky A. Accurate assembly of minority viral haplotypes from next-generation sequencing through efficient noise reduction. Nucleic Acids Res 2021; 49:e102. [PMID: 34214168 PMCID: PMC8464054 DOI: 10.1093/nar/gkab576] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 05/25/2021] [Accepted: 06/18/2021] [Indexed: 12/21/2022] Open
Abstract
Rapidly evolving RNA viruses continuously produce minority haplotypes that can become dominant if they are drug-resistant or can better evade the immune system. Therefore, early detection and identification of minority viral haplotypes may help to promptly adjust the patient’s treatment plan preventing potential disease complications. Minority haplotypes can be identified using next-generation sequencing, but sequencing noise hinders accurate identification. The elimination of sequencing noise is a non-trivial task that still remains open. Here we propose CliqueSNV based on extracting pairs of statistically linked mutations from noisy reads. This effectively reduces sequencing noise and enables identifying minority haplotypes with the frequency below the sequencing error rate. We comparatively assess the performance of CliqueSNV using an in vitro mixture of nine haplotypes that were derived from the mutation profile of an existing HIV patient. We show that CliqueSNV can accurately assemble viral haplotypes with frequencies as low as 0.1% and maintains consistent performance across short and long bases sequencing platforms.
Collapse
Affiliation(s)
- Sergey Knyazev
- Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA.,Division of HIV Prevention, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA.,Oak Ridge Institute for Science and Education, Oak Ridge, TN 37830, USA
| | - Viachaslau Tsyvina
- Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA
| | - Anupama Shankar
- Division of HIV Prevention, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA
| | - Andrew Melnyk
- Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA
| | | | - Tatiana Malygina
- International Scientific and Research Institute of Bioengineering, ITMO University, St. Petersburg 197101, Russia
| | - Yuri B Porozov
- World-Class Research Center "Digital biodesign and personalized healthcare", I.M. Sechenov First Moscow State Medical University, Moscow 119991, Russia.,Department of Computational Biology, Sirius University of Science and Technology, Sochi 354340, Russia
| | - Ellsworth M Campbell
- Division of HIV Prevention, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA
| | - William M Switzer
- Division of HIV Prevention, Centers for Disease Control and Prevention, Atlanta, GA 30333, USA
| | - Pavel Skums
- Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA
| | - Serghei Mangul
- Department of Clinical Pharmacy, School of Pharmacy, University of Southern California, Los Angeles, CA 90089, USA
| | - Alex Zelikovsky
- Department of Computer Science, Georgia State University, Atlanta, GA 30302, USA.,World-Class Research Center "Digital biodesign and personalized healthcare", I.M. Sechenov First Moscow State Medical University, Moscow 119991, Russia
| |
Collapse
|
9
|
Fuhrmann L, Jablonski KP, Beerenwinkel N. Quantitative measures of within-host viral genetic diversity. Curr Opin Virol 2021; 49:157-163. [PMID: 34153841 DOI: 10.1016/j.coviro.2021.06.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 06/03/2021] [Accepted: 06/07/2021] [Indexed: 12/22/2022]
Abstract
The genetic diversity of virus populations within their hosts is known to influence disease progression, treatment outcome, drug resistance, cell tropism, and transmission risk, and the study of dynamic changes of genetic heterogeneity can provide insights into the evolution of viruses. Several measures to quantify within-host genetic diversity capturing different aspects of diversity patterns in a sample or population are used, based on incidence, relative frequencies, pairwise distances, or phylogenetic trees. Here, we review and compare several of these measures.
Collapse
Affiliation(s)
- Lara Fuhrmann
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland; SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - Kim Philipp Jablonski
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland; SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland; SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland.
| |
Collapse
|
10
|
Intra-host evolutionary dynamics of the hepatitis C virus among people who inject drugs. Sci Rep 2021; 11:9986. [PMID: 33976241 PMCID: PMC8113533 DOI: 10.1038/s41598-021-88132-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 03/31/2021] [Indexed: 02/03/2023] Open
Abstract
Most individuals chronically infected with hepatitis C virus (HCV) are asymptomatic during the initial stages of infection and therefore the precise timing of infection is often unknown. Retrospective estimation of infection duration would improve existing surveillance data and help guide treatment. While intra-host viral diversity quantifications such as Shannon entropy have previously been utilized for estimating duration of infection, these studies characterize the viral population from only a relatively short segment of the HCV genome. In this study intra-host diversities were examined across the HCV genome in order to identify the region most reflective of time and the degree to which these estimates are influenced by high-risk activities including those associated with HCV acquisition. Shannon diversities were calculated for all regions of HCV from 78 longitudinally sampled individuals with known seroconversion timeframes. While the region of the HCV genome most accurately reflecting time resided within the NS3 gene, the gene region with the highest capacity to differentiate acute from chronic infections was identified within the NS5b region. Multivariate models predicting duration of infection from viral diversity significantly improved upon incorporation of variables associated with recent public, unsupervised drug use. These results could assist the development of strategic population treatment guidelines for high-risk individuals infected with HCV and offer insights into variables associated with a likelihood of transmission.
Collapse
|