1
|
Zeeb M, Frischknecht P, Balakrishna S, Jörimann L, Tschumi J, Zsichla L, Chaudron SE, Jaha B, Neumann K, Leemann C, Huber M, Leuzinger K, Günthard HF, Metzner KJ, Kouyos RD, The Zurich HIV Primary Infection Cohort Study, and the Swiss HIV Cohort Study. Addressing data management and analysis challenges in viral genomics: The Swiss HIV cohort study viral next generation sequencing database. PLOS DIGITAL HEALTH 2025; 4:e0000825. [PMID: 40257980 PMCID: PMC12011223 DOI: 10.1371/journal.pdig.0000825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2024] [Accepted: 03/16/2025] [Indexed: 04/23/2025]
Abstract
Numerous HIV related outcomes can be determined on the viral genome, for example, resistance associated mutations, population transmission dynamics, viral heritability traits, or time since infection. Viral sequences of people with HIV (PWH) are therefore essential for therapeutic and research purposes. While in the first three decades of the HIV pandemic viral genomes were mainly sequenced using Sanger sequencing, the last decade has seen a shift towards next-generation sequencing (NGS) as the preferred method. NGS can achieve near full length genome sequence coverage and simultaneously, it accurately encapsulates the within-host diversity by characterizing HIV subpopulations. NGS opens new avenues for HIV research, but it also presents challenges concerning data management and analysis. We therefore set up the Swiss HIV Cohort Study Viral NGS Database (SHCND) to address key issues in the handling of NGS data including high loads of raw- and processed NGS data, data storage solutions, downstream application of sophisticated bioinformatic tools, high-performance computing resources, and reproducibility. The database is nested within the Swiss HIV Cohort Study (SHCS) and the Zurich Primary HIV Infection Cohort Study (ZPHI), which together enrolled 21,876 PWH since 1988 and include a biobank dating back to the early nineties. Since its initiation in 2018, the SHCND accumulated NGS sequences (plasma and proviral origin) of 5,178 unique PWH. We here describe the design, set-up, and use of this NGS database. Overall, the SHCND has contributed to several research projects on HIV pathogenesis, treatment, drug resistance, and molecular epidemiology, and has thereby become a central part of HIV-genomics research in Switzerland.
Collapse
Affiliation(s)
- Marius Zeeb
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Paul Frischknecht
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
| | - Suraj Balakrishna
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Lisa Jörimann
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Jasmin Tschumi
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Levente Zsichla
- Institute of Biology, ELTE Eötvös Loránd University, Budapest, Hungary
- National Laboratory for Health Security, ELTE Eötvös Loránd University, Budapest, Hungary
| | - Sandra E. Chaudron
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Bashkim Jaha
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
| | - Kathrin Neumann
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
| | - Christine Leemann
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
| | - Michael Huber
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | | | - Huldrych F. Günthard
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Karin J. Metzner
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | - Roger D. Kouyos
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, Zurich, Switzerland
- Institute of Medical Virology, University of Zurich, Zurich, Switzerland
| | | |
Collapse
|
2
|
Blenkinsop A, Sofocleous L, Di Lauro F, Kostaki EG, van Sighem A, Bezemer D, van de Laar T, Reiss P, de Bree G, Pantazis N, Ratmann O, on behalf of the HIV Transmission Elimination Amsterdam (H-TEAM) Consortium. Bayesian mixture models for phylogenetic source attribution from consensus sequences and time since infection estimates. Stat Methods Med Res 2025; 34:523-544. [PMID: 39936344 PMCID: PMC11951470 DOI: 10.1177/09622802241309750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2025]
Abstract
In stopping the spread of infectious diseases, pathogen genomic data can be used to reconstruct transmission events and characterize population-level sources of infection. Most approaches for identifying transmission pairs do not account for the time passing since the divergence of pathogen variants in individuals, which is problematic in viruses with high within-host evolutionary rates. This prompted us to consider possible transmission pairs in terms of phylogenetic data and additional estimates of time since infection derived from clinical biomarkers. We develop Bayesian mixture models with an evolutionary clock as a signal component and additional mixed effects or covariate random functions describing the mixing weights to classify potential pairs into likely and unlikely transmission pairs. We demonstrate that although sources cannot be identified at the individual level with certainty, even with the additional data on time elapsed, inferences into the population-level sources of transmission are possible, and more accurate than using only phylogenetic data without time since infection estimates. We apply the proposed approach to estimate age-specific sources of HIV infection in Amsterdam tranamission networks among men who have sex with men between 2010 and 2021. This study demonstrates that infection time estimates provide informative data to characterize transmission sources, and shows how phylogenetic source attribution can then be done with multi-dimensional mixture models.
Collapse
Affiliation(s)
| | | | - Francesco Di Lauro
- Big Data Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Evangelia Georgia Kostaki
- Department of Hygiene, Epidemiology and Medical Statistics, Medical School, National and Kapodistrian University of Athens, Athens, Greece
| | | | | | | | - Peter Reiss
- Amsterdam Institute for Global Health and Development, Amsterdam, the Netherlands
- Department of Global Health, Amsterdam University Medical Centers, Amsterdam, the Netherlands
| | - Godelieve de Bree
- Amsterdam Institute for Global Health and Development, Amsterdam, the Netherlands
- Division of Infectious Diseases, Department of Internal Medicine, Amsterdam Infection and Immunity Institute, Amsterdam, the Netherlands
| | - Nikos Pantazis
- Department of Hygiene, Epidemiology and Medical Statistics, Medical School, National and Kapodistrian University of Athens, Athens, Greece
| | - Oliver Ratmann
- Department of Mathematics, Imperial College London, London, UK
| | | |
Collapse
|
3
|
Obeng BM, Kelleher AD, Di Giallonardo F. Molecular epidemiology to aid virtual elimination of HIV transmission in Australia. Virus Res 2024; 341:199310. [PMID: 38185332 PMCID: PMC10825322 DOI: 10.1016/j.virusres.2024.199310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Revised: 01/03/2024] [Accepted: 01/04/2024] [Indexed: 01/09/2024]
Abstract
The Global UNAIDS 95/95/95 targets aim to increase the percentage of persons who know their HIV status, receive antiretroviral therapy, and have achieved viral suppression. Achieving these targets requires efforts to improve the public health response to increase access to care for those living with HIV, identify those yet undiagnosed with HIV early, and increase access to prevention for those most at risk of HIV acquisition. HIV infections in Australia are among the lowest globally having recorded significant declines in new diagnoses in the last decade. However, the HIV epidemic has changed with an increasing proportion of newly diagnosed infections among those born outside Australia observed in the last five years. Thus, the current prevention efforts are not enough to achieve the UNAIDS targets and virtual elimination across all population groups. We believe both are possible by including molecular epidemiology in the public health response. Molecular epidemiology methods have been crucial in the field of HIV prevention, particularly in demonstrating the efficacy of treatment as prevention. Cluster detection using molecular epidemiology can provide opportunities for the real-time detection of new outbreaks before they grow, and cluster detection programs are now part of the public health response in the USA and Canada. Here, we review what molecular epidemiology has taught us about HIV evolution and spread. We summarize how we can use this knowledge to improve public health measures by presenting case studies from the USA and Canada. We discuss the successes and challenges of current public health programs in Australia, and how we could use cluster detection as an add-on to identify gaps in current prevention measures easier and respond quicker to growing clusters. Lastly, we raise important ethical and legal challenges that need to be addressed when HIV genotypic data is used in combination with personal data.
Collapse
Affiliation(s)
- Billal M Obeng
- The Kirby Institute, University of New South Wales, Sydney, Australia
| | | | | |
Collapse
|