1
|
Zsichla L, Zeeb M, Fazekas D, Áy É, Müller D, Metzner KJ, Kouyos RD, Müller V. Comparative Evaluation of Open-Source Bioinformatics Pipelines for Full-Length Viral Genome Assembly. Viruses 2024; 16:1824. [PMID: 39772134 PMCID: PMC11680378 DOI: 10.3390/v16121824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Revised: 11/19/2024] [Accepted: 11/22/2024] [Indexed: 01/11/2025] Open
Abstract
The increasingly widespread application of next-generation sequencing (NGS) in clinical diagnostics and epidemiological research has generated a demand for robust, fast, automated, and user-friendly bioinformatics workflows. To guide the choice of tools for the assembly of full-length viral genomes from NGS datasets, we assessed the performance and applicability of four open-source bioinformatics pipelines (shiver-for which we created a user-friendly Dockerized version, referred to as dshiver; SmaltAlign; viral-ngs; and V-pipe) using both simulated and real-world HIV-1 paired-end short-read datasets and default settings. All four pipelines produced consensus genome assemblies with high quality metrics (genome fraction recovery, mismatch and indel rates, variant calling F1 scores) when the reference sequence used for assembly had high similarity to the analyzed sample. The shiver and SmaltAlign pipelines (but not viral-ngs and V-Pipe) also showed robust performance with more divergent samples (non-matching subtypes). With empirical datasets, SmaltAlign and viral-ngs exhibited an order of magnitude shorter runtime compared to V-Pipe and shiver. In terms of applicability, V-Pipe provides the broadest functionalities, SmaltAlign and dshiver combine user-friendliness with robustness, while the use of viral-ngs requires less computational resources compared to other pipelines. In conclusion, if a closely matched reference sequence is available, all pipelines can reliably reconstruct viral consensus genomes; therefore, differences in user-friendliness and runtime may guide the choice of the pipeline in a particular setting. If a matched reference sequence cannot be selected, we recommend shiver or SmaltAlign for robust performance. The new Dockerized version of shiver offers ease of use in addition to the accuracy and robustness of the original pipeline.
Collapse
Affiliation(s)
- Levente Zsichla
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
| | - Marius Zeeb
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital of Zurich, University of Zurich, 8091 Zurich, Switzerland; (M.Z.); (K.J.M.); (R.D.K.)
- Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland
| | - Dávid Fazekas
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- Earlham Institute, Norwich NR4 7UZ, UK
| | - Éva Áy
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
- National Reference Laboratory for Retroviruses, Department of Virology, National Center for Public Health and Pharmacy, 1097 Budapest, Hungary
| | - Dalma Müller
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
- Department of Bioinformatics, Semmelweis University, 1094 Budapest, Hungary
| | - Karin J. Metzner
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital of Zurich, University of Zurich, 8091 Zurich, Switzerland; (M.Z.); (K.J.M.); (R.D.K.)
- Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland
| | - Roger D. Kouyos
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital of Zurich, University of Zurich, 8091 Zurich, Switzerland; (M.Z.); (K.J.M.); (R.D.K.)
- Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland
| | - Viktor Müller
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
| |
Collapse
|
2
|
Russell ML, Fish CS, Drescher S, Cassidy NAJ, Chanana P, Benki-Nugent S, Slyker J, Mbori-Ngacha D, Bosire R, Richardson B, Wamalwa D, Maleche-Obimbo E, Overbaugh J, John-Stewart G, Matsen FA, Lehman DA. Using viral sequence diversity to estimate time of HIV infection in infants. PLoS Pathog 2023; 19:e1011861. [PMID: 38117834 PMCID: PMC10732395 DOI: 10.1371/journal.ppat.1011861] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Accepted: 11/27/2023] [Indexed: 12/22/2023] Open
Abstract
Age at HIV acquisition may influence viral pathogenesis in infants, and yet infection timing (i.e. date of infection) is not always known. Adult studies have estimated infection timing using rates of HIV RNA diversification, however, it is unknown whether adult-trained models can provide accurate predictions when used for infants due to possible differences in viral dynamics. While rates of viral diversification have been well defined for adults, there are limited data characterizing these dynamics for infants. Here, we performed Illumina sequencing of gag and pol using longitudinal plasma samples from 22 Kenyan infants with well-characterized infection timing. We used these data to characterize viral diversity changes over time by designing an infant-trained Bayesian hierarchical regression model that predicts time since infection using viral diversity. We show that diversity accumulates with time for most infants (median rate within pol = 0.00079 diversity/month), and diversity accumulates much faster than in adults (compare previously-reported adult rate within pol = 0.00024 diversity/month [1]). We find that the infant rate of viral diversification varies by individual, gene region, and relative timing of infection, but not by set-point viral load or rate of CD4+ T cell decline. We compare the predictive performance of this infant-trained Bayesian hierarchical regression model with simple linear regression models trained using the same infant data, as well as existing adult-trained models [1]. Using an independent dataset from an additional 15 infants with frequent HIV testing to define infection timing, we demonstrate that infant-trained models more accurately estimate time since infection than existing adult-trained models. This work will be useful for timing HIV acquisition for infants with unknown infection timing and for refining our understanding of how viral diversity accumulates in infants, both of which may have broad implications for the future development of infant-specific therapeutic and preventive interventions.
Collapse
Affiliation(s)
- Magdalena L. Russell
- Computational Biology Program, Fred Hutch Cancer Center, Seattle, Washington, United States of America
- Molecular and Cellular Biology Program, University of Washington, Seattle, Washington, United States of America
| | - Carolyn S. Fish
- Division of Human Biology, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Sara Drescher
- University of Washington Medical Center, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
| | - Noah A. J. Cassidy
- Division of Human Biology, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Pritha Chanana
- Bioinformatics Shared Resource, Fred Hutch Cancer Center, Seattle, Washington, United States of America
| | - Sarah Benki-Nugent
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
| | - Jennifer Slyker
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
- Department of Epidemiology, University of Washington, Seattle, Washington, United States of America
| | - Dorothy Mbori-Ngacha
- Department of Pediatrics and Child Health, University of Nairobi, Nairobi, Kenya
| | - Rose Bosire
- Centre for Clinical Research, Kenya Medical Research Institute, Nairobi, Kenya
| | - Barbra Richardson
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
- Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
- Vaccine and Infectious Disease Division, Fred Hutch Cancer Center, Seattle, Washington, United States of America
| | - Dalton Wamalwa
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
- Department of Pediatrics and Child Health, University of Nairobi, Nairobi, Kenya
| | | | - Julie Overbaugh
- Division of Human Biology, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
| | - Grace John-Stewart
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
- Department of Epidemiology, University of Washington, Seattle, Washington, United States of America
- Department of Pediatrics, University of Washington, Seattle, Washington, United States of America
- Department of Medicine, University of Washington, Seattle, Washington, United States of America
| | - Frederick A. Matsen
- Computational Biology Program, Fred Hutch Cancer Center, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Department of Statistics, University of Washington, Seattle, Washington, United States of America
| | - Dara A. Lehman
- Division of Human Biology, Fred Hutchinson Cancer Center, Seattle, Washington, United States of America
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
| |
Collapse
|
3
|
Novitsky V, Nyandiko W, Vreeman R, DeLong AK, Howison M, Manne A, Aluoch J, Chory A, Sang F, Ashimosi C, Jepkemboi E, Orido M, Hogan JW, Kantor R. Added Value of Next Generation Sequencing in Characterizing the Evolution of HIV-1 Drug Resistance in Kenyan Youth. Viruses 2023; 15:1416. [PMID: 37515104 PMCID: PMC10383797 DOI: 10.3390/v15071416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 06/14/2023] [Accepted: 06/20/2023] [Indexed: 07/30/2023] Open
Abstract
Drug resistance remains a global challenge in children and adolescents living with HIV (CALWH). Characterizing resistance evolution, specifically using next generation sequencing (NGS) can potentially inform care, but remains understudied, particularly in antiretroviral therapy (ART)-experienced CALWH in resource-limited settings. We conducted reverse-transcriptase NGS and investigated short-and long-term resistance evolution and its predicted impact in a well-characterized cohort of Kenyan CALWH failing 1st-line ART and followed for up to ~8 years. Drug resistance mutation (DRM) evolution types were determined by NGS frequency changes over time, defined as evolving (up-trending and crossing the 20% NGS threshold), reverting (down-trending and crossing the 20% threshold) or other. Exploratory analyses assessed potential impacts of minority resistance variants on evolution. Evolution was detected in 93% of 42 participants, including 91% of 22 with short-term follow-up, 100% of 7 with long-term follow-up without regimen change, and 95% of 19 with long-term follow-up with regimen change. Evolving DRMs were identified in 60% and minority resistance variants evolved in 17%, with exploratory analysis suggesting greater rate of evolution of minority resistance variants under drug selection pressure and higher predicted drug resistance scores in the presence of minority DRMs. Despite high-level pre-existing resistance, NGS-based longitudinal follow-up of this small but unique cohort of Kenyan CALWH demonstrated continued DRM evolution, at times including low-level DRMs detected only by NGS, with predicted impact on care. NGS can inform better understanding of DRM evolution and dynamics and possibly improve care. The clinical significance of these findings should be further evaluated.
Collapse
Affiliation(s)
- Vlad Novitsky
- Alpert Medical School, Brown University, Providence, RI 02912, USA
| | - Winstone Nyandiko
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
- College of Health Sciences, Moi University, Eldoret 30100, Kenya
| | - Rachel Vreeman
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
- Department of Global Health and Health System Design, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Arnhold Institute for Global Health, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Allison K DeLong
- School of Public Health, Brown University, Providence, RI 02912, USA
| | - Mark Howison
- Research Improving People's Lives, Providence, RI 02903, USA
| | - Akarsh Manne
- Alpert Medical School, Brown University, Providence, RI 02912, USA
| | - Josephine Aluoch
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
| | - Ashley Chory
- Department of Global Health and Health System Design, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Festus Sang
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
| | - Celestine Ashimosi
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
| | - Eslyne Jepkemboi
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
| | - Millicent Orido
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
| | - Joseph W Hogan
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret 30100, Kenya
- School of Public Health, Brown University, Providence, RI 02912, USA
| | - Rami Kantor
- Alpert Medical School, Brown University, Providence, RI 02912, USA
| |
Collapse
|
4
|
Novitsky V, Nyandiko W, Vreeman R, DeLong AK, Manne A, Scanlon M, Ngeresa A, Aluoch J, Sang F, Ashimosi C, Jepkemboi E, Orido M, Hogan JW, Kantor R. Added Value of Next Generation over Sanger Sequencing in Kenyan Youth with Extensive HIV-1 Drug Resistance. Microbiol Spectr 2022; 10:e0345422. [PMID: 36445146 PMCID: PMC9769539 DOI: 10.1128/spectrum.03454-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 11/16/2022] [Indexed: 12/03/2022] Open
Abstract
HIV-1 drug resistance testing in children and adolescents in low-resource settings is both important and challenging. New (more sensitive) drug resistance testing technologies may improve clinical care, but evaluation of their added value is limited. We assessed the potential added value of using next-generation sequencing (NGS) over Sanger sequencing for detecting nucleoside reverse transcriptase inhibitor (NRTI) and nonnucleoside reverse transcriptase inhibitor (NNRTI) drug resistance mutations (DRMs). Participants included 132 treatment-experienced Kenyan children and adolescents with diverse HIV-1 subtypes and with already high levels of drug resistance detected by Sanger sequencing. We examined overall and DRM-specific resistance and its predicted impact on antiretroviral therapy and evaluated the discrepancy between Sanger sequencing and six NGS thresholds (1%, 2%, 5%, 10%, 15%, and 20%). Depending on the NGS threshold, agreement between the two technologies was 62% to 88% for any DRM, 83% to 92% for NRTI DRMs, and 73% to 94% for NNRTI DRMs, with more DRMs detected at low NGS thresholds. NGS identified 96% to 100% of DRMs detected by Sanger sequencing, while Sanger identified 83% to 99% of DRMs detected by NGS. Higher discrepancy between technologies was associated with higher DRM prevalence. Even in this resistance-saturated cohort, 12% of participants had higher, potentially clinically relevant predicted resistance detected only by NGS. These findings, in a young, vulnerable Kenyan population with diverse HIV-1 subtypes and already high resistance levels, suggest potential benefits of more sensitive NGS over existing technology. Good agreement between technologies at high NGS thresholds supports their interchangeable use; however, the significance of DRMs identified at lower thresholds to patient care should be explored further. IMPORTANCE HIV-1 drug resistance in children and adolescents remains a significant problem in countries facing the highest burden of the HIV epidemic. Surveillance of HIV-1 drug resistance in children and adolescents is an important public health strategy, particularly in resource-limited settings, and yet, it is limited due mostly to cost and infrastructure constraints. Whether newer and more sensitive next-generation sequencing (NGS) adds substantial value beyond traditional Sanger sequencing in detecting HIV-1 drug resistance in real life settings remains an open and debatable question. In this paper, we attempt to address this issue by performing a comprehensive comparison of drug resistance identified by Sanger sequencing and six NGS thresholds. We conducted this study in a well-characterized, vulnerable cohort of children and adolescents living with diverse HIV-1 subtypes in Kenya and, importantly, failing antiretroviral therapy (ART) with already extensive drug resistance. Our findings suggest a potential added value of NGS over Sanger even in this unique cohort.
Collapse
Affiliation(s)
- V. Novitsky
- Brown University, Providence, Rhode Island, USA
| | - W. Nyandiko
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
- Moi University, Eldoret, Kenya
| | - R. Vreeman
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Arnhold Institute for Global Health, New York, New York, USA
| | | | - A. Manne
- Brown University, Providence, Rhode Island, USA
| | - M. Scanlon
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Arnhold Institute for Global Health, New York, New York, USA
| | - A. Ngeresa
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
| | - J. Aluoch
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
| | - F. Sang
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
| | - C. Ashimosi
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
| | - E. Jepkemboi
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
| | - M. Orido
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
| | - J. W. Hogan
- Brown University, Providence, Rhode Island, USA
| | - R. Kantor
- Brown University, Providence, Rhode Island, USA
| | - for the RESistance in a PEdiatric CohorT (RESPECT) Study
- Brown University, Providence, Rhode Island, USA
- Academic Model Providing Access to Healthcare (AMPATH), Eldoret, Kenya
- Moi University, Eldoret, Kenya
- Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Arnhold Institute for Global Health, New York, New York, USA
| |
Collapse
|
5
|
Abstract
PURPOSE OF REVIEW HIV-1 drug resistance (HIV DR) testing is routinely performed by genotyping plasma viruses using Sanger population sequencing. Next-generation sequencing (NGS) is increasingly replacing standardized Sanger sequencing. This opens up new opportunities, but also brings challenges. RECENT FINDINGS The number of NGS applications and protocols for HIV DR testing is increasing. All of them are noninferior to Sanger sequencing when comparing NGS-derived consensus sequences to Sanger sequencing-derived sequences. In addition, NGS enables high-throughput sequencing of near full-length HIV-1 genomes and detection of low-abundance drug-resistant HIV-1 variants, although their clinical implications need further investigation. Several groups have defined remaining challenges in implementing NGS protocols for HIV-1 resistance testing. Some of them are already being addressed. One of the most important needs is quality management and consequently, if possible, standardization. SUMMARY The use of NGS technologies on HIV DR testing will allow unprecedented insights into genomic structures of virus populations that may be of immediate relevance to both clinical and research areas such as personalized antiretroviral treatment. Efforts continue to tackle the remaining challenges in NGS-based HIV DR testing.
Collapse
|
6
|
Climaco-Arvizu S, Flores-López V, González-Torres C, Gaytán-Cervantes FJ, Hernández-García MC, Zárate-Segura PB, Chávez-Torres M, Tesoro-Cruz E, Pinto-Cardoso SM, Bekker-Méndez VC. Protease and gag diversity and drug resistance mutations among treatment-naive Mexican people living with HIV. BMC Infect Dis 2022; 22:447. [PMID: 35538426 PMCID: PMC9088029 DOI: 10.1186/s12879-022-07446-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 04/29/2022] [Indexed: 09/17/2024] Open
Abstract
Introduction In Mexico, HIV genotyping is performed in people living with HIV (PLWH) failing their first-line antiretroviral (ARV) regimen; it is not routinely done for all treatment-naive PLWH before ARV initiation. The first nationally representative survey published in 2016 reported that the prevalence of pretreatment drug mutations in treatment-naive Mexican PLWH was 15.5% to any antiretroviral drug and 10.6% to non-nucleoside reverse transcriptase inhibitors (NNRTIs) using conventional Sanger sequencing. Most reports in Mexico focus on HIV pol gene and nucleoside and non-nucleoside reverse transcriptase inhibitor (NRTI and NNRTI) drug resistance mutations (DRMs) prevalence, using Sanger sequencing, next-generation sequencing (NGS) or both. To our knowledge, NGS has not be used to detect pretreatment drug resistance mutations (DRMs) in the HIV protease (PR) gene and its substrate the Gag polyprotein. Methods Treatment-naive adult Mexican PLWH were recruited between 2016 and 2019. HIV Gag and protease sequences were obtained by NGS and DRMs were identified using the WHO surveillance drug resistance mutation (SDRM) list. Results One hundred PLWH attending a public national reference hospital were included. The median age was 28 years-old, and most were male. The median HIV viral load was 4.99 [4.39–5.40] log copies/mL and median CD4 cell count was 150 [68.0–355.78] cells/mm3. As expected, most sequences clustered with HIV-1 subtype B (97.9%). Major PI resistance mutations were detected: 8 (8.3%) of 96 patients at a detection threshold of 1% and 3 (3.1%) at a detection threshold of 20%. A total of 1184 mutations in Gag were detected, of which 51 have been associated with resistance to PI, most of them were detected at a threshold of 20%. Follow-up clinical data was available for 79 PLWH at 6 months post-ART initiation, seven PLWH failed their first ART regimen; however no major PI mutations were identified in these individuals at baseline. Conclusions The frequency of DRM in the HIV protease was 7.3% at a detection threshold of 1% and 3.1% at a detection threshold of 20%. NGS-based HIV drug resistance genotyping provide improved detection of DRMs. Viral load was used to monitor ARV response and treatment failure was 8.9%. Supplementary Information The online version contains supplementary material available at 10.1186/s12879-022-07446-8.
Collapse
Affiliation(s)
- Samantha Climaco-Arvizu
- Unidad de Investigación Médica en Inmunología e Infectología, Hospital de Infectología "Dr Daniel Méndez Hernández", Centro Médico Nacional "La Raza", Instituto Mexicano del Seguro Social (IMSS), Ciudad de México, C.P. 02990, México.,Laboratorio de Medicina Traslacional, Instituto Politécnico Nacional, Ciudad de México, México
| | | | - Carolina González-Torres
- División de Desarrollo de La Investigación, Instituto Mexicano del Seguro Social, Ciudad de México, México
| | | | - María Concepción Hernández-García
- Instituto Mexicano del Seguro Social (IMSS), Hospital de Infectología "Dr Daniel Méndez Hernández", Centro Médico Nacional (CMN), La Raza", Ciudad de México, México
| | | | - Monserrat Chávez-Torres
- Centro de Investigación en Enfermedades Infecciosas, Instituto Nacional de Enfermedades Respiratorias Ismael Cosío Villegas, Ciudad de México, C.P. 14080, México
| | - Emiliano Tesoro-Cruz
- Unidad de Investigación Médica en Inmunología e Infectología, Hospital de Infectología "Dr Daniel Méndez Hernández", Centro Médico Nacional "La Raza", Instituto Mexicano del Seguro Social (IMSS), Ciudad de México, C.P. 02990, México
| | - Sandra María Pinto-Cardoso
- Centro de Investigación en Enfermedades Infecciosas, Instituto Nacional de Enfermedades Respiratorias Ismael Cosío Villegas, Ciudad de México, C.P. 14080, México.
| | - Vilma Carolina Bekker-Méndez
- Unidad de Investigación Médica en Inmunología e Infectología, Hospital de Infectología "Dr Daniel Méndez Hernández", Centro Médico Nacional "La Raza", Instituto Mexicano del Seguro Social (IMSS), Ciudad de México, C.P. 02990, México.
| |
Collapse
|
7
|
Guang A, Howison M, Ledingham L, D’Antuono M, Chan PA, Lawrence C, Dunn CW, Kantor R. Incorporating Within-Host Diversity in Phylogenetic Analyses for Detecting Clusters of New HIV Diagnoses. Front Microbiol 2022; 12:803190. [PMID: 35250908 PMCID: PMC8891961 DOI: 10.3389/fmicb.2021.803190] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Accepted: 12/22/2021] [Indexed: 11/29/2022] Open
Abstract
Background Phylogenetic analyses of HIV sequences are used to detect clusters and inform public health interventions. Conventional approaches summarize within-host HIV diversity with a single consensus sequence per host of the pol gene, obtained from Sanger or next-generation sequencing (NGS). There is growing recognition that this approach discards potentially important information about within-host sequence variation, which can impact phylogenetic inference. However, whether alternative summary methods that incorporate intra-host variation impact phylogenetic inference of transmission network features is unknown. Methods We introduce profile sampling, a method to incorporate within-host NGS sequence diversity into phylogenetic HIV cluster inference. We compare this approach to Sanger- and NGS-derived pol and near-whole-genome consensus sequences and evaluate its potential benefits in identifying molecular clusters among all newly-HIV-diagnosed individuals over six months at the largest HIV center in Rhode Island. Results Profile sampling cluster inference demonstrated that within-host viral diversity impacts phylogenetic inference across individuals, and that consensus sequence approaches can obscure both magnitude and effect of these impacts. Clustering differed between Sanger- and NGS-derived consensus and profile sampling sequences, and across gene regions. Discussion Profile sampling can incorporate within-host HIV diversity captured by NGS into phylogenetic analyses. This additional information can improve robustness of cluster detection.
Collapse
Affiliation(s)
- August Guang
- Center for Computational Biology of Human Disease, Brown University, Providence, RI, United States
- Center for Computation and Visualization, Brown University, Providence, RI, United States
- *Correspondence: August Guang,
| | - Mark Howison
- Research Improving People’s Lives, Providence, RI, United States
| | - Lauren Ledingham
- Division of Infectious Diseases, The Alpert Medical School, Brown University, Providence, RI, United States
| | - Matthew D’Antuono
- Division of Infectious Diseases, The Alpert Medical School, Brown University, Providence, RI, United States
| | - Philip A. Chan
- Division of Infectious Diseases, The Alpert Medical School, Brown University, Providence, RI, United States
| | - Charles Lawrence
- Division of Applied Mathematics, Brown University, Providence, RI, United States
| | - Casey W. Dunn
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, United States
| | - Rami Kantor
- Division of Infectious Diseases, The Alpert Medical School, Brown University, Providence, RI, United States
| |
Collapse
|
8
|
Beckwith CG, Min S, Manne A, Novitsky V, Howison M, Liu T, Kuo I, Kurth A, Bazerman L, Agopian A, Kantor R. HIV Drug Resistance and Transmission Networks Among a Justice-Involved Population at the Time of Community Reentry in Washington, D.C. AIDS Res Hum Retroviruses 2021; 37:903-912. [PMID: 33896212 PMCID: PMC8716515 DOI: 10.1089/aid.2020.0267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Justice-involved (JI) populations bear a disproportionate burden of HIV infection and are at risk of poor treatment outcomes. Drug resistance prevalence and emergence, and phylogenetic inference of transmission networks, understudied in vulnerable JI populations, can inform care and prevention interventions, particularly around the critical community reentry period. We analyzed banked blood specimens from CARE+ Corrections study participants in Washington, D.C. (DC) across three time points and conducted HIV drug resistance testing using next-generation sequencing (NGS) at 20% and 5% thresholds to identify prevalent and evolving resistance during community reentry. Phylogenetic analysis was used to identify molecular clusters within participants, and in an extended analysis between participants and publicly available DC sequences. HIV sequence data from 54 participants (99 specimens) were analyzed. The prevalence of transmitted drug resistance was 14% at both thresholds, and acquired drug resistance was 47% at 20%, and 57% at 5% NGS thresholds, respectively. The overall prevalence of drug resistance was 43% at 20%, and 52% at 5% NGS thresholds, respectively. Among 34 participants sampled longitudinally, 21%–35% accumulated 10–17 new resistance mutations during a mean 4.3 months. In phylogenetic analysis within the JI population, 11% were found in three molecular clusters. The extended phylogenetic analysis identified 46% of participants in 22 clusters, of which 21 also included publicly-available DC sequences, and one JI-only unique dyad. This is the first study to identify a high prevalence of HIV drug resistance and its accumulation in a JI population during community reentry and suggests phylogenetic integration of this population into the non-JI DC HIV community. These data support the need for new, effective, and timely interventions to improve HIV treatment during this vulnerable period, and for JI populations to be included in broader surveillance and prevention efforts.
Collapse
Affiliation(s)
- Curt G. Beckwith
- Division of Infectious Diseases, The Miriam Hospital, Providence, Rhode Island, USA
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Sugi Min
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Akarsh Manne
- Division of Infectious Diseases, The Miriam Hospital, Providence, Rhode Island, USA
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Vladimir Novitsky
- Division of Infectious Diseases, The Miriam Hospital, Providence, Rhode Island, USA
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| | - Mark Howison
- Research Improving People's Lives, Providence, Rhode Island, USA
| | - Tao Liu
- Department of Biostatistics, Brown University School of Public Health, Providence, Rhode Island, USA
| | - Irene Kuo
- George Washington University Milken Institute School of Public Health, Washington, District of Columbia, USA
| | - Ann Kurth
- Yale University School of Nursing, Orange, Connecticut, USA
| | - Lauri Bazerman
- Division of Infectious Diseases, The Miriam Hospital, Providence, Rhode Island, USA
| | - Anya Agopian
- George Washington University Milken Institute School of Public Health, Washington, District of Columbia, USA
| | - Rami Kantor
- Division of Infectious Diseases, The Miriam Hospital, Providence, Rhode Island, USA
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island, USA
| |
Collapse
|
9
|
Blassel L, Zhukova A, Villabona-Arenas CJ, Atkins KE, Hué S, Gascuel O. Drug resistance mutations in HIV: new bioinformatics approaches and challenges. Curr Opin Virol 2021; 51:56-64. [PMID: 34597873 DOI: 10.1016/j.coviro.2021.09.009] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2021] [Revised: 08/31/2021] [Accepted: 09/13/2021] [Indexed: 12/11/2022]
Abstract
Drug resistance mutations appear in HIV under treatment pressure. Resistant variants can be transmitted to treatment-naive individuals, which can lead to rapid virological failure and can limit treatment options. Consequently, quantifying the prevalence, emergence and transmission of drug resistance is critical to effectively treating patients and to shape health policies. We review recent bioinformatics developments and in particular describe: (1) the machine learning approaches intended to predict and explain the level of resistance of HIV variants from their sequence data; (2) the phylogenetic methods used to survey the emergence and dynamics of resistant HIV transmission clusters; (3) the impact of deep sequencing in studying within-host and between-host genetic diversity of HIV variants, notably regarding minority resistant variants.
Collapse
Affiliation(s)
- Luc Blassel
- Unité Bioinformatique Evolutive, Institut Pasteur, Paris, France; Sorbonne Université, Collège Doctoral, Paris, France
| | - Anna Zhukova
- Unité Bioinformatique Evolutive, Institut Pasteur, Paris, France; Hub de Bioinformatique et Biostatistique, Institut Pasteur, Paris, France
| | - Christian J Villabona-Arenas
- Centre for the Mathematical Modelling of Infectious Diseases (CMMID), London School of Hygiene & Tropical Medicine, London, UK; Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Katherine E Atkins
- Centre for the Mathematical Modelling of Infectious Diseases (CMMID), London School of Hygiene & Tropical Medicine, London, UK; Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK; Usher Institute, University of Edinburgh, Edinburgh, UK
| | - Stéphane Hué
- Centre for the Mathematical Modelling of Infectious Diseases (CMMID), London School of Hygiene & Tropical Medicine, London, UK; Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Olivier Gascuel
- Institut de Systématique, Evolution, Biodiversité (ISYEB, UMR 7205 - CNRS, Muséum National d'Histoire Naturelle, EPHE, SU, UA), Paris, France.
| |
Collapse
|
10
|
Posada-Céspedes S, Seifert D, Topolsky I, Jablonski KP, Metzner KJ, Beerenwinkel N. V-pipe: a computational pipeline for assessing viral genetic diversity from high-throughput data. Bioinformatics 2021; 37:1673-1680. [PMID: 33471068 PMCID: PMC8289377 DOI: 10.1093/bioinformatics/btab015] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 12/09/2020] [Accepted: 01/08/2021] [Indexed: 12/30/2022] Open
Abstract
Motivation High-throughput sequencing technologies are used increasingly not only in viral genomics research but also in clinical surveillance and diagnostics. These technologies facilitate the assessment of the genetic diversity in intra-host virus populations, which affects transmission, virulence and pathogenesis of viral infections. However, there are two major challenges in analysing viral diversity. First, amplification and sequencing errors confound the identification of true biological variants, and second, the large data volumes represent computational limitations. Results To support viral high-throughput sequencing studies, we developed V-pipe, a bioinformatics pipeline combining various state-of-the-art statistical models and computational tools for automated end-to-end analyses of raw sequencing reads. V-pipe supports quality control, read mapping and alignment, low-frequency mutation calling, and inference of viral haplotypes. For generating high-quality read alignments, we developed a novel method, called ngshmmalign, based on profile hidden Markov models and tailored to small and highly diverse viral genomes. V-pipe also includes benchmarking functionality providing a standardized environment for comparative evaluations of different pipeline configurations. We demonstrate this capability by assessing the impact of three different read aligners (Bowtie 2, BWA MEM, ngshmmalign) and two different variant callers (LoFreq, ShoRAH) on the performance of calling single-nucleotide variants in intra-host virus populations. V-pipe supports various pipeline configurations and is implemented in a modular fashion to facilitate adaptations to the continuously changing technology landscape. Availabilityand implementation V-pipe is freely available at https://github.com/cbg-ethz/V-pipe. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Susana Posada-Céspedes
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - David Seifert
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - Ivan Topolsky
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - Kim Philipp Jablonski
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| | - Karin J Metzner
- Division of Infectious Diseases and Hospital Epidemiology, University Hospital Zurich, University of Zurich, Zurich, 8091, Switzerland.,4 Institute of Medical Virology, University of Zurich, Zurich, 8091, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, 4058, Switzerland.,SIB Swiss Institute of Bioinformatics, Basel, 4058, Switzerland
| |
Collapse
|
11
|
Cevallos C, Culasso ACA, Urquiza J, Ojeda D, Sued O, Figueroa MI, Avila MM, Delpino MV, Quarleri JF. In vivo drug resistance mutation dynamics from the early to chronic stage of infection in antiretroviral-therapy-naïve HIV-infected men who have sex with men. Arch Virol 2020; 165:2915-2919. [PMID: 32978684 DOI: 10.1007/s00705-020-04823-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 08/25/2020] [Indexed: 01/18/2023]
Abstract
Human immunodeficiency virus type 1 (HIV) primary drug resistance mutations (DRMs) influence the long-term therapeutic effects of antiretroviral treatment (ART). Drug-resistance genotyping based on polymerase gene sequences obtained by next-generation sequencing (NGS) was performed using samples from 10 ART-naïve HIV-infected men who have sex with men (MSM; P1-P10) from the acute/early to chronic stage of infection. Three of the 10 subjects exhibited the presence of major (abundance, ≥ 20%) viral populations carrying DRM at early/acute stage that later, at the chronic stage, dropped drastically (V106M) or remained highly abundant (E138A). Four individuals exhibited additional DRMs (M46I/L; I47A; I54M, L100V) as HIV minority populations (abundance, 2-20%) that emerged during the chronic stage but ephemerally.
Collapse
Affiliation(s)
- Cintia Cevallos
- Instituto de Investigaciones Biomédicas en Retrovirus Y Sida (INBIRS), Facultad de Medicina, Universidad de Buenos Aires (UBA), Paraguay 2155-Piso 11 (1121), Buenos Aires, Argentina
| | - Andrés C A Culasso
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET) Instituto de Bacteriología Y Virología Molecular (IBaViM) Facultad de Farmacia Y Bioquímica, Universidad de Buenos Aires, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET), Buenos Aires, Argentina
| | - Javier Urquiza
- Instituto de Investigaciones Biomédicas en Retrovirus Y Sida (INBIRS), Facultad de Medicina, Universidad de Buenos Aires (UBA), Paraguay 2155-Piso 11 (1121), Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET), Buenos Aires, Argentina
| | - Diego Ojeda
- Instituto de Investigaciones Biomédicas en Retrovirus Y Sida (INBIRS), Facultad de Medicina, Universidad de Buenos Aires (UBA), Paraguay 2155-Piso 11 (1121), Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET), Buenos Aires, Argentina
| | - Omar Sued
- Fundación Huésped, Pasaje Angel Peluffo 3932 (C1202ABB), Ciudad Autónoma de Buenos Aires, Argentina
| | - María I Figueroa
- Fundación Huésped, Pasaje Angel Peluffo 3932 (C1202ABB), Ciudad Autónoma de Buenos Aires, Argentina
| | - María M Avila
- Instituto de Investigaciones Biomédicas en Retrovirus Y Sida (INBIRS), Facultad de Medicina, Universidad de Buenos Aires (UBA), Paraguay 2155-Piso 11 (1121), Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET), Buenos Aires, Argentina
| | - M Victoria Delpino
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET), Buenos Aires, Argentina
- Instituto de Inmunología, Genética Y Metabolismo (INIGEM), Universidad de Buenos Aires. CONICET, Buenos Aires, Argentina
| | - Jorge F Quarleri
- Instituto de Investigaciones Biomédicas en Retrovirus Y Sida (INBIRS), Facultad de Medicina, Universidad de Buenos Aires (UBA), Paraguay 2155-Piso 11 (1121), Buenos Aires, Argentina.
- Consejo Nacional de Investigaciones Científicas Y Técnicas (CONICET), Buenos Aires, Argentina.
| |
Collapse
|
12
|
Caputo V, Diotti RA, Boeri E, Hasson H, Sampaolo M, Criscuolo E, Bagaglio S, Messina E, Uberti-Foppa C, Castelli M, Burioni R, Mancini N, Clementi M, Clementi N. Detection of low-level HCV variants in DAA treated patients: comparison amongst three different NGS data analysis protocols. Virol J 2020; 17:103. [PMID: 32660499 PMCID: PMC7359454 DOI: 10.1186/s12985-020-01381-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 07/03/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Notwithstanding the efforts of direct-acting antivirals (DAAs) for the treatment of chronically infected hepatitis C virus (HCV) patients, concerns exist regarding the emergence of resistance-associated substitutions (RAS) related to therapy failure. Sanger sequencing is still the reference technique used for the detection of RAS and it detects viral variants present up to 15%, meaning that minority variants are undetectable, using this technique. To date, many studies are focused on the analysis of the impact of HCV low variants using next-generation sequencing (NGS) techniques, but the importance of these minority variants is still debated, and importantly, a common data analysis method is still not defined. METHODS Serum samples from four patients failing DAAs therapy were collected at baseline and failure, and amplification of NS3, NS5A and NS5B genes was performed on each sample. The genes amplified were sequenced using Sanger and NGS Illumina sequencing and the data generated were analyzed with different approaches. Three different NGS data analysis methods, two homemade in silico pipeline and one commercially available certified user-friendly software, were used to detect low-level variants. RESULTS The NGS approach allowed to infer also very-low level virus variants. Moreover, data processing allowed to generate high accuracy data which results in reduction in the error rates for each single sequence polymorphism. The results improved the detection of low-level viral variants in the HCV quasispecies of the analyzed patients, and in one patient a low-level RAS related to treatment failure was identified. Importantly, the results obtained from only two out of the three data analysis strategies were in complete agreement in terms of both detection and frequency of RAS. CONCLUSIONS These results highlight the need to find a robust NGS data analysis method to standardize NGS results for a better comprehension of the clinical role of low-level HCV variants. Based on the extreme importance of data analysis approaches for wet-data interpretation, a detailed description of the used pipelines and further standardization of the in silico analysis could allow increasing diagnostic laboratory networking to unleash true potentials of NGS.
Collapse
Affiliation(s)
- Valeria Caputo
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy
| | - Roberta Antonia Diotti
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy
| | - Enzo Boeri
- Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Hospital, 20132, Milan, Italy
| | - Hamid Hasson
- Department of Infectious Diseases, San Raffaele Hospital, 20132, Milan, Italy
| | - Michela Sampaolo
- Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Hospital, 20132, Milan, Italy
| | - Elena Criscuolo
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy
| | - Sabrina Bagaglio
- Department of Infectious Diseases, San Raffaele Hospital, 20132, Milan, Italy
| | - Emanuela Messina
- Department of Infectious Diseases, San Raffaele Hospital, 20132, Milan, Italy
| | | | - Matteo Castelli
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy
| | - Roberto Burioni
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy
| | - Nicasio Mancini
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy.,Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Hospital, 20132, Milan, Italy
| | - Massimo Clementi
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy.,Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Hospital, 20132, Milan, Italy
| | - Nicola Clementi
- Laboratory of Medical Microbiology and Virology at "Vita-Salute" San Raffaele University, 20132, Milan, Italy. .,Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Hospital, 20132, Milan, Italy.
| |
Collapse
|
13
|
Deng ZL, Dhingra A, Fritz A, Götting J, Münch PC, Steinbrück L, Schulz TF, Ganzenmüller T, McHardy AC. Evaluating assembly and variant calling software for strain-resolved analysis of large DNA viruses. Brief Bioinform 2020; 22:5868070. [PMID: 34020538 PMCID: PMC8138829 DOI: 10.1093/bib/bbaa123] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2019] [Revised: 05/18/2020] [Accepted: 05/19/2020] [Indexed: 02/06/2023] Open
Abstract
Infection with human cytomegalovirus (HCMV) can cause severe complications in immunocompromised individuals and congenitally infected children. Characterizing heterogeneous viral populations and their evolution by high-throughput sequencing of clinical specimens requires the accurate assembly of individual strains or sequence variants and suitable variant calling methods. However, the performance of most methods has not been assessed for populations composed of low divergent viral strains with large genomes, such as HCMV. In an extensive benchmarking study, we evaluated 15 assemblers and 6 variant callers on 10 lab-generated benchmark data sets created with two different library preparation protocols, to identify best practices and challenges for analyzing such data. Most assemblers, especially metaSPAdes and IVA, performed well across a range of metrics in recovering abundant strains. However, only one, Savage, recovered low abundant strains and in a highly fragmented manner. Two variant callers, LoFreq and VarScan2, excelled across all strain abundances. Both shared a large fraction of false positive variant calls, which were strongly enriched in T to G changes in a 'G.G' context. The magnitude of this context-dependent systematic error is linked to the experimental protocol. We provide all benchmarking data, results and the entire benchmarking workflow named QuasiModo, Quasispecies Metric determination on omics, under the GNU General Public License v3.0 (https://github.com/hzi-bifo/Quasimodo), to enable full reproducibility and further benchmarking on these and other data.
Collapse
Affiliation(s)
- Zhi-Luo Deng
- Department Computational Biology of Infection Research of the Helmholtz Centre for Infection Research
| | | | - Adrian Fritz
- Department Computational Biology of Infection Research of the Helmholtz Centre for Infection Research
| | | | - Philipp C Münch
- Department Computational Biology of Infection Research of the Helmholtz Centre for Infection Research and Max von Pettenkofer Institute in Ludwig Maximilian University of Munich
| | | | | | | | - Alice C McHardy
- Department Computational Biology of Infection Research of the Helmholtz Centre for Infection Research
| |
Collapse
|
14
|
Multi-Laboratory Comparison of Next-Generation to Sanger-Based Sequencing for HIV-1 Drug Resistance Genotyping. Viruses 2020; 12:v12070694. [PMID: 32605062 PMCID: PMC7411816 DOI: 10.3390/v12070694] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 06/20/2020] [Accepted: 06/24/2020] [Indexed: 11/16/2022] Open
Abstract
Next-generation sequencing (NGS) is increasingly used for HIV-1 drug resistance genotyping. NGS methods have the potential for a more sensitive detection of low-abundance variants (LAV) compared to standard Sanger sequencing (SS) methods. A standardized threshold for reporting LAV that generates data comparable to those derived from SS is needed to allow for the comparability of data from laboratories using NGS and SS. Ten HIV-1 specimens were tested in ten laboratories using Illumina MiSeq-based methods. The consensus sequences for each specimen using LAV thresholds of 5%, 10%, 15%, and 20% were compared to each other and to the consensus of the SS sequences (protease 4-99; reverse transcriptase 38-247). The concordance among laboratories' sequences at different thresholds was evaluated by pairwise sequence comparisons. NGS sequences generated using the 20% threshold were the most similar to the SS consensus (average 99.6% identity, range 96.1-100%), compared to 15% (99.4%, 88.5-100%), 10% (99.2%, 87.4-100%), or 5% (98.5%, 86.4-100%). The average sequence identity between laboratories using thresholds of 20%, 15%, 10%, and 5% was 99.1%, 98.7%, 98.3%, and 97.3%, respectively. Using the 20% threshold, we observed an excellent agreement between NGS and SS, but significant differences at lower thresholds. Understanding how variation in NGS methods influences sequence quality is essential for NGS-based HIV-1 drug resistance genotyping.
Collapse
|
15
|
Noguera-Julian M, Lee ER, Shafer RW, Kantor R, Ji H. Dry Panels Supporting External Quality Assessment Programs for Next Generation Sequencing-Based HIV Drug Resistance Testing. Viruses 2020; 12:v12060666. [PMID: 32575676 PMCID: PMC7354622 DOI: 10.3390/v12060666] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 06/18/2020] [Accepted: 06/18/2020] [Indexed: 12/18/2022] Open
Abstract
External quality assessment (EQA) is a keystone element in the validation and implementation of next generation sequencing (NGS)-based HIV drug resistance testing (DRT). Software validation and evaluation is a critical element in NGS EQA programs. While the development, sharing, and adoption of wet lab protocols is coupled with the increasing access to NGS technology worldwide, rendering it easy to produce NGS data for HIV-DRT, bioinformatic data analysis remains a bottleneck for most of the diagnostic laboratories. Several computational tools have been made available, via free or commercial sources, to automate the conversion of raw NGS data into an actionable clinical report. Although different software platforms yield equivalent results when identical raw NGS datasets are analyzed for variations at higher abundance, discrepancies arise when variations at lower frequencies are considered. This implies that validation and performance assessment of the bioinformatics tools applied in NGS HIV-DRT is critical, and the origins of the observed discrepancies should be determined. Well-characterized reference NGS datasets with ground truth on the genotype composition at all examined loci and the exact frequencies of HIV variations they may harbor, so-called dry panels, would be essential in such cases. The strategic design and construction of such panels are challenging but imperative tasks in support of EQA programs for NGS-based HIV-DRT and the validation of relevant bioinformatics tools. Here, we present criteria that can guide the design of such dry panels, which were discussed in the Second International Winnipeg Symposium themed for EQA strategies for NGS HIVDR assays.
Collapse
Affiliation(s)
- Marc Noguera-Julian
- IrsiCaixa AIDS Research Institute, Hospital Germans Trias i Pujol, s/n, Catalonia, 08196 Badalona, Spain
- Chair in AIDS and Related Illnesses, Centre for Health and Social Care Research (CESS), Faculty of Medicine, University of Vic, Central University of Catalonia, Can Baumann. Ctra. de Roda, 70, 08500 Vic, Spain
- Correspondence:
| | - Emma R. Lee
- National HIV and Retrovirology Laboratories, National Microbiology Laboratory at JC Wilt Infectious Diseases Research Centre, Public Health Agency of Canada, Winnipeg, MB R3E 3R2, Canada; (E.R.L.); (H.J.)
| | | | - Rami Kantor
- Division of Infectious Diseases, Brown University Alpert Medical School, Providence, RI 02903, USA;
| | - Hezhao Ji
- National HIV and Retrovirology Laboratories, National Microbiology Laboratory at JC Wilt Infectious Diseases Research Centre, Public Health Agency of Canada, Winnipeg, MB R3E 3R2, Canada; (E.R.L.); (H.J.)
- Department of Medical Microbiology and Infectious Diseases, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB R3E 0J9, Canada
| |
Collapse
|
16
|
Quality Control of Next-Generation Sequencing-Based HIV-1 Drug Resistance Data in Clinical Laboratory Information Systems Framework. Viruses 2020; 12:v12060645. [PMID: 32545906 PMCID: PMC7354600 DOI: 10.3390/v12060645] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Revised: 05/29/2020] [Accepted: 06/11/2020] [Indexed: 01/24/2023] Open
Abstract
Next-generation sequencing (NGS) in HIV drug resistance (HIVDR) testing has the potential to improve both clinical and public health settings, however it challenges the normal operations of quality management systems to be more flexible due to its complexity, massive data generation, and rapidly evolving protocols. While guidelines for quality management in NGS data have previously been outlined, little guidance has been implemented for NGS-based HIVDR testing. This document summarizes quality control procedures for NGS-based HIVDR testing laboratories using a laboratory information systems (LIS) framework. Here, we focus in particular on the quality control measures applied on the final sequencing product aligned with the recommendations from the World Health Organization HIV Drug Resistance Laboratory Network.
Collapse
|
17
|
Ji H, Sandstrom P, Paredes R, Harrigan PR, Brumme CJ, Avila Rios S, Noguera-Julian M, Parkin N, Kantor R. Are We Ready for NGS HIV Drug Resistance Testing? The Second "Winnipeg Consensus" Symposium. Viruses 2020; 12:E586. [PMID: 32471096 PMCID: PMC7354487 DOI: 10.3390/v12060586] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Revised: 05/13/2020] [Accepted: 05/25/2020] [Indexed: 12/31/2022] Open
Abstract
HIV drug resistance is a major global challenge to successful and sustainable antiretroviral therapy. Next-generation sequencing (NGS)-based HIV drug resistance (HIVDR) assays enable more sensitive and quantitative detection of drug-resistance-associated mutations (DRMs) and outperform Sanger sequencing approaches in detecting lower abundance resistance mutations. While NGS is likely to become the new standard for routine HIVDR testing, many technical and knowledge gaps remain to be resolved before its generalized adoption in regular clinical care, public health, and research. Recognizing this, we conceived and launched an international symposium series on NGS HIVDR, to bring together leading experts in the field to address these issues through in-depth discussions and brainstorming. Following the first symposium in 2018 (Winnipeg, MB Canada, 21-22 February, 2018), a second "Winnipeg Consensus" symposium was held in September 2019 in Winnipeg, Canada, and was focused on external quality assurance strategies for NGS HIVDR assays. In this paper, we summarize this second symposium's goals and highlights.
Collapse
Affiliation(s)
- Hezhao Ji
- National HIV and Retrovirology Laboratories at JC Wilt Infectious Diseases Research Centre, Public Health Agency of Canada, Winnipeg, MB R3E 3R2, Canada;
- Department of Medical Microbiology and Infectious Diseases, University of Manitoba, Winnipeg, MB R3E 0J9, Canada
| | - Paul Sandstrom
- National HIV and Retrovirology Laboratories at JC Wilt Infectious Diseases Research Centre, Public Health Agency of Canada, Winnipeg, MB R3E 3R2, Canada;
- Department of Medical Microbiology and Infectious Diseases, University of Manitoba, Winnipeg, MB R3E 0J9, Canada
| | - Roger Paredes
- IrsiCaixa AIDS Research Institute, Hospital Germans Trias i Pujol, s/n, 08916 Badalona, Catalonia, Spain; (R.P.); (M.N.-J.)
- Infectious Diseases Department, Hospital Germans Trias i Pujol, 08916 Badalona, Catalonia, Spain
| | - P. Richard Harrigan
- Division of AIDS, Department of Medicine, University of British Columbia, Vancouver, BC V5Z 1M9, Canada;
| | - Chanson J. Brumme
- British Columbia Centre for Excellence in HIV/AIDS, Vancouver, BC V6Z 1Y6, Canada;
- Division of Infectious Diseases, Department of Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC V5Z 1M9, Canada
| | - Santiago Avila Rios
- Centre for Research in Infectious Diseases, National Institute of Respiratory Diseases, Mexico City 14080, Mexico;
| | - Marc Noguera-Julian
- IrsiCaixa AIDS Research Institute, Hospital Germans Trias i Pujol, s/n, 08916 Badalona, Catalonia, Spain; (R.P.); (M.N.-J.)
- Chair in AIDS and Related Illnesses, Centre for Health and Social Care Research (CESS), Faculty of Medicine, University of Vic–Central University of Catalonia (UVic–UCC), Can Baumann, Ctra. de Roda, 70, 08500 Vic, Spain
| | - Neil Parkin
- Data First Consulting Inc., Sebastopol, CA 95472, USA;
| | - Rami Kantor
- Division of Infectious Diseases, Brown University Alpert Medical School, Providence, RI 02906, USA;
| |
Collapse
|
18
|
Sidhu G, Schuster L, Liu L, Tamashiro R, Li E, Langaee T, Wagner R, Wang GP. A single variant sequencing method for sensitive and quantitative detection of HIV-1 minority variants. Sci Rep 2020; 10:8185. [PMID: 32424187 PMCID: PMC7234988 DOI: 10.1038/s41598-020-65085-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 04/20/2020] [Indexed: 10/25/2022] Open
Abstract
HIV drug resistance is a major threat to achieving long-term viral suppression in HIV-positive individuals. Drug resistant HIV variants, including minority variants, can compromise response to antiretroviral therapy. Many studies have investigated the clinical relevance of drug resistant minority variants, but the level at which minority variants become clinically relevant remains unclear. A combination of Primer-ID and deep sequencing is a promising approach that may quantify minority variants more accurately compared to standard deep sequencing. However, most studies that used the Primer-ID method have analyzed clinical samples directly. Thus, its sensitivity and quantitative accuracy have not been adequately validated using known controls. Here, we constructed defined proportions of artificial RNA and virus quasispecies and measured their relative proportions using the Primer-ID based, quantitative single-variant sequencing (qSVS) assay. Our results showed that minority variants present at 1% of quasispecies were detected reproducibly with minimal variations between technical replicates. In addition, the measured frequencies were comparable to the expected frequencies. These data validate the accuracy and reproducibility of the qSVS assay in quantifying authentic HIV minority variants, and support the use of this approach to examine the impacts of minority HIV variants on virologic response and clinical outcome.
Collapse
Affiliation(s)
- Gurjit Sidhu
- Division of Infectious Disease and Global Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Layla Schuster
- Division of Infectious Disease and Global Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA.,Medosome Biosciences, Alachua, FL, USA
| | - Lin Liu
- Division of Infectious Disease and Global Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA.,Department of Medicine, St. Luke's Hospital, Chesterfield, MO, USA
| | - Ryan Tamashiro
- Division of Infectious Disease and Global Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Eric Li
- Division of Infectious Disease and Global Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA
| | - Taimour Langaee
- Department of Pharmacotherapy and Translational Research, Center for Pharmacogenomics and Precision Medicine, College of Pharmacy, University of Florida, Gainesville, FL, USA
| | | | - Gary P Wang
- Division of Infectious Disease and Global Medicine, Department of Medicine, University of Florida, Gainesville, FL, USA. .,Infectious Diseases Section, Medical Service, North Florida/South Georgia Veterans Health System, Gainesville, FL, USA.
| |
Collapse
|
19
|
External Quality Assessment for Next-Generation Sequencing-Based HIV Drug Resistance Testing: Unique Requirements and Challenges. Viruses 2020; 12:v12050550. [PMID: 32429382 PMCID: PMC7291216 DOI: 10.3390/v12050550] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/09/2020] [Accepted: 05/14/2020] [Indexed: 12/25/2022] Open
Abstract
Over the past decade, there has been an increase in the adoption of next generation sequencing (NGS) technologies for HIV drug resistance (HIVDR) testing. NGS far outweighs conventional Sanger sequencing as it has much higher throughput, lower cost when samples are batched and, most importantly, significantly higher sensitivities for variants present at low frequencies, which may have significant clinical implications. Despite the advantages of NGS, Sanger sequencing remains the gold standard for HIVDR testing, largely due to the lack of standardization of NGS-based HIVDR testing. One important aspect of standardization includes external quality assessment (EQA) strategies and programs. Current EQA for Sanger-based HIVDR testing includes proficiency testing where samples are sent to labs and the performance of the lab conducting such assays is evaluated. The current methods for Sanger-based EQA may not apply to NGS-based tests because of the fundamental differences in their technologies and outputs. Sanger-based genotyping reports drug resistance mutations (DRMs) data as dichotomous, whereas NGS-based HIVDR genotyping also reports DRMs as numerical data (percent abundance). Here we present an overview of the need to develop EQA for NGS-based HIVDR testing and some unique challenges that may be encountered.
Collapse
|
20
|
Performance comparison of next generation sequencing analysis pipelines for HIV-1 drug resistance testing. Sci Rep 2020; 10:1634. [PMID: 32005884 PMCID: PMC6994664 DOI: 10.1038/s41598-020-58544-z] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Accepted: 01/09/2020] [Indexed: 01/13/2023] Open
Abstract
Next generation sequencing (NGS) is a trending new standard for genotypic HIV-1 drug resistance (HIVDR) testing. Many NGS HIVDR data analysis pipelines have been independently developed, each with variable outputs and data management protocols. Standardization of such analytical methods and comparison of available pipelines are lacking, yet may impact subsequent HIVDR interpretation and other downstream applications. Here we compared the performance of five NGS HIVDR pipelines using proficiency panel samples from NIAID Virology Quality Assurance (VQA) program. Ten VQA panel specimens were genotyped by each of six international laboratories using their own in-house NGS assays. Raw NGS data were then processed using each of the five different pipelines including HyDRA, MiCall, PASeq, Hivmmer and DEEPGEN. All pipelines detected amino acid variants (AAVs) at full range of frequencies (1~100%) and demonstrated good linearity as compared to the reference frequency values. While the sensitivity in detecting low abundance AAVs, with frequencies between 1~20%, is less a concern for all pipelines, their specificity dramatically decreased at AAV frequencies <2%, suggesting that 2% threshold may be a more reliable reporting threshold for ensured specificity in AAV calling and reporting. More variations were observed among the pipelines when low abundance AAVs are concerned, likely due to differences in their NGS read quality control strategies. Findings from this study highlight the need for standardized strategies for NGS HIVDR data analysis, especially for the detection of minority HIVDR variants.
Collapse
|
21
|
Lambrechts L, Cole B, Rutsaert S, Trypsteen W, Vandekerckhove L. Emerging PCR-Based Techniques to Study HIV-1 Reservoir Persistence. Viruses 2020; 12:E149. [PMID: 32012811 PMCID: PMC7077278 DOI: 10.3390/v12020149] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 01/24/2020] [Accepted: 01/26/2020] [Indexed: 02/04/2023] Open
Abstract
While current antiretroviral therapies are able to halt HIV-1 progression, they are not curative, as an interruption of treatment usually leads to viral rebound. The persistence of this stable HIV-1 latent reservoir forms the major barrier in HIV-1 cure research. The need for a better understanding of the mechanisms behind reservoir persistence resulted in the development of several novel assays allowing to perform an extensive in-depth characterization. The objective of this review is to present an overview of the current state-of-the-art PCR-based technologies to study the replication-competent HIV-1 reservoir. Here, we outline the advantages, limitations, and clinical relevance of different approaches. Future HIV-1 eradication studies would benefit from information-rich, high-throughput assays as they provide a more efficient and standardized way of characterizing the persisting HIV-1 reservoir.
Collapse
Affiliation(s)
- Laurens Lambrechts
- HIV Cure Research Center, Department of Internal Medicine and Pediatrics, Ghent University, 9000 Ghent, Belgium; (L.L.); (B.C.); (S.R.); (W.T.)
- BioBix, Department of Data Analysis and Mathematical Modelling, Faculty of Bioscience Engineering, Ghent University, 9000 Ghent, Belgium
| | - Basiel Cole
- HIV Cure Research Center, Department of Internal Medicine and Pediatrics, Ghent University, 9000 Ghent, Belgium; (L.L.); (B.C.); (S.R.); (W.T.)
| | - Sofie Rutsaert
- HIV Cure Research Center, Department of Internal Medicine and Pediatrics, Ghent University, 9000 Ghent, Belgium; (L.L.); (B.C.); (S.R.); (W.T.)
| | - Wim Trypsteen
- HIV Cure Research Center, Department of Internal Medicine and Pediatrics, Ghent University, 9000 Ghent, Belgium; (L.L.); (B.C.); (S.R.); (W.T.)
| | - Linos Vandekerckhove
- HIV Cure Research Center, Department of Internal Medicine and Pediatrics, Ghent University, 9000 Ghent, Belgium; (L.L.); (B.C.); (S.R.); (W.T.)
| |
Collapse
|
22
|
Pérez-Losada M, Arenas M, Galán JC, Bracho MA, Hillung J, García-González N, González-Candelas F. High-throughput sequencing (HTS) for the analysis of viral populations. INFECTION GENETICS AND EVOLUTION 2020; 80:104208. [PMID: 32001386 DOI: 10.1016/j.meegid.2020.104208] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 01/21/2020] [Accepted: 01/24/2020] [Indexed: 12/12/2022]
Abstract
The development of High-Throughput Sequencing (HTS) technologies is having a major impact on the genomic analysis of viral populations. Current HTS platforms can capture nucleic acid variation across millions of genes for both selected amplicons and full viral genomes. HTS has already facilitated the discovery of new viruses, hinted new taxonomic classifications and provided a deeper and broader understanding of their diversity, population and genetic structure. Hence, HTS has already replaced standard Sanger sequencing in basic and applied research fields, but the next step is its implementation as a routine technology for the analysis of viruses in clinical settings. The most likely application of this implementation will be the analysis of viral genomics, because the huge population sizes, high mutation rates and very fast replacement of viral populations have demonstrated the limited information obtained with Sanger technology. In this review, we describe new technologies and provide guidelines for the high-throughput sequencing and genetic and evolutionary analyses of viral populations and metaviromes, including software applications. With the development of new HTS technologies, new and refurbished molecular and bioinformatic tools are also constantly being developed to process and integrate HTS data. These allow assembling viral genomes and inferring viral population diversity and dynamics. Finally, we also present several applications of these approaches to the analysis of viral clinical samples including transmission clusters and outbreak characterization.
Collapse
Affiliation(s)
- Marcos Pérez-Losada
- Computational Biology Institute, Milken Institute School of Public Health, George Washington University, Washington, DC, USA; CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Campus Agrário de Vairão, Vairão 4485-661, Portugal
| | - Miguel Arenas
- Department of Biochemistry, Genetics and Immunology, University of Vigo, 36310 Vigo, Spain; Biomedical Research Center (CINBIO), University of Vigo, 36310 Vigo, Spain.
| | - Juan Carlos Galán
- Microbiology Service, Hospital Ramón y Cajal, Madrid, Spain; CIBER in Epidemiology and Public Health, Spain.
| | - Mª Alma Bracho
- CIBER in Epidemiology and Public Health, Spain; Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain.
| | - Julia Hillung
- Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain; Institute for Integrative Systems Biology (I2SysBio), CSIC-University of Valencia, Valencia, Spain.
| | - Neris García-González
- Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain; Institute for Integrative Systems Biology (I2SysBio), CSIC-University of Valencia, Valencia, Spain.
| | - Fernando González-Candelas
- CIBER in Epidemiology and Public Health, Spain; Joint Research Unit "Infection and Public Health" FISABIO-University of Valencia, Valencia, Spain; Institute for Integrative Systems Biology (I2SysBio), CSIC-University of Valencia, Valencia, Spain.
| |
Collapse
|