1
|
Leoni G, Petrillo M, Ruiz-Serra V, Querci M, Coecke S, Wiesenthal T. PathoSeq-QC: a decision support bioinformatics workflow for robust genomic surveillance. Bioinformatics 2025; 41:btaf102. [PMID: 40053686 PMCID: PMC11961196 DOI: 10.1093/bioinformatics/btaf102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 12/10/2024] [Accepted: 03/03/2025] [Indexed: 03/09/2025] Open
Abstract
MOTIVATION Recommendations on the use of genomics for pathogens surveillance are evidence that high-throughput genomic sequencing plays a key role to fight global health threats. Coupled with bioinformatics and other data types (e.g., epidemiological information), genomics is used to obtain knowledge on health pathogenic threats and insights on their evolution, to monitor pathogens spread, and to evaluate the effectiveness of countermeasures. From a decision-making policy perspective, it is essential to ensure the entire process's quality before relying on analysis results as evidence. Available workflows usually offer quality assessment tools that are primarily focused on the quality of raw NGS reads but often struggle to keep pace with new technologies and threats, and fail to provide a robust consensus on results, necessitating manual evaluation of multiple tool outputs. RESULTS We present PathoSeq-QC, a bioinformatics decision support workflow developed to improve the trustworthiness of genomic surveillance analyses and conclusions. Designed for SARS-CoV-2, it is suitable for any viral threat. In the specific case of SARS-CoV-2, PathoSeq-QC: (i) evaluates the quality of the raw data; (ii) assesses whether the analysed sample is composed by single or multiple lineages; (iii) produces robust variant calling results via multi-tool comparison; (iv) reports whether the produced data are in support of a recombinant virus, a novel or an already known lineage. The tool is modular, which will allow easy functionalities extension. AVAILABILITY AND IMPLEMENTATION PathoSeq-QC is a command-line tool written in Python and R. The code is available at https://code.europa.eu/dighealth/pathoseq-qc.
Collapse
Affiliation(s)
- Gabriele Leoni
- European Commission, Joint Research Centre (JRC), Ispra, 21027, Italy
| | | | | | - Maddalena Querci
- European Commission, Joint Research Centre (JRC), Ispra, 21027, Italy
| | - Sandra Coecke
- European Commission, Joint Research Centre (JRC), Ispra, 21027, Italy
| | - Tobias Wiesenthal
- European Commission, Joint Research Centre (JRC), Geel, 2440, Belgium
| |
Collapse
|
2
|
Lau KA, Foster CSP, Theis T, Draper J, Sullivan MJ, Ballard S, Rawlinson WD. Continued improvement in the development of the SARS-CoV-2 whole genome sequencing proficiency testing program. Pathology 2024; 56:717-725. [PMID: 38729860 DOI: 10.1016/j.pathol.2024.02.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Revised: 01/11/2024] [Accepted: 02/07/2024] [Indexed: 05/12/2024]
Abstract
Application of whole genome sequencing (WGS) has allowed monitoring of the emergence of variants of concern (VOC) of severe acute respiratory syndrome-related coronavirus 2 (SARS-CoV-2) globally. Genomic investigation of emerging variants and surveillance of clinical progress has reduced the public health impact of infection during the COVID-19 pandemic. These steps required developing and implementing a proficiency testing program (PTP), as WGS has been incorporated into routine reference laboratory practice. In this study, we describe how the PTP evaluated the capacity and capability of one New Zealand and 14 Australian public health laboratories to perform WGS of SARS-CoV-2 in 2022. The participants' performances in characterising a specimen panel of known SARS-CoV-2 isolates in the PTP were assessed based on: (1) genome coverage, (2) Pango lineage, and (3) sequence quality, with the choice of assessment metrics refined based on a previously reported assessment conducted in 2021. The participants' performances in 2021 and 2022 were also compared after reassessing the 2021 results using the more stringent metrics adopted in 2022. We found that more participants would have failed the 2021 assessment for all survey samples and a significantly higher fail rate per sample in 2021 compared to 2022. This study highlights the importance of choosing appropriate performance metrics to reflect better the laboratories' capacity to perform SARS-CoV-2 WGS, as was done in the 2022 PTP. It also displays the need for a PTP for WGS of SARS-CoV-2 to be available to public health laboratories ongoing, with continuous refinements in the design and provision of the PTP to account for the dynamic nature of the COVID-19 pandemic as SARS-CoV-2 continues to evolve.
Collapse
Affiliation(s)
| | - Charles S P Foster
- University of NSW (UNSW) School of Biomedical Sciences, Sydney, NSW, Australia; Serology and Virology Division (SAViD) Department of Microbiology, NSW Health Pathology, SOMS, BABS, Women's and Children's, University of New South Wales, Sydney, NSW, Australia
| | | | - Jenny Draper
- Institute for Clinical Pathology and Medical Research, New South Wales Health Pathology, Westmead, NSW, Australia; Faculty of Medicine and Health, University of Sydney, Sydney, NSW, Australia; Sydney Infectious Diseases Institute, University of Sydney, Sydney, NSW, Australia
| | - Mitchell J Sullivan
- Queensland Public Health and Infectious Diseases Reference Genomics, Public and Environmental Health, Forensic and Scientific Services, Queensland Health, Brisbane, Qld, Australia
| | - Susan Ballard
- Microbiological Diagnostic Unit Public Health Laboratory, The University of Melbourne, Peter Doherty Institute for Infection and Immunity, Melbourne, Vic, Australia
| | - William D Rawlinson
- University of NSW (UNSW) School of Biomedical Sciences, Sydney, NSW, Australia; Serology and Virology Division (SAViD) Department of Microbiology, NSW Health Pathology, SOMS, BABS, Women's and Children's, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
3
|
Soller J, Jennings W, Schoen M, Boehm A, Wigginton K, Gonzalez R, Graham KE, McBride G, Kirby A, Mattioli M. Modeling infection from SARS-CoV-2 wastewater concentrations: promise, limitations, and future directions. JOURNAL OF WATER AND HEALTH 2022; 20:1197-1211. [PMID: 36044189 PMCID: PMC10911093 DOI: 10.2166/wh.2022.094] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Estimating total infection levels, including unreported and asymptomatic infections, is important for understanding community disease transmission. Wastewater can provide a pooled community sample to estimate total infections that is independent of case reporting biases toward individuals with moderate to severe symptoms and by test-seeking behavior and access. We derive three mechanistic models for estimating community infection levels from wastewater measurements based on a description of the processes that generate SARS-CoV-2 RNA signals in wastewater and accounting for the fecal strength of wastewater through endogenous microbial markers, daily flow, and per-capita wastewater generation estimates. The models are illustrated through two case studies of wastewater data collected during 2020-2021 in Virginia Beach, VA, and Santa Clara County, CA. Median simulated infection levels generally were higher than reported cases, but at times, were lower, suggesting a discrepancy between the reported cases and wastewater data, or inaccurate modeling results. Daily simulated infection estimates showed large ranges, in part due to dependence on highly variable clinical viral fecal shedding data. Overall, the wastewater-based mechanistic models are useful for normalization of wastewater measurements and for understanding wastewater-based surveillance data for public health decision-making but are currently limited by lack of robust SARS-CoV-2 fecal shedding data.
Collapse
Affiliation(s)
- Jeffrey Soller
- Soller Environmental, LLC, 3022 King St, Berkeley, CA 94703, USA
| | - Wiley Jennings
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA E-mail:
| | - Mary Schoen
- Soller Environmental, LLC, 3022 King St, Berkeley, CA 94703, USA
| | - Alexandria Boehm
- Stanford University Department of Civil and Environmental Engineering, Stanford, California, USA
| | - Krista Wigginton
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor 48109, Michigan, USA
| | - Raul Gonzalez
- Hampton Roads Sanitation District, 1434 Air Rail Avenue, Virginia Beach, VA 23455, USA
| | - Katherine E Graham
- Stanford University Department of Civil and Environmental Engineering, Stanford, California, USA
| | - Graham McBride
- National Institute of Water & Atmospheric Research Ltd (NIWA), Hillcrest, Hamilton, New Zealand
| | - Amy Kirby
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA E-mail:
| | - Mia Mattioli
- Waterborne Disease Prevention Branch, Division of Foodborne, Waterborne, and Environmental Diseases, National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA E-mail:
| |
Collapse
|