1
|
Rahman N, O'Cathail C, Zyoud A, Sokolov A, Oude Munnink B, Grüning B, Cummins C, Amid C, Nieuwenhuijse DF, Visontai D, Yuan DY, Gupta D, Prasad DK, Gulyás GM, Rinck G, McKinnon J, Rajan J, Knaggs J, Skiby JE, Stéger J, Szarvas J, Gueye K, Papp K, Hoek M, Kumar M, Ventouratou MA, Bouquieaux MC, Koliba M, Mansurova M, Haseeb M, Worp N, Harrison PW, Leinonen R, Thorne R, Selvakumar S, Hunt S, Venkataraman S, Jayathilaka S, Cezard T, Maier W, Waheed Z, Iqbal Z, Aarestrup FM, Csabai I, Koopmans M, Burdett T, Cochrane G. Mobilisation and analyses of publicly available SARS-CoV-2 data for pandemic responses. Microb Genom 2024; 10:001188. [PMID: 38358325 PMCID: PMC10926692 DOI: 10.1099/mgen.0.001188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 01/14/2024] [Indexed: 02/16/2024] Open
Abstract
The COVID-19 pandemic has seen large-scale pathogen genomic sequencing efforts, becoming part of the toolbox for surveillance and epidemic research. This resulted in an unprecedented level of data sharing to open repositories, which has actively supported the identification of SARS-CoV-2 structure, molecular interactions, mutations and variants, and facilitated vaccine development and drug reuse studies and design. The European COVID-19 Data Platform was launched to support this data sharing, and has resulted in the deposition of several million SARS-CoV-2 raw reads. In this paper we describe (1) open data sharing, (2) tools for submission, analysis, visualisation and data claiming (e.g. ORCiD), (3) the systematic analysis of these datasets, at scale via the SARS-CoV-2 Data Hubs as well as (4) lessons learnt. This paper describes a component of the Platform, the SARS-CoV-2 Data Hubs, which enable the extension and set up of infrastructure that we intend to use more widely in the future for pathogen surveillance and pandemic preparedness.
Collapse
Affiliation(s)
- Nadim Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Colman O'Cathail
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Ahmad Zyoud
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Alexey Sokolov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Bas Oude Munnink
- Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, Netherlands
| | - Björn Grüning
- University of Freiburg, Friedrichstr. 39, 79098 Freiburg, Germany
| | - Carla Cummins
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Clara Amid
- Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, Netherlands
| | | | - Dávid Visontai
- Eötvös Loránd University, H-1053 Budapest, Egyetem tér 1-3, Hungary
| | - David Yu Yuan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Dipayan Gupta
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Divyae K. Prasad
- Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, Netherlands
| | - Gábor Máté Gulyás
- Technical University of Denmark, Anker Engelunds Vej 101, 2800 Kongens Lyngby, Denmark
| | - Gabriele Rinck
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Jasmine McKinnon
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Jeena Rajan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Jeff Knaggs
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Jeffrey Edward Skiby
- Technical University of Denmark, Anker Engelunds Vej 101, 2800 Kongens Lyngby, Denmark
| | - József Stéger
- Eötvös Loránd University, H-1053 Budapest, Egyetem tér 1-3, Hungary
| | - Judit Szarvas
- Technical University of Denmark, Anker Engelunds Vej 101, 2800 Kongens Lyngby, Denmark
| | - Khadim Gueye
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Krisztián Papp
- Eötvös Loránd University, H-1053 Budapest, Egyetem tér 1-3, Hungary
| | - Maarten Hoek
- Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, Netherlands
| | - Manish Kumar
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Marianna A. Ventouratou
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | | | - Martin Koliba
- Technical University of Denmark, Anker Engelunds Vej 101, 2800 Kongens Lyngby, Denmark
| | - Milena Mansurova
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Muhammad Haseeb
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Nathalie Worp
- Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, Netherlands
| | - Peter W. Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Rasko Leinonen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Ross Thorne
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Sandeep Selvakumar
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Sarah Hunt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Sundar Venkataraman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Suran Jayathilaka
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Timothée Cezard
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Wolfgang Maier
- University of Freiburg, Friedrichstr. 39, 79098 Freiburg, Germany
| | - Zahra Waheed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Zamin Iqbal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | | | - Istvan Csabai
- Eötvös Loránd University, H-1053 Budapest, Egyetem tér 1-3, Hungary
| | - Marion Koopmans
- Erasmus Medical Center, Wytemaweg 80, 3015 CN Rotterdam, Netherlands
| | - Tony Burdett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| | - Guy Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
| |
Collapse
|
2
|
Timme RE, Karsch-Mizrachi I, Waheed Z, Arita M, MacCannell D, Maguire F, Petit III R, Page AJ, Mendes CI, Nasar MI, Oluniyi P, Tyler AD, Raphenya AR, Guthrie JL, Olawoye I, Rinck G, O’Cathail C, Lees J, Cochrane G, Cummins C, Brister JR, Klimke W, Feldgarden M, Griffiths E. Putting everything in its place: using the INSDC compliant Pathogen Data Object Model to better structure genomic data submitted for public health applications. Microb Genom 2023; 9:001145. [PMID: 38085797 PMCID: PMC10763499 DOI: 10.1099/mgen.0.001145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 11/13/2023] [Indexed: 12/18/2023] Open
Abstract
Fast, efficient public health actions require well-organized and coordinated systems that can supply timely and accurate knowledge. Public databases of pathogen genomic data, such as the International Nucleotide Sequence Database Collaboration (INSDC), have become essential tools for efficient public health decisions. However, these international resources began primarily for academic purposes, rather than for surveillance or interventions. Now, queries need to access not only the whole genomes of multiple pathogens but also make connections using robust contextual metadata to identify issues of public health relevance. Databases that over time developed a patchwork of submission formats and requirements need to be consistently organized and coordinated internationally to allow effective searches.To help resolve these issues, we propose a common pathogen data structure called the Pathogen Data Object Model (DOM) that will formalize the minimum pieces of sequence data and contextual data necessary for general public health uses, while recognizing that submitters will likely withhold a wide range of non-public contextual data. Further, we propose contributors use the Pathogen DOM for all pathogen submissions (bacterial, viral, fungal, and parasites), which will simplify data submissions and provide a consistent and transparent data structure for downstream data analyses. We also highlight how improved submission tools can support the Pathogen DOM, offering users additional easy-to-use methods to ensure this structure is followed.
Collapse
Affiliation(s)
- Ruth E. Timme
- Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, MD, USA
| | - Ilene Karsch-Mizrachi
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Zahra Waheed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Masanori Arita
- DNA Data Bank of Japan, National Institute of Genetics, Mishima, Japan
| | - Duncan MacCannell
- National Center for Emerging and Zoonotic Infectious Diseases, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Finlay Maguire
- Department of Community Health & Epidemiology, Faculty of Medicine, Dalhousie University, Halifax, Canada
- Faculty of Computer Science, Dalhousie University, Halifax, Canada
| | | | - Andrew J. Page
- Quadram Institute Bioscience, Norwich, Norfolk, UK
- Theiagen Genomics LLC, Highlands Ranch, CO, USA
| | | | - Muhammad Ibtisam Nasar
- Department of Biology, College of Science, United Arab Emirates University- Al Ain, Abu Dhabi, UAE
| | - Paul Oluniyi
- Chan Zuckerberg Biohub Network, San Francisco, CA, USA
| | - Andrea D. Tyler
- Science Technology Cores and Services, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada
| | - Amogelang R. Raphenya
- Department of Biochemistry and Biomedical Sciences and the Michael G. DeGroote Institute for Infectious Disease Research, McMaster University, Hamilton, Ontario, Canada
| | - Jennifer L. Guthrie
- Schulich School of Medicine & Dentistry, University of Western Ontario, London, Ontario, Canada
| | - Idowu Olawoye
- Schulich School of Medicine & Dentistry, University of Western Ontario, London, Ontario, Canada
| | - Gabriele Rinck
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Colman O’Cathail
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - John Lees
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Guy Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Carla Cummins
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - J. Rodney Brister
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - William Klimke
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Michael Feldgarden
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Emma Griffiths
- Faculty of Health Sciences, Simon Fraser University, Burnaby, British Columbia, Canada
| |
Collapse
|
3
|
Neves A, Cuesta I, Hjerde E, Klemetsen T, Salgado D, van Helden J, Rahman N, Fatima N, Karathanasis N, Zmora P, Åkerström WN, Grellscheid SN, Waheed Z, Blomberg N. FAIR+E pathogen data for surveillance and research: lessons from COVID-19. Front Public Health 2023; 11:1289945. [PMID: 38074768 PMCID: PMC10703184 DOI: 10.3389/fpubh.2023.1289945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 10/30/2023] [Indexed: 12/18/2023] Open
Abstract
The COVID-19 pandemic has exemplified the importance of interoperable and equitable data sharing for global surveillance and to support research. While many challenges could be overcome, at least in some countries, many hurdles within the organizational, scientific, technical and cultural realms still remain to be tackled to be prepared for future threats. We propose to (i) continue supporting global efforts that have proven to be efficient and trustworthy toward addressing challenges in pathogen molecular data sharing; (ii) establish a distributed network of Pathogen Data Platforms to (a) ensure high quality data, metadata standardization and data analysis, (b) perform data brokering on behalf of data providers both for research and surveillance, (c) foster capacity building and continuous improvements, also for pandemic preparedness; (iii) establish an International One Health Pathogens Portal, connecting pathogen data isolated from various sources (human, animal, food, environment), in a truly One Health approach and following FAIR principles. To address these challenging endeavors, we have started an ELIXIR Focus Group where we invite all interested experts to join in a concerted, expert-driven effort toward sustaining and ensuring high-quality data for global surveillance and research.
Collapse
Affiliation(s)
- Aitana Neves
- SIB Swiss Institute of Bioinformatics, Clinical Bioinformatics, Geneva, Switzerland
| | - Isabel Cuesta
- Bioinformatics Unit, Institute of Health Carlos III, Madrid, Spain
| | - Erik Hjerde
- Institute of Chemistry, The Arctic University of Norway, Tromsø, Norway
| | - Terje Klemetsen
- Institute of Chemistry, The Arctic University of Norway, Tromsø, Norway
| | - David Salgado
- CNRS, Institut Français de Bioinformatique, IFB-core, UMS 3601, Evry, France
| | - Jacques van Helden
- CNRS, Institut Français de Bioinformatique, IFB-core, UMS 3601, Evry, France
- Aix-Marseille Univ, INSERM, Lab. Theory and Approaches of Genome Complexity (TAGC), Marseille, France
| | - Nadim Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Nazeefa Fatima
- ELIXIR Norway, Centre for Bioinformatics, University of Oslo, Oslo, Norway
| | - Nestoras Karathanasis
- Bioinformatics Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Pawel Zmora
- Department of Molecular Virology, Institute of Bioorganic Chemistry Polish Academy of Sciences, Poznan, Poland
| | - Wolmar Nyberg Åkerström
- NBIS National Bioinformatics Infrastructure Sweden, SciLifeLab, Uppsala University, Uppsala, Sweden
| | | | - Zahra Waheed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Niklas Blomberg
- ELIXIR Hub, Wellcome Genome Campus, Cambridge, United Kingdom
| |
Collapse
|
4
|
Alvarez-Romero C, Martínez-García A, Bernabeu-Wittel M, Parra-Calderón CL. Health data hubs: an analysis of existing data governance features for research. Health Res Policy Syst 2023; 21:70. [PMID: 37430347 DOI: 10.1186/s12961-023-01026-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 06/25/2023] [Indexed: 07/12/2023] Open
Abstract
BACKGROUND Digital transformation in healthcare and the growth of health data generation and collection are important challenges for the secondary use of healthcare records in the health research field. Likewise, due to the ethical and legal constraints for using sensitive data, understanding how health data are managed by dedicated infrastructures called data hubs is essential to facilitating data sharing and reuse. METHODS To capture the different data governance behind health data hubs across Europe, a survey focused on analysing the feasibility of linking individual-level data between data collections and the generation of health data governance patterns was carried out. The target audience of this study was national, European, and global data hubs. In total, the designed survey was sent to a representative list of 99 health data hubs in January 2022. RESULTS In total, 41 survey responses received until June 2022 were analysed. Stratification methods were performed to cover the different levels of granularity identified in some data hubs' characteristics. Firstly, a general pattern of data governance for data hubs was defined. Afterward, specific profiles were defined, generating specific data governance patterns through the stratifications in terms of the kind of organization (centralized versus decentralized) and role (data controller or data processor) of the health data hub respondents. CONCLUSIONS The analysis of the responses from health data hub respondents across Europe provided a list of the most frequent aspects, which concluded with a set of specific best practices on data management and governance, taking into account the constraints of sensitive data. In summary, a data hub should work in a centralized way, providing a Data Processing Agreement and a formal procedure to identify data providers, as well as data quality control, data integrity and anonymization methods.
Collapse
Affiliation(s)
- Celia Alvarez-Romero
- Computational Health Informatics Group, Institute of Biomedicine of Seville, IBiS/Virgen del Rocío University Hospital/CSIC/University of Seville, Avenue Manuel Siurot S/N, 41013, Seville, Spain.
| | - Alicia Martínez-García
- Computational Health Informatics Group, Institute of Biomedicine of Seville, IBiS/Virgen del Rocío University Hospital/CSIC/University of Seville, Avenue Manuel Siurot S/N, 41013, Seville, Spain
| | | | - Carlos Luis Parra-Calderón
- Computational Health Informatics Group, Institute of Biomedicine of Seville, IBiS/Virgen del Rocío University Hospital/CSIC/University of Seville, Avenue Manuel Siurot S/N, 41013, Seville, Spain
| |
Collapse
|
5
|
Huang W, Guo Y, Lysen C, Wang Y, Tang K, Seabolt MH, Yang F, Cebelinski E, Gonzalez-Moreno O, Hou T, Chen C, Chen M, Wan M, Li N, Hlavsa MC, Roellig DM, Feng Y, Xiao L. Multiple introductions and recombination events underlie the emergence of a hyper-transmissible Cryptosporidium hominis subtype in the USA. Cell Host Microbe 2023; 31:112-123.e4. [PMID: 36521488 PMCID: PMC10124589 DOI: 10.1016/j.chom.2022.11.013] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/20/2022] [Accepted: 11/22/2022] [Indexed: 12/15/2022]
Abstract
The parasite Cryptosporidium hominis is a leading cause of the diarrheal disease cryptosporidiosis, whose incidence in the United States has increased since 2005. Here, we show that the newly emerged and hyper-transmissible subtype IfA12G1R5 is now dominant in the United States. In a comparative analysis of 127 newly sequenced and 95 published C. hominis genomes, IfA12G1R5 isolates from the United States place into three of the 14 clusters (Pop6, Pop13, and Pop14), indicating that this subtype has multiple ancestral origins. Pop6 (IfA12G1R5a) has an East Africa origin and has recombined with autochthonous subtypes after its arrival. Pop13 (IfA12G1R5b) is imported from Europe, where it has recombined with the prevalent local subtype, whereas Pop14 (IfA12G1R5c) is a progeny of secondary recombination between Pop6 and Pop13. Selective sweeps in invasion-associated genes have accompanied the emergence of the dominant Pop14. These observations offer insights into the emergence and evolution of hyper-transmissible pathogens.
Collapse
Affiliation(s)
- Wanyi Huang
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Yaqiong Guo
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Colleen Lysen
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA
| | - Yuanfei Wang
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA
| | - Kevin Tang
- Division of Scientific Resources, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA
| | - Matthew H Seabolt
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA
| | - Fengkun Yang
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA
| | - Elizabeth Cebelinski
- Infectious Disease Laboratory, Minnesota Department of Health, St. Paul, MN 55101, USA
| | | | - Tianyi Hou
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Chengyi Chen
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Ming Chen
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Muchun Wan
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Na Li
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China
| | - Michele C Hlavsa
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA
| | - Dawn M Roellig
- Division of Foodborne, Waterborne, and Environmental Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30341, USA.
| | - Yaoyu Feng
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China.
| | - Lihua Xiao
- Guangdong Laboratory for Lingnan Modern Agriculture, Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China.
| |
Collapse
|
6
|
Moco V, Cazenave D, Garnier M, Pot M, Marcelino I, Talarmin A, Guyomard-Rabenirina S, Breurec S, Ferdinand S, Dereeper A, Reynaud Y, Couvin D. getSequenceInfo: a suite of tools allowing to get genome sequence information from public repositories. BMC Bioinformatics 2022; 23:268. [PMID: 35804320 PMCID: PMC9264741 DOI: 10.1186/s12859-022-04809-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 06/23/2022] [Indexed: 12/02/2022] Open
Abstract
Background Biological sequences are increasing rapidly and exponentially worldwide. Nucleotide sequence databases play an important role in providing meaningful genomic information on a variety of biological organisms. Results The getSequenceInfo software tool allows to access sequence information from various public repositories (GenBank, RefSeq, and the European Nucleotide Archive), and is compatible with different operating systems (Linux, MacOS, and Microsoft Windows) in a programmatic way (command line) or as a graphical user interface. getSequenceInfo or gSeqI v1.0 should help users to get some information on queried sequences that could be useful for specific studies (e.g. the country of origin/isolation or the release date of queried sequences). Queries can be made to retrieve sequence data based on a given kingdom and species, or from a given date. This program allows the separation between chromosomes and plasmids (or other genetic elements/components) by arranging each component in a given folder. Some basic statistics are also performed by the program (such as the calculation of GC content for queried assemblies). An empirically designed nucleotide ratio is calculated using nucleotide information in order to tentatively provide a “NucleScore” for studied genome assemblies. Besides the main gSeqI tool, other additional tools have been developed to perform various tasks related to sequence analysis. Conclusion The aim of this study is to democratize the use of public repositories in programmatic ways, and to facilitate sequence data analysis in a pedagogical perspective. Output results are available in FASTA, FASTQ, Excel/TSV or HTML formats. The program is freely available at: https://github.com/karubiotools/getSequenceInfo. getSequenceInfo and supplementary tools are partly available through the recently released Galaxy KaruBioNet platform (http://calamar.univ-ag.fr/c3i/galaxy_karubionet.html). Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04809-5.
Collapse
Affiliation(s)
- Vincent Moco
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Damien Cazenave
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Maëlle Garnier
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Matthieu Pot
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Isabel Marcelino
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Antoine Talarmin
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Stéphanie Guyomard-Rabenirina
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Sébastien Breurec
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France.,Faculté de Médecine Hyacinthe Bastaraud, Université des Antilles, Pointe-à-Pitre, France.,Centre d'Investigation Clinique Antilles Guyane, Inserm CIC 1424, Pointe-à-Pitre, France
| | - Séverine Ferdinand
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Alexis Dereeper
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - Yann Reynaud
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France
| | - David Couvin
- Unité Transmission, Réservoir et Diversité des Pathogènes, Institut Pasteur de Guadeloupe, Les Abymes, Guadeloupe, France.
| |
Collapse
|
7
|
Lau KA, Horan K, Gonçalves da Silva A, Kaufer A, Theis T, Ballard SA, Rawlinson WD. Proficiency testing for SARS-CoV-2 whole genome sequencing. Pathology 2022; 54:615-622. [PMID: 35778290 PMCID: PMC9239710 DOI: 10.1016/j.pathol.2022.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 04/12/2022] [Accepted: 04/20/2022] [Indexed: 11/15/2022]
Abstract
Extensive studies and analyses into the molecular features of severe acute respiratory syndrome related coronavirus 2 (SARS-CoV-2) have enhanced the surveillance and investigation of its clusters and transmission worldwide. The whole genome sequencing (WGS) approach is crucial in identifying the source of infection and transmission routes by monitoring the emergence of variants over time and through communities. Varying SARS-CoV-2 genomics capacity and capability levels have been established in public health laboratories across different Australian states and territories. Therefore, laboratories performing SARS-CoV-2 WGS for public health purposes are recommended to participate in an external proficiency testing program (PTP). This study describes the development of a SARS-CoV-2 WGS PTP. The PTP assessed the performance of laboratories while providing valuable insight into the current state of SARS-CoV-2 genomics in public health across Australia. Part 1 of the PTP contained eight simulated SARS-CoV-2 positive and negative specimens to assess laboratories' wet and dry laboratory capacity. Part 2 involved the analysis of a genomic dataset that consisted of a multi-FASTA file of 70 consensus genomes of SARS-CoV-2. Participating laboratories were required to (1) submit raw data for independent bioinformatics analysis, (2) analyse the data with their processes, and (3) answer relevant questions about the data. The performance of the laboratories was commendable, despite some variation in the reported results due to the different sequencing and bioinformatics approaches used by laboratories. The overall outcome is positive and demonstrates the critical role of the PTP in supporting the implementation and validation of SARS-CoV-2 WGS processes. The data derived from this PTP will contribute to the development of SARS-CoV-2 bioinformatic quality control (QC) and performance benchmarking for accreditation.
Collapse
Affiliation(s)
| | - Kristy Horan
- Communicable Diseases Genomics Network (CDGN), Public Health Laboratory Network (PHLN), Sydney, NSW, Australia; Microbiological Diagnostic Unit Public Health Laboratory (MDU PHL), The University of Melbourne at The Peter Doherty Institute for Immunity and Infection, Melbourne, Vic, Australia
| | - Anders Gonçalves da Silva
- Communicable Diseases Genomics Network (CDGN), Public Health Laboratory Network (PHLN), Sydney, NSW, Australia; Microbiological Diagnostic Unit Public Health Laboratory (MDU PHL), The University of Melbourne at The Peter Doherty Institute for Immunity and Infection, Melbourne, Vic, Australia
| | - Alexa Kaufer
- RCPAQAP Biosecurity, St Leonards, NSW, Australia
| | | | - Susan A Ballard
- Communicable Diseases Genomics Network (CDGN), Public Health Laboratory Network (PHLN), Sydney, NSW, Australia; Microbiological Diagnostic Unit Public Health Laboratory (MDU PHL), The University of Melbourne at The Peter Doherty Institute for Immunity and Infection, Melbourne, Vic, Australia
| | - William D Rawlinson
- Serology and Virology Division (SAViD) SEALS Microbiology, NSW Health Pathology, SOMS, BABS, Women's and Children's, University of NSW, Sydney, NSW, Australia
| |
Collapse
|
8
|
van Kampen JJA, Dalm VASH, Fraaij PLA, Oude Munnink BB, Schapendonk CME, Izquierdo-Lara RW, Villabruna N, Ettayebi K, Estes MK, Koopmans MPG, de Graaf M. Clinical and In Vitro Evidence Favoring Immunoglobulin Treatment of a Chronic Norovirus Infection in a Patient With Common Variable Immunodeficiency. J Infect Dis 2022; 226:1781-1789. [PMID: 35255136 PMCID: PMC9650502 DOI: 10.1093/infdis/jiac085] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2021] [Accepted: 03/04/2022] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Immunocompromised individuals can become chronically infected with norovirus, but effective antiviral therapies are not yet available. METHODS Treatments with nitazoxanide, ribavirin, interferon alpha-2a, and nasoduodenally administered immunoglobulins were evaluated sequentially in an immunocompromised patient chronically infected with norovirus. In support, these components were also applied to measure norovirus inhibition in intestinal enteroid cultures in vitro. Viral RNA levels were determined in fecal and plasma samples during each treatment and viral genomes were sequenced. RESULTS None of the antivirals resulted in a reduction of viral RNA levels in feces or plasma. However, during ribavirin treatment, there was an increased accumulation of virus genome mutations. In vitro, an effect of interferon alpha-2a on virus replication was observed and a genetically related strain was neutralized effectively in vitro using immunoglobulins and post-norovirus-infection antiserum. In agreement, after administration of immunoglobulins, the patient cleared the infection. CONCLUSIONS Intestinal enteroid cultures provide a relevant system to evaluate antivirals and the neutralizing potential of immunoglobulins. We successfully treated a chronically infected patient with immunoglobulins, despite varying results reported by others. This case study provides in-depth, multifaceted exploration of norovirus treatment that can be used as a guidance for further research towards norovirus treatments.
Collapse
Affiliation(s)
| | | | - Pieter L A Fraaij
- Department of Viroscience, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Bas B Oude Munnink
- Department of Viroscience, Erasmus University Medical Center, Rotterdam, The Netherlands
| | | | - Ray W Izquierdo-Lara
- Department of Viroscience, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Nele Villabruna
- Department of Viroscience, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Khalil Ettayebi
- Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, USA
| | - Mary K Estes
- Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, USA,Department of Medicine, Baylor College of Medicine, Houston, Texas, USA
| | - Marion P G Koopmans
- Department of Viroscience, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Miranda de Graaf
- Correspondence: Miranda de Graaf, PhD, Erasmus University Medical Center, PO Box 1738, 3000 DR Rotterdam, the Netherlands ()
| |
Collapse
|
9
|
Moreno P, Fexova S, George N, Manning JR, Miao Z, Mohammed S, Muñoz-Pomer A, Fullgrabe A, Bi Y, Bush N, Iqbal H, Kumbham U, Solovyev A, Zhao L, Prakash A, García-Seisdedos D, Kundu DJ, Wang S, Walzer M, Clarke L, Osumi-Sutherland D, Tello-Ruiz MK, Kumari S, Ware D, Eliasova J, Arends MJ, Nawijn MC, Meyer K, Burdett T, Marioni J, Teichmann S, Vizcaíno JA, Brazma A, Papatheodorou I. Expression Atlas update: gene and protein expression in multiple species. Nucleic Acids Res 2022; 50:D129-D140. [PMID: 34850121 PMCID: PMC8728300 DOI: 10.1093/nar/gkab1030] [Citation(s) in RCA: 69] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/11/2021] [Accepted: 11/19/2021] [Indexed: 01/21/2023] Open
Abstract
The EMBL-EBI Expression Atlas is an added value knowledge base that enables researchers to answer the question of where (tissue, organism part, developmental stage, cell type) and under which conditions (disease, treatment, gender, etc) a gene or protein of interest is expressed. Expression Atlas brings together data from >4500 expression studies from >65 different species, across different conditions and tissues. It makes these data freely available in an easy to visualise form, after expert curation to accurately represent the intended experimental design, re-analysed via standardised pipelines that rely on open-source community developed tools. Each study's metadata are annotated using ontologies. The data are re-analyzed with the aim of reproducing the original conclusions of the underlying experiments. Expression Atlas is currently divided into Bulk Expression Atlas and Single Cell Expression Atlas. Expression Atlas contains data from differential studies (microarray and bulk RNA-Seq) and baseline studies (bulk RNA-Seq and proteomics), whereas Single Cell Expression Atlas is currently dedicated to Single Cell RNA-Sequencing (scRNA-Seq) studies. The resource has been in continuous development since 2009 and it is available at https://www.ebi.ac.uk/gxa.
Collapse
Affiliation(s)
- Pablo Moreno
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Silvie Fexova
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Nancy George
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Jonathan R Manning
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Zhichiao Miao
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Suhaib Mohammed
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Alfonso Muñoz-Pomer
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Anja Fullgrabe
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Yalan Bi
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Natassja Bush
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Haider Iqbal
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Upendra Kumbham
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Andrey Solovyev
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Lingyun Zhao
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Ananth Prakash
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - David García-Seisdedos
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Deepti J Kundu
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Shengbo Wang
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Mathias Walzer
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Laura Clarke
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - David Osumi-Sutherland
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | | | - Sunita Kumari
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Doreen Ware
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
- USDA ARS NEA, Plant Soil & Nutrition Laboratory Research Unit, Ithaca, NY 14853, USA
| | - Jana Eliasova
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Mark J Arends
- Edinburgh Pathology, University of Edinburgh, Institute of Genetics & Cancer, Edinburgh, UK
| | - Martijn C Nawijn
- Department of Pathology and Medical Biology, GRIAC research institute, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Kerstin Meyer
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Tony Burdett
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - John Marioni
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Sarah Teichmann
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Alvis Brazma
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| | - Irene Papatheodorou
- European Molecular Biology Laboratory, European Bioinformatics Institute, EMBL-EBI, Wellcome Genome Campus, Hinxton, UK
| |
Collapse
|
10
|
viromeBrowser: A Shiny App for Browsing Virome Sequencing Analysis Results. Viruses 2021; 13:v13030437. [PMID: 33803225 PMCID: PMC7999463 DOI: 10.3390/v13030437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 02/26/2021] [Accepted: 03/04/2021] [Indexed: 11/16/2022] Open
Abstract
Experiments in which complex virome sequencing data is generated remain difficult to explore and unpack for scientists without a background in data science. The processing of raw sequencing data by high throughput sequencing workflows usually results in contigs in FASTA format coupled to an annotation file linking the contigs to a reference sequence or taxonomic identifier. The next step is to compare the virome of different samples based on the metadata of the experimental setup and extract sequences of interest that can be used in subsequent analyses. The viromeBrowser is an application written in the opensource R shiny framework that was developed in collaboration with end-users and is focused on three common data analysis steps. First, the application allows interactive filtering of annotations by default or custom quality thresholds. Next, multiple samples can be visualized to facilitate comparison of contig annotations based on sample specific metadata values. Last, the application makes it easy for users to extract sequences of interest in FASTA format. With the interactive features in the viromeBrowser we aim to enable scientists without a data science background to compare and extract annotation data and sequences from virome sequencing analysis results.
Collapse
|
11
|
Vandenberg O, Martiny D, Rochas O, van Belkum A, Kozlakidis Z. Considerations for diagnostic COVID-19 tests. Nat Rev Microbiol 2021; 19:171-183. [PMID: 33057203 PMCID: PMC7556561 DOI: 10.1038/s41579-020-00461-z] [Citation(s) in RCA: 438] [Impact Index Per Article: 146.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/17/2020] [Indexed: 02/07/2023]
Abstract
During the early phase of the coronavirus disease 2019 (COVID-19) pandemic, design, development, validation, verification and implementation of diagnostic tests were actively addressed by a large number of diagnostic test manufacturers. Hundreds of molecular tests and immunoassays were rapidly developed, albeit many still await clinical validation and formal approval. In this Review, we summarize the crucial role of diagnostic tests during the first global wave of COVID-19. We explore the technical and implementation problems encountered during this early phase in the pandemic, and try to define future directions for the progressive and better use of (syndromic) diagnostics during a possible resurgence of COVID-19 in future global waves or regional outbreaks. Continuous global improvement in diagnostic test preparedness is essential for more rapid detection of patients, possibly at the point of care, and for optimized prevention and treatment, in both industrialized countries and low-resource settings.
Collapse
Affiliation(s)
- Olivier Vandenberg
- Innovation and Business Development Unit, Laboratoire Hospitalier Universtaire de Bruxelles - Universitair Laboratorium Brussel, Université Libre de Bruxelles, Brussels, Belgium.
- Center for Environmental Health and Occupational Health, School of Public Health, Université Libre de Bruxelles, Brussels, Belgium.
- Division of Infection and Immunity, Faculty of Medical Sciences, University College London, London, UK.
| | - Delphine Martiny
- Department of Microbiology, Laboratoire Hospitalier Universtaire de Bruxelles - Universitair Laboratorium Brussel, Université Libre de Bruxelles, Brussels, Belgium
| | - Olivier Rochas
- Strategic Intelligence, Corporate Business Development, bioMérieux, Chemin de L'Orme, France
| | - Alex van Belkum
- Open Innovation and Partnerships, bioMérieux, La Balme Les Grottes, France.
| | - Zisis Kozlakidis
- Laboratory Services and Biobank Group, International Agency for Research on Cancer, World Health Organization, Lyon, France
| |
Collapse
|
12
|
Harrison PW, Ahamed A, Aslam R, Alako BTF, Burgin J, Buso N, Courtot M, Fan J, Gupta D, Haseeb M, Holt S, Ibrahim T, Ivanov E, Jayathilaka S, Balavenkataraman Kadhirvelu V, Kumar M, Lopez R, Kay S, Leinonen R, Liu X, O'Cathail C, Pakseresht A, Park Y, Pesant S, Rahman N, Rajan J, Sokolov A, Vijayaraja S, Waheed Z, Zyoud A, Burdett T, Cochrane G. The European Nucleotide Archive in 2020. Nucleic Acids Res 2021; 49:D82-D85. [PMID: 33175160 PMCID: PMC7778925 DOI: 10.1093/nar/gkaa1028] [Citation(s) in RCA: 72] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 10/20/2020] [Indexed: 11/12/2022] Open
Abstract
The European Nucleotide Archive (ENA; https://www.ebi.ac.uk/ena), provided by the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI), has for almost forty years continued in its mission to freely archive and present the world's public sequencing data for the benefit of the entire scientific community and for the acceleration of the global research effort. Here we highlight the major developments to ENA services and content in 2020, focussing in particular on the recently released updated ENA browser, modernisation of our release process and our data coordination collaborations with specific research communities.
Collapse
Affiliation(s)
- Peter W Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alisha Ahamed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Raheela Aslam
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Blaise T F Alako
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Josephine Burgin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nicola Buso
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Mélanie Courtot
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jun Fan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Dipayan Gupta
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Muhammad Haseeb
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sam Holt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Talal Ibrahim
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Eugene Ivanov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Suran Jayathilaka
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Manish Kumar
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rodrigo Lopez
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Simon Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rasko Leinonen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Xin Liu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Colman O'Cathail
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Amir Pakseresht
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Youngmi Park
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Stephane Pesant
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nadim Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jeena Rajan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alexey Sokolov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Senthilnathan Vijayaraja
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Zahra Waheed
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ahmad Zyoud
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Tony Burdett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Guy Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
13
|
Cantelli G, Cochrane G, Brooksbank C, McDonagh E, Flicek P, McEntyre J, Birney E, Apweiler R. The European Bioinformatics Institute: empowering cooperation in response to a global health crisis. Nucleic Acids Res 2021; 49:D29-D37. [PMID: 33245775 PMCID: PMC7778996 DOI: 10.1093/nar/gkaa1077] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 10/20/2020] [Accepted: 10/22/2020] [Indexed: 02/06/2023] Open
Abstract
The European Bioinformatics Institute (EMBL-EBI; https://www.ebi.ac.uk/) provides freely available data and bioinformatics services to the scientific community, alongside its research activity and training provision. The 2020 COVID-19 pandemic has brought to the forefront a need for the scientific community to work even more cooperatively to effectively tackle a global health crisis. EMBL-EBI has been able to build on its position to contribute to the fight against COVID-19 in a number of ways. Firstly, EMBL-EBI has used its infrastructure, expertise and network of international collaborations to help build the European COVID-19 Data Platform (https://www.covid19dataportal.org/), which brings together COVID-19 biomolecular data and connects it to researchers, clinicians and public health professionals. By September 2020, the COVID-19 Data Platform has integrated in excess of 170 000 COVID-19 biomolecular data and literature records, collected through a number of EMBL-EBI resources. Secondly, EMBL-EBI has strived to continue its support of the life science communities through the crisis, with updated Training provision and improved service provision throughout its resources. The COVID-19 pandemic has highlighted the importance of EMBL-EBI's core principles, including international cooperation, resource sharing and central data brokering, and has further empowered scientific cooperation.
Collapse
Affiliation(s)
- Gaia Cantelli
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Guy Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Cath Brooksbank
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ellen McDonagh
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- Open Targets, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Johanna McEntyre
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rolf Apweiler
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
14
|
Bai R, Sikkema RS, Munnink BBO, Li CR, Wu J, Zou L, Jing Y, Lu J, Yuan RY, Liao M, Koopmans M, Ke CW. Exploring utility of genomic epidemiology to trace origins of highly pathogenic influenza A/H7N9 in Guangdong. Virus Evol 2021; 6:veaa097. [PMID: 33391821 PMCID: PMC7758296 DOI: 10.1093/ve/veaa097] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The first highly pathogenic (HP) influenza A/H7N9 was reported in Guangdong in January 2017. To investigate the emergence and spread of HP A/H7N9 in Guangdong province, we sequenced 297 viruses (58 HP A/H7N9, 19 low pathogenic (LP) A/H7N9, and 220 A/H9N2) during 2016–2017. Our analysis showed that during the fifth wave, three A/H7N9 lineages were co-circulating in Guangdong: the local LP Pearl River Delta (PRD) lineage (13%), the newly imported LP Yangtze River Delta (YRD) lineage (23%), and the HP YRD lineage (64%). Previously circulating YRD-lineage LP during the third wave evolved to the YRD-lineage HP A/H7N9 in Guangdong. All YRD-lineage LP detected during the fifth wave most likely originated from newly imported viruses into Guangdong. Genotype comparison of HP A/H7N9 suggests limited outward spread of HP A/H7N9 to other provinces. The distribution of HP A/H7N9 cleavage site variants on live poultry markets differed from that found in humans, suggesting a V1-type cleavage site may facilitate human infections.
Collapse
Affiliation(s)
| | - Reina S Sikkema
- Department of Viroscience, Erasmus University Medical Center, P.O. Box 2040, 3000CA Rotterdam, The Netherlands
| | - Bas B Oude Munnink
- Department of Viroscience, Erasmus University Medical Center, P.O. Box 2040, 3000CA Rotterdam, The Netherlands
| | - Cong Rong Li
- Biosafety Laboratory, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Jie Wu
- Department of Pathogenic Microbiolgy, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, 511430, China
| | - Lirong Zou
- Department of Pathogenic Microbiolgy, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, 511430, China
| | - Yi Jing
- School of Public Health, Southern Medical University, Guangzhou, 510515, China
| | - Jing Lu
- Department of Pathogenic Microbiolgy, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, 511430, China
| | - Run Yu Yuan
- Department of Pathogenic Microbiolgy, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, 511430, China
| | - Ming Liao
- Biosafety Laboratory, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Marion Koopmans
- Department of Viroscience, Erasmus University Medical Center, P.O. Box 2040, 3000CA Rotterdam, The Netherlands
| | - Chang-Wen Ke
- Department of Pathogenic Microbiolgy, Guangdong Provincial Center for Disease Control and Prevention, Guangzhou, 511430, China
| |
Collapse
|
15
|
Sala C, Mordhorst H, Grützke J, Brinkmann A, Petersen TN, Poulsen C, Cotter PD, Crispie F, Ellis RJ, Castellani G, Amid C, Hakhverdyan M, Guyader SL, Manfreda G, Mossong J, Nitsche A, Ragimbeau C, Schaeffer J, Schlundt J, Tay MYF, Aarestrup FM, Hendriksen RS, Pamp SJ, De Cesare A. Metagenomics-Based Proficiency Test of Smoked Salmon Spiked with a Mock Community. Microorganisms 2020; 8:microorganisms8121861. [PMID: 33255715 PMCID: PMC7760972 DOI: 10.3390/microorganisms8121861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 11/17/2020] [Accepted: 11/23/2020] [Indexed: 12/13/2022] Open
Abstract
An inter-laboratory proficiency test was organized to assess the ability of participants to perform shotgun metagenomic sequencing of cold smoked salmon, experimentally spiked with a mock community composed of six bacteria, one parasite, one yeast, one DNA, and two RNA viruses. Each participant applied its in-house wet-lab workflow(s) to obtain the metagenomic dataset(s), which were then collected and analyzed using MG-RAST. A total of 27 datasets were analyzed. Sample pre-processing, DNA extraction protocol, library preparation kit, and sequencing platform, influenced the abundance of specific microorganisms of the mock community. Our results highlight that despite differences in wet-lab protocols, the reads corresponding to the mock community members spiked in the cold smoked salmon, were both detected and quantified in terms of relative abundance, in the metagenomic datasets, proving the suitability of shotgun metagenomic sequencing as a genomic tool to detect microorganisms belonging to different domains in the same food matrix. The implementation of standardized wet-lab protocols would highly facilitate the comparability of shotgun metagenomic sequencing dataset across laboratories and sectors. Moreover, there is a need for clearly defining a sequencing reads threshold, to consider pathogens as detected or undetected in a food sample.
Collapse
Affiliation(s)
- Claudia Sala
- Department of Physics and Astronomy, University of Bologna, 40127 Bologna, Italy;
| | - Hanne Mordhorst
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Josephine Grützke
- German Federal Institute for Risk Assessment, Department of Biological Safety, 12277 Berlin, Germany;
| | - Annika Brinkmann
- Highly Pathogenic Viruses, ZBS 1, Centre for Biological Threats and Special Pathogens, Robert Koch Institute, 13353 Berlin, Germany; (A.B.); (A.N.)
| | - Thomas N. Petersen
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Casper Poulsen
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Paul D. Cotter
- Teagasc Food Research Centre, Moorepark, APC Microbiome Ireland and Vistamilk, T12 YN60 Co. Cork, Ireland; (P.D.C.); (F.C.)
| | - Fiona Crispie
- Teagasc Food Research Centre, Moorepark, APC Microbiome Ireland and Vistamilk, T12 YN60 Co. Cork, Ireland; (P.D.C.); (F.C.)
| | - Richard J. Ellis
- Surveillance and Laboratory Services Department, Animal and Plant Health Agency, APHA Weybridge, Addlestone, Surrey, KT15 3NB, UK;
| | - Gastone Castellani
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, 40127 Bologna, Italy;
| | - Clara Amid
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK;
| | | | - Soizick Le Guyader
- Laboratoire de Microbiologie, CEDEX 03, 44311 Nantes, France; (S.L.G.); (J.S.)
| | - Gerardo Manfreda
- Department of Agricultural and Food Sciences, University of Bologna, 40064 Ozzano dell’Emilia, Italy;
| | - Joël Mossong
- Epidemiology and Microbial Genomics, Laboratoire National de Santé, L-3555 Dudelange, Luxembourg; (J.M.); (C.R.)
| | - Andreas Nitsche
- Highly Pathogenic Viruses, ZBS 1, Centre for Biological Threats and Special Pathogens, Robert Koch Institute, 13353 Berlin, Germany; (A.B.); (A.N.)
| | - Catherine Ragimbeau
- Epidemiology and Microbial Genomics, Laboratoire National de Santé, L-3555 Dudelange, Luxembourg; (J.M.); (C.R.)
| | - Julien Schaeffer
- Laboratoire de Microbiologie, CEDEX 03, 44311 Nantes, France; (S.L.G.); (J.S.)
| | - Joergen Schlundt
- Nanyang Technological University Food Technology Centre (NAFTEC), Nanyang Technological University (NTU), 62 Nanyang Dr, Singapore 637459, Singapore; (J.S.); (M.Y.F.T.)
| | - Moon Y. F. Tay
- Nanyang Technological University Food Technology Centre (NAFTEC), Nanyang Technological University (NTU), 62 Nanyang Dr, Singapore 637459, Singapore; (J.S.); (M.Y.F.T.)
| | - Frank M. Aarestrup
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Rene S. Hendriksen
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Sünje Johanna Pamp
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kemitorvet, DK-2800 Kgs, 2800 Lyngby, Denmark; (H.M.); (T.N.P.); (C.P.); (F.M.A.); (R.S.H.); (S.J.P.)
| | - Alessandra De Cesare
- Department of Veterinary Medical Sciences, University of Bologna, Via Tolara di Sopra 50, 40064 Ozzano dell’Emilia, Italy
- Correspondence:
| |
Collapse
|
16
|
Oude Munnink BB, Nieuwenhuijse DF, Stein M, O'Toole Á, Haverkate M, Mollers M, Kamga SK, Schapendonk C, Pronk M, Lexmond P, van der Linden A, Bestebroer T, Chestakova I, Overmars RJ, van Nieuwkoop S, Molenkamp R, van der Eijk AA, GeurtsvanKessel C, Vennema H, Meijer A, Rambaut A, van Dissel J, Sikkema RS, Timen A, Koopmans M. Rapid SARS-CoV-2 whole-genome sequencing and analysis for informed public health decision-making in the Netherlands. Nat Med 2020; 26:1405-1410. [PMID: 32678356 DOI: 10.1038/s41591-020-0997-y] [Citation(s) in RCA: 208] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 06/26/2020] [Indexed: 11/09/2022]
Abstract
In late December 2019, a cluster of cases of pneumonia of unknown etiology were reported linked to a market in Wuhan, China1. The causative agent was identified as the species Severe acute respiratory syndrome-related coronavirus and was named SARS-CoV-2 (ref. 2). By 16 April the virus had spread to 185 different countries, infected over 2,000,000 people and resulted in over 130,000 deaths3. In the Netherlands, the first case of SARS-CoV-2 was notified on 27 February. The outbreak started with several different introductory events from Italy, Austria, Germany and France followed by local amplification in, and later also outside, the south of the Netherlands. The combination of near to real-time whole-genome sequence analysis and epidemiology resulted in reliable assessments of the extent of SARS-CoV-2 transmission in the community, facilitating early decision-making to control local transmission of SARS-CoV-2 in the Netherlands. We demonstrate how these data were generated and analyzed, and how SARS-CoV-2 whole-genome sequencing, in combination with epidemiological data, was used to inform public health decision-making in the Netherlands.
Collapse
Affiliation(s)
- Bas B Oude Munnink
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - David F Nieuwenhuijse
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Mart Stein
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Áine O'Toole
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| | - Manon Haverkate
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Madelief Mollers
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Sandra K Kamga
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Claudia Schapendonk
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Mark Pronk
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Pascal Lexmond
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Anne van der Linden
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Theo Bestebroer
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Irina Chestakova
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Ronald J Overmars
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Stefan van Nieuwkoop
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Richard Molenkamp
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Annemiek A van der Eijk
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Corine GeurtsvanKessel
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Harry Vennema
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Adam Meijer
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| | - Jaap van Dissel
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Reina S Sikkema
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands
| | - Aura Timen
- Centre for Infectious Disease Control, National Institute for Public Health and the Environment, Bilthoven, the Netherlands
| | - Marion Koopmans
- ErasmusMC, Department of Viroscience, WHO Collaborating Centre for Arbovirus and Viral Hemorrhagic Fever Reference and Research, Rotterdam, the Netherlands.
| | | |
Collapse
|
17
|
Egli A, Schrenzel J, Greub G. Digital microbiology. Clin Microbiol Infect 2020; 26:1324-1331. [PMID: 32603804 PMCID: PMC7320868 DOI: 10.1016/j.cmi.2020.06.023] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 06/15/2020] [Accepted: 06/20/2020] [Indexed: 12/12/2022]
Abstract
BACKGROUND Digitalization and artificial intelligence have an important impact on the way microbiology laboratories will work in the near future. Opportunities and challenges lie ahead to digitalize the microbiological workflows. Making efficient use of big data, machine learning, and artificial intelligence in clinical microbiology requires a profound understanding of data handling aspects. OBJECTIVE This review article summarizes the most important concepts of digital microbiology. The article gives microbiologists, clinicians and data scientists a viewpoint and practical examples along the diagnostic process. SOURCES We used peer-reviewed literature identified by a PubMed search for digitalization, machine learning, artificial intelligence and microbiology. CONTENT We describe the opportunities and challenges of digitalization in microbiological diagnostic processes with various examples. We also provide in this context key aspects of data structure and interoperability, as well as legal aspects. Finally, we outline the way for applications in a modern microbiology laboratory. IMPLICATIONS We predict that digitalization and the usage of machine learning will have a profound impact on the daily routine of laboratory staff. Along the analytical process, the most important steps should be identified, where digital technologies can be applied and provide a benefit. The education of all staff involved should be adapted to prepare for the advances in digital microbiology.
Collapse
Affiliation(s)
- A Egli
- Clinical Bacteriology and Mycology, University Hospital Basel, Basel, Switzerland; Applied Microbiology Research, Department of Biomedicine, University of Basel, Basel, Switzerland.
| | - J Schrenzel
- Laboratory of Bacteriology, University Hospitals of Geneva, Geneva, Switzerland
| | - G Greub
- Institute of Medical Microbiology, University Hospital Lausanne, Lausanne, Switzerland
| |
Collapse
|
18
|
Desdouits M, de Graaf M, Strubbia S, Oude Munnink BB, Kroneman A, Le Guyader FS, Koopmans MPG. Novel opportunities for NGS-based one health surveillance of foodborne viruses. ONE HEALTH OUTLOOK 2020; 2:14. [PMID: 33829135 PMCID: PMC7993515 DOI: 10.1186/s42522-020-00015-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2020] [Accepted: 05/01/2020] [Indexed: 05/15/2023]
Abstract
Foodborne viral infections rank among the top 5 causes of disease, with noroviruses and hepatitis A causing the greatest burden globally. Contamination of foods by infected food handlers or through environmental pollution are the main sources of foodborne illness, with a lesser role for consumption of products from infected animals. Viral partial genomic sequencing has been used for more than two decades to track foodborne outbreaks and whole genome or metagenomics next-generation-sequencing (NGS) are new additions to the toolbox of food microbiology laboratories. We discuss developments in the field of targeted and metagenomic NGS, with an emphasis on application in food virology, the challenges and possible solutions towards future routine application.
Collapse
Affiliation(s)
- Marion Desdouits
- IFREMER, Laboratoire de Microbiologie, LSEM/SG2M, Nantes, France
| | - Miranda de Graaf
- Viroscience Department, Erasmus Medical Centre, Molewaterplein 40, 3015 GD Rotterdam, The Netherlands
| | - Sofia Strubbia
- IFREMER, Laboratoire de Microbiologie, LSEM/SG2M, Nantes, France
| | - Bas B. Oude Munnink
- Viroscience Department, Erasmus Medical Centre, Molewaterplein 40, 3015 GD Rotterdam, The Netherlands
| | - Annelies Kroneman
- Centre for Infectious Disease Control, National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | | | - Marion P. G. Koopmans
- Viroscience Department, Erasmus Medical Centre, Molewaterplein 40, 3015 GD Rotterdam, The Netherlands
| |
Collapse
|
19
|
Amid C, Alako BTF, Balavenkataraman Kadhirvelu V, Burdett T, Burgin J, Fan J, Harrison PW, Holt S, Hussein A, Ivanov E, Jayathilaka S, Kay S, Keane T, Leinonen R, Liu X, Martinez-Villacorta J, Milano A, Pakseresht A, Rahman N, Rajan J, Reddy K, Richards E, Smirnov D, Sokolov A, Vijayaraja S, Cochrane G. The European Nucleotide Archive in 2019. Nucleic Acids Res 2020; 48:D70-D76. [PMID: 31722421 PMCID: PMC7145635 DOI: 10.1093/nar/gkz1063] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2019] [Revised: 10/25/2019] [Accepted: 11/07/2019] [Indexed: 11/12/2022] Open
Abstract
The European Nucleotide Archive (ENA, https://www.ebi.ac.uk/ena) at the European Molecular Biology Laboratory's European Bioinformatics Institute provides open and freely available data deposition and access services across the spectrum of nucleotide sequence data types. Making the world's public sequencing datasets available to the scientific community, the ENA represents a globally comprehensive nucleotide sequence resource. Here, we outline ENA services and content in 2019 and provide an insight into selected key areas of development in this period.
Collapse
Affiliation(s)
- Clara Amid
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Blaise T F Alako
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | - Tony Burdett
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Josephine Burgin
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jun Fan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Peter W Harrison
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Sam Holt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Abdulrahman Hussein
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Eugene Ivanov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Suran Jayathilaka
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Simon Kay
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Thomas Keane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Rasko Leinonen
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Xin Liu
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Josue Martinez-Villacorta
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Annalisa Milano
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Amir Pakseresht
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Nadim Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Jeena Rajan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Kethi Reddy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Edward Richards
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Dmitriy Smirnov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Alexey Sokolov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Senthilnathan Vijayaraja
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Guy Cochrane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
20
|
Matamoros S, Hendriksen RS, Pataki BÁ, Pakseresht N, Rossello M, Silvester N, Amid C, Aarestrup FM, Koopmans M, Cochrane G, Csabai I, Lund O, Schultsz C. Accelerating surveillance and research of antimicrobial resistance - an online repository for sharing of antimicrobial susceptibility data associated with whole-genome sequences. Microb Genom 2020; 6:e000342. [PMID: 32255760 PMCID: PMC7371118 DOI: 10.1099/mgen.0.000342] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 01/31/2020] [Indexed: 11/29/2022] Open
Abstract
Antimicrobial resistance (AMR) is an emerging threat to modern medicine. Improved diagnostics and surveillance of resistant bacteria require the development of next-generation analysis tools and collaboration between international partners. Here, we present the 'AMR Data Hub', an online infrastructure for storage and sharing of structured phenotypic AMR data linked to bacterial whole-genome sequences. Leveraging infrastructure built by the European COMPARE Consortium and structured around the European Nucleotide Archive (ENA), the AMR Data Hub already provides an extensive data collection of more than 2500 isolates with linked genome and AMR data. Representing these data in standardized formats, we provide tools for the validation and submission of new data and services supporting search, browse and retrieval. The current collection was created through a collaboration by several partners from the European COMPARE Consortium, demonstrating the capacities and utility of the AMR Data Hub and its associated tools. We anticipate growth of content and offer the hub as a basis for future research into methods to explore and predict AMR.
Collapse
Affiliation(s)
- Sébastien Matamoros
- Amsterdam UMC, University of Amsterdam, Department of Medical Microbiology, Amsterdam, The Netherlands
| | - Rene S. Hendriksen
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | - Bálint Ármin Pataki
- Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Budapest, Hungary
- Department of Computational Sciences, Wigner Research Centre for Physics of the HAS, Budapest, Hungary
| | - Nima Pakseresht
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Marc Rossello
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Nicole Silvester
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Clara Amid
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Frank M. Aarestrup
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | - Marion Koopmans
- Department of Viroscience, Erasmus University Medical Center, Rotterdam, The Netherlands
| | - Guy Cochrane
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Istvan Csabai
- Department of Physics of Complex Systems, ELTE Eötvös Loránd University, Budapest, Hungary
- Department of Computational Sciences, Wigner Research Centre for Physics of the HAS, Budapest, Hungary
| | - Ole Lund
- National Food Institute, Technical University of Denmark, Lyngby, Denmark
| | - Constance Schultsz
- Amsterdam UMC, University of Amsterdam, Department of Medical Microbiology, Amsterdam, The Netherlands
- Amsterdam UMC, University of Amsterdam, Department of Global Health, Amsterdam Institute for Global Health and Development, Amsterdam, The Netherlands
| |
Collapse
|
21
|
Munck N, Leekitcharoenphon P, Litrup E, Kaas R, Meinen A, Guillier L, Tang Y, Malorny B, Palma F, Borowiak M, Gourmelon M, Simon S, Banerji S, Petrovska L, Dallman TJ, Hald T. Four European Salmonella Typhimurium datasets collected to develop WGS-based source attribution methods. Sci Data 2020; 7:75. [PMID: 32127544 PMCID: PMC7054362 DOI: 10.1038/s41597-020-0417-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2019] [Accepted: 02/03/2020] [Indexed: 11/22/2022] Open
Abstract
Zoonotic Salmonella causes millions of human salmonellosis infections worldwide each year. Information about the source of the bacteria guides risk managers on control and preventive strategies. Source attribution is the effort to quantify the number of sporadic human cases of a specific illness to specific sources and animal reservoirs. Source attribution methods for Salmonella have so far been based on traditional wet-lab typing methods. With the change to whole genome sequencing there is a need to develop new methods for source attribution based on sequencing data. Four European datasets collected in Denmark (DK), Germany (DE), the United Kingdom (UK) and France (FR) are presented in this descriptor. The datasets contain sequenced samples of Salmonella Typhimurium and its monophasic variants isolated from human, food, animal and the environment. The objective of the datasets was either to attribute the human salmonellosis cases to animal reservoirs or to investigate contamination of the environment by attributing the environmental isolates to different animal reservoirs.
Collapse
Affiliation(s)
- Nanna Munck
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kgs. Lyngby, Denmark.
| | - Pimlapas Leekitcharoenphon
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Eva Litrup
- Foodborne Infections, Department of Bacteria, Parasites and Fungi, Statens Serum Institute, Copenhagen, Denmark
| | - Rolf Kaas
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Anika Meinen
- Department for Infectious Disease Epidemiology, Robert Koch Institute, Berlin, Germany
| | - Laurent Guillier
- Université Paris Est, ANSES, Laboratory for Food Safety, F-94701, Maisons-Alfort, France
| | - Yue Tang
- Department of Bacteriology, Animal and Plant Health Agency, Weybridge, Surrey, UK
| | - Burkhard Malorny
- Department of Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Federica Palma
- Université Paris Est, ANSES, Laboratory for Food Safety, F-94701, Maisons-Alfort, France
| | - Maria Borowiak
- Department of Biological Safety, German Federal Institute for Risk Assessment, Berlin, Germany
| | - Michèle Gourmelon
- Ifremer, Environment and Microbiology Laboratory, RBE, SGMM, Plouzané, France
| | - Sandra Simon
- National Reference Center for Salmonella and other bacterial enteric pathogens, Robert Koch Institute, Wernigerode, Germany
| | - Sangeeta Banerji
- National Reference Center for Salmonella and other bacterial enteric pathogens, Robert Koch Institute, Wernigerode, Germany
| | - Liljana Petrovska
- Department of Bacteriology, Animal and Plant Health Agency, Weybridge, Surrey, UK
| | | | - Tine Hald
- Research Group for Genomic Epidemiology, National Food Institute, Technical University of Denmark, Kgs. Lyngby, Denmark
| |
Collapse
|
22
|
Poen MJ, Pohlmann A, Amid C, Bestebroer TM, Brookes SM, Brown IH, Everett H, Schapendonk CME, Scheuer RD, Smits SL, Beer M, Fouchier RAM, Ellis RJ. Comparison of sequencing methods and data processing pipelines for whole genome sequencing and minority single nucleotide variant (mSNV) analysis during an influenza A/H5N8 outbreak. PLoS One 2020; 15:e0229326. [PMID: 32078666 PMCID: PMC7032710 DOI: 10.1371/journal.pone.0229326] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 12/30/2019] [Indexed: 12/12/2022] Open
Abstract
As high-throughput sequencing technologies are becoming more widely adopted for analysing pathogens in disease outbreaks there needs to be assurance that the different sequencing technologies and approaches to data analysis will yield reliable and comparable results. Conversely, understanding where agreement cannot be achieved provides insight into the limitations of these approaches and also allows efforts to be focused on areas of the process that need improvement. This manuscript describes the next-generation sequencing of three closely related viruses, each analysed using different sequencing strategies, sequencing instruments and data processing pipelines. In order to determine the comparability of consensus sequences and minority (sub-consensus) single nucleotide variant (mSNV) identification, the biological samples, the sequence data from 3 sequencing platforms and the *.bam quality-trimmed alignment files of raw data of 3 influenza A/H5N8 viruses were shared. This analysis demonstrated that variation in the final result could be attributed to all stages in the process, but the most critical were the well-known homopolymer errors introduced by 454 sequencing, and the alignment processes in the different data processing pipelines which affected the consistency of mSNV detection. However, homopolymer errors aside, there was generally a good agreement between consensus sequences that were obtained for all combinations of sequencing platforms and data processing pipelines. Nevertheless, minority variant analysis will need a different level of careful standardization and awareness about the possible limitations, as shown in this study.
Collapse
Affiliation(s)
| | - Anne Pohlmann
- Institute of Diagnostic Virology, Friedrich-Loeffler-Institute, Insel Riems, Germany
| | - Clara Amid
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | | | - Sharon M. Brookes
- Animal and Plant Health Agency (APHA) - Weybridge, Addlestone, Surrey, United Kingdom
| | - Ian H. Brown
- Animal and Plant Health Agency (APHA) - Weybridge, Addlestone, Surrey, United Kingdom
| | - Helen Everett
- Animal and Plant Health Agency (APHA) - Weybridge, Addlestone, Surrey, United Kingdom
| | | | | | - Saskia L. Smits
- Erasmus MC, Department of Viroscience, Rotterdam, the Netherlands
| | - Martin Beer
- Institute of Diagnostic Virology, Friedrich-Loeffler-Institute, Insel Riems, Germany
| | | | - Richard J. Ellis
- Animal and Plant Health Agency (APHA) - Weybridge, Addlestone, Surrey, United Kingdom
- * E-mail:
| |
Collapse
|
23
|
Uelze L, Grützke J, Borowiak M, Hammerl JA, Juraschek K, Deneke C, Tausch SH, Malorny B. Typing methods based on whole genome sequencing data. ONE HEALTH OUTLOOK 2020; 2:3. [PMID: 33829127 PMCID: PMC7993478 DOI: 10.1186/s42522-020-0010-1] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 01/08/2020] [Indexed: 05/12/2023]
Abstract
Whole genome sequencing (WGS) of foodborne pathogens has become an effective method for investigating the information contained in the genome sequence of bacterial pathogens. In addition, its highly discriminative power enables the comparison of genetic relatedness between bacteria even on a sub-species level. For this reason, WGS is being implemented worldwide and across sectors (human, veterinary, food, and environment) for the investigation of disease outbreaks, source attribution, and improved risk characterization models. In order to extract relevant information from the large quantity and complex data produced by WGS, a host of bioinformatics tools has been developed, allowing users to analyze and interpret sequencing data, starting from simple gene-searches to complex phylogenetic studies. Depending on the research question, the complexity of the dataset and their bioinformatics skill set, users can choose between a great variety of tools for the analysis of WGS data. In this review, we describe the relevant approaches for phylogenomic studies for outbreak studies and give an overview of selected tools for the characterization of foodborne pathogens based on WGS data. Despite the efforts of the last years, harmonization and standardization of typing tools are still urgently needed to allow for an easy comparison of data between laboratories, moving towards a one health worldwide surveillance system for foodborne pathogens.
Collapse
Affiliation(s)
- Laura Uelze
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Josephine Grützke
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Maria Borowiak
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Jens Andre Hammerl
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Katharina Juraschek
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Carlus Deneke
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Simon H. Tausch
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| | - Burkhard Malorny
- Department for Biological Safety, German Federal Institute for Risk Assessment, BfR, Max-Dohrn Straße 8-10, 10589 Berlin, Germany
| |
Collapse
|