2
|
Gill EE, Jia B, Murall CL, Poujol R, Anwar MZ, John NS, Richardsson J, Hobb A, Olabode AS, Lepsa A, Duggan AT, Tyler AD, N'Guessan A, Kachru A, Chan B, Yoshida C, Yung CK, Bujold D, Andric D, Su E, Griffiths EJ, Van Domselaar G, Jolly GW, Ward HKE, Feher H, Baker J, Simpson JT, Uddin J, Ragoussis J, Eubank J, Fritz JH, Gálvez JH, Fang K, Cullion K, Rivera L, Xiang L, Croxen MA, Shiell M, Prystajecky N, Quirion PO, Bajari R, Rich S, Mubareka S, Moreira S, Cain S, Sutcliffe SG, Kraemer SA, Alturmessov Y, Joly Y, Fiume M, Snutch TP, Bell C, Lopez-Correa C, Hussin JG, Joy JB, Colijn C, Gordon PMK, Hsiao WWL, Poon AFY, Knox NC, Courtot M, Stein L, Otto SP, Bourque G, Shapiro BJ, Brinkman FSL. The Canadian VirusSeq Data Portal and Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology. Microb Genom 2024; 10:001293. [PMID: 39401061 PMCID: PMC11472881 DOI: 10.1099/mgen.0.001293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 08/20/2024] [Indexed: 10/15/2024] Open
Abstract
The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform the public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). In addition, the portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. From inception to execution, the portal was developed with a conscientious focus on strong data governance principles and practices. Extensive efforts ensured a commitment to Canadian privacy laws, data security standards, and organizational processes. This portal has been coupled with other resources, such as Viral AI, and was further leveraged by the Coronavirus Variants Rapid Response Network (CoVaRR-Net) to produce a suite of continually updated analytical tools and notebooks. Here we highlight this portal (https://virusseq-dataportal.ca/), including its contextual data not available elsewhere, and the Duotang (https://covarr-net.github.io/duotang/duotang.html), a web platform that presents key genomic epidemiology and modelling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the portal (COVID-MVP, CoVizu), are all open source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses.
Collapse
Affiliation(s)
- Erin E. Gill
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Baofeng Jia
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Carmen Lia Murall
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Raphaël Poujol
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
| | - Muhammad Zohaib Anwar
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Nithu Sara John
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | | | | | - Abayomi S. Olabode
- Department of Pathology and Laboratory Medicine, Western University, London, ON, Canada
| | | | - Ana T. Duggan
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Andrea D. Tyler
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Arnaud N'Guessan
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill Genome Centre, McGill University, Montréal, QC, Canada
| | - Atul Kachru
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Brandon Chan
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Catherine Yoshida
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Christina K. Yung
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Indoc Systems, Toronto, ON, Canada
| | - David Bujold
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
| | - Dusan Andric
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Edmund Su
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Emma J. Griffiths
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Gary Van Domselaar
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Gordon W. Jolly
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | | | - Henrich Feher
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Jared Baker
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Jaser Uddin
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Jon Eubank
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Jörg H. Fritz
- Department of Microbiology and Immunology, McGill Research Center on Complex Traits (MRCCT), Dahdaleh Institute of Genomic Medicine (DIGM), McGill University, Montréal, QC, Canada
| | | | | | - Kim Cullion
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Linda Xiang
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Matthew A. Croxen
- Alberta Precision Laboratories, Public Health Laboratory, Edmonton, AB, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, Canada
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, AB, Canada
- Women and Children’s Health Research Institute, University of Alberta, Edmonton, AB, Canada
| | | | - Natalie Prystajecky
- British Columbia Centre for Disease Control Public Health Laboratory, Vancouver, BC, Canada
- Department of Pathology and Laboratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | | | - Rosita Bajari
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Samantha Rich
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Samira Mubareka
- Sunnybrook Research Institute, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
| | | | - Scott Cain
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Steven G. Sutcliffe
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
| | - Susanne A. Kraemer
- McGill Genome Centre, McGill University, Montréal, QC, Canada
- Aquatic Contaminants Research Division, ECCC, Montréal, QC, Canada
| | | | - Yann Joly
- Centre of Genomics and Policy, McGill University, Montréal, QC, Canada
| | - CPHLN Consortium**
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- DNAstack, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Western University, London, ON, Canada
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill Genome Centre, McGill University, Montréal, QC, Canada
- Indoc Systems, Toronto, ON, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill Research Center on Complex Traits (MRCCT), Dahdaleh Institute of Genomic Medicine (DIGM), McGill University, Montréal, QC, Canada
- Alberta Precision Laboratories, Public Health Laboratory, Edmonton, AB, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, Canada
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, AB, Canada
- Women and Children’s Health Research Institute, University of Alberta, Edmonton, AB, Canada
- British Columbia Centre for Disease Control Public Health Laboratory, Vancouver, BC, Canada
- Department of Pathology and Laboratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- Sunnybrook Research Institute, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Université de Montréal, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
- Aquatic Contaminants Research Division, ECCC, Montréal, QC, Canada
- Centre of Genomics and Policy, McGill University, Montréal, QC, Canada
- Michael Smith Laboratories and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, Canada
- Genome Canada, 150 Metcalfe Street, Suite 2100, Ottawa, ON, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Mila-Québec AI institute, Montréal, QC, Canada
- Molecular Epidemiology and Evolutionary Genetics, BC Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, BC, Canada
- Bioinformatics Programme, University of British Columbia, Vancouver, BC, Canada
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
- Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, Canada
- Department of Medical BioPhysics, University of Toronto, ON, Canada
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - CanCOGeN Consortium**
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- DNAstack, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Western University, London, ON, Canada
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill Genome Centre, McGill University, Montréal, QC, Canada
- Indoc Systems, Toronto, ON, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill Research Center on Complex Traits (MRCCT), Dahdaleh Institute of Genomic Medicine (DIGM), McGill University, Montréal, QC, Canada
- Alberta Precision Laboratories, Public Health Laboratory, Edmonton, AB, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, Canada
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, AB, Canada
- Women and Children’s Health Research Institute, University of Alberta, Edmonton, AB, Canada
- British Columbia Centre for Disease Control Public Health Laboratory, Vancouver, BC, Canada
- Department of Pathology and Laboratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- Sunnybrook Research Institute, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Université de Montréal, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
- Aquatic Contaminants Research Division, ECCC, Montréal, QC, Canada
- Centre of Genomics and Policy, McGill University, Montréal, QC, Canada
- Michael Smith Laboratories and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, Canada
- Genome Canada, 150 Metcalfe Street, Suite 2100, Ottawa, ON, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Mila-Québec AI institute, Montréal, QC, Canada
- Molecular Epidemiology and Evolutionary Genetics, BC Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, BC, Canada
- Bioinformatics Programme, University of British Columbia, Vancouver, BC, Canada
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
- Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, Canada
- Department of Medical BioPhysics, University of Toronto, ON, Canada
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - VirusSeq Data Portal Academic and Health Network**
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- DNAstack, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Western University, London, ON, Canada
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill Genome Centre, McGill University, Montréal, QC, Canada
- Indoc Systems, Toronto, ON, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill Research Center on Complex Traits (MRCCT), Dahdaleh Institute of Genomic Medicine (DIGM), McGill University, Montréal, QC, Canada
- Alberta Precision Laboratories, Public Health Laboratory, Edmonton, AB, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, Canada
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, AB, Canada
- Women and Children’s Health Research Institute, University of Alberta, Edmonton, AB, Canada
- British Columbia Centre for Disease Control Public Health Laboratory, Vancouver, BC, Canada
- Department of Pathology and Laboratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- Sunnybrook Research Institute, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Université de Montréal, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
- Aquatic Contaminants Research Division, ECCC, Montréal, QC, Canada
- Centre of Genomics and Policy, McGill University, Montréal, QC, Canada
- Michael Smith Laboratories and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, Canada
- Genome Canada, 150 Metcalfe Street, Suite 2100, Ottawa, ON, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Mila-Québec AI institute, Montréal, QC, Canada
- Molecular Epidemiology and Evolutionary Genetics, BC Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, BC, Canada
- Bioinformatics Programme, University of British Columbia, Vancouver, BC, Canada
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
- Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, Canada
- Department of Medical BioPhysics, University of Toronto, ON, Canada
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | | | - Terrance P. Snutch
- Michael Smith Laboratories and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, Canada
| | - Cindy Bell
- Genome Canada, 150 Metcalfe Street, Suite 2100, Ottawa, ON, Canada
| | | | - Julie G. Hussin
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Mila-Québec AI institute, Montréal, QC, Canada
| | - Jeffrey B. Joy
- Molecular Epidemiology and Evolutionary Genetics, BC Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, BC, Canada
- Bioinformatics Programme, University of British Columbia, Vancouver, BC, Canada
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| | - Paul M. K. Gordon
- Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, Canada
| | - William W. L. Hsiao
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Art F. Y. Poon
- Department of Pathology and Laboratory Medicine, Western University, London, ON, Canada
| | - Natalie C. Knox
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Mélanie Courtot
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Medical BioPhysics, University of Toronto, ON, Canada
| | - Lincoln Stein
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Sarah P. Otto
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
| | - B. Jesse Shapiro
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
| | - Fiona S. L. Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - CPHLN consortium
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- DNAstack, Toronto, ON, Canada
- Department of Pathology and Laboratory Medicine, Western University, London, ON, Canada
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill Genome Centre, McGill University, Montréal, QC, Canada
- Indoc Systems, Toronto, ON, Canada
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill Research Center on Complex Traits (MRCCT), Dahdaleh Institute of Genomic Medicine (DIGM), McGill University, Montréal, QC, Canada
- Alberta Precision Laboratories, Public Health Laboratory, Edmonton, AB, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, Canada
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, AB, Canada
- Women and Children’s Health Research Institute, University of Alberta, Edmonton, AB, Canada
- British Columbia Centre for Disease Control Public Health Laboratory, Vancouver, BC, Canada
- Department of Pathology and Laboratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- Sunnybrook Research Institute, Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Université de Montréal, Montréal, QC, Canada
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
- Aquatic Contaminants Research Division, ECCC, Montréal, QC, Canada
- Centre of Genomics and Policy, McGill University, Montréal, QC, Canada
- Michael Smith Laboratories and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, Canada
- Genome Canada, 150 Metcalfe Street, Suite 2100, Ottawa, ON, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Mila-Québec AI institute, Montréal, QC, Canada
- Molecular Epidemiology and Evolutionary Genetics, BC Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, BC, Canada
- Bioinformatics Programme, University of British Columbia, Vancouver, BC, Canada
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
- Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, Canada
- Department of Medical BioPhysics, University of Toronto, ON, Canada
- Department of Zoology and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
3
|
Gill EE, Jia B, Murall CL, Poujol R, Anwar MZ, John NS, Richardsson J, Hobb A, Olabode AS, Lepsa A, Duggan AT, Tyler AD, N’Guessan A, Kachru A, Chan B, Yoshida C, Yung CK, Bujold D, Andric D, Su E, Griffiths EJ, Van Domselaar G, Jolly GW, Ward HK, Feher H, Baker J, Simpson JT, Uddin J, Ragoussis J, Eubank J, Fritz JH, Gálvez JH, Fang K, Cullion K, Rivera L, Xiang L, Croxen MA, Shiell M, Prystajecky N, Quirion PO, Bajari R, Rich S, Mubareka S, Moreira S, Cain S, Sutcliffe SG, Kraemer SA, Joly Y, Alturmessov Y, consortium CPHLN, consortium C, Fiume M, Snutch TP, Bell C, Lopez-Correa C, Hussin JG, Joy JB, Colijn C, Gordon PM, Hsiao WW, Poon AF, Knox NC, Courtot M, Stein L, Otto SP, Bourque G, Shapiro BJ, Brinkman FS. The Canadian VirusSeq Data Portal & Duotang: open resources for SARS-CoV-2 viral sequences and genomic epidemiology. ARXIV 2024:arXiv:2405.04734v1. [PMID: 38764594 PMCID: PMC11100916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/21/2024]
Abstract
The COVID-19 pandemic led to a large global effort to sequence SARS-CoV-2 genomes from patient samples to track viral evolution and inform public health response. Millions of SARS-CoV-2 genome sequences have been deposited in global public repositories. The Canadian COVID-19 Genomics Network (CanCOGeN - VirusSeq), a consortium tasked with coordinating expanded sequencing of SARS-CoV-2 genomes across Canada early in the pandemic, created the Canadian VirusSeq Data Portal, with associated data pipelines and procedures, to support these efforts. The goal of VirusSeq was to allow open access to Canadian SARS-CoV-2 genomic sequences and enhanced, standardized contextual data that were unavailable in other repositories and that meet FAIR standards (Findable, Accessible, Interoperable and Reusable). In addition, the Portal data submission pipeline contains data quality checking procedures and appropriate acknowledgement of data generators that encourages collaboration. From inception to execution, the portal was developed with a conscientious focus on strong data governance principles and practices. Extensive efforts ensured a commitment to Canadian privacy laws, data security standards, and organizational processes. This Portal has been coupled with other resources like Viral AI and was further leveraged by the Coronavirus Variants Rapid Response Network (CoVaRR-Net) to produce a suite of continually updated analytical tools and notebooks. Here we highlight this Portal, including its contextual data not available elsewhere, and the 'Duotang', a web platform that presents key genomic epidemiology and modeling analyses on circulating and emerging SARS-CoV-2 variants in Canada. Duotang presents dynamic changes in variant composition of SARS-CoV-2 in Canada and by province, estimates variant growth, and displays complementary interactive visualizations, with a text overview of the current situation. The VirusSeq Data Portal and Duotang resources, alongside additional analyses and resources computed from the Portal (COVID-MVP, CoVizu), are all open-source and freely available. Together, they provide an updated picture of SARS-CoV-2 evolution to spur scientific discussions, inform public discourse, and support communication with and within public health authorities. They also serve as a framework for other jurisdictions interested in open, collaborative sequence data sharing and analyses.
Collapse
Affiliation(s)
- Erin E. Gill
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Baofeng Jia
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| | - Carmen Lia Murall
- Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Raphaël Poujol
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
| | - Muhammad Zohaib Anwar
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Nithu Sara John
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | | | | | - Abayomi S. Olabode
- Department of Pathology and Laboratory Medicine, Western University, ON Canada
| | | | - Ana T. Duggan
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Andrea D. Tyler
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Arnaud N’Guessan
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- McGill Genome Centre, McGill University, Montréal, QC, Canada
| | - Atul Kachru
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Brandon Chan
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Catherine Yoshida
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Christina K. Yung
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Indoc Systems, Toronto, ON, Canada
| | - David Bujold
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
| | - Dusan Andric
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Edmund Su
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Emma J. Griffiths
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Gary Van Domselaar
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Gordon W. Jolly
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | | | - Henrich Feher
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Jared Baker
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Jaser Uddin
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Jon Eubank
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Jörg H. Fritz
- Department of Microbiology and Immunology, McGill Research Center on Complex Traits (MRCCT), Dahdaleh Institute of Genomic Medicine (DIGM), McGill University, Montréal, QC, Canada
| | | | | | - Kim Cullion
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | | | - Linda Xiang
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Matthew A. Croxen
- Alberta Precision Laboratories, Public Health Laboratory, Edmonton, AB, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, AB, Canada
- Li Ka Shing Institute of Virology, University of Alberta, Edmonton, AB, Canada
- Women and Children’s Health Research Institute, University of Alberta, Edmonton, AB, Canada
| | | | - Natalie Prystajecky
- British Columbia Centre for Disease Control Public Health Laboratory, Vancouver, BC Canada
- Department of Pathology and Laboratory Medicine, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
| | | | - Rosita Bajari
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Samantha Rich
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Samira Mubareka
- Sunnybrook Research Institute; Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
| | | | - Scott Cain
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Steven G. Sutcliffe
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
| | - Susanne A. Kraemer
- McGill Genome Centre, McGill University, Montréal, QC, Canada
- Aquatic Contaminants Research Division, ECCC, Montréal, QC, Canada
| | - Yann Joly
- Centre of Genomics and Policy, McGill University, Montréal, QC, Canada
| | | | | | | | | | | | - Terrance P. Snutch
- Michael Smith Laboratories and Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, Canada
| | - Cindy Bell
- Genome Canada, 150 Metcalfe Street, Suite 2100, Ottawa, ON, Canada
| | | | - Julie G. Hussin
- Département de Biochimie et Médecine Moléculaire, Université de Montréal, Montreal, QC, Canada
- Research Centre, Montréal Heart Institute, Montréal, QC, Canada
- Mila-Québec AI institute, Montréal, QC, Canada
| | - Jeffrey B. Joy
- Molecular Epidemiology and Evolutionary Genetics, BC Centre for Excellence in HIV/AIDS, Vancouver, BC, Canada
- Infectious Diseases, Department of Medicine, University of British Columbia, Vancouver, BC, Canada
- Bioinformatics Programme, University of British Columbia, Vancouver, BC, Canada
| | - Caroline Colijn
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| | - Paul M.K. Gordon
- Centre for Health Genomics and Informatics, University of Calgary, Calgary, AB, Canada
| | - William W.L. Hsiao
- Centre for Infectious Disease Genomics and One Health, Faculty of Health Sciences, Simon Fraser University, Burnaby, BC, Canada
| | - Art F.Y. Poon
- Department of Pathology and Laboratory Medicine, Western University, ON Canada
| | - Natalie C. Knox
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada
| | - Mélanie Courtot
- Ontario Institute for Cancer Research, Toronto, ON, Canada
- Department of Medical BioPhysics, University of Toronto, ON, Canada
| | - Lincoln Stein
- Ontario Institute for Cancer Research, Toronto, ON, Canada
| | - Sarah P. Otto
- Department of Zoology & Biodiversity Research Centre, University of British Columbia, Vancouver BC Canada
| | - Guillaume Bourque
- Department of Human Genetics, McGill University, Montréal, QC, Canada
- Canadian Centre for Computational Genomics, Montréal, QC, Canada
| | - B. Jesse Shapiro
- Department of Microbiology and Immunology, McGill University, Montréal, QC, Canada
| | - Fiona S.L. Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|