1
|
Nash D, Palermo CN, Inamoto I, Charles TC, Nissimov JI, Short SM. Hybrid sequencing reveals the genome of a Chrysochromulina parva virus and highlight its distinct replication strategy. BMC Genomics 2025; 26:498. [PMID: 40382578 PMCID: PMC12085832 DOI: 10.1186/s12864-025-11700-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2025] [Accepted: 05/12/2025] [Indexed: 05/20/2025] Open
Abstract
Chrysochromulina parva (C. parva) is a eukaryotic freshwater haptophyte algae found in lakes and rivers worldwide. It is known to be infected by viruses, yet knowledge of the diversity and activity of these viruses is still very limited. Based on sequences of PCR-amplified DNA polymerase B (polB) gene fragments, Chrysochromulina parva virus BQ1 (CpV-BQ1) was the first known lytic agent of C. parva, and was considered a member of the virus family Phycodnaviridae, order Algavirales. However, the genome of a different C. parva-infecting virus (CpV-BQ2, or Tethysvirus ontarioense) from another virus family, the Mesomimiviridae, order Imitervirales, was the first sequenced. Here, we report the complete genome sequence of the putative phycodnavirus CpV-BQ1, accession PQ783904. The complete CpV-BQ1 genome sequence is 165,454 bp with a GC content of 32.32% and it encodes 193 open reading frames. Phylogenetic analyses of several virus hallmark genes including the polB, the late gene transcription factor (VLTF-3), and the putative A32-like virion packaging ATPase (Viral A32) all demonstrate that CpV-BQ1 is most closely related to other viruses in the phylum Megaviricetes within the order Algavirales, family Phycodnaviridae.
Collapse
Affiliation(s)
- Delaney Nash
- Department of Biology, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.
| | - Christine N Palermo
- Department of Biology, University of Toronto Mississauga, Mississauga, ON, L5L 1C6, Canada
| | - Ichiro Inamoto
- Department of Biology, University of Toronto Mississauga, Mississauga, ON, L5L 1C6, Canada
| | - Trevor C Charles
- Department of Biology, University of Waterloo, Waterloo, ON, N2L 3G1, Canada
| | - Jozef I Nissimov
- Department of Biology, University of Waterloo, Waterloo, ON, N2L 3G1, Canada.
| | - Steven M Short
- Department of Biology, University of Toronto Mississauga, Mississauga, ON, L5L 1C6, Canada.
| |
Collapse
|
2
|
Gladysheva A, Osinkina I, Radchenko N, Alkhireenko D, Agafonov A. Tertiary Structures of Haseki Tick Virus Nonstructural Proteins Are Similar to Those of Orthoflaviviruses. Int J Mol Sci 2024; 25:13654. [PMID: 39769413 PMCID: PMC11678601 DOI: 10.3390/ijms252413654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 12/12/2024] [Accepted: 12/13/2024] [Indexed: 01/11/2025] Open
Abstract
Currently, a large number of novel tick-borne viruses potentially pathogenic to humans are discovered. Studying many of them by classical methods of virology is difficult due to the absence of live viral particles or a sufficient amount of their genetic material. In this case, the use of modern methods of bioinformatics and synthetic and structural biology can help. Haseki tick virus (HSTV) is a recently discovered tick-borne unclassified ssRNA(+) virus. HSTV-positive patients experienced fever and an elevated temperature. However, at the moment, there is no information on the tertiary structure and functions of its proteins. In this work, we used AlphaFold 3 and other bioinformatic tools for the annotation of HSTV nonstructural proteins, based on the principle that the tertiary structure of a protein is inextricably linked with its molecular function. We were the first to obtain models of tertiary structures and describe the putative functions of HSTV nonstructural proteins (NS3 helicase, NS3 protease, NS5 RNA-dependent RNA-polymerase, and NS5 methyltransferase), which play a key role in viral genome replication. Our results may help in further taxonomic identification of HSTV and the development of direct-acting antiviral drugs, POC tests, and vaccines.
Collapse
Affiliation(s)
- Anastasia Gladysheva
- State Research Center of Virology and Biotechnology “Vector”, 630559 Kol’tsovo, Russia; (I.O.); (N.R.); (D.A.); (A.A.)
| | - Irina Osinkina
- State Research Center of Virology and Biotechnology “Vector”, 630559 Kol’tsovo, Russia; (I.O.); (N.R.); (D.A.); (A.A.)
| | - Nikita Radchenko
- State Research Center of Virology and Biotechnology “Vector”, 630559 Kol’tsovo, Russia; (I.O.); (N.R.); (D.A.); (A.A.)
- Natural Sciences Department, Novosibirsk State University, 630090 Novosibirsk, Russia
| | - Daria Alkhireenko
- State Research Center of Virology and Biotechnology “Vector”, 630559 Kol’tsovo, Russia; (I.O.); (N.R.); (D.A.); (A.A.)
- Natural Sciences Department, Novosibirsk State University, 630090 Novosibirsk, Russia
| | - Alexander Agafonov
- State Research Center of Virology and Biotechnology “Vector”, 630559 Kol’tsovo, Russia; (I.O.); (N.R.); (D.A.); (A.A.)
| |
Collapse
|
3
|
Flamholz ZN, Li C, Kelly L. Improving viral annotation with artificial intelligence. mBio 2024; 15:e0320623. [PMID: 39230289 PMCID: PMC11481560 DOI: 10.1128/mbio.03206-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024] Open
Abstract
Viruses of bacteria, "phages," are fundamental, poorly understood components of microbial community structure and function. Additionally, their dependence on hosts for replication positions phages as unique sensors of ecosystem features and environmental pressures. High-throughput sequencing approaches have begun to give us access to the diversity and range of phage populations in complex microbial community samples, and metagenomics is currently the primary tool with which we study phage populations. The study of phages by metagenomic sequencing, however, is fundamentally limited by viral diversity, which results in the vast majority of viral genomes and metagenome-annotated genomes lacking annotation. To harness bacteriophages for applications in human and environmental health and disease, we need new methods to organize and annotate viral sequence diversity. We recently demonstrated that methods that leverage self-supervised representation learning can supplement statistical sequence representations for remote viral protein homology detection in the ocean virome and propose that consideration of the functional content of viral sequences allows for the identification of similarity in otherwise sequence-diverse viruses and viral-like elements for biological discovery. In this review, we describe the potential and pitfalls of large language models for viral annotation. We describe the need for new approaches to annotate viral sequences in metagenomes, the fundamentals of what protein language models are and how one can use them for sequence annotation, the strengths and weaknesses of these models, and future directions toward developing better models for viral annotation more broadly.
Collapse
Affiliation(s)
- Zachary N. Flamholz
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Charlotte Li
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Libusha Kelly
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, New York, USA
| |
Collapse
|
4
|
Nomburg J, Doherty EE, Price N, Bellieny-Rabelo D, Zhu YK, Doudna JA. Birth of protein folds and functions in the virome. Nature 2024; 633:710-717. [PMID: 39187718 PMCID: PMC11410667 DOI: 10.1038/s41586-024-07809-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 07/10/2024] [Indexed: 08/28/2024]
Abstract
The rapid evolution of viruses generates proteins that are essential for infectivity and replication but with unknown functions, due to extreme sequence divergence1. Here, using a database of 67,715 newly predicted protein structures from 4,463 eukaryotic viral species, we found that 62% of viral proteins are structurally distinct and lack homologues in the AlphaFold database2,3. Among the remaining 38% of viral proteins, many have non-viral structural analogues that revealed surprising similarities between human pathogens and their eukaryotic hosts. Structural comparisons suggested putative functions for up to 25% of unannotated viral proteins, including those with roles in the evasion of innate immunity. In particular, RNA ligase T-like phosphodiesterases were found to resemble phage-encoded proteins that hydrolyse the host immune-activating cyclic dinucleotides 3',3'- and 2',3'-cyclic GMP-AMP (cGAMP). Experimental analysis showed that RNA ligase T homologues encoded by avian poxviruses similarly hydrolyse cGAMP, showing that RNA ligase T-mediated targeting of cGAMP is an evolutionarily conserved mechanism of immune evasion that is present in both bacteriophage and eukaryotic viruses. Together, the viral protein structural database and analyses presented here afford new opportunities to identify mechanisms of virus-host interactions that are common across the virome.
Collapse
Affiliation(s)
- Jason Nomburg
- Gladstone-UCSF Institute of Data Science and Biotechnology, San Francisco, CA, USA
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Erin E Doherty
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA
| | - Nathan Price
- Gladstone-UCSF Institute of Data Science and Biotechnology, San Francisco, CA, USA
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Daniel Bellieny-Rabelo
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA
| | - Yong K Zhu
- Gladstone-UCSF Institute of Data Science and Biotechnology, San Francisco, CA, USA
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA
| | - Jennifer A Doudna
- Gladstone-UCSF Institute of Data Science and Biotechnology, San Francisco, CA, USA.
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA.
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA.
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA.
- Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, CA, USA.
- Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Department of Chemistry, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
5
|
Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, Chain PSG, Nayfach S, Kyrpides NC. Identification of mobile genetic elements with geNomad. Nat Biotechnol 2024; 42:1303-1312. [PMID: 37735266 PMCID: PMC11324519 DOI: 10.1038/s41587-023-01953-y] [Citation(s) in RCA: 207] [Impact Index Per Article: 207.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 08/17/2023] [Indexed: 09/23/2023]
Abstract
Identifying and characterizing mobile genetic elements in sequencing data is essential for understanding their diversity, ecology, biotechnological applications and impact on public health. Here we introduce geNomad, a classification and annotation framework that combines information from gene content and a deep neural network to identify sequences of plasmids and viruses. geNomad uses a dataset of more than 200,000 marker protein profiles to provide functional gene annotation and taxonomic assignment of viral genomes. Using a conditional random field model, geNomad also detects proviruses integrated into host genomes with high precision. In benchmarks, geNomad achieved high classification performance for diverse plasmids and viruses (Matthews correlation coefficient of 77.8% and 95.3%, respectively), substantially outperforming other tools. Leveraging geNomad's speed and scalability, we processed over 2.7 trillion base pairs of sequencing data, leading to the discovery of millions of viruses and plasmids that are available through the IMG/VR and IMG/PR databases. geNomad is available at https://portal.nersc.gov/genomad .
Collapse
Affiliation(s)
- Antonio Pedro Camargo
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Michal Babinski
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Yan Xu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Bin Hu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Patrick S G Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
6
|
Flamholz ZN, Biller SJ, Kelly L. Large language models improve annotation of prokaryotic viral proteins. Nat Microbiol 2024; 9:537-549. [PMID: 38287147 PMCID: PMC11311208 DOI: 10.1038/s41564-023-01584-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 12/08/2023] [Indexed: 01/31/2024]
Abstract
Viral genomes are poorly annotated in metagenomic samples, representing an obstacle to understanding viral diversity and function. Current annotation approaches rely on alignment-based sequence homology methods, which are limited by the paucity of characterized viral proteins and divergence among viral sequences. Here we show that protein language models can capture prokaryotic viral protein function, enabling new portions of viral sequence space to be assigned biologically meaningful labels. When applied to global ocean virome data, our classifier expanded the annotated fraction of viral protein families by 29%. Among previously unannotated sequences, we highlight the identification of an integrase defining a mobile element in marine picocyanobacteria and a capsid protein that anchors globally widespread viral elements. Furthermore, improved high-level functional annotation provides a means to characterize similarities in genomic organization among diverse viral sequences. Protein language models thus enhance remote homology detection of viral proteins, serving as a useful complement to existing approaches.
Collapse
Affiliation(s)
- Zachary N Flamholz
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Steven J Biller
- Department of Biological Sciences, Wellesley College, Wellesley, MA, USA
| | - Libusha Kelly
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA.
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA.
| |
Collapse
|
7
|
Yan M, Pratama AA, Somasundaram S, Li Z, Jiang Y, Sullivan MB, Yu Z. Interrogating the viral dark matter of the rumen ecosystem with a global virome database. Nat Commun 2023; 14:5254. [PMID: 37644066 PMCID: PMC10465536 DOI: 10.1038/s41467-023-41075-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 08/21/2023] [Indexed: 08/31/2023] Open
Abstract
The diverse rumen virome can modulate the rumen microbiome, but it remains largely unexplored. Here, we mine 975 published rumen metagenomes for viral sequences, create a global rumen virome database (RVD), and analyze the rumen virome for diversity, virus-host linkages, and potential roles in affecting rumen functions. Containing 397,180 species-level viral operational taxonomic units (vOTUs), RVD substantially increases the detection rate of rumen viruses from metagenomes compared with IMG/VR V3. Most of the classified vOTUs belong to Caudovirales, differing from those found in the human gut. The rumen virome is predicted to infect the core rumen microbiome, including fiber degraders and methanogens, carries diverse auxiliary metabolic genes, and thus likely impacts the rumen ecosystem in both a top-down and a bottom-up manner. RVD and the findings provide useful resources and a baseline framework for future research to investigate how viruses may impact the rumen ecosystem and digestive physiology.
Collapse
Affiliation(s)
- Ming Yan
- Department of Animal Sciences, The Ohio State University, Columbus, OH, USA
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA
| | - Akbar Adjie Pratama
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA
- Department of Microbiology, The Ohio State University, Columbus, OH, USA
| | - Sripoorna Somasundaram
- Department of Animal Sciences, The Ohio State University, Columbus, OH, USA
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA
| | - Zongjun Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Yu Jiang
- College of Animal Science and Technology, Northwest A&F University, Yangling, China
| | - Matthew B Sullivan
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA
- Department of Microbiology, The Ohio State University, Columbus, OH, USA
- Department of Civil, Environmental, and Geodetic Engineering, The Ohio State University, Columbus, OH, USA
| | - Zhongtang Yu
- Department of Animal Sciences, The Ohio State University, Columbus, OH, USA.
- Center of Microbiome Science, The Ohio State University, Columbus, OH, USA.
| |
Collapse
|
8
|
Flamholz ZN, Biller SJ, Kelly L. Large language models improve annotation of viral proteins. RESEARCH SQUARE 2023:rs.3.rs-2852098. [PMID: 37205395 PMCID: PMC10187409 DOI: 10.21203/rs.3.rs-2852098/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Viral sequences are poorly annotated in environmental samples, a major roadblock to understanding how viruses influence microbial community structure. Current annotation approaches rely on alignment-based sequence ho-mology methods, which are limited by available viral sequences and sequence divergence in viral proteins. Here, we show that protein language model representations capture viral protein function beyond the limits of remote sequence homology by targeting two axes of viral sequence annotation: systematic labeling of protein families and function identification for biologic discovery. Protein language model representations capture protein functional properties specific to viruses and expand the annotated fraction of ocean virome viral protein sequences by 37%. Among unannotated viral protein families, we identify a novel DNA editing protein family that defines a new mobile element in marine picocyanobacteria. Protein language models thus significantly enhance remote homology detection of viral proteins and can be utilized to enable new biological discovery across diverse functional categories.
Collapse
Affiliation(s)
- Zachary N. Flamholz
- Department of Systems and Computational Biology, Albert Einstein College of Medicine; Bronx, NY, USA
| | - Steve J. Biller
- Department of Biological Sciences, Wellesley College; Wellesley, MA USA
| | - Libusha Kelly
- Department of Systems and Computational Biology, Albert Einstein College of Medicine; Bronx, NY, USA
- Department of Microbiology and Immunology, Albert Einstein College of Medicine; Bronx, NY, USA
| |
Collapse
|
9
|
Moraru C. VirClust-A Tool for Hierarchical Clustering, Core Protein Detection and Annotation of ( Prokaryotic) Viruses. Viruses 2023; 15:v15041007. [PMID: 37112988 PMCID: PMC10143988 DOI: 10.3390/v15041007] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 04/16/2023] [Accepted: 04/18/2023] [Indexed: 04/29/2023] Open
Abstract
Recent years have seen major changes in the classification criteria and taxonomy of viruses. The current classification scheme, also called "megataxonomy of viruses", recognizes six different viral realms, defined based on the presence of viral hallmark genes (VHGs). Within the realms, viruses are classified into hierarchical taxons, ideally defined by the phylogeny of their shared genes. To enable the detection of shared genes, viruses have first to be clustered, and there is currently a need for tools to assist with virus clustering and classification. Here, VirClust is presented. It is a novel, reference-free tool capable of performing: (i) protein clustering, based on BLASTp and Hidden Markov Models (HMMs) similarities; (ii) hierarchical clustering of viruses based on intergenomic distances calculated from their shared protein content; (iii) identification of core proteins and (iv) annotation of viral proteins. VirClust has flexible parameters both for protein clustering and for splitting the viral genome tree into smaller genome clusters, corresponding to different taxonomic levels. Benchmarking on a phage dataset showed that the genome trees produced by VirClust match the current ICTV classification at family, sub-family and genus levels. VirClust is freely available, as a web-service and stand-alone tool.
Collapse
Affiliation(s)
- Cristina Moraru
- Institute for Chemistry and Biology of the Marine Environment, Carl-von-Ossietzky-Str. 9-11, 26111 Oldenburg, Germany
| |
Collapse
|
10
|
Oliveira LS, Reyes A, Dutilh BE, Gruber A. Rational Design of Profile HMMs for Sensitive and Specific Sequence Detection with Case Studies Applied to Viruses, Bacteriophages, and Casposons. Viruses 2023; 15:519. [PMID: 36851733 PMCID: PMC9966878 DOI: 10.3390/v15020519] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 02/01/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023] Open
Abstract
Profile hidden Markov models (HMMs) are a powerful way of modeling biological sequence diversity and constitute a very sensitive approach to detecting divergent sequences. Here, we report the development of protocols for the rational design of profile HMMs. These methods were implemented on TABAJARA, a program that can be used to either detect all biological sequences of a group or discriminate specific groups of sequences. By calculating position-specific information scores along a multiple sequence alignment, TABAJARA automatically identifies the most informative sequence motifs and uses them to construct profile HMMs. As a proof-of-principle, we applied TABAJARA to generate profile HMMs for the detection and classification of two viral groups presenting different evolutionary rates: bacteriophages of the Microviridae family and viruses of the Flavivirus genus. We obtained conserved models for the generic detection of any Microviridae or Flavivirus sequence, and profile HMMs that can specifically discriminate Microviridae subfamilies or Flavivirus species. In another application, we constructed Cas1 endonuclease-derived profile HMMs that can discriminate CRISPRs and casposons, two evolutionarily related transposable elements. We believe that the protocols described here, and implemented on TABAJARA, constitute a generic toolbox for generating profile HMMs for the highly sensitive and specific detection of sequence classes.
Collapse
Affiliation(s)
- Liliane S. Oliveira
- Department of Parasitology, Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo 05508-000, SP, Brazil
| | - Alejandro Reyes
- Max Planck Tandem Group in Computational Biology, Department of Biological Sciences, Universidad de los Andes, Bogotá 111711, Colombia
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63108, USA
| | - Bas E. Dutilh
- Institute of Biodiversity, Faculty of Biological Sciences, Cluster of Excellence Balance of the Microverse, Friedrich-Schiller-University Jena, 07743 Jena, Germany
- Theoretical Biology and Bioinformatics, Science for Life, Utrecht University, Padualaan 8, 3584 CH Utrecht, The Netherlands
- European Virus Bioinformatics Center, Leutragraben 1, 07743 Jena, Germany
| | - Arthur Gruber
- Department of Parasitology, Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo 05508-000, SP, Brazil
- European Virus Bioinformatics Center, Leutragraben 1, 07743 Jena, Germany
| |
Collapse
|
11
|
Zucker F, Bischoff V, Olo Ndela E, Heyerhoff B, Poehlein A, Freese HM, Roux S, Simon M, Enault F, Moraru C. New Microviridae isolated from Sulfitobacter reveals two cosmopolitan subfamilies of single-stranded DNA phages infecting marine and terrestrial Alphaproteobacteria. Virus Evol 2022; 8:veac070. [PMID: 36533142 PMCID: PMC9753089 DOI: 10.1093/ve/veac070] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 05/24/2022] [Accepted: 07/28/2022] [Indexed: 10/01/2023] Open
Abstract
The Microviridae family represents one of the major clades of single-stranded DNA (ssDNA) phages. Their cultivated members are lytic and infect Proteobacteria, Bacteroidetes, and Chlamydiae. Prophages have been predicted in the genomes from Bacteroidales, Hyphomicrobiales, and Enterobacteriaceae and cluster within the 'Alpavirinae', 'Amoyvirinae', and Gokushovirinae. We have isolated 'Ascunsovirus oldenburgi' ICBM5, a novel phage distantly related to known Microviridae. It infects Sulfitobacter dubius SH24-1b and uses both a lytic and a carrier-state life strategy. Using ICBM5 proteins as a query, we uncovered in publicly available resources sixty-five new Microviridae prophages and episomes in bacterial genomes and retrieved forty-seven environmental viral genomes (EVGs) from various viromes. Genome clustering based on protein content and phylogenetic analysis showed that ICBM5, together with Rhizobium phages, new prophages, episomes, and EVGs cluster within two new phylogenetic clades, here tentatively assigned the rank of subfamily and named 'Tainavirinae' and 'Occultatumvirinae'. They both infect Rhodobacterales. Occultatumviruses also infect Hyphomicrobiales, including nitrogen-fixing endosymbionts from cosmopolitan legumes. A biogeographical assessment showed that tainaviruses and occultatumviruses are spread worldwide, in terrestrial and marine environments. The new phage isolated here sheds light onto new and diverse branches of the Microviridae tree, suggesting that much of the ssDNA phage diversity remains in the dark.
Collapse
Affiliation(s)
- Falk Zucker
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Carl-von-Ossietzky-Str. 9−11, Oldenburg D-26111, Germany
| | - Vera Bischoff
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Carl-von-Ossietzky-Str. 9−11, Oldenburg D-26111, Germany
| | - Eric Olo Ndela
- Laboratoire Microorganismes: Genome Environment (LMGE), Université Clermont Auvergne, CNRS, 1 Imp. Amélie Murat, Aubière 63170, Frankreich
| | - Benedikt Heyerhoff
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Carl-von-Ossietzky-Str. 9−11, Oldenburg D-26111, Germany
| | - Anja Poehlein
- Department of Genomic and Applied Microbiology & Göttingen Genomics Laboratory, Georg-August-University Göttingen, Institute of Microbiology and Genetics, Grisebachstr. 8, Göttingen D-37077, Germany
| | - Heike M Freese
- Leibniz-Institut DSMZ, Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, Inhoffenstraße 7 B, Braunschweig D-38124, Germany
| | - Simon Roux
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA 94720, USA
| | - Meinhard Simon
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Carl-von-Ossietzky-Str. 9−11, Oldenburg D-26111, Germany
| | - Francois Enault
- Laboratoire Microorganismes: Genome Environment (LMGE), Université Clermont Auvergne, CNRS, 1 Imp. Amélie Murat, Aubière 63170, Frankreich
| | - Cristina Moraru
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Carl-von-Ossietzky-Str. 9−11, Oldenburg D-26111, Germany
| |
Collapse
|
12
|
Abril AG, Carrera M, Notario V, Sánchez-Pérez Á, Villa TG. The Use of Bacteriophages in Biotechnology and Recent Insights into Proteomics. Antibiotics (Basel) 2022; 11:653. [PMID: 35625297 PMCID: PMC9137636 DOI: 10.3390/antibiotics11050653] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 05/11/2022] [Accepted: 05/12/2022] [Indexed: 12/10/2022] Open
Abstract
Phages have certain features, such as their ability to form protein-protein interactions, that make them good candidates for use in a variety of beneficial applications, such as in human or animal health, industry, food science, food safety, and agriculture. It is essential to identify and characterize the proteins produced by particular phages in order to use these viruses in a variety of functional processes, such as bacterial detection, as vehicles for drug delivery, in vaccine development, and to combat multidrug resistant bacterial infections. Furthermore, phages can also play a major role in the design of a variety of cheap and stable sensors as well as in diagnostic assays that can either specifically identify specific compounds or detect bacteria. This article reviews recently developed phage-based techniques, such as the use of recombinant tempered phages, phage display and phage amplification-based detection. It also encompasses the application of phages as capture elements, biosensors and bioreceptors, with a special emphasis on novel bacteriophage-based mass spectrometry (MS) applications.
Collapse
Affiliation(s)
- Ana G. Abril
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15898 Santiago de Compostela, Spain;
- Department of Food Technology, Spanish National Research Council (CSIC), Marine Research Institute (IIM), 36208 Vigo, Spain;
| | - Mónica Carrera
- Department of Food Technology, Spanish National Research Council (CSIC), Marine Research Institute (IIM), 36208 Vigo, Spain;
| | - Vicente Notario
- Lombardi Comprehensive Cancer Center, Georgetown University, Washington, DC 20057, USA;
| | - Ángeles Sánchez-Pérez
- Sydney School of Veterinary Science, Faculty of Science, University of Sydney, Sydney, NSW 2006, Australia;
| | - Tomás G. Villa
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15898 Santiago de Compostela, Spain;
| |
Collapse
|
13
|
Hufsky F, Beslic D, Boeckaerts D, Duchene S, González-Tortuero E, Gruber AJ, Guo J, Jansen D, Juma J, Kongkitimanon K, Luque A, Ritsch M, Lencioni Lovate G, Nishimura L, Pas C, Domingo E, Hodcroft E, Lemey P, Sullivan MB, Weber F, González-Candelas F, Krautwurst S, Pérez-Cataluña A, Randazzo W, Sánchez G, Marz M. The International Virus Bioinformatics Meeting 2022. Viruses 2022; 14:973. [PMID: 35632715 PMCID: PMC9144528 DOI: 10.3390/v14050973] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 04/28/2022] [Indexed: 12/22/2022] Open
Abstract
The International Virus Bioinformatics Meeting 2022 took place online, on 23-25 March 2022, and has attracted about 380 participants from all over the world. The goal of the meeting was to provide a meaningful and interactive scientific environment to promote discussion and collaboration and to inspire and suggest new research directions and questions. The participants created a highly interactive scientific environment even without physical face-to-face interactions. This meeting is a focal point to gain an insight into the state-of-the-art of the virus bioinformatics research landscape and to interact with researchers in the forefront as well as aspiring young scientists. The meeting featured eight invited and 18 contributed talks in eight sessions on three days, as well as 52 posters, which were presented during three virtual poster sessions. The main topics were: SARS-CoV-2, viral emergence and surveillance, virus-host interactions, viral sequence analysis, virus identification and annotation, phages, and viral diversity. This report summarizes the main research findings and highlights presented at the meeting.
Collapse
Affiliation(s)
- Franziska Hufsky
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany;
| | - Denis Beslic
- Methodology and Research Infrastructure, MF1 Bioinformatics, Robert Koch Institute, 13353 Berlin, Germany;
| | - Dimitri Boeckaerts
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, 9000 Ghent, Belgium; (D.B.); (C.P.)
- KERMIT, Department of Data Analysis and Mathematical Modelling, Ghent University, 9000 Ghent, Belgium
| | - Sebastian Duchene
- Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne 3000, Australia;
| | - Enrique González-Tortuero
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- School of Science, Engineering and Environment (SEE), University of Salford, Salford M5 4WT, UK
| | - Andreas J. Gruber
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Department of Biology, University of Konstanz, 78464 Konstanz, Germany
| | - Jiarong Guo
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Departments of Microbiology, and Civil, Environmental, and Geodetic Engineering, Ohio State University, Columbus, OH 43210, USA
| | - Daan Jansen
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Department of Microbiology, Immunology and Transplantation, Rega Institute, Laboratory of Viral Metagenomics, KU Leuven, 3000 Leuven, Belgium
| | - John Juma
- International Livestock Research Institute (ILRI), Nairobi 00100, Kenya;
- South African National Bioinformatics Institute, South African MRC Bioinformatics Unit, Cape Town 7530, South Africa
| | - Kunaphas Kongkitimanon
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Methodology and Research Infrastructure, MF1 Bioinformatics, Robert Koch Institute, 13353 Berlin, Germany;
| | - Antoni Luque
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Viral Information Institute, San Diego State University, San Diego, CA 92116, USA
- Computational Science Research Center, San Diego State University, San Diego, CA 92116, USA
- Department of Mathematics and Statistics, San Diego State University, San Diego, CA 92116, USA
| | - Muriel Ritsch
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany;
| | - Gabriel Lencioni Lovate
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany;
- JRG Analytical MicroBioinformatics, Friedrich Schiller University Jena, 07743 Jena, Germany
| | - Luca Nishimura
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan
- Human Genetics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan
| | - Célia Pas
- Laboratory of Applied Biotechnology, Department of Biotechnology, Ghent University, 9000 Ghent, Belgium; (D.B.); (C.P.)
| | - Esteban Domingo
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), 28049 Madrid, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd) del Instituto de Salud Carlos III, 28029 Madrid, Spain
| | - Emma Hodcroft
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Institute of Social and Preventive Medicine, University of Bern, 3012 Bern, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Philippe Lemey
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, 3000 Leuven, Belgium
| | - Matthew B. Sullivan
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Departments of Microbiology, and Civil, Environmental, and Geodetic Engineering, Ohio State University, Columbus, OH 43210, USA
| | - Friedemann Weber
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Institute for Virology, Veterinary Medicine, Justus-Liebig University, 35390 Gießen, Germany
| | - Fernando González-Candelas
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- Joint Research Unit “Infection and Public Health” FISABIO, University of Valencia, 46010 Valencia, Spain
- Institute for Integrative Systems Biology (I2SysBio), CSIC, University of Valencia, 46010 Valencia, Spain
| | - Sarah Krautwurst
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany;
| | - Alba Pérez-Cataluña
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- VISAFELab, Department of Preservation and Food Safety Technologies, Institute of Agrochemistry and Food Technology, IATA-CSIC, 46980 Valencia, Spain
| | - Walter Randazzo
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- VISAFELab, Department of Preservation and Food Safety Technologies, Institute of Agrochemistry and Food Technology, IATA-CSIC, 46980 Valencia, Spain
| | - Gloria Sánchez
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- VISAFELab, Department of Preservation and Food Safety Technologies, Institute of Agrochemistry and Food Technology, IATA-CSIC, 46980 Valencia, Spain
| | - Manja Marz
- European Virus Bioinformatics Center, 07743 Jena, Germany; (E.G.-T.); (A.J.G.); (J.G.); (D.J.); (K.K.); (A.L.); (M.R.); (G.L.L.); (L.N.); (E.D.); (E.H.); (P.L.); (M.B.S.); (F.W.); (F.G.-C.); (A.P.-C.); (W.R.); (G.S.)
- RNA Bioinformatics and High-Throughput Analysis, Friedrich Schiller University Jena, 07743 Jena, Germany;
| |
Collapse
|