1
|
Herazo-Álvarez J, Mora M, Cuadros-Orellana S, Vilches-Ponce K, Hernández-García R. A review of neural networks for metagenomic binning. Brief Bioinform 2025; 26:bbaf065. [PMID: 40131312 PMCID: PMC11934572 DOI: 10.1093/bib/bbaf065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 01/02/2025] [Accepted: 03/07/2025] [Indexed: 03/26/2025] Open
Abstract
One of the main goals of metagenomic studies is to describe the taxonomic diversity of microbial communities. A crucial step in metagenomic analysis is metagenomic binning, which involves the (supervised) classification or (unsupervised) clustering of metagenomic sequences. Various machine learning models have been applied to address this task. In this review, the contributions of artificial neural networks (ANN) in the context of metagenomic binning are detailed, addressing both supervised, unsupervised, and semi-supervised approaches. 34 ANN-based binning tools are systematically compared, detailing their architectures, input features, datasets, advantages, disadvantages, and other relevant aspects. The findings reveal that deep learning approaches, such as convolutional neural networks and autoencoders, achieve higher accuracy and scalability than traditional methods. Gaps in benchmarking practices are highlighted, and future directions are proposed, including standardized datasets and optimization of architectures, for third-generation sequencing. This review provides support to researchers in identifying trends and selecting suitable tools for the metagenomic binning problem.
Collapse
Affiliation(s)
- Jair Herazo-Álvarez
- Doctorado en Modelamiento Matemático Aplicado, Universidad Católica del Maule, Talca, Maule 3480564, Chile
- Laboratory of Technological Research in Pattern Recognition (LITRP), Universidad Católica del Maule, Talca, Maule 3480564, Chile
| | - Marco Mora
- Laboratory of Technological Research in Pattern Recognition (LITRP), Universidad Católica del Maule, Talca, Maule 3480564, Chile
- Departamento de Computación e Industrias, Facultad de Ciencias de la Ingeniería, Universidad Católica del Maule, Talca, Maule 3480564, Chile
| | - Sara Cuadros-Orellana
- Laboratory of Technological Research in Pattern Recognition (LITRP), Universidad Católica del Maule, Talca, Maule 3480564, Chile
- Centro de Biotecnología de los Recursos Naturales (CENBio), Universidad Católica del Maule, Talca, Maule 3480564, Chile
| | - Karina Vilches-Ponce
- Laboratory of Technological Research in Pattern Recognition (LITRP), Universidad Católica del Maule, Talca, Maule 3480564, Chile
| | - Ruber Hernández-García
- Laboratory of Technological Research in Pattern Recognition (LITRP), Universidad Católica del Maule, Talca, Maule 3480564, Chile
- Departamento de Computación e Industrias, Facultad de Ciencias de la Ingeniería, Universidad Católica del Maule, Talca, Maule 3480564, Chile
| |
Collapse
|
2
|
Biller SJ, Ryan MG, Li J, Burger A, Eppley JM, Hackl T, DeLong EF. Distinct horizontal gene transfer potential of extracellular vesicles versus viral-like particles in marine habitats. Nat Commun 2025; 16:2126. [PMID: 40032822 PMCID: PMC11876622 DOI: 10.1038/s41467-025-57276-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Accepted: 02/13/2025] [Indexed: 03/05/2025] Open
Abstract
Horizontal gene transfer (HGT) is enabled in part through the movement of DNA within two broad groups of small (<0.2 µm), diffusible nanoparticles: extracellular vesicles (EVs) and virus-like particles (VLPs; including viruses, gene transfer agents, and phage satellites). The information enclosed within these structures represents a substantial portion of the HGT potential available in planktonic ecosystems, but whether some genes might be preferentially transported through one type of nanoparticle versus another is unknown. Here we use long-read sequencing to compare the genetic content of EVs and VLPs from the oligotrophic North Pacific. Fractionated EV-enriched and VLP-enriched subpopulations contain diverse DNA from the surrounding microbial community, but differ in their capacity and encoded functions. The sequences carried by both particle types are enriched in mobile genetic elements (MGEs) as compared with other cellular chromosomal regions, and we highlight how this property enables novel MGE discovery. Examining the Pelagibacter mobilome reveals >7200 distinct chromosomal fragments and MGEs, many differentially partitioned between EVs and VLPs. Together these results suggest that distinctions in nanoparticle contents contribute to the mode and trajectory of microbial HGT networks and evolutionary dynamics in natural habitats.
Collapse
Affiliation(s)
- Steven J Biller
- Department of Biological Sciences, Wellesley College, Wellesley, MA, USA.
| | - M Gray Ryan
- Department of Biological Sciences, Wellesley College, Wellesley, MA, USA
| | - Jasmine Li
- Department of Biological Sciences, Wellesley College, Wellesley, MA, USA
| | - Andrew Burger
- Department of Oceanography, Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, USA
| | - John M Eppley
- Department of Oceanography, Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, USA
| | - Thomas Hackl
- Groningen Institute for Evolutionary Life Sciences, University of Groningen, Groningen, the Netherlands
| | - Edward F DeLong
- Department of Oceanography, Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, USA
| |
Collapse
|
3
|
Weinheimer AR, Ha AD, Aylward FO. Towards a unifying phylogenomic framework for tailed phages. PLoS Genet 2025; 21:e1011595. [PMID: 39908317 PMCID: PMC11835377 DOI: 10.1371/journal.pgen.1011595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2024] [Revised: 02/18/2025] [Accepted: 01/28/2025] [Indexed: 02/07/2025] Open
Abstract
Classifying viruses systematically has remained a key challenge of virology due to the absence of universal genes and vast genetic diversity of viruses. In particular, the most dominant and diverse group of viruses, the tailed double-stranded DNA viruses of prokaryotes belonging to the class Caudoviricetes, lack sufficient similarity in the genetic machinery that unifies them to reconstruct an inclusive, stable phylogeny of these genes. While previous approaches to organize tailed phage diversity have managed to distinguish various taxonomic levels, these methods are limited in scalability, reproducibility, and the inclusion of modes of evolution, like gene gains and losses, remain key challenges. Here, we present a novel, comprehensive, and reproducible framework for examining evolutionary relationships of tailed phages. In this framework, we compare phage genomes based on the presence and absence of a fixed set of gene families which are used as binary trait data that is input into maximum likelihood models. Our resulting phylogeny stably recovers known taxonomic families of tailed phages, with and without the inclusion of metagenome-derived phages. We also quantify the mosaicism of replication and structural genes among known families, and our results suggest that these exchanges likely underpin the emergence of new families. Additionally, we apply this framework to large phages (>100 kilobases) to map emergences of traits associated with genome expansion. Taken together, this evolutionary framework for charting and organizing tailed phage diversity improves the systemization of phage taxonomy, which can unify phage studies and advance our understanding of their evolution.
Collapse
Affiliation(s)
- Alaina R. Weinheimer
- Department of Biological Sciences, Virginia Tech; Blacksburg, Virginia, United States of America
- Bigelow Laboratory for Ocean Sciences, East Boothbay, Maine, United States of America
| | - Anh D. Ha
- Department of Biological Sciences, Virginia Tech; Blacksburg, Virginia, United States of America
| | - Frank O. Aylward
- Department of Biological Sciences, Virginia Tech; Blacksburg, Virginia, United States of America
- Center for Emerging, Zoonotic, and Arthropod-Borne Infectious Disease, Virginia Tech; Blacksburg, Virginia, United States of America
| |
Collapse
|
4
|
Song H, Tithi SS, Brown C, Aylward FO, Jensen R, Zhang L. Virseqimprover: an integrated pipeline for viral contig error correction, extension, and annotation. PeerJ 2025; 13:e18515. [PMID: 39807156 PMCID: PMC11727651 DOI: 10.7717/peerj.18515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Accepted: 10/21/2024] [Indexed: 01/16/2025] Open
Abstract
Despite the recent surge of viral metagenomic studies, it remains a significant challenge to recover complete virus genomes from metagenomic data. The majority of viral contigs generated from de novo assembly programs are highly fragmented, presenting significant challenges to downstream analysis and inference. To address this issue, we have developed Virseqimprover, a computational pipeline that can extend assembled contigs to complete or nearly complete genomes while maintaining extension quality. Virseqimprover first examines whether there is any chimeric sequence based on read coverage, breaks the sequence into segments if there is, then extends the longest segment with uniform depth of coverage, and repeats these procedures until the sequence cannot be extended. Finally, Virseqimprover annotates the gene content of the resulting sequence. Results show that Virseqimprover has good performances on correcting and extending viral contigs to their full lengths, hence can be a useful tool to improve the completeness and minimize the assembly errors of viral contigs. Both a web server and a conda package for Virseqimprover are provided to the research community free of charge.
Collapse
Affiliation(s)
- Haoqiu Song
- Department of Computer Science, Virginia Polytechnic Institute and State University (Virginia Tech), Blacksburg, VA, United States of America
| | - Saima Sultana Tithi
- Department of Cell & Molecular Biology, St. Jude Children’s Research Hospital, Memphis, TN, United States of America
| | - Connor Brown
- Department of Civil and Environmental Engineering, Virginia Polytechnic Institute and State University (Virginia Tech), Blacksburg, VA, United States of America
| | - Frank O. Aylward
- Department of Biological Sciences, Virginia Polytechnic Institute and State University (Virginia Tech), Blacksburg, VA, United States of America
| | - Roderick Jensen
- Department of Biological Sciences, Virginia Polytechnic Institute and State University (Virginia Tech), Blacksburg, VA, United States of America
| | - Liqing Zhang
- Department of Computer Science, Virginia Polytechnic Institute and State University (Virginia Tech), Blacksburg, VA, United States of America
| |
Collapse
|
5
|
Wang H, Sun C, Li Y, Chen J, Zhao XM, Chen WH. Complementary insights into gut viral genomes: a comparative benchmark of short- and long-read metagenomes using diverse assemblers and binners. MICROBIOME 2024; 12:260. [PMID: 39707560 DOI: 10.1186/s40168-024-01981-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/14/2024] [Accepted: 11/17/2024] [Indexed: 12/23/2024]
Abstract
BACKGROUND Metagenome-assembled viral genomes have significantly advanced the discovery and characterization of the human gut virome. However, we lack a comparative assessment of assembly tools on the efficacy of viral genome identification, particularly across next-generation sequencing (NGS) and third-generation sequencing (TGS) data. RESULTS We evaluated the efficiency of NGS, TGS, and hybrid assemblers for viral genome discovery using 95 viral-like particle (VLP)-enriched fecal samples sequenced on both Illumina and PacBio platforms. MEGAHIT, metaFlye, and hybridSPAdes emerged as the optimal choices for NGS, TGS, and hybrid datasets, respectively. Notably, these assemblers recovered distinct viral genomes, demonstrating a remarkable degree of complementarity. By combining individual assembler results, we expanded the total number of nonredundant high-quality viral genomes by 4.83 ~ 21.7-fold compared to individual assemblers. Among them, viral genomes from NGS and TGS data have the least overlap, indicating the impact of data type on viral genome recovery. We also evaluated four binning methods, finding that CONCOCT incorporated more unrelated contigs into the same bins, while MetaBAT2, AVAMB, and vRhyme balanced inclusiveness and taxonomic consistency within bins. CONCLUSIONS Our findings highlight the challenges in metagenome-driven viral discovery, underscoring tool limitations. We advocate for combined use of multiple assemblers and sequencing technologies when feasible and highlight the urgent need for specialized tools tailored to gut virome assembly. This study contributes essential insights for advancing viral genome research in the context of gut metagenomics. Video Abstract.
Collapse
Affiliation(s)
- Huarui Wang
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Department of Bioinformatics and Systems Biology, Center for Artificial Intelligence Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Chuqing Sun
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Department of Bioinformatics and Systems Biology, Center for Artificial Intelligence Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Yun Li
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Department of Bioinformatics and Systems Biology, Center for Artificial Intelligence Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Jingchao Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Department of Bioinformatics and Systems Biology, Center for Artificial Intelligence Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Xing-Ming Zhao
- Department of Neurology, Institute of Science and Technology for Brain-Inspired Intelligence, Zhongshan Hospitaland, Fudan University , Shanghai, 200433, China.
- Lingang Laboratory, Shanghai, 200031, China.
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, 200032, China.
- MOE Frontiers Center for Brain Science, Fudan University, Shanghai, 200433, China.
- Huzhou Central Hospital, Affiliated Central Hospital Huzhou University, Huzhou, Zhejiang, 313000, China.
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Department of Bioinformatics and Systems Biology, Center for Artificial Intelligence Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
- School of Biological Science, Jining Medical University, Rizhao, 276800, China.
| |
Collapse
|
6
|
Zhang Y, Zheng X, Yan W, Wang D, Chen X, Wang Y, Zhang T. Method evaluation for viruses in activated sludge: Concentration, sequencing, and identification. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 955:176886. [PMID: 39419205 DOI: 10.1016/j.scitotenv.2024.176886] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 10/09/2024] [Accepted: 10/10/2024] [Indexed: 10/19/2024]
Abstract
Activated sludge (AS) in wastewater treatment plants is one of the largest artificial microbial ecosystems on earth and it makes enormous contributions to human societies. Viruses are an important component in AS with a high abundance. However, their communities and functionalities have not been as widely explored as those of other microorganisms, such as bacteria. This gap is mainly due to technical challenges in effective viral concentration, extraction, and sequencing. In this study, we compared four kinds of concentration methods, two sequencing approaches, and four identification bioinformatic tools to evaluate the whole analysis workflow for viruses in AS. Results showed flocculation, filtration, and resuspension (FFR) could get the longest DNA lengths and ultracentrifugation obtained the highest DNA yields for viruses in AS. Based on the results of present study, FFR and tangential flow filtration with the membrane pore size of 100 kDa were most recommended to concentrate viruses in AS samples with huge volumes. Besides, different concentration methods could get different viral catalogs and thus multiple methods should be combined to get the whole picture of viruses in the system. In addition, geNomad was the most recommended identification tool for viruses in the present study and the long-read sequencing could improve the assembly statistics of viruses when compared with the short-read sequencing. For the 8192 viral operational taxonomic units in this study, 95.1 % of them were phages and belonged to the same lineage at the order level of Caudovirales. Virulent phages dominated the AS system and Pseudomonadota were the main host. Taken together, this study provides new insights into methods selection for virus research of AS.
Collapse
Affiliation(s)
- Yulin Zhang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China
| | - Xiawan Zheng
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China
| | - Weifu Yan
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China
| | - Dou Wang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China
| | - Xi Chen
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China
| | - Yulin Wang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China
| | - Tong Zhang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam, Road, Hong Kong, China; School of Public Health, The University of Hong Kong, Pokfulam Road, Hong Kong, China; Macau Institute of Applied Research in Medicine and Health, Macau University of Science and Technology, Macao.
| |
Collapse
|
7
|
Zsichla L, Zeeb M, Fazekas D, Áy É, Müller D, Metzner KJ, Kouyos RD, Müller V. Comparative Evaluation of Open-Source Bioinformatics Pipelines for Full-Length Viral Genome Assembly. Viruses 2024; 16:1824. [PMID: 39772134 PMCID: PMC11680378 DOI: 10.3390/v16121824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Revised: 11/19/2024] [Accepted: 11/22/2024] [Indexed: 01/11/2025] Open
Abstract
The increasingly widespread application of next-generation sequencing (NGS) in clinical diagnostics and epidemiological research has generated a demand for robust, fast, automated, and user-friendly bioinformatics workflows. To guide the choice of tools for the assembly of full-length viral genomes from NGS datasets, we assessed the performance and applicability of four open-source bioinformatics pipelines (shiver-for which we created a user-friendly Dockerized version, referred to as dshiver; SmaltAlign; viral-ngs; and V-pipe) using both simulated and real-world HIV-1 paired-end short-read datasets and default settings. All four pipelines produced consensus genome assemblies with high quality metrics (genome fraction recovery, mismatch and indel rates, variant calling F1 scores) when the reference sequence used for assembly had high similarity to the analyzed sample. The shiver and SmaltAlign pipelines (but not viral-ngs and V-Pipe) also showed robust performance with more divergent samples (non-matching subtypes). With empirical datasets, SmaltAlign and viral-ngs exhibited an order of magnitude shorter runtime compared to V-Pipe and shiver. In terms of applicability, V-Pipe provides the broadest functionalities, SmaltAlign and dshiver combine user-friendliness with robustness, while the use of viral-ngs requires less computational resources compared to other pipelines. In conclusion, if a closely matched reference sequence is available, all pipelines can reliably reconstruct viral consensus genomes; therefore, differences in user-friendliness and runtime may guide the choice of the pipeline in a particular setting. If a matched reference sequence cannot be selected, we recommend shiver or SmaltAlign for robust performance. The new Dockerized version of shiver offers ease of use in addition to the accuracy and robustness of the original pipeline.
Collapse
Affiliation(s)
- Levente Zsichla
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
| | - Marius Zeeb
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital of Zurich, University of Zurich, 8091 Zurich, Switzerland; (M.Z.); (K.J.M.); (R.D.K.)
- Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland
| | - Dávid Fazekas
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- Earlham Institute, Norwich NR4 7UZ, UK
| | - Éva Áy
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
- National Reference Laboratory for Retroviruses, Department of Virology, National Center for Public Health and Pharmacy, 1097 Budapest, Hungary
| | - Dalma Müller
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
- Department of Bioinformatics, Semmelweis University, 1094 Budapest, Hungary
| | - Karin J. Metzner
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital of Zurich, University of Zurich, 8091 Zurich, Switzerland; (M.Z.); (K.J.M.); (R.D.K.)
- Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland
| | - Roger D. Kouyos
- Department of Infectious Diseases and Hospital Epidemiology, University Hospital of Zurich, University of Zurich, 8091 Zurich, Switzerland; (M.Z.); (K.J.M.); (R.D.K.)
- Institute of Medical Virology, University of Zurich, 8057 Zurich, Switzerland
| | - Viktor Müller
- Institute of Biology, ELTE Eötvös Loránd University, 1117 Budapest, Hungary; (L.Z.); (D.F.); (D.M.)
- National Laboratory for Health Security, ELTE Eötvös Loránd University, 1117 Budapest, Hungary;
| |
Collapse
|
8
|
Kazantseva E, Donmez A, Frolova M, Pop M, Kolmogorov M. Strainy: phasing and assembly of strain haplotypes from long-read metagenome sequencing. Nat Methods 2024; 21:2034-2043. [PMID: 39327484 DOI: 10.1038/s41592-024-02424-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 08/22/2024] [Indexed: 09/28/2024]
Abstract
Bacterial species in microbial communities are often represented by mixtures of strains, distinguished by small variations in their genomes. Short-read approaches can be used to detect small-scale variation between strains but fail to phase these variants into contiguous haplotypes. Long-read metagenome assemblers can generate contiguous bacterial chromosomes but often suppress strain-level variation in favor of species-level consensus. Here we present Strainy, an algorithm for strain-level metagenome assembly and phasing from Nanopore and PacBio reads. Strainy takes a de novo metagenomic assembly as input and identifies strain variants, which are then phased and assembled into contiguous haplotypes. Using simulated and mock Nanopore and PacBio metagenome data, we show that Strainy assembles accurate and complete strain haplotypes, outperforming current Nanopore-based methods and comparable with PacBio-based algorithms in completeness and accuracy. We then use Strainy to assemble strain haplotypes of a complex environmental metagenome, revealing distinct strain distribution and mutational patterns in bacterial species.
Collapse
Affiliation(s)
- Ekaterina Kazantseva
- Bioinformatics and Systems Biology Program, ITMO University, St. Petersburg, Russia
| | - Ataberk Donmez
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Maria Frolova
- Functional Genomics of Prokaryotes Laboratory, Institute of Cell Biophysics, RAS, Pushchino, Russia
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, MD, USA.
| | - Mikhail Kolmogorov
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
9
|
Martin-Cuadrado AB, Rubio-Portillo E, Rosselló F, Antón J. The coral Oculina patagonica holobiont and its response to confinement, temperature, and Vibrio infections. MICROBIOME 2024; 12:222. [PMID: 39472959 PMCID: PMC11520598 DOI: 10.1186/s40168-024-01921-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 08/28/2024] [Indexed: 11/02/2024]
Abstract
BACKGROUND Extensive research on the diversity and functional roles of the microorganisms associated with reef-building corals has been promoted as a consequence of the rapid global decline of coral reefs attributed to climate change. Several studies have highlighted the importance of coral-associated algae (Symbiodinium) and bacteria and their potential roles in promoting coral host fitness and survival. However, the complex coral holobiont extends beyond these components to encompass other entities such as protists, fungi, and viruses. While each constituent has been individually investigated in corals, a comprehensive understanding of their collective roles is imperative for a holistic comprehension of coral health and resilience. RESULTS The metagenomic analysis of the microbiome of the coral Oculina patagonica has revealed that fungi of the genera Aspergillus, Fusarium, and Rhizofagus together with the prokaryotic genera Streptomyces, Pseudomonas, and Bacillus were abundant members of the coral holobiont. This study also assessed changes in microeukaryotic, prokaryotic, and viral communities under three stress conditions: aquaria confinement, heat stress, and Vibrio infections. In general, stress conditions led to an increase in Rhodobacteraceae, Flavobacteraceae, and Vibrionaceae families, accompanied by a decrease in Streptomycetaceae. Concurrently, there was a significant decline in both the abundance and richness of microeukaryotic species and a reduction in genes associated with antimicrobial compound production by the coral itself, as well as by Symbiodinium and fungi. CONCLUSION Our findings suggest that the interplay between microeukaryotic and prokaryotic components of the coral holobiont may be disrupted by stress conditions, such as confinement, increase of seawater temperature, or Vibrio infection, leading to a dysbiosis in the global microbial community that may increase coral susceptibility to diseases. Further, microeukaryotic community seems to exert influence on the prokaryotic community dynamics, possibly through predation or the production of secondary metabolites with anti-bacterial activity. Video Abstract.
Collapse
Affiliation(s)
| | - Esther Rubio-Portillo
- Dpt. Fisiología, Genética y Microbiología, University of Alicante, San Vicente del Raspeig, Spain.
| | - Francesc Rosselló
- Mathematics and Computer Science Dept, University of the Balearic Islands, Palma, Spain
- Balearic Islands Health Research Institute (IdISBa), Palma, Spain
| | - Josefa Antón
- Dpt. Fisiología, Genética y Microbiología, University of Alicante, San Vicente del Raspeig, Spain
| |
Collapse
|
10
|
Lui LM, Nielsen TN. Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses. mSystems 2024; 9:e0024224. [PMID: 39158287 PMCID: PMC11406994 DOI: 10.1128/msystems.00242-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 07/11/2024] [Indexed: 08/20/2024] Open
Abstract
Although long-read sequencing has enabled obtaining high-quality and complete genomes from metagenomes, many challenges still remain to completely decompose a metagenome into its constituent prokaryotic and viral genomes. This study focuses on decomposing an estuarine metagenome to obtain a more accurate estimate of microbial diversity. To achieve this, we developed a new bead-based DNA extraction method, a novel bin refinement method, and obtained 150 Gbp of Nanopore sequencing. We estimate that there are ~500 bacterial and archaeal species in our sample and obtained 68 high-quality bins (>90% complete, <5% contamination, ≤5 contigs, contig length of >100 kbp, and all ribosomal and tRNA genes). We also obtained many contigs of picoeukaryotes, environmental DNA of larger eukaryotes such as mammals, and complete mitochondrial and chloroplast genomes and detected ~40,000 viral populations. Our analysis indicates that there are only a few strains that comprise most of the species abundances. IMPORTANCE Ocean and estuarine microbiomes play critical roles in global element cycling and ecosystem function. Despite the importance of these microbial communities, many species still have not been cultured in the lab. Environmental sequencing is the primary way the function and population dynamics of these communities can be studied. Long-read sequencing provides an avenue to overcome limitations of short-read technologies to obtain complete microbial genomes but comes with its own technical challenges, such as needed sequencing depth and obtaining high-quality DNA. We present here new sampling and bioinformatics methods to attempt decomposing an estuarine microbiome into its constituent genomes. Our results suggest there are only a few strains that comprise most of the species abundances from viruses to picoeukaryotes, and to fully decompose a metagenome of this diversity requires 1 Tbp of long-read sequencing. We anticipate that as long-read sequencing technologies continue to improve, less sequencing will be needed.
Collapse
Affiliation(s)
- Lauren M Lui
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Torben N Nielsen
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| |
Collapse
|
11
|
Chen L, Chen A, Zhang XD, Saenz Robles MT, Han HS, Xiao Y, Xiao G, Pipas JM, Weitz DA. Targeted whole-genome recovery of single viral species in a complex environmental sample. Proc Natl Acad Sci U S A 2024; 121:e2404727121. [PMID: 39052829 PMCID: PMC11295033 DOI: 10.1073/pnas.2404727121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 06/07/2024] [Indexed: 07/27/2024] Open
Abstract
Characterizing unknown viruses is essential for understanding viral ecology and preparing against viral outbreaks. Recovering complete genome sequences from environmental samples remains computationally challenging using metagenomics, especially for low-abundance species with uneven coverage. We present an experimental method for reliably recovering complete viral genomes from complex environmental samples. Individual genomes are encapsulated into droplets and amplified using multiple displacement amplification. A unique gene detection assay, which employs an RNA-based probe and an exonuclease, selectively identifies droplets containing the target viral genome. Labeled droplets are sorted using a microfluidic sorter, and genomes are extracted for sequencing. We demonstrate this method's efficacy by spiking two known viral genomes, Simian virus 40 (SV40, 5,243 bp) and Human Adenovirus 5 (HAd5, 35,938 bp), into a sewage sample with a final abundance in the droplets of around 0.1% and 0.015%, respectively. We achieve 100% recovery of the complete sequence of the spiked-in SV40 genome with uniform coverage distribution. For the larger HAd5 genome, we cover approximately 99.4% of its sequence. Notably, genome recovery is achieved with as few as one sorted droplet, which enables the recovery of any desired genomes in complex environmental samples, regardless of their abundance. This method enables single-genome whole-genome amplification and targeting characterizations of rare viral species and will facilitate our ability to access the mutational profile in single-virus genomes and contribute to an improved understanding of viral ecology.
Collapse
Affiliation(s)
- Liyin Chen
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA02138
| | - Anqi Chen
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA02138
| | - Xinge Diana Zhang
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA02138
| | | | - Hee-Sun Han
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, IL61801
- Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL61801
| | - Yi Xiao
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA02138
| | - Gao Xiao
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA02138
| | - James M. Pipas
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA15260
| | - David A. Weitz
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA02138
- Department of Physics, Harvard University, Cambridge, MA02138
| |
Collapse
|
12
|
Farrall T, Brawner J, Dinsdale A, Kehoe M. A Review of Probe-Based Enrichment Methods to Inform Plant Virus Diagnostics. Int J Mol Sci 2024; 25:8348. [PMID: 39125919 PMCID: PMC11312432 DOI: 10.3390/ijms25158348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 07/20/2024] [Accepted: 07/28/2024] [Indexed: 08/12/2024] Open
Abstract
Modern diagnostic techniques based on DNA sequence similarity are currently the gold standard for the detection of existing and emerging pathogens. Whilst individual assays are inexpensive to use, assay development is costly and carries risks of not being sensitive or specific enough to capture an increasingly diverse range of targets. Sequencing can provide the entire nucleic acid content of a sample and may be used to identify all pathogens present in the sample when the depth of coverage is sufficient. Targeted enrichment techniques have been used to increase sequence coverage and improve the sensitivity of detection within virus samples, specifically, to capture sequences for a range of different viruses or increase the number of reads from low-titre virus infections. Vertebrate viruses have been well characterised using in-solution hybridisation capture to target diverse virus families. The use of probes for genotyping and strain identification has been limited in plants, and uncertainty around sensitivity is an impediment to the development of a large-scale virus panel to use within regulatory settings and diagnostic pipelines. This review aims to compare significant studies that have used targeted enrichment of viruses to identify approaches to probe design and potential for use in plant virus detection and characterisation.
Collapse
Affiliation(s)
- Thomas Farrall
- Plant Innovation Centre, Australian Government, Department of Agriculture, Fisheries and Forestry (DAFF), Canberra, ACT 2601, Australia; (T.F.); (A.D.)
- Forest Research Institute, School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD 4556, Australia
| | - Jeremy Brawner
- Forest Research Institute, School of Science, Technology and Engineering, University of the Sunshine Coast, Sippy Downs, QLD 4556, Australia
- Plant Pathology Department, University of Florida, Gainesville, FL 32611, USA
| | - Adrian Dinsdale
- Plant Innovation Centre, Australian Government, Department of Agriculture, Fisheries and Forestry (DAFF), Canberra, ACT 2601, Australia; (T.F.); (A.D.)
| | - Monica Kehoe
- Diagnostic Laboratory Services, Biosecurity and Sustainability, Department of Primary Industries and Regional Development (DPIRD), Perth, WA 6151, Australia
| |
Collapse
|
13
|
Wurtzer S, Duvivier M, Accrombessi H, Levert M, Richard E, Moulin L. Assessing RNA integrity by digital RT-PCR: Influence of extraction, storage, and matrices. Biol Methods Protoc 2024; 9:bpae053. [PMID: 39450241 PMCID: PMC11500190 DOI: 10.1093/biomethods/bpae053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 07/15/2024] [Accepted: 07/26/2024] [Indexed: 10/26/2024] Open
Abstract
The development of high-throughput sequencing has greatly improved our knowledge of microbial diversity in aquatic environments and its evolution in highly diverse ecosystems. Relevant microbial diversity description based on high-throughput sequencing relies on the good quality of the nucleic acid recovered. Indeed, long genetic fragments are more informative for identifying mutation combinations that characterize variants or species in complex samples. This study describes a new analytical method based on digital Polymerase Chain Reaction (PCR) partitioning technology for assessing the fragmentation of nucleic acid and more specifically viral RNA. This method allows us to overcome limits associated with hydrolysis probe-based assay by focusing on the distance between different amplicons, and not, as usual, on the size of amplicons. RNA integrity can thus be determined as a new fragmentation index, the so-called Fragment size 50. The application of this method has provided information on issues that are inherent in environmental analyses, such as the storage impact of raw samples or extracted RNA, extraction methods, and the nature of the sample on the integrity of viral RNA. Finally, the estimation of fragment size by digital PCR (dPCR) showed a very strong similarity with the fragment size sequenced using Oxford Nanopore Technology. In addition to enabling objective improvements in analytical methods, this approach could become a systematic quality control prior to any long-read sequencing, avoiding insufficiently productive sequencing runs or biases in the representativeness of sequenced fragments.
Collapse
Affiliation(s)
- Sebastien Wurtzer
- Research & Development Department, Eau de Paris. DRDQE, FR-9400, France
| | - Mathilde Duvivier
- Research & Development Department, Eau de Paris. DRDQE, FR-9400, France
| | | | - Morgane Levert
- Research & Development Department, Eau de Paris. DRDQE, FR-9400, France
- Paris Sorbonne Universite, CNRS, EPHE, UMR 7619 Metis, e-LTER Zone Atelier Seine, F-75005, Paris, France
| | - Elise Richard
- Research & Development Department, Eau de Paris. DRDQE, FR-9400, France
| | - Laurent Moulin
- Research & Development Department, Eau de Paris. DRDQE, FR-9400, France
- Obepine SIG, Paris, FR-75000, France
| |
Collapse
|
14
|
Schulze TT, Neville AJ, Watson GF, Sanford AG, Won HI, Conrin ME, Eastman CG, Lui LM, Alizai MY, Walters MJ, Davis PH, Tapprich WE. Complete genome sequence of a Pseudomonas fluorescens bacteriophage UNO-G1W1 isolated from freshwater ice in Nebraska. Microbiol Resour Announc 2024; 13:e0038424. [PMID: 38847506 PMCID: PMC11256812 DOI: 10.1128/mra.00384-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Accepted: 05/10/2024] [Indexed: 07/19/2024] Open
Abstract
We provide the complete genome sequence for a novel Pseudomonas fluorescens bacteriophage named UNO-G1W1. This phage was isolated from a single ice cover sampling. The genome was sequenced on the Nanopore MinION, generated with the direct terminal repeat-phage-pipeline and polished with Illumina short reads. Sequence identity classifies the phage as an otagovirus.
Collapse
Affiliation(s)
- Thomas T. Schulze
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
- Department of Pathology, Microbiology, and Immunology, University of Nebraska Medical Center, Omaha, Nebraska, USA
| | - Andrew J. Neville
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - Gabrielle F. Watson
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
- Department of Pathology, Microbiology, and Immunology, University of Nebraska Medical Center, Omaha, Nebraska, USA
| | - Austin G. Sanford
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - Harim I. Won
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - Mackenzie E. Conrin
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - Connor G. Eastman
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - LeeAnna M. Lui
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - M. Yunos Alizai
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - Matthias J. Walters
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - Paul H. Davis
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| | - William E. Tapprich
- Department of Biology, University of Nebraska at Omaha, Omaha, Nebraska, USA
| |
Collapse
|
15
|
Krause GR, Shands W, Wheeler TJ. Sensitive and error-tolerant annotation of protein-coding DNA with BATH. BIOINFORMATICS ADVANCES 2024; 4:vbae088. [PMID: 38966592 PMCID: PMC11223822 DOI: 10.1093/bioadv/vbae088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/04/2024] [Revised: 05/03/2024] [Accepted: 06/10/2024] [Indexed: 07/06/2024]
Abstract
Summary We present BATH, a tool for highly sensitive annotation of protein-coding DNA based on direct alignment of that DNA to a database of protein sequences or profile hidden Markov models (pHMMs). BATH is built on top of the HMMER3 code base, and simplifies the annotation workflow for pHMM-based translated sequence annotation by providing a straightforward input interface and easy-to-interpret output. BATH also introduces novel frameshift-aware algorithms to detect frameshift-inducing nucleotide insertions and deletions (indels). BATH matches the accuracy of HMMER3 for annotation of sequences containing no errors, and produces superior accuracy to all tested tools for annotation of sequences containing nucleotide indels. These results suggest that BATH should be used when high annotation sensitivity is required, particularly when frameshift errors are expected to interrupt protein-coding regions, as is true with long-read sequencing data and in the context of pseudogenes. Availability and implementation The software is available at https://github.com/TravisWheelerLab/BATH.
Collapse
Affiliation(s)
- Genevieve R Krause
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, AZ 85721, United States
- Department of Computer Science, University of Montana, Missoula, MT 59812, United States
| | - Walt Shands
- Department of Computer Science, University of Montana, Missoula, MT 59812, United States
- Genomics Institute, UC Santa Cruz, Santa Cruz, CA 95060, United States
| | - Travis J Wheeler
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, AZ 85721, United States
- Department of Computer Science, University of Montana, Missoula, MT 59812, United States
| |
Collapse
|
16
|
Cook R, Telatin A, Hsieh SY, Newberry F, Tariq MA, Baker DJ, Carding SR, Adriaenssens EM. Nanopore and Illumina sequencing reveal different viral populations from human gut samples. Microb Genom 2024; 10:001236. [PMID: 38683195 PMCID: PMC11092197 DOI: 10.1099/mgen.0.001236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 03/18/2024] [Indexed: 05/01/2024] Open
Abstract
The advent of viral metagenomics, or viromics, has improved our knowledge and understanding of global viral diversity. High-throughput sequencing technologies enable explorations of the ecological roles, contributions to host metabolism, and the influence of viruses in various environments, including the human intestinal microbiome. However, bacterial metagenomic studies frequently have the advantage. The adoption of advanced technologies like long-read sequencing has the potential to be transformative in refining viromics and metagenomics. Here, we examined the effectiveness of long-read and hybrid sequencing by comparing Illumina short-read and Oxford Nanopore Technology (ONT) long-read sequencing technologies and different assembly strategies on recovering viral genomes from human faecal samples. Our findings showed that if a single sequencing technology is to be chosen for virome analysis, Illumina is preferable due to its superior ability to recover fully resolved viral genomes and minimise erroneous genomes. While ONT assemblies were effective in recovering viral diversity, the challenges related to input requirements and the necessity for amplification made it less ideal as a standalone solution. However, using a combined, hybrid approach enabled a more authentic representation of viral diversity to be obtained within samples.
Collapse
Affiliation(s)
- Ryan Cook
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
| | | | | | - Fiona Newberry
- Department of Biosciences, Nottingham Trent University, Nottingham, NG11 8NS, UK
| | - Mohammad A. Tariq
- Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
| | | | - Simon R. Carding
- Quadram Institute Bioscience, Norwich, NR4 7UQ, UK
- Norwich Medical School, University of East Anglia, Norwich, NR4 7TJ, UK
| | | |
Collapse
|
17
|
Faith DR, Kinnersley M, Brooks DM, Drecktrah D, Hall LS, Luo E, Santiago-Frangos A, Wachter J, Samuels DS, Secor PR. Characterization and genomic analysis of the Lyme disease spirochete bacteriophage ϕBB-1. PLoS Pathog 2024; 20:e1012122. [PMID: 38558079 PMCID: PMC11008901 DOI: 10.1371/journal.ppat.1012122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 04/11/2024] [Accepted: 03/13/2024] [Indexed: 04/04/2024] Open
Abstract
Lyme disease is a tick-borne infection caused by the spirochete Borrelia (Borreliella) burgdorferi. Borrelia species have highly fragmented genomes composed of a linear chromosome and a constellation of linear and circular plasmids some of which are required throughout the enzootic cycle. Included in this plasmid repertoire by almost all Lyme disease spirochetes are the 32-kb circular plasmid cp32 prophages that are capable of lytic replication to produce infectious virions called ϕBB-1. While the B. burgdorferi genome contains evidence of horizontal transfer, the mechanisms of gene transfer between strains remain unclear. While we know that ϕBB-1 transduces cp32 and shuttle vector DNA during in vitro cultivation, the extent of ϕBB-1 DNA transfer is not clear. Herein, we use proteomics and long-read sequencing to further characterize ϕBB-1 virions. Our studies identified the cp32 pac region and revealed that ϕBB-1 packages linear cp32s via a headful mechanism with preferential packaging of plasmids containing the cp32 pac region. Additionally, we find ϕBB-1 packages fragments of the linear chromosome and full-length plasmids including lp54, cp26, and others. Furthermore, sequencing of ϕBB-1 packaged DNA allowed us to resolve the covalently closed hairpin telomeres for the linear B. burgdorferi chromosome and most linear plasmids in strain CA-11.2A. Collectively, our results shed light on the biology of the ubiquitous ϕBB-1 phage and further implicates ϕBB-1 in the generalized transduction of diverse genes and the maintenance of genetic diversity in Lyme disease spirochetes.
Collapse
Affiliation(s)
- Dominick R. Faith
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| | - Margie Kinnersley
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| | - Diane M. Brooks
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| | - Dan Drecktrah
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| | - Laura S. Hall
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| | - Eric Luo
- Vaccine and Infectious Disease Organization, Saskatoon, Canada
| | - Andrew Santiago-Frangos
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Jenny Wachter
- Vaccine and Infectious Disease Organization, Saskatoon, Canada
| | - D. Scott Samuels
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| | - Patrick R. Secor
- Division of Biological Sciences, University of Montana, Missoula, Montana, United States of America
| |
Collapse
|
18
|
Du S, Wu Y, Ying H, Wu Z, Yang M, Chen F, Shao J, Liu H, Zhang Z, Zhao Y. Genome sequences of the first Autographiviridae phages infecting marine Roseobacter. Microb Genom 2024; 10. [PMID: 38630615 DOI: 10.1099/mgen.0.001240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024] Open
Abstract
The ubiquitous and abundant marine phages play critical roles in shaping the composition and function of bacterial communities, impacting biogeochemical cycling in marine ecosystems. Autographiviridae is among the most abundant and ubiquitous phage families in the ocean. However, studies on the diversity and ecology of Autographiviridae phages in marine environments are restricted to isolates that infect SAR11 bacteria and cyanobacteria. In this study, ten new roseophages that infect marine Roseobacter strains were isolated from coastal waters. These new roseophages have a genome size ranging from 38 917 to 42 634 bp and G+C content of 44.6-50 %. Comparative genomics showed that they are similar to known Autographiviridae phages regarding gene content and architecture, thus representing the first Autographiviridae roseophages. Phylogenomic analysis based on concatenated conserved genes showed that the ten roseophages form three distinct subgroups within the Autographiviridae, and sequence analysis revealed that they belong to eight new genera. Finally, viromic read-mapping showed that these new Autographiviridae phages are widely distributed in global oceans, mostly inhabiting polar and estuarine locations. This study has expanded the current understanding of the genomic diversity, evolution and ecology of Autographiviridae phages and roseophages. We suggest that Autographiviridae phages play important roles in the mortality and community structure of roseobacters, and have broad ecological applications.
Collapse
Affiliation(s)
- Sen Du
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Ying Wu
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Hanqi Ying
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Zuqing Wu
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Mingyu Yang
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Feng Chen
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, Maryland, USA
| | - Jiabing Shao
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - He Liu
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Zefeng Zhang
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| | - Yanlin Zhao
- College of Juncao Science and Ecology, Fujian Agriculture and Forestry University, Fuzhou, PR China
- Key Laboratory of Marine Biotechnology of Fujian Province, Institute of Oceanology, Fujian Agriculture and Forestry University, Fuzhou, PR China
| |
Collapse
|
19
|
Chen L, Banfield JF. COBRA improves the completeness and contiguity of viral genomes assembled from metagenomes. Nat Microbiol 2024; 9:737-750. [PMID: 38321183 PMCID: PMC10914622 DOI: 10.1038/s41564-023-01598-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 12/19/2023] [Indexed: 02/08/2024]
Abstract
Viruses are often studied using metagenome-assembled sequences, but genome incompleteness hampers comprehensive and accurate analyses. Contig Overlap Based Re-Assembly (COBRA) resolves assembly breakpoints based on the de Bruijn graph and joins contigs. Here we benchmarked COBRA using ocean and soil viral datasets. COBRA accurately joined the assembled sequences and achieved notably higher genome accuracy than binning tools. From 231 published freshwater metagenomes, we obtained 7,334 bacteriophage clusters, ~83% of which represent new phage species. Notably, ~70% of these were circular, compared with 34% before COBRA analyses. We expanded sampling of huge phages (≥200 kbp), the largest of which was curated to completion (717 kbp). Improved phage genomes from Rotsee Lake provided context for metatranscriptomic data and indicated the in situ activity of huge phages, whiB-encoding phages and cysC- and cysH-encoding phages. COBRA improves viral genome assembly contiguity and completeness, thus the accuracy and reliability of analyses of gene content, diversity and evolution.
Collapse
Affiliation(s)
- LinXing Chen
- Department of Earth and Planetary Sciences, University of California, Berkeley, Berkeley, CA, USA.
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA.
| | - Jillian F Banfield
- Department of Earth and Planetary Sciences, University of California, Berkeley, Berkeley, CA, USA.
- Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA, USA.
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA, USA.
- Department of Environmental Science Policy, and Management, University of California, Berkeley, Berkeley, CA, USA.
- Earth and Environmental Sciences, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
20
|
Cook R, Brown N, Rihtman B, Michniewski S, Redgwell T, Clokie M, Stekel DJ, Chen Y, Scanlan DJ, Hobman JL, Nelson A, Jones MA, Smith D, Millard A. The long and short of it: benchmarking viromics using Illumina, Nanopore and PacBio sequencing technologies. Microb Genom 2024; 10:001198. [PMID: 38376377 PMCID: PMC10926689 DOI: 10.1099/mgen.0.001198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2023] [Accepted: 01/25/2024] [Indexed: 02/21/2024] Open
Abstract
Viral metagenomics has fuelled a rapid change in our understanding of global viral diversity and ecology. Long-read sequencing and hybrid assembly approaches that combine long- and short-read technologies are now being widely implemented in bacterial genomics and metagenomics. However, the use of long-read sequencing to investigate viral communities is still in its infancy. While Nanopore and PacBio technologies have been applied to viral metagenomics, it is not known to what extent different technologies will impact the reconstruction of the viral community. Thus, we constructed a mock bacteriophage community of previously sequenced phage genomes and sequenced them using Illumina, Nanopore and PacBio sequencing technologies and tested a number of different assembly approaches. When using a single sequencing technology, Illumina assemblies were the best at recovering phage genomes. Nanopore- and PacBio-only assemblies performed poorly in comparison to Illumina in both genome recovery and error rates, which both varied with the assembler used. The best Nanopore assembly had errors that manifested as SNPs and INDELs at frequencies 41 and 157 % higher than found in Illumina only assemblies, respectively. While the best PacBio assemblies had SNPs at frequencies 12 and 78 % higher than found in Illumina-only assemblies, respectively. Despite high-read coverage, long-read-only assemblies recovered a maximum of one complete genome from any assembly, unless reads were down-sampled prior to assembly. Overall the best approach was assembly by a combination of Illumina and Nanopore reads, which reduced error rates to levels comparable with short-read-only assemblies. When using a single technology, Illumina only was the best approach. The differences in genome recovery and error rates between technology and assembler had downstream impacts on gene prediction, viral prediction, and subsequent estimates of diversity within a sample. These findings will provide a starting point for others in the choice of reads and assembly algorithms for the analysis of viromes.
Collapse
Affiliation(s)
- Ryan Cook
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
| | - Nathan Brown
- Centre for Phage Research, Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, Leicestershire, LE1 7RH, UK
| | - Branko Rihtman
- School of Life Sciences, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
| | - Slawomir Michniewski
- Warwick Medical School, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
| | - Tamsin Redgwell
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Ledreborg Alle 34, 2820, Gentofte, Denmark
| | - Martha Clokie
- Centre for Phage Research, Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, Leicestershire, LE1 7RH, UK
| | - Dov J. Stekel
- School of Biosciences, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
- Department of Mathematics and Applied Mathematics, University of Johannesburg, Rossmore 2029, South Africa
| | - Yin Chen
- School of Life Sciences, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
| | - David J. Scanlan
- School of Life Sciences, University of Warwick, Gibbet Hill Road, Coventry, CV4 7AL, UK
| | - Jon L. Hobman
- School of Biosciences, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
| | - Andrew Nelson
- Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
| | - Michael A. Jones
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire, LE12 5RD, UK
| | - Darren Smith
- Faculty of Health and Life Sciences, University of Northumbria, Newcastle upon Tyne, NE1 8ST, UK
| | - Andrew Millard
- Centre for Phage Research, Dept Genetics and Genome Biology, University of Leicester, University Road, Leicester, Leicestershire, LE1 7RH, UK
| |
Collapse
|
21
|
Faith DR, Kinnersley M, Brooks DM, Drecktrah D, Hall LS, Luo E, Santiago-Frangos A, Wachter J, Samuels DS, Secor PR. Characterization and genomic analysis of the Lyme disease spirochete bacteriophage ϕBB-1. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.08.574763. [PMID: 38260690 PMCID: PMC10802411 DOI: 10.1101/2024.01.08.574763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
Lyme disease is a tick-borne infection caused by the spirochete Borrelia (Borreliella) burgdorferi. Borrelia species have highly fragmented genomes composed of a linear chromosome and a constellation of linear and circular plasmids some of which are required throughout the enzootic cycle. Included in this plasmid repertoire by almost all Lyme disease spirochetes are the 32-kb circular plasmid cp32 prophages that are capable of lytic replication to produce infectious virions called ϕBB-1. While the B. burgdorferi genome contains evidence of horizontal transfer, the mechanisms of gene transfer between strains remain unclear. While we know that ϕBB-1 transduces cp32 and shuttle vector DNA during in vitro cultivation, the extent of ϕBB-1 DNA transfer is not clear. Herein, we use proteomics and long-read sequencing to further characterize ϕBB-1 virions. Our studies identified the cp32 pac region and revealed that ϕBB-1 packages linear cp32s via a headful mechanism with preferentially packaging of plasmids containing the cp32 pac region. Additionally, we find ϕBB-1 packages fragments of the linear chromosome and full-length plasmids including lp54, cp26, and others. Furthermore, sequencing of ϕBB-1 packaged DNA allowed us to resolve the covalently closed hairpin telomeres for the linear B. burgdorferi chromosome and most linear plasmids in strain CA-11.2A. Collectively, our results shed light on the biology of the ubiquitous ϕBB-1 phage and further implicates ϕBB-1 in the generalized transduction of diverse genes and the maintenance of genetic diversity in Lyme disease spirochetes.
Collapse
Affiliation(s)
- Dominick R. Faith
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Margie Kinnersley
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Diane M. Brooks
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Dan Drecktrah
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Laura S. Hall
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Eric Luo
- Vaccine and Infectious Disease Organization, Saskatoon, SK, Canada
| | | | - Jenny Wachter
- Vaccine and Infectious Disease Organization, Saskatoon, SK, Canada
| | - D. Scott Samuels
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Patrick R. Secor
- Division of Biological Sciences, University of Montana, Missoula, MT, USA
| |
Collapse
|
22
|
Krause GR, Shands W, Wheeler TJ. Sensitive and error-tolerant annotation of protein-coding DNA with BATH. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.31.573773. [PMID: 38260252 PMCID: PMC10802276 DOI: 10.1101/2023.12.31.573773] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2024]
Abstract
We present BATH, a tool for highly sensitive annotation of protein-coding DNA based on direct alignment of that DNA to a database of protein sequences or profile hidden Markov models (pHMMs). BATH is built on top of the HMMER3 code base, and simplifies the annotation workflow for pHMM-based annotation by providing a straightforward input interface and easy-to-interpret output. BATH also introduces novel frameshift-aware algorithms to detect frameshift-inducing nucleotide insertions and deletions (indels). BATH matches the accuracy of HMMER3 for annotation of sequences containing no errors, and produces superior accuracy to all tested tools for annotation of sequences containing nucleotide indels. These results suggest that BATH should be used when high annotation sensitivity is required, particularly when frameshift errors are expected to interrupt protein-coding regions, as is true with long read sequencing data and in the context of pseudogenes.
Collapse
Affiliation(s)
- Genevieve R Krause
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, Arizona, USA
- Department of Computer Science, University of Montana, Missoula, Montana, USA
| | - Walt Shands
- Department of Computer Science, University of Montana, Missoula, Montana, USA
- UC Santa Cruz Genomics Institute, Santa Cruz, California, USA
| | - Travis J Wheeler
- R. Ken Coit College of Pharmacy, University of Arizona, Tucson, Arizona, USA
- Department of Computer Science, University of Montana, Missoula, Montana, USA
| |
Collapse
|
23
|
Fu P, Wu Y, Zhang Z, Qiu Y, Wang Y, Peng Y. VIGA: a one-stop tool for eukaryotic virus identification and genome assembly from next-generation-sequencing data. Brief Bioinform 2023; 25:bbad444. [PMID: 38048079 PMCID: PMC10753531 DOI: 10.1093/bib/bbad444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 10/26/2023] [Accepted: 11/11/2023] [Indexed: 12/05/2023] Open
Abstract
Identification of viruses and further assembly of viral genomes from the next-generation-sequencing data are essential steps in virome studies. This study presented a one-stop tool named VIGA (available at https://github.com/viralInformatics/VIGA) for eukaryotic virus identification and genome assembly from NGS data. It was composed of four modules, namely, identification, taxonomic annotation, assembly and novel virus discovery, which integrated several third-party tools such as BLAST, Trinity, MetaCompass and RagTag. Evaluation on multiple simulated and real virome datasets showed that VIGA assembled more complete virus genomes than its competitors on both the metatranscriptomic and metagenomic data and performed well in assembling virus genomes at the strain level. Finally, VIGA was used to investigate the virome in metatranscriptomic data from the Human Microbiome Project and revealed different composition and positive rate of viromes in diseases of prediabetes, Crohn's disease and ulcerative colitis. Overall, VIGA would help much in identification and characterization of viromes, especially the known viruses, in future studies.
Collapse
Affiliation(s)
- Ping Fu
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha 410082, China
| | - Yifan Wu
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha 410082, China
| | - Zhiyuan Zhang
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha 410082, China
| | - Ye Qiu
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha 410082, China
| | - Yirong Wang
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha 410082, China
| | - Yousong Peng
- Bioinformatics Center, College of Biology, Hunan Provincial Key Laboratory of Medical Virology, Hunan University, Changsha 410082, China
| |
Collapse
|
24
|
Rosani U, Corinaldesi C, Luongo G, Sollitto M, Dal Monego S, Licastro D, Bongiorni L, Venier P, Pallavicini A, Dell’Anno A. Viral Diversity in Benthic Abyssal Ecosystems: Ecological and Methodological Considerations. Viruses 2023; 15:2282. [PMID: 38140524 PMCID: PMC10747316 DOI: 10.3390/v15122282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 11/13/2023] [Accepted: 11/18/2023] [Indexed: 12/24/2023] Open
Abstract
Viruses are the most abundant 'biological entities' in the world's oceans. However, technical and methodological constraints limit our understanding of their diversity, particularly in benthic abyssal ecosystems (>4000 m depth). To verify advantages and limitations of analyzing virome DNA subjected either to random amplification or unamplified, we applied shotgun sequencing-by-synthesis to two sample pairs obtained from benthic abyssal sites located in the North-eastern Atlantic Ocean at ca. 4700 m depth. One amplified DNA sample was also subjected to single-molecule long-read sequencing for comparative purposes. Overall, we identified 24,828 viral Operational Taxonomic Units (vOTUs), belonging to 22 viral families. Viral reads were more abundant in the amplified DNA samples (38.5-49.9%) compared to the unamplified ones (4.4-5.8%), with the latter showing a greater viral diversity and 11-16% of dsDNA viruses almost undetectable in the amplified samples. From a procedural point of view, the viromes obtained by direct sequencing (without amplification step) provided a broader overview of both ss and dsDNA viral diversity. Nevertheless, our results suggest that the contextual use of random amplification of the same sample and long-read technology can improve the assessment of viral assemblages by reducing off-target reads.
Collapse
Affiliation(s)
- Umberto Rosani
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy;
| | - Cinzia Corinaldesi
- Department of Materials, Environmental Sciences and Urban Planning, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy;
| | - Gabriella Luongo
- Department of Life and Environmental Sciences, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy;
| | - Marco Sollitto
- Department of Life Sciences, University of Trieste, Via Licio Giorgeri 5, 34127 Trieste, Italy; (M.S.); (A.P.)
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, 6000 Koper, Slovenia
| | - Simeone Dal Monego
- Laboratorio di Genomica ed Epigenomica, AREA Scienze Park, Padriciano 99, 34149 Trieste, Italy; (S.D.M.); (D.L.)
| | - Danilo Licastro
- Laboratorio di Genomica ed Epigenomica, AREA Scienze Park, Padriciano 99, 34149 Trieste, Italy; (S.D.M.); (D.L.)
| | - Lucia Bongiorni
- Consiglio Nazionale delle Ricerche, Istituto di Scienze Marine, Tesa 104–Arsenale, Castello 2737/F, 30122 Venezia, Italy;
| | - Paola Venier
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy;
| | - Alberto Pallavicini
- Department of Life Sciences, University of Trieste, Via Licio Giorgeri 5, 34127 Trieste, Italy; (M.S.); (A.P.)
| | - Antonio Dell’Anno
- Department of Life and Environmental Sciences, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy;
| |
Collapse
|
25
|
Deng WK, He JL, Chen JY, Wu RT, Xing SC, Liao XD. Effects of microplastics on functional genes related to CH 4 and N 2O metabolism in bacteriophages during manure composting and its planting applications. JOURNAL OF HAZARDOUS MATERIALS 2023; 460:132288. [PMID: 37611393 DOI: 10.1016/j.jhazmat.2023.132288] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 08/08/2023] [Accepted: 08/11/2023] [Indexed: 08/25/2023]
Abstract
Microplastics (MPs), as a new type of pollutant, widely exist in livestock and poultry breeding and agricultural soils. However, research on MPs pollution on greenhouse gas emissions in combined planting and breeding systems is lacking, especially from the perspective of phage horizontal gene transfer. Therefore, this paper explores the effects of MPs on functional genes related to CH4 and N2O metabolism in bacteriophages during manure composting and its planting applications. The results of the study indicated that the addition of MPs had an impact on both the physicochemical properties and microbial community structure of manure during the composting process and on the compost-applied rhizosphere soil of lactuca (Lactuca sativa). Specifically, on day 7 of composting, mcrA/pmoA and (nirS+nirK) levels in bacteria in the MP group significantly increased. Additionally, it was observed that the MP group had higher average temperatures during the high-temperature period of composting, which led to a rapid reduction in phages. However, the phage levels quickly recovered during the cooling period. Furthermore, the addition of MPs to the rhizosphere soil resulted in higher levels of nirK. These changes may affect greenhouse gas emissions.
Collapse
Affiliation(s)
- Wei-Kang Deng
- College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Jun-Liang He
- College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Jing-Yuan Chen
- College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Rui-Ting Wu
- College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China
| | - Si-Cheng Xing
- Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou 510642, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry Agriculture, Guangzhou 510642, Guangdong, China; National-Local Joint Engineering Research Center for Livestock Breeding, Guangzhou 510642, Guangdong, China
| | - Xin-Di Liao
- College of Animal Science, South China Agricultural University, Guangzhou 510642, Guangdong, China; Guangdong Provincial Key Lab of Agro-Animal Genomics and Molecular Breeding, and Key Laboratory of Chicken Genetics, Breeding and Reproduction, Ministry Agriculture, Guangzhou 510642, Guangdong, China; National-Local Joint Engineering Research Center for Livestock Breeding, Guangzhou 510642, Guangdong, China; State Key Laboratory of Swine and Poultry Breeding Industry, Guangzhou 510642, Guangdong, China.
| |
Collapse
|
26
|
Yang M, Du S, Zhang Z, Xia Q, Liu H, Qin F, Wu Z, Ying H, Wu Y, Shao J, Zhao Y. Genomic diversity and biogeographic distributions of a novel lineage of bacteriophages that infect marine OM43 bacteria. Microbiol Spectr 2023; 11:e0494222. [PMID: 37607063 PMCID: PMC10580990 DOI: 10.1128/spectrum.04942-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Accepted: 07/07/2023] [Indexed: 08/24/2023] Open
Abstract
The marine methylotrophic OM43 clade is considered an important bacterial group in coastal microbial communities. OM43 bacteria, which are closely related to phytoplankton blooms, have small cell sizes and streamlined genomes. Bacteriophages profoundly shape the evolutionary trajectories, population dynamics, and physiology of microbes. The prevalence and diversity of several phages that infect OM43 bacteria have been reported. In this study, we isolated and sequenced two novel OM43 phages, MEP401 and MEP402. These phages share 90% of their open reading frames (ORFs) and are distinct from other known phage isolates. Furthermore, a total of 99 metagenomic viral genomes (MVGs) closely related to MEP401 and MEP402 were identified. Phylogenomic analyses suggest that MEP401, MEP402, and these identified MVGs belong to a novel subfamily in the family Zobellviridae and that they can be separated into two groups. Group I MVGs show conserved whole-genome synteny with MEP401, while group II MVGs possess the MEP401-type DNA replication module and a distinct type of morphogenesis and packaging module, suggesting that genomic recombination occurred between phages. Most members in these two groups were predicted to infect OM43 bacteria. Metagenomic read-mapping analysis revealed that the phages in these two groups are globally ubiquitous and display distinct biogeographic distributions, with some phages being predominant in cold regions, some exclusively detected in estuarine stations, and others displaying wider distributions. This study expands our knowledge of the diversity and ecology of a novel phage lineage that infects OM43 bacteria by describing their genomic diversity and global distribution patterns. IMPORTANCE OM43 phages that infect marine OM43 bacteria are important for host mortality, community structure, and physiological functions. In this study, two OM43 phages were isolated and characterized. Metagenomic viral genome (MVG) retrieval using these two OM43 phages as baits led to the identification of two phage groups of a new subfamily in the family Zobellviridae. We found that group I MVGs share similar genomic content and arrangement with MEP401 and MEP402, whereas group II MVGs only possess the MEP401-type DNA replication module. Metagenomic mapping analysis suggests that members in these two groups are globally ubiquitous with distinct distribution patterns. This study provides important insights into the genomic diversity and biogeography of the OM43 phages in the global ocean.
Collapse
Affiliation(s)
- Mingyu Yang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Sen Du
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zefeng Zhang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Qian Xia
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - He Liu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Fang Qin
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zuqing Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Hanqi Ying
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yin Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Jiabing Shao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yanlin Zhao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
- Key Laboratory of Marine Biotechnology of Fujian Province, Institute of Oceanology, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
27
|
Cheng Z, Li X, Palomo A, Yang Q, Han L, Wu Z, Li Z, Zhang M, Chen L, Zhao B, Yu K, Zhang C, Hou S, Zheng Y, Xia Y. Virus impacted community adaptation in oligotrophic groundwater environment revealed by Hi-C coupled metagenomic and viromic study. JOURNAL OF HAZARDOUS MATERIALS 2023; 458:131944. [PMID: 37390685 DOI: 10.1016/j.jhazmat.2023.131944] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 06/21/2023] [Accepted: 06/24/2023] [Indexed: 07/02/2023]
Abstract
Viruses play a crucial role in microbial mortality, diversity and biogeochemical cycles. Groundwater is the largest global freshwater and one of the most oligotrophic aquatic systems on Earth, but how microbial and viral communities are shaped in this special habitat is largely unexplored. In this study, we collected groundwater samples from 23 to 60 m aquifers at Yinchuan Plain, China. In total, 1920 non-reductant viral contigs were retrieved from metagenomes and viromes constructed by Illumina and Nanopore hybrid sequencing. Only 3% of them could be clustered with known viruses, most of which were Caudoviricetes. Coupling 1.2 Tb Hi-C sequencing with CRISPR matching and homology search, we connected 469 viruses with their hosts while some viral clusters presented a broad-host-range trait. Meanwhile, a large proportion of biosynthesis related auxiliary metabolism genes were identified. Those characteristics might benefit viruses for a better survival in this special oligotrophic environment. Additionally, the groundwater virome showed genomic features distinct from those of the open ocean and wastewater treatment facilities in GC distribution and unannotated gene compositions. This paper expands the current knowledge of the global viromic records and serves as a foundation for a more thorough understanding of viruses in groundwater.
Collapse
Affiliation(s)
- Zhanwen Cheng
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Xiang Li
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Guangdong Provincial Key Laboratory of Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Alejandro Palomo
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Guangdong Provincial Key Laboratory of Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Qing Yang
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Long Han
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Ziqi Wu
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Zengyi Li
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Miao Zhang
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Liming Chen
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Bixi Zhao
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Kaiqiang Yu
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Chuanlun Zhang
- Shenzhen Key Laboratory of Marine Archaea Geo-Omics, Department of Ocean Science and Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Southern Marine Science and Engineering Guangdong Laboratory, Guangzhou 510000, China
| | - Shengwei Hou
- Shenzhen Key Laboratory of Marine Archaea Geo-Omics, Department of Ocean Science and Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Southern Marine Science and Engineering Guangdong Laboratory, Guangzhou 510000, China
| | - Yan Zheng
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Guangdong Provincial Key Laboratory of Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Shenzhen Key Laboratory of Marine Archaea Geo-Omics, Department of Ocean Science and Department of Ocean Science & Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Yu Xia
- School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; State Environmental Protection Key Laboratory of Integrated Surface Water-Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China; Guangdong Provincial Key Laboratory of Soil and Groundwater Pollution Control, School of Environmental Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China.
| |
Collapse
|
28
|
Bass D, Christison KW, Stentiford GD, Cook LSJ, Hartikainen H. Environmental DNA/RNA for pathogen and parasite detection, surveillance, and ecology. Trends Parasitol 2023; 39:285-304. [PMID: 36759269 DOI: 10.1016/j.pt.2022.12.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 12/20/2022] [Accepted: 12/26/2022] [Indexed: 02/11/2023]
Abstract
Detection of pathogens, parasites, and other symbionts in environmental samples via eDNA/eRNA (collectively eNA) is an increasingly important source of information about their occurrence and activity. There is great potential for using such detections as a proxy for infection of host organisms in connected habitats, for pathogen monitoring and surveillance, and for early warning systems for disease. However, many factors require consideration, and appropriate methods developed and verified, in order that eNA detections can be reliably interpreted and adopted for surveillance and assessment of disease risk, and potentially inclusion in international standards, such as the World Organisation for Animal Health guidelines. Disease manifestation results from host-symbiont-environment interactions between hosts, demanding a multifactorial approach to interpretation of eNA signals.
Collapse
Affiliation(s)
- David Bass
- International Centre of Excellence for Aquatic Animal Health, The Centre for Environment, Fisheries and Aquaculture Science, Weymouth, UK; Sustainable Aquaculture Futures, Biosciences, College of Life and Environmental Sciences, University of Exeter, Stocker Road, Exeter, UK.
| | - Kevin W Christison
- Department of Biodiversity and Conservation Biology, University of the Western Cape, Private Bag X17, Bellville, 7535, South Africa; Department of Forestry, Fisheries and the Environment, Private Bag X2, Vlaeberg, 8012, South Africa
| | - Grant D Stentiford
- International Centre of Excellence for Aquatic Animal Health, The Centre for Environment, Fisheries and Aquaculture Science, Weymouth, UK; Sustainable Aquaculture Futures, Biosciences, College of Life and Environmental Sciences, University of Exeter, Stocker Road, Exeter, UK
| | - Lauren S J Cook
- International Centre of Excellence for Aquatic Animal Health, The Centre for Environment, Fisheries and Aquaculture Science, Weymouth, UK; Royal Holloway, University of London, Egham Hill, Egham TW20 0EX, UK
| | - Hanna Hartikainen
- University of Nottingham, School of Life Sciences, University Park, NG7 2RD, Nottingham, UK
| |
Collapse
|
29
|
Walsh AM, Leech J, Huttenhower C, Delhomme-Nguyen H, Crispie F, Chervaux C, Cotter P. Integrated molecular approaches for fermented food microbiome research. FEMS Microbiol Rev 2023; 47:fuad001. [PMID: 36725208 PMCID: PMC10002906 DOI: 10.1093/femsre/fuad001] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 12/28/2022] [Accepted: 01/09/2023] [Indexed: 02/03/2023] Open
Abstract
Molecular technologies, including high-throughput sequencing, have expanded our perception of the microbial world. Unprecedented insights into the composition and function of microbial communities have generated large interest, with numerous landmark studies published in recent years relating the important roles of microbiomes and the environment-especially diet and nutrition-in human, animal, and global health. As such, food microbiomes represent an important cross-over between the environment and host. This is especially true of fermented food microbiomes, which actively introduce microbial metabolites and, to a lesser extent, live microbes into the human gut. Here, we discuss the history of fermented foods, and examine how molecular approaches have advanced research of these fermented foods over the past decade. We highlight how various molecular approaches have helped us to understand the ways in which microbes shape the qualities of these products, and we summarize the impacts of consuming fermented foods on the gut. Finally, we explore how advances in bioinformatics could be leveraged to enhance our understanding of fermented foods. This review highlights how integrated molecular approaches are changing our understanding of the microbial communities associated with food fermentation, the creation of unique food products, and their influences on the human microbiome and health.
Collapse
Affiliation(s)
- Aaron M Walsh
- Teagasc Food Research Centre, Moorepark, Fermoy, Cork and APC Microbiome Ireland, P61 C996, Ireland
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - John Leech
- Teagasc Food Research Centre, Moorepark, Fermoy, Cork and APC Microbiome Ireland, P61 C996, Ireland
| | - Curtis Huttenhower
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | | | - Fiona Crispie
- Teagasc Food Research Centre, Moorepark, Fermoy, Cork and APC Microbiome Ireland, P61 C996, Ireland
| | - Christian Chervaux
- Danone Nutricia Research, Centre Daniel Carasso, Palaiseau 91120, France
| | - Paul D Cotter
- Teagasc Food Research Centre, Moorepark, Fermoy, Cork and APC Microbiome Ireland, P61 C996, Ireland
| |
Collapse
|
30
|
Tithi SS, Aylward FO, Jensen RV, Zhang L. FastViromeExplorer-Novel: Recovering Draft Genomes of Novel Viruses and Phages in Metagenomic Data. JOURNAL OF COMPUTATIONAL BIOLOGY : A JOURNAL OF COMPUTATIONAL MOLECULAR CELL BIOLOGY 2023; 30:391-408. [PMID: 36607772 DOI: 10.1089/cmb.2022.0397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Despite the recent surge of viral metagenomic studies, recovering complete virus/phage genomes from metagenomic data is still extremely difficult and most viral contigs generated from de novo assembly programs are highly fragmented, posing serious challenges to downstream analysis and inference. In this study, we develop FastViromeExplorer (FVE)-novel, a computational pipeline for reconstructing complete or near-complete viral draft genomes from metagenomic data. The FVE-novel deploys FVE to efficiently map metagenomic reads to viral reference genomes, performs de novo assembly of the mapped reads to generate contigs, and extends the contigs through iterative assembly to produce final viral scaffolds. We applied FVE-novel to an ocean metagenomic sample and obtained 268 viral scaffolds that potentially come from novel viruses. Through manual examination and validation of the 10 longest scaffolds, we successfully recovered 4 complete viral genomes, 2 are novel as they cannot be found in the existing databases and the other 2 are related to known phages. This hybrid reference-based and de novo assembly approach used by FVE-novel represents a powerful new approach for uncovering near-complete viral genomes in metagenomic data.
Collapse
Affiliation(s)
| | - Frank O Aylward
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, USA
| | - Roderick V Jensen
- Department of Biological Sciences, Virginia Tech, Blacksburg, Virginia, USA
| | - Liqing Zhang
- Department of Computer Science, Virginia Tech, Blacksburg, Virginia, USA
| |
Collapse
|
31
|
Hackl T, Laurenceau R, Ankenbrand MJ, Bliem C, Cariani Z, Thomas E, Dooley KD, Arellano AA, Hogle SL, Berube P, Leventhal GE, Luo E, Eppley JM, Zayed AA, Beaulaurier J, Stepanauskas R, Sullivan MB, DeLong EF, Biller SJ, Chisholm SW. Novel integrative elements and genomic plasticity in ocean ecosystems. Cell 2023; 186:47-62.e16. [PMID: 36608657 DOI: 10.1016/j.cell.2022.12.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 09/16/2022] [Accepted: 12/05/2022] [Indexed: 01/07/2023]
Abstract
Horizontal gene transfer accelerates microbial evolution. The marine picocyanobacterium Prochlorococcus exhibits high genomic plasticity, yet the underlying mechanisms are elusive. Here, we report a novel family of DNA transposons-"tycheposons"-some of which are viral satellites while others carry cargo, such as nutrient-acquisition genes, which shape the genetic variability in this globally abundant genus. Tycheposons share distinctive mobile-lifecycle-linked hallmark genes, including a deep-branching site-specific tyrosine recombinase. Their excision and integration at tRNA genes appear to drive the remodeling of genomic islands-key reservoirs for flexible genes in bacteria. In a selection experiment, tycheposons harboring a nitrate assimilation cassette were dynamically gained and lost, thereby promoting chromosomal rearrangements and host adaptation. Vesicles and phage particles harvested from seawater are enriched in tycheposons, providing a means for their dispersal in the wild. Similar elements are found in microbes co-occurring with Prochlorococcus, suggesting a common mechanism for microbial diversification in the vast oligotrophic oceans.
Collapse
Affiliation(s)
- Thomas Hackl
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA; Groningen Institute for Evolutionary Life Sciences, University of Groningen, 9700CC Groningen, the Netherlands.
| | - Raphaël Laurenceau
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Markus J Ankenbrand
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA; University of Würzburg, Center for Computational and Theoretical Biology, 97070 Würzburg, Germany
| | - Christina Bliem
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Zev Cariani
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Elaina Thomas
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Keven D Dooley
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Aldo A Arellano
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Shane L Hogle
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Paul Berube
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Gabriel E Leventhal
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA
| | - Elaine Luo
- Daniel K. Inouye Center for Microbial Oceanography, Research and Education, University of Hawai'i Manoa, Honolulu, HI 96822, USA
| | - John M Eppley
- Daniel K. Inouye Center for Microbial Oceanography, Research and Education, University of Hawai'i Manoa, Honolulu, HI 96822, USA
| | - Ahmed A Zayed
- EMERGE Biology Integration Institute, Ohio State University, Columbus, OH 43210, USA; Center of Microbiome Science, Ohio State University, Columbus, OH 43210, USA
| | | | | | - Matthew B Sullivan
- Department of Microbiology & Department of Civil, Environmental, and Geodetic Engineering, Ohio State University, Columbus, OH 43210, USA; EMERGE Biology Integration Institute, Ohio State University, Columbus, OH 43210, USA; Center of Microbiome Science, Ohio State University, Columbus, OH 43210, USA
| | - Edward F DeLong
- Daniel K. Inouye Center for Microbial Oceanography, Research and Education, University of Hawai'i Manoa, Honolulu, HI 96822, USA
| | - Steven J Biller
- Wellesley College, Department of Biological Sciences, Wellesley, MA 02481, USA
| | - Sallie W Chisholm
- Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA 02139, USA; Massachusetts Institute of Technology, Department of Biology, Cambridge, MA 02139, USA.
| |
Collapse
|
32
|
Zhang Z, Wu Z, Liu H, Yang M, Wang R, Zhao Y, Chen F. Genomic analysis and characterization of phages infecting the marine Roseobacter CHAB-I-5 lineage reveal a globally distributed and abundant phage genus. Front Microbiol 2023; 14:1164101. [PMID: 37138617 PMCID: PMC10149686 DOI: 10.3389/fmicb.2023.1164101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 03/27/2023] [Indexed: 05/05/2023] Open
Abstract
Marine phages play an important role in marine biogeochemical cycles by regulating the death, physiological metabolism, and evolutionary trajectory of bacteria. The Roseobacter group is an abundant and important heterotrophic bacterial group in the ocean, and plays an important role in carbon, nitrogen, sulfur and phosphorus cycling. The CHAB-I-5 lineage is one of the most dominant Roseobacter lineages, but remains largely uncultured. Phages infecting CHAB-I-5 bacteria have not yet been investigated due to the lack of culturable CHAB-I-5 strains. In this study, we isolated and sequenced two new phages (CRP-901 and CRP-902) infecting the CHAB-I-5 strain FZCC0083. We applied metagenomic data mining, comparative genomics, phylogenetic analysis, and metagenomic read-mapping to investigate the diversity, evolution, taxonomy, and biogeography of the phage group represented by the two phages. The two phages are highly similar, with an average nucleotide identity of 89.17%, and sharing 77% of their open reading frames. We identified several genes involved in DNA replication and metabolism, virion structure, DNA packing, and host lysis from their genomes. Metagenomic mining identified 24 metagenomic viral genomes closely related to CRP-901 and CRP-902. Genomic comparison and phylogenetic analysis demonstrated that these phages are distinct from other known viruses, representing a novel genus-level phage group (CRP-901-type). The CRP-901-type phages do not contain DNA primase and DNA polymerase genes, but possess a novel bifunctional DNA primase-polymerase gene with both primase and polymerase activities. Read-mapping analysis showed that the CRP-901-type phages are widespread across the world's oceans and are most abundant in estuarine and polar waters. Their abundance is generally higher than other known roseophages and even higher than most pelagiphages in the polar region. In summary, this study has greatly expanded our understanding of the genetic diversity, evolution, and distribution of roseophages. Our analysis suggests that the CRP-901-type phage is an important and novel marine phage group that plays important roles in the physiology and ecology of roseobacters.
Collapse
Affiliation(s)
- Zefeng Zhang
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Zuqing Wu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - He Liu
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Mingyu Yang
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Rui Wang
- Institute of Marine Science and Technology, Shandong University, Qingdao, China
| | - Yanlin Zhao
- Fujian Provincial Key Laboratory of Agroecological Processing and Safety Monitoring, College of Life Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
- *Correspondence: Yanlin Zhao,
| | - Feng Chen
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, MD, United States
- Feng Chen,
| |
Collapse
|
33
|
Hackl T, Laurenceau R, Ankenbrand MJ, Bliem C, Cariani Z, Thomas E, Dooley KD, Arellano AA, Hogle SL, Berube P, Leventhal GE, Luo E, Eppley JM, Zayed AA, Beaulaurier J, Stepanauskas R, Sullivan MB, DeLong EF, Biller SJ, Chisholm SW. Novel integrative elements and genomic plasticity in ocean ecosystems. Cell 2023. [DOI: doi.org/10.1016/j.cell.2022.12.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
34
|
Marquet M, Hölzer M, Pletz MW, Viehweger A, Makarewicz O, Ehricht R, Brandt C. What the Phage: a scalable workflow for the identification and analysis of phage sequences. Gigascience 2022; 11:giac110. [PMID: 36399058 PMCID: PMC9673492 DOI: 10.1093/gigascience/giac110] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 08/24/2022] [Accepted: 10/17/2022] [Indexed: 11/19/2022] Open
Abstract
Phages are among the most abundant and diverse biological entities on earth. Phage prediction from sequence data is a crucial first step to understanding their impact on the environment. A variety of bacteriophage prediction tools have been developed over the years. They differ in algorithmic approach, results, and ease of use. We, therefore, developed "What the Phage" (WtP), an easy-to-use and parallel multitool approach for phage prediction combined with an annotation and classification downstream strategy, thus supporting the user's decision-making process by summarizing the results of the different prediction tools in charts and tables. WtP is reproducible and scales to thousands of datasets through a workflow manager (Nextflow). WtP is freely available under a GPL-3.0 license (https://github.com/replikation/What_the_Phage).
Collapse
Affiliation(s)
- Mike Marquet
- Institute of Infectious Diseases and Infection Control, Jena-University Hospital/Friedrich Schiller University, Jena 07747, Germany
- Center of Sepsis Control and Care (CSCC), Jena 07747, Germany
- Leibniz Center for Photonics in Infection Research (LPI), Jena 07747, Germany
| | - Martin Hölzer
- Bioinformatics and Systems Biology, Robert Koch Institute, Berlin 13353, Germany
| | - Mathias W Pletz
- Institute of Infectious Diseases and Infection Control, Jena-University Hospital/Friedrich Schiller University, Jena 07747, Germany
- Center of Sepsis Control and Care (CSCC), Jena 07747, Germany
- Leibniz Center for Photonics in Infection Research (LPI), Jena 07747, Germany
- InfectoGnostics Research Campus, Jena 07747, Germany
| | - Adrian Viehweger
- Medical Microbiology and Virology, University Hospital Leipzig, Leipzig 04103, Germany
| | - Oliwia Makarewicz
- Institute of Infectious Diseases and Infection Control, Jena-University Hospital/Friedrich Schiller University, Jena 07747, Germany
- Center of Sepsis Control and Care (CSCC), Jena 07747, Germany
- Leibniz Center for Photonics in Infection Research (LPI), Jena 07747, Germany
- InfectoGnostics Research Campus, Jena 07747, Germany
| | - Ralf Ehricht
- InfectoGnostics Research Campus, Jena 07747, Germany
- Optisch-molekulare Diagnostik und Systemtechnologie, Leibniz Institute of Photonic Technology (Leibniz-IPHT), Jena 07747, Germany
- Institute of Physical Chemistry, Friedrich-Schiller-University Jena, Jena 07747, Germany
| | - Christian Brandt
- Institute of Infectious Diseases and Infection Control, Jena-University Hospital/Friedrich Schiller University, Jena 07747, Germany
- Leibniz Center for Photonics in Infection Research (LPI), Jena 07747, Germany
- InfectoGnostics Research Campus, Jena 07747, Germany
| |
Collapse
|
35
|
Dotto-Maurel A, Pelletier C, Morga B, Jacquot M, Faury N, Dégremont L, Bereszczynki M, Delmotte J, Escoubas JM, Chevignon G. Evaluation of tangential flow filtration coupled to long-read sequencing for ostreid herpesvirus type 1 genome assembly. Microb Genom 2022; 8:mgen000895. [PMID: 36355418 PMCID: PMC9836095 DOI: 10.1099/mgen.0.000895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Whole-genome sequencing is widely used to better understand the transmission dynamics, the evolution and the emergence of new variants of viral pathogens. This can bring crucial information to stakeholders for disease management. Unfortunately, aquatic virus genomes are usually difficult to characterize because most of these viruses cannot be easily propagated in vitro. Developing methodologies for routine genome sequencing of aquatic viruses is timely given the ongoing threat of disease emergence. This is particularly true for pathogenic viruses infecting species of commercial interest that are widely exchanged between production basins or countries. For example, the ostreid herpesvirus type 1 (OsHV-1) is a Herpesvirus widely associated with mass mortality events of juvenile Pacific oyster Crassostrea gigas. Genomes of Herpesviruses are large and complex with long direct and inverted terminal repeats. In addition, OsHV-1 is unculturable. It therefore accumulates several features that make its genome sequencing and assembly challenging. To overcome these difficulties, we developed a tangential flow filtration (TFF) method to enrich OsHV-1 infective particles from infected host tissues. This virus purification allowed us to extract high molecular weight and high-quality viral DNA that was subjected to Illumina short-read and Nanopore long-read sequencing. Dedicated bioinformatic pipelines were developed to assemble complete OsHV-1 genomes with reads from both sequencing technologies. Nanopore sequencing allowed characterization of new structural variations and major viral isomers while having 99,98 % of nucleotide identity with the Illumina assembled genome. Our study shows that TFF-based purification method, coupled with Nanopore sequencing, is a promising approach to enable in field sequencing of unculturable aquatic DNA virus.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Jean Delmotte
- IHPE, Univ. Montpellier, CNRS, Ifremer, UPVD, F-34095 Montpellier, France
| | - Jean-Michel Escoubas
- IHPE, Univ. Montpellier, CNRS, Ifremer, UPVD, F-34095 Montpellier, France,*Correspondence: Jean-Michel Escoubas,
| | - Germain Chevignon
- Ifremer, ASIM, F-17390 La Tremblade, France,*Correspondence: Germain Chevignon,
| |
Collapse
|
36
|
Eppley JM, Biller SJ, Luo E, Burger A, DeLong EF. Marine viral particles reveal an expansive repertoire of phage-parasitizing mobile elements. Proc Natl Acad Sci U S A 2022; 119:e2212722119. [PMID: 36256808 PMCID: PMC9618062 DOI: 10.1073/pnas.2212722119] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 09/22/2022] [Indexed: 11/19/2022] Open
Abstract
Phage satellites are mobile genetic elements that propagate by parasitizing bacteriophage replication. We report here the discovery of abundant and diverse phage satellites that were packaged as concatemeric repeats within naturally occurring bacteriophage particles in seawater. These same phage-parasitizing mobile elements were found integrated in the genomes of dominant co-occurring bacterioplankton species. Like known phage satellites, many marine phage satellites encoded genes for integration, DNA replication, phage interference, and capsid assembly. Many also contained distinctive gene suites indicative of unique virus hijacking, phage immunity, and mobilization mechanisms. Marine phage satellite sequences were widespread in local and global oceanic virioplankton populations, reflecting their ubiquity, abundance, and temporal persistence in marine planktonic communities worldwide. Their gene content and putative life cycles suggest they may impact host-cell phage immunity and defense, lateral gene transfer, bacteriophage-induced cell mortality and cellular host and virus productivity. Given that marine phage satellites cannot be distinguished from bona fide viral particles via commonly used microscopic techniques, their predicted numbers (∼3.2 × 1026 in the ocean) may influence current estimates of virus densities, production, and virus-induced mortality. In total, the data suggest that marine phage satellites have potential to significantly impact the ecology and evolution of bacteria and their viruses throughout the oceans. We predict that any habitat that harbors bacteriophage will also harbor similar phage satellites, making them a ubiquitous feature of most microbiomes on Earth.
Collapse
Affiliation(s)
- John M. Eppley
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI 96822
| | - Steven J. Biller
- Department of Biological Sciences, Wellesley College, Wellesley, MA 02481
| | - Elaine Luo
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI 96822
| | - Andrew Burger
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI 96822
| | - Edward F. DeLong
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education, University of Hawaii, Honolulu, HI 96822
| |
Collapse
|
37
|
Locke H, Bidle KD, Thamatrakoln K, Johns CT, Bonachela JA, Ferrell BD, Wommack KE. Marine viruses and climate change: Virioplankton, the carbon cycle, and our future ocean. Adv Virus Res 2022; 114:67-146. [PMID: 39492214 DOI: 10.1016/bs.aivir.2022.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Interactions between marine viruses and microbes are a critical part of the oceanic carbon cycle. The impacts of virus-host interactions range from short-term disruptions in the mobility of microbial biomass carbon to higher trophic levels through cell lysis (i.e., the viral shunt) to long-term reallocation of microbial biomass carbon to the deep sea through accelerating the biological pump (i.e., the viral shuttle). The biogeochemical backdrop of the ocean-the physical, chemical, and biological landscape-influences the likelihood of both virus-host interactions and particle formation, and the fate and flow of carbon. As climate change reshapes the oceanic landscape through large-scale shifts in temperature, circulation, stratification, and acidification, virus-mediated carbon flux is likely to shift in response. Dynamics in the directionality and magnitude of changes in how, where, and when viruses mediate the recycling or storage of microbial biomass carbon is largely unknown. Integrating viral infection dynamics data obtained from experimental models and field systems, with particle motion microphysics and global observations of oceanic biogeochemistry, into improved ecosystem models will enable viral oceanographers to better predict the role of viruses in marine carbon cycling in the future ocean.
Collapse
Affiliation(s)
- Hannah Locke
- Univ. of Delaware, Delaware Biotechnology Inst., Newark, DE, United States
| | - Kay D Bidle
- Rutgers Univ., Dept. of Marine & Coastal Sciences, New Brunswick, NJ, United States
| | | | - Christopher T Johns
- Rutgers Univ., Dept. of Marine & Coastal Sciences, New Brunswick, NJ, United States
| | - Juan A Bonachela
- Rutgers Univ., Dept. of Ecology, Evolution & Natural Resources, New Brunswick, NJ, United States
| | - Barbra D Ferrell
- Univ. of Delaware, Delaware Biotechnology Inst., Newark, DE, United States
| | - K Eric Wommack
- Univ. of Delaware, Delaware Biotechnology Inst., Newark, DE, United States.
| |
Collapse
|
38
|
Abstract
The recovery of DNA from viromes is a major obstacle in the use of long-read sequencing to study their genomes. For this reason, the use of cellular metagenomes (>0.2-μm size range) emerges as an interesting complementary tool, since they contain large amounts of naturally amplified viral genomes from prelytic replication. We have applied second-generation (Illumina NextSeq; short reads) and third-generation (PacBio Sequel II; long reads) sequencing to compare the diversity and features of the viral community in a marine sample obtained from offshore waters of the western Mediterranean. We found that a major wedge of the expected marine viral diversity was directly recovered by the raw PacBio circular consensus sequencing (CCS) reads. More than 30,000 sequences were detected only in this data set, with no homologues in the long- and short-read assembly, and ca. 26,000 had no homologues in the large data set of the Global Ocean Virome 2 (GOV2), highlighting the information gap created by the assembly bias. At the level of complete viral genomes, the performance was similar in both approaches. However, the hybrid long- and short-read assembly provided the longest average length of the sequences and improved the host assignment. Although no novel major clades of viruses were found, there was an increase in the intraclade genomic diversity recovered by long reads that produced an enriched assessment of the real diversity and allowed the discovery of novel genes with biotechnological potential (e.g., endolysin genes). IMPORTANCE We explored the vast genetic diversity of environmental viruses by using a combination of cellular metagenome (as opposed to virome) sequencing using high-fidelity long-read sequences (in this case, PacBio CCS). This approach resulted in the recovery of a representative sample of the viral population, and it performed better (more phage contigs, larger average contig size) than Illumina sequencing applied to the same sample. By this approach, the many biases of assembly are avoided, as the CCS reads recovers (typically around 5 kb) complete genes and even operons, resulting in a better discovery of the viral gene diversity based on viral marker proteins. Thus, biotechnologically promising genes, such as endolysin genes, can be very efficiently searched with this approach. In addition, hybrid assembly produces more complete and longer contigs, which is particularly important for studying little-known viral groups such as the nucleocytoplasmic large DNA viruses (NCLDV).
Collapse
|
39
|
Smith SE, Huang W, Tiamani K, Unterer M, Khan Mirzaei M, Deng L. Emerging technologies in the study of the virome. Curr Opin Virol 2022; 54:101231. [DOI: 10.1016/j.coviro.2022.101231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 04/16/2022] [Accepted: 04/19/2022] [Indexed: 11/03/2022]
|
40
|
Luo E, Leu AO, Eppley JM, Karl DM, DeLong EF. Diversity and origins of bacterial and archaeal viruses on sinking particles reaching the abyssal ocean. THE ISME JOURNAL 2022; 16:1627-1635. [PMID: 35236926 PMCID: PMC9122931 DOI: 10.1038/s41396-022-01202-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 01/09/2022] [Accepted: 01/25/2022] [Indexed: 11/15/2022]
Abstract
Sinking particles and particle-associated microbes influence global biogeochemistry through particulate matter export from the surface to the deep ocean. Despite ongoing studies of particle-associated microbes, viruses in these habitats remain largely unexplored. Whether, where, and which viruses might contribute to particle production and export remain open to investigation. In this study, we analyzed 857 virus population genomes associated with sinking particles collected over three years in sediment traps moored at 4000 m in the North Pacific Subtropical Gyre. Particle-associated viruses here were linked to cellular hosts through matches to bacterial and archaeal metagenome-assembled genome (MAG)-encoded prophages or CRISPR spacers, identifying novel viruses infecting presumptive deep-sea bacteria such as Colwellia, Moritella, and Shewanella. We also identified lytic viruses whose abundances correlated with particulate carbon flux and/or were exported from the photic to abyssal ocean, including cyanophages. Our data are consistent with some of the predicted outcomes of the viral shuttle hypothesis, and further suggest that viral lysis of both autotrophic and heterotrophic prokaryotes may play a role in carbon export. Our analyses revealed the diversity and origins of prevalent viruses found on deep-sea sinking particles and identified prospective viral groups for future investigation into processes that govern particle export in the open ocean.
Collapse
Affiliation(s)
- Elaine Luo
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, 96822, USA.
- Woods Hole Oceanographic Institution, 266 Woods Hole Road, MS 51, Woods Hole MA, 02543, Falmouth, USA.
| | - Andy O Leu
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, 96822, USA
- Australia Center for Ecogenomics, University of Queensland, St. Lucia QLD, 4072, Australia
| | - John M Eppley
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, 96822, USA
| | - David M Karl
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, 96822, USA
| | - Edward F DeLong
- Daniel K. Inouye Center for Microbial Oceanography: Research and Education (C-MORE), University of Hawai'i at Manoa, Honolulu, HI, 96822, USA.
| |
Collapse
|
41
|
Spang A, Mahendrarajah TA, Offre P, Stairs CW. Evolving Perspective on the Origin and Diversification of Cellular Life and the Virosphere. Genome Biol Evol 2022; 14:evac034. [PMID: 35218347 PMCID: PMC9169541 DOI: 10.1093/gbe/evac034] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/18/2022] [Indexed: 11/14/2022] Open
Abstract
The tree of life (TOL) is a powerful framework to depict the evolutionary history of cellular organisms through time, from our microbial origins to the diversification of multicellular eukaryotes that shape the visible biosphere today. During the past decades, our perception of the TOL has fundamentally changed, in part, due to profound methodological advances, which allowed a more objective approach to studying organismal and viral diversity and led to the discovery of major new branches in the TOL as well as viral lineages. Phylogenetic and comparative genomics analyses of these data have, among others, revolutionized our understanding of the deep roots and diversity of microbial life, the origin of the eukaryotic cell, eukaryotic diversity, as well as the origin, and diversification of viruses. In this review, we provide an overview of some of the recent discoveries on the evolutionary history of cellular organisms and their viruses and discuss a variety of complementary techniques that we consider crucial for making further progress in our understanding of the TOL and its interconnection with the virosphere.
Collapse
Affiliation(s)
- Anja Spang
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Utrecht University, Den Burg, The Netherlands
- Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Tara A Mahendrarajah
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Utrecht University, Den Burg, The Netherlands
| | - Pierre Offre
- Department of Marine Microbiology and Biogeochemistry, NIOZ, Royal Netherlands Institute for Sea Research, Utrecht University, Den Burg, The Netherlands
| | - Courtney W Stairs
- Department of Biology, Microbiology research group, Lund University, Lund, Sweden
| |
Collapse
|
42
|
Newly identified HMO-2011-type phages reveal genomic diversity and biogeographic distributions of this marine viral group. THE ISME JOURNAL 2022; 16:1363-1375. [PMID: 35022515 PMCID: PMC9038755 DOI: 10.1038/s41396-021-01183-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 12/17/2021] [Accepted: 12/23/2021] [Indexed: 12/13/2022]
Abstract
Viruses play critical roles in influencing biogeochemical cycles and adjusting host mortality, population structure, physiology, and evolution in the ocean. Marine viral communities are composed of numerous genetically distinct subfamily/genus-level viral groups. Among currently identified viral groups, the HMO-2011-type group is known to be dominant and broadly distributed. However, only four HMO-2011-type cultivated representatives that infect marine SAR116 and Roseobacter strains have been reported to date, and the genetic diversity, potential hosts, and ecology of this group remain poorly elucidated. Here, we present the genomes of seven HMO-2011-type phages that were isolated using four Roseobacter strains and one SAR11 strain, as well as additional 207 HMO-2011-type metagenomic viral genomes (MVGs) identified from various marine viromes. Phylogenomic and shared-gene analyses revealed that the HMO-2011-type group is a subfamily-level group comprising at least 10 discernible genus-level subgroups. Moreover, >2000 HMO-2011-type DNA polymerase sequences were identified, and the DNA polymerase phylogeny also revealed that the HMO-2011-type group contains diverse subgroups and is globally distributed. Metagenomic read-mapping results further showed that most HMO-2011-type phages are prevalent in global oceans and display distinct geographic distributions, with the distribution of most HMO-2011-type phages being associated with temperature. Lastly, we found that members in subgroup IX, represented by pelagiphage HTVC033P, were among the most abundant HMO-2011-type phages, which implies that SAR11 bacteria are crucial hosts for this viral group. In summary, our findings substantially expand current knowledge regarding the phylogenetic diversity, evolution, and distribution of HMO-2011-type phages, highlighting HMO-2011-type phages as major ecological agents that can infect certain key bacterial groups.
Collapse
|
43
|
Whole-genome sequencing and genetic characteristics of representative porcine reproductive and respiratory syndrome virus (PRRSV) isolates in Korea. Virol J 2022; 19:66. [PMID: 35410421 PMCID: PMC8996673 DOI: 10.1186/s12985-022-01790-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 03/23/2022] [Indexed: 12/04/2022] Open
Abstract
Background Porcine reproductive and respiratory syndrome virus (PRRSV) is a macrophage-tropic arterivirus with extremely high genetic and pathogenic heterogeneity that causes significant economic losses in the swine industry worldwide. PRRSV can be divided into two species [PRRSV1 (European) and PRRSV2 (North American)] and is usually diagnosed and genetically differentiated into several lineages based on the ORF5 gene, which constitutes only 5% of the whole genome. This study was conducted to achieve nonselective amplification and whole-genome sequencing (WGS) based on a simplified sequence-independent, single-primer amplification (SISPA) technique with next-generation sequencing (NGS), and to genetically characterize Korean PRRSV field isolates at the whole genome level. Methods The SISPA-NGS method coupled with a bioinformatics pipeline was utilized to retrieve full length PRRSV genomes of 19 representative Korean PRRSV strains by de novo assembly. Phylogenetic analysis, analysis of the insertion and deletion (INDEL) pattern of nonstructural protein 2 (NSP2), and recombination analysis were conducted. Results Nineteen complete PRRSV genomes were obtained with a high depth of coverage by the SISPA-NGS method. Korean PRRSV1 belonged to the Korean-specific subtype 1A and vaccine-related subtype 1C lineages, showing no evidence of recombination and divergent genetic heterogeneity with conserved NSP2 deletion patterns. Among Korean PRRSV2 isolates, modified live vaccine (MLV)-related lineage 5 viruses, lineage 1 viruses, and nation-specific Korean lineages (KOR A, B and C) could be identified. The NSP2 deletion pattern of the Korean lineages was consistent with that of the MN-184 strain (lineage 1), which indicates the common ancestor and independent evolution of Korean lineages. Multiple recombination signals were detected from Korean-lineage strains isolated in the 2010s, suggesting natural interlineage recombination between circulating KOR C and MLV strains. Interestingly, the Korean strain GGYC45 was identified as a recombinant KOR C and MLV strain harboring the KOR B ORF5 gene and might be the ancestor of currently circulating KOR B strains. Additionally, two novel lineage 1 recombinants of NADC30-like and NADC34-like viruses were detected. Conclusion Genome-wide analysis of Korean PRRSV isolates retrieved by the SISPA-NGS method and de novo assembly, revealed complex evolution and recombination in the field. Therefore, continuous surveillance of PRRSV at the whole genome level should be conducted, and new vaccine strategies for more efficient control of the virus are needed. Supplementary Information The online version contains supplementary material available at 10.1186/s12985-022-01790-6.
Collapse
|
44
|
Kieft K, Anantharaman K. Virus genomics: what is being overlooked? Curr Opin Virol 2022; 53:101200. [DOI: 10.1016/j.coviro.2022.101200] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 12/21/2021] [Accepted: 01/03/2022] [Indexed: 01/05/2023]
|
45
|
Martinez-Hernandez F, Diop A, Garcia-Heredia I, Bobay LM, Martinez-Garcia M. Unexpected myriad of co-occurring viral strains and species in one of the most abundant and microdiverse viruses on Earth. THE ISME JOURNAL 2022; 16:1025-1035. [PMID: 34775488 PMCID: PMC8940918 DOI: 10.1038/s41396-021-01150-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 10/15/2021] [Accepted: 10/28/2021] [Indexed: 11/09/2022]
Abstract
Viral genetic microdiversity drives adaptation, pathogenicity, and speciation and has critical consequences for the viral-host arms race occurring at the strain and species levels, which ultimately impact microbial community structure and biogeochemical cycles. Despite the fact that most efforts have focused on viral macrodiversity, little is known about the microdiversity of ecologically important viruses on Earth. Recently, single-virus genomics discovered the putatively most abundant ocean virus in temperate and tropical waters: the uncultured dsDNA virus vSAG 37-F6 infecting Pelagibacter, the most abundant marine bacteria. In this study, we report the cooccurrence of up to ≈1,500 different viral strains (>95% nucleotide identity) and ≈30 related species (80-95% nucleotide identity) in a single oceanic sample. Viral microdiversity was maintained over space and time, and most alleles were the result of synonymous mutations without any apparent adaptive benefits to cope with host translation codon bias and efficiency. Gene flow analysis used to delimitate species according to the biological species concept (BSC) revealed the impact of recombination in shaping vSAG 37-F6 virus and Pelagibacter speciation. Data demonstrated that this large viral microdiversity somehow mirrors the host species diversity since ≈50% of the 926 analyzed Pelagibacter genomes were found to belong to independent BSC species that do not significantly engage in gene flow with one another. The host range of this evolutionarily successful virus revealed that a single viral species can infect multiple Pelagibacter BSC species, indicating that this virus crosses not only formal BSC barriers but also biomes since viral ancestors are found in freshwater.
Collapse
Affiliation(s)
| | - Awa Diop
- Department of Biology, University of North Carolina at Greensboro, Greensboro, USA
| | | | - Louis-Marie Bobay
- Department of Biology, University of North Carolina at Greensboro, Greensboro, USA
| | - Manuel Martinez-Garcia
- Department of Physiology, Genetics, and Microbiology, University of Alicante, Alicante, Spain.
| |
Collapse
|
46
|
Kolundžija S, Cheng DQ, Lauro FM. RNA Viruses in Aquatic Ecosystems through the Lens of Ecological Genomics and Transcriptomics. Viruses 2022; 14:702. [PMID: 35458432 PMCID: PMC9029791 DOI: 10.3390/v14040702] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 03/19/2022] [Accepted: 03/23/2022] [Indexed: 02/04/2023] Open
Abstract
Massive amounts of data from nucleic acid sequencing have changed our perspective about diversity and dynamics of marine viral communities. Here, we summarize recent metatranscriptomic and metaviromic studies targeting predominantly RNA viral communities. The analysis of RNA viromes reaffirms the abundance of lytic (+) ssRNA viruses of the order Picornavirales, but also reveals other (+) ssRNA viruses, including RNA bacteriophages, as important constituents of extracellular RNA viral communities. Sequencing of dsRNA suggests unknown diversity of dsRNA viruses. Environmental metatranscriptomes capture the dynamics of ssDNA, dsDNA, ssRNA, and dsRNA viruses simultaneously, unravelling the full complexity of viral dynamics in the marine environment. RNA viruses are prevalent in large size fractions of environmental metatranscriptomes, actively infect marine unicellular eukaryotes larger than 3 µm, and can outnumber bacteriophages during phytoplankton blooms. DNA and RNA viruses change abundance on hourly timescales, implying viral control on a daily temporal basis. Metatranscriptomes of cultured protists host a diverse community of ssRNA and dsRNA viruses, often with multipartite genomes and possibly persistent intracellular lifestyles. We posit that RNA viral communities might be more diverse and complex than formerly anticipated and that the influence they exert on community composition and global carbon flows in aquatic ecosystems may be underestimated.
Collapse
Affiliation(s)
- Sandra Kolundžija
- Asian School of the Environment, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore;
| | - Dong-Qiang Cheng
- Singapore Centre for Environmental Life Sciences Engineering, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore;
| | - Federico M. Lauro
- Asian School of the Environment, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore;
- Singapore Centre for Environmental Life Sciences Engineering, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore;
| |
Collapse
|
47
|
Antipov D, Rayko M, Kolmogorov M, Pevzner PA. viralFlye: assembling viruses and identifying their hosts from long-read metagenomics data. Genome Biol 2022; 23:57. [PMID: 35189932 PMCID: PMC8862349 DOI: 10.1186/s13059-021-02566-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 12/03/2021] [Indexed: 11/10/2022] Open
Abstract
Although the use of long-read sequencing improves the contiguity of assembled viral genomes compared to short-read methods, assembling complex viral communities remains an open problem. We describe the viralFlye tool for identification and analysis of metagenome-assembled viruses in long-read assemblies. We show it significantly improves viral assemblies and demonstrate that long-reads result in a much larger array of predicted virus-host associations as compared to short-read assemblies. We demonstrate that the identification of novel CRISPR arrays in bacterial genomes from a newly assembled metagenomic sample provides information for predicting novel hosts for novel viruses.
Collapse
Affiliation(s)
- Dmitry Antipov
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, Russia.
| | - Mikhail Rayko
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, Russia
| | - Mikhail Kolmogorov
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, USA
| | - Pavel A Pevzner
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, USA
| |
Collapse
|
48
|
Yang S, Zhao Q, Tang L, Chen Z, Wu Z, Li K, Lin R, Chen Y, Ou D, Zhou L, Xu J, Qin Q. Whole Genome Assembly of Human Papillomavirus by Nanopore Long-Read Sequencing. Front Genet 2022; 12:798608. [PMID: 35058971 PMCID: PMC8764290 DOI: 10.3389/fgene.2021.798608] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 12/01/2021] [Indexed: 02/05/2023] Open
Abstract
Human papillomavirus (HPV) is a causal agent for most cervical cancers. The physical status of the HPV genome in these cancers could be episomal, integrated, or both. HPV integration could serve as a biomarker for clinical diagnosis, treatment, and prognosis. Although whole-genome sequencing by next-generation sequencing (NGS) technologies, such as the Illumina sequencing platform, have been used for detecting integrated HPV genome in cervical cancer, it faces challenges of analyzing long repeats and translocated sequences. In contrast, Oxford nanopore sequencing technology can generate ultra-long reads, which could be a very useful tool for determining HPV genome sequence and its physical status in cervical cancer. As a proof of concept, in this study, we completed whole genome sequencing from a cervical cancer tissue and a CaSki cell line with Oxford Nanopore Technologies. From the cervical cancer tissue, a 7,894 bp-long HPV35 genomic sequence was assembled from 678 reads at 97-fold coverage of HPV genome, sharing 99.96% identity with the HPV sequence obtained by Sanger sequencing. A 7904 bp-long HPV16 genomic sequence was assembled from data generated from the CaSki cell line at 3857-fold coverage, sharing 99.99% identity with the reference genome (NCBI: U89348). Intriguingly, long reads generated by nanopore sequencing directly revealed chimeric cellular-viral sequences and concatemeric genomic sequences, leading to the discovery of 448 unique integration breakpoints in the CaSki cell line and 60 breakpoints in the cervical cancer sample. Taken together, nanopore sequencing is a unique tool to identify HPV sequences and would shed light on the physical status of HPV genome in its associated cancers.
Collapse
Affiliation(s)
- Shuaibing Yang
- Laboratory of Human Virology and Oncology, Shantou University Medical College, Shantou, China
| | - Qianqian Zhao
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College, Shantou, China
| | - Lihua Tang
- Department of Gynecologic Oncology, Cancer Hospital of Shantou University Medical College, Shantou, China
| | - Zejia Chen
- Department of Gynecologic Oncology, Cancer Hospital of Shantou University Medical College, Shantou, China
| | - Zhaoting Wu
- Department of Gynecologic Oncology, Cancer Hospital of Shantou University Medical College, Shantou, China
| | - Kaixin Li
- Undergraduate Program of Innovation and Entrepreneurship, Shantou University Medical College, Shantou, China
| | - Ruoru Lin
- Undergraduate Program of Innovation and Entrepreneurship, Shantou University Medical College, Shantou, China
| | - Yang Chen
- Undergraduate Program of Innovation and Entrepreneurship, Shantou University Medical College, Shantou, China
| | - Danlin Ou
- Undergraduate Program of Innovation and Entrepreneurship, Shantou University Medical College, Shantou, China
| | - Li Zhou
- Department of Gynecologic Oncology, Cancer Hospital of Shantou University Medical College, Shantou, China
| | - Jianzhen Xu
- Computational Systems Biology Lab, Department of Bioinformatics, Shantou University Medical College, Shantou, China
| | - Qingsong Qin
- Laboratory of Human Virology and Oncology, Shantou University Medical College, Shantou, China
- Guangdong Provincial Key Laboratory of Infectious Diseases and Molecular Immunopathology, Shantou, China
- Guangdong Provincial Key Laboratory for Diagnosis and Treatment of Breast Cancer, Shantou, China
| |
Collapse
|
49
|
Abstract
The transcriptomes of Pseudomonas aeruginosa clone C isolates NN2 and SG17M during the mid-exponential and early stationary phase of planktonic growth were evaluated by direct RNA sequencing on the nanopore platform and compared with established short-read cDNA sequencing on the Illumina platform. Fifty to ninety percent of the sense RNAs turned out to be rRNA molecules followed by similar proportions of mRNA transcripts and non-coding RNAs. Both platforms detected similar proportions of uncharged tRNAs and 29 yet undescribed antisense tRNAs. For example, the rarest arginine codon was paired with the most abundant tRNAArg, and the tRNAArg gene is missing for the most frequent arginine codon. More than 90% of the antisense RNA molecules were complementary to a coding sequence. The antisense RNAs were evenly distributed in the genomes. Direct RNA sequencing identified more than 4,000 distinct non-overlapping antisense RNAs during exponential and stationary growth. Besides highly expressed small antisense RNAs less than 200 bases in size, a population of longer antisense RNAs was sequenced that covered a broad range of a few hundred to thousands of bases and could be complementary to a contig of several genes. In summary, direct RNA sequencing identified yet undescribed RNA molecules and an unexpected composition of the pools of tRNAs, sense and antisense RNAs. IMPORTANCE Genome-wide gene expression of bacteria is commonly studied by high-throughput sequencing of size-selected cDNA fragment libraries of reverse-transcribed RNA preparations. However, the depletion of ribosomal RNAs, enzymatic reverse transcription and the fragmentation, size selection and amplification during library preparation lead to inevitable losses of information about the initial composition of the RNA pool. We demonstrate that direct RNA sequencing on the nanopore platform can overcome these limitations. Nanopore sequencing of total RNA yielded novel insights into the Pseudomonas aeruginosa transcriptome that - if replicated in other species - will change our view of the bacterial RNA world. The discovery of sense - antisense pairs of tmRNA, tRNAs and mRNAs indicates a further and unknown level of gene regulation in bacteria.
Collapse
|
50
|
Lin P, Jin T, Yu X, Liang L, Liu G, Jovic D, Sun Z, Yu Z, Pan J, Fan G. Composition and Dynamics of H1N1 and H7N9 Influenza A Virus Quasispecies in a Co-infected Patient Analyzed by Single Molecule Sequencing Technology. Front Genet 2021; 12:754445. [PMID: 34804122 PMCID: PMC8595946 DOI: 10.3389/fgene.2021.754445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 09/10/2021] [Indexed: 11/22/2022] Open
Abstract
A human co-infected with H1N1 and H7N9 subtypes influenza A virus (IAV) causes a complex infectious disease. The identification of molecular-level variations in composition and dynamics of IAV quasispecies will help to understand the pathogenesis and provide guidance for precision medicine treatment. In this study, using single-molecule real-time sequencing (SMRT) technology, we successfully acquired full-length IAV genomic sequences and quantified their genotypes abundance in serial samples from an 81-year-old male co-infected with H1N1 and H7N9 subtypes IAV. A total of 26 high diversity nucleotide loci was detected, in which the A-G base transversion was the most abundant substitution type (67 and 64%, in H1N1 and H7N9, respectively). Seven significant amino acid variations were detected, such as NA:H275Y and HA: R222K in H1N1 as well as PB2:E627K and NA: K432E in H7N9, which are related to viral drug-resistance or mammalian adaptation. Furtherly, we retrieved 25 H1N1 and 22 H7N9 genomic segment haplotypes from the eight samples based on combining high-diversity nucleotide loci, which provided a more concise overview of viral quasispecies composition and dynamics. Our approach promotes the popularization of viral quasispecies analysis in a complex infectious disease, which will boost the understanding of viral infections, pathogenesis, evolution, and precision medicine.
Collapse
Affiliation(s)
- Peng Lin
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- BGI-Qingdao, BGI-Shenzhen, Qingdao, China
| | - Tao Jin
- BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- BGI-Shenzhen, Shenzhen, China
| | - Xinfen Yu
- Hangzhou Center for Disease Control and Prevention, Hangzhou, China
| | | | - Guang Liu
- BGI-Qingdao, BGI-Shenzhen, Qingdao, China
| | | | - Zhou Sun
- Hangzhou Center for Disease Control and Prevention, Hangzhou, China
| | - Zhe Yu
- BGI-Shenzhen, Shenzhen, China
| | - Jingcao Pan
- Hangzhou Center for Disease Control and Prevention, Hangzhou, China
| | - Guangyi Fan
- BGI-Qingdao, BGI-Shenzhen, Qingdao, China
- BGI-Shenzhen, Shenzhen, China
| |
Collapse
|