1
|
Plewka J, Alibrandi A, Bornemann TLV, Esser SP, Stach TL, Sures K, Becker J, Moraru C, Soares A, di Primio R, Kallmeyer J, Probst AJ. Metagenomic analysis of pristine oil sheds new light on the global distribution of microbial genetic repertoire in hydrocarbon-associated ecosystems. MICROLIFE 2025; 6:uqae027. [PMID: 39877152 PMCID: PMC11774207 DOI: 10.1093/femsml/uqae027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 10/23/2024] [Accepted: 01/22/2025] [Indexed: 01/31/2025]
Abstract
Oil reservoirs are society's primary source of hydrocarbons. While microbial communities in industrially exploited oil reservoirs have been investigated in the past, pristine microbial communities in untapped oil reservoirs are little explored, as are distribution patterns of respective genetic signatures. Here, we show that a pristine oil sample contains a complex community consisting of bacteria and fungi for the degradation of hydrocarbons. We identified microorganisms and their pathways for the degradation of methane, n-alkanes, mono-aromatic, and polycyclic aromatic compounds in a metagenome retrieved from biodegraded petroleum encountered in a subsurface reservoir in the Barents Sea. Capitalizing on marker genes from metagenomes and public data mining, we compared the prokaryotes, putative viruses, and putative plasmids of the sampled site to those from 10 other hydrocarbon-associated sites, revealing a shared network of species and genetic elements across the globe. To test for the potential dispersal of the microbes and predicted elements via seawater, we compared our findings to the Tara Ocean dataset, resulting in a broad distribution of prokaryotic and viral signatures. Although frequently shared between hydrocarbon-associated sites, putative plasmids, however, showed little coverage in the Tara Oceans dataset, suggesting an undiscovered mode of transfer between hydrocarbon-affected ecosystems. Based on our analyses, genetic information is globally shared between oil reservoirs and hydrocarbon-associated sites, and we propose that currents and other physical occurrences within the ocean along with deep aquifers are major distributors of prokaryotes and viruses into these subsurface ecosystems.
Collapse
Affiliation(s)
- Julia Plewka
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Cyclotron Road, Berkeley, CA 94720, United States of America
| | - Armando Alibrandi
- GFZ German Research Centre for Geoscience, Telegrafenberg, 14473 Potsdam, Germany
| | - Till L V Bornemann
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
| | - Sarah P Esser
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
| | - Tom L Stach
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
- Centre of Water and Environmental Research (ZWU), University of Duisburg-Essen, 45141 Essen, Germany
| | - Katharina Sures
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
| | - Jannis Becker
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
| | - Cristina Moraru
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
| | - André Soares
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
| | | | - Jens Kallmeyer
- GFZ German Research Centre for Geoscience, Telegrafenberg, 14473 Potsdam, Germany
| | - Alexander J Probst
- Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, 45141 Essen, Germany
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Cyclotron Road, Berkeley, CA 94720, United States of America
- Centre of Water and Environmental Research (ZWU), University of Duisburg-Essen, 45141 Essen, Germany
- Centre of Medical Biotechnology (ZMB), University of Duisburg-Essen, 45141 Essen, Germany
| |
Collapse
|
2
|
Ren M, Hu A, Zhang L, Yao X, Zhao Z, Kimirei IA, Wang J. Acidic proteomes are linked to microbial alkaline preference in African lakes. WATER RESEARCH 2024; 266:122393. [PMID: 39243463 DOI: 10.1016/j.watres.2024.122393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 08/28/2024] [Accepted: 09/03/2024] [Indexed: 09/09/2024]
Abstract
Microbial amino acid composition (AA) reflects adaptive strategies of cellular and molecular regulations such as a high proportion of acidic AAs, including glutamic and aspartic acids in alkaliphiles. It remains understudied how microbial AA content is linked to their pH adaptation especially in natural environments. Here we examined prokaryotic communities and their AA composition of genes with metagenomics for 39 water and sediments of East African lakes along a gradient of pH spanning from 7.2 to 10.1. We found that Shannon diversity declined with the increasing pH and that species abundance were either positively or negatively associated with pH, indicating their distinct habitat preference in lakes. Microbial communities showed higher acidic proteomes in alkaline than neutral lakes. Species acidic proteomes were also positively correlated with their pH preference, which was consistent across major bacterial lineages. These results suggest selective pressure associated with high pH likely shape microbial amino acid composition both at the species and community levels. Comparative genome analyses further revealed that alkaliphilic microbes contained more functional genes with higher acidic AAs when compared to those in neutral conditions. These traits included genes encoding diverse classes of cation transmembrane transporters, antiporters, and compatible solute transporters, which are involved in cytoplasmic pH homeostasis and osmotic stress defense under high pH conditions. Our results provide the field evidence for the strong relationship between prokaryotic AA composition and their habitat preference and highlight amino acid optimization as strategies for environmental adaptation.
Collapse
Affiliation(s)
- Minglei Ren
- Key Laboratory of Lake and Watershed Science for Water Security, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China; State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China
| | - Ang Hu
- Key Laboratory of Lake and Watershed Science for Water Security, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China; State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China
| | - Lu Zhang
- Key Laboratory of Lake and Watershed Science for Water Security, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China; State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China
| | - Xiaolong Yao
- Key Laboratory of Lake and Watershed Science for Water Security, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China; State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China
| | - Zhonghua Zhao
- Key Laboratory of Lake and Watershed Science for Water Security, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China; State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China
| | - Ismael Aaron Kimirei
- Tanzania Fisheries Research Institute-Headquarter, Dar Es Salaam P.O. Box 9750, Tanzania
| | - Jianjun Wang
- Key Laboratory of Lake and Watershed Science for Water Security, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China; State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China.
| |
Collapse
|
3
|
Lui LM, Nielsen TN. Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses. mSystems 2024; 9:e0024224. [PMID: 39158287 PMCID: PMC11406994 DOI: 10.1128/msystems.00242-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 07/11/2024] [Indexed: 08/20/2024] Open
Abstract
Although long-read sequencing has enabled obtaining high-quality and complete genomes from metagenomes, many challenges still remain to completely decompose a metagenome into its constituent prokaryotic and viral genomes. This study focuses on decomposing an estuarine metagenome to obtain a more accurate estimate of microbial diversity. To achieve this, we developed a new bead-based DNA extraction method, a novel bin refinement method, and obtained 150 Gbp of Nanopore sequencing. We estimate that there are ~500 bacterial and archaeal species in our sample and obtained 68 high-quality bins (>90% complete, <5% contamination, ≤5 contigs, contig length of >100 kbp, and all ribosomal and tRNA genes). We also obtained many contigs of picoeukaryotes, environmental DNA of larger eukaryotes such as mammals, and complete mitochondrial and chloroplast genomes and detected ~40,000 viral populations. Our analysis indicates that there are only a few strains that comprise most of the species abundances. IMPORTANCE Ocean and estuarine microbiomes play critical roles in global element cycling and ecosystem function. Despite the importance of these microbial communities, many species still have not been cultured in the lab. Environmental sequencing is the primary way the function and population dynamics of these communities can be studied. Long-read sequencing provides an avenue to overcome limitations of short-read technologies to obtain complete microbial genomes but comes with its own technical challenges, such as needed sequencing depth and obtaining high-quality DNA. We present here new sampling and bioinformatics methods to attempt decomposing an estuarine microbiome into its constituent genomes. Our results suggest there are only a few strains that comprise most of the species abundances from viruses to picoeukaryotes, and to fully decompose a metagenome of this diversity requires 1 Tbp of long-read sequencing. We anticipate that as long-read sequencing technologies continue to improve, less sequencing will be needed.
Collapse
Affiliation(s)
- Lauren M Lui
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Torben N Nielsen
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| |
Collapse
|
4
|
Hiralal A, Geelhoed JS, Hidalgo-Martinez S, Smets B, van Dijk JR, Meysman FJR. Closing the genome of unculturable cable bacteria using a combined metagenomic assembly of long and short sequencing reads. Microb Genom 2024; 10:001197. [PMID: 38376381 PMCID: PMC10926707 DOI: 10.1099/mgen.0.001197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 01/23/2024] [Indexed: 02/21/2024] Open
Abstract
Many environmentally relevant micro-organisms cannot be cultured, and even with the latest metagenomic approaches, achieving complete genomes for specific target organisms of interest remains a challenge. Cable bacteria provide a prominent example of a microbial ecosystem engineer that is currently unculturable. They occur in low abundance in natural sediments, but due to their capability for long-distance electron transport, they exert a disproportionately large impact on the biogeochemistry of their environment. Current available genomes of marine cable bacteria are highly fragmented and incomplete, hampering the elucidation of their unique electrogenic physiology. Here, we present a metagenomic pipeline that combines Nanopore long-read and Illumina short-read shotgun sequencing. Starting from a clonal enrichment of a cable bacterium, we recovered a circular metagenome-assembled genome (5.09 Mbp in size), which represents a novel cable bacterium species with the proposed name Candidatus Electrothrix scaldis. The closed genome contains 1109 novel identified genes, including key metabolic enzymes not previously described in incomplete genomes of cable bacteria. We examined in detail the factors leading to genome closure. Foremost, native, non-amplified long reads are crucial to resolve the many repetitive regions within the genome of cable bacteria, and by analysing the whole metagenomic assembly, we found that low strain diversity is key for achieving genome closure. The insights and approaches presented here could help achieve genome closure for other keystone micro-organisms present in complex environmental samples at low abundance.
Collapse
Affiliation(s)
- Anwar Hiralal
- Geobiology Research Group, University of Antwerp, Antwerp, Belgium
| | | | | | - Bent Smets
- Geobiology Research Group, University of Antwerp, Antwerp, Belgium
| | | | - Filip J. R. Meysman
- Geobiology Research Group, University of Antwerp, Antwerp, Belgium
- Department of Biotechnology, Delft University of Technology, Delft, Netherlands
| |
Collapse
|
5
|
Booker AE, D'Angelo T, Adams-Beyea A, Brown JM, Nigro O, Rappé MS, Stepanauskas R, Orcutt BN. Life strategies for Aminicenantia in subseafloor oceanic crust. THE ISME JOURNAL 2023; 17:1406-1415. [PMID: 37328571 PMCID: PMC10432499 DOI: 10.1038/s41396-023-01454-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 04/11/2023] [Accepted: 04/17/2023] [Indexed: 06/18/2023]
Abstract
After decades studying the microbial "deep biosphere" in subseafloor oceanic crust, the growth and life strategies in this anoxic, low energy habitat remain poorly described. Using both single cell genomics and metagenomics, we reveal the life strategies of two distinct lineages of uncultivated Aminicenantia bacteria from the basaltic subseafloor oceanic crust of the eastern flank of the Juan de Fuca Ridge. Both lineages appear adapted to scavenge organic carbon, as each have genetic potential to catabolize amino acids and fatty acids, aligning with previous Aminicenantia reports. Given the organic carbon limitation in this habitat, seawater recharge and necromass may be important carbon sources for heterotrophic microorganisms inhabiting the ocean crust. Both lineages generate ATP via several mechanisms including substrate-level phosphorylation, anaerobic respiration, and electron bifurcation driving an Rnf ion translocation membrane complex. Genomic comparisons suggest these Aminicenantia transfer electrons extracellularly, perhaps to iron or sulfur oxides consistent with mineralogy of this site. One lineage, called JdFR-78, has small genomes that are basal to the Aminicenantia class and potentially use "primordial" siroheme biosynthetic intermediates for heme synthesis, suggesting this lineage retain characteristics of early evolved life. Lineage JdFR-78 contains CRISPR-Cas defenses to evade viruses, while other lineages contain prophage that may help prevent super-infection or no detectable viral defenses. Overall, genomic evidence points to Aminicenantia being well adapted to oceanic crust environments by taking advantage of simple organic molecules and extracellular electron transport.
Collapse
Affiliation(s)
- Anne E Booker
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, USA
| | | | - Annabelle Adams-Beyea
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, USA
- Eugene Lang College of Liberal Arts at The New School, New York City, NY, USA
| | - Julia M Brown
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, USA
| | - Olivia Nigro
- Department of Natural Science, Hawai'i Pacific University, Honolulu, HI, USA
| | - Michael S Rappé
- Hawai'i Institute of Marine Biology, SOEST, University of Hawai'i at Mānoa, Kāne'ohe, HI, USA
| | | | - Beth N Orcutt
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, USA.
| |
Collapse
|
6
|
DeSantis TZ, Cardona C, Narayan NR, Viswanatham S, Ravichandar D, Wee B, Chow CE, Iwai S. StrainSelect: A novel microbiome reference database that disambiguates all bacterial strains, genome assemblies and extant cultures worldwide. Heliyon 2023; 9:e13314. [PMID: 36814618 PMCID: PMC9939595 DOI: 10.1016/j.heliyon.2023.e13314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 12/01/2022] [Accepted: 01/26/2023] [Indexed: 02/05/2023] Open
Abstract
Motivation: Microbial metagenomic profiling software and databases are advancing rapidly for development of novel disease biomarkers and therapeutics yet three problems impede analyses: 1) the conflation of "genome assembly" and "strain" in reference databases; 2) difficulty connecting DNA biomarkers to a procurable strain for laboratory experimentation; and 3) absence of a comprehensive and unified strain-resolved reference database for integrating both shotgun metagenomics and 16S rRNA gene data. Results: We demarcated 681,087 strains, the largest collection of its kind, by filtering public data into a knowledge graph of vertices representing contiguous DNA sequences, genome assemblies, strain monikers and bio-resource center (BRC) catalog numbers then adding inter-vertex edges only for synonyms or direct derivatives. Surprisingly, for 10,043 important strains, we found replicate RefSeq genome assemblies obstructing interpretation of database searches. We organized each strain into eight taxonomic ranks with bootstrap confidence inversely correlated with genome assembly contamination. The StrainSelect database is suited for applications where a taxonomic, functional or procurement reference is needed for shotgun or amplicon metagenomics since 636,568 strains have at least one 16S rRNA gene, 245,005 have at least one annotated genome assembly, and 36,671 are procurable from at least one BRC. The database overcomes all three aforementioned problems since it disambiguates strains from assemblies, locates strains at BRCs, and unifies a taxonomic reference for both 16S rRNA and shotgun metagenomics. Availability: The StrainSelect database is available in igraph and tabular vertex-edge formats compatible with Neo4J. Dereplicated MinHash and fasta databases are distributed for sourmash and usearch pipelines at http://strainselect.secondgenome.com. Contact:todd.desantis@gmail.com. Supplementary information: Supplementary data are available online.
Collapse
Affiliation(s)
- Todd Z. DeSantis
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA,Environmental Metagenomics, Research Center One Health Ruhr of the University Alliance Ruhr, Faculty of Chemistry, University of Duisburg-Essen, Germany,Corresponding author at: Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA.
| | - Cesar Cardona
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Nicole R. Narayan
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Satish Viswanatham
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Divya Ravichandar
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | - Brendan Wee
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| | | | - Shoko Iwai
- Second Genome, Inc., 1000 Marina Blvd, Suite 500, Brisbane, CA, 94005, USA
| |
Collapse
|
7
|
Ajiboye TT, Ajiboye TO, Babalola OO. Impacts of Binary Oxide Nanoparticles on the Soybean Plant and Its Rhizosphere, Associated Phytohormones, and Enzymes. Molecules 2023; 28:1326. [PMID: 36770994 PMCID: PMC9919940 DOI: 10.3390/molecules28031326] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 01/25/2023] [Accepted: 01/25/2023] [Indexed: 01/31/2023] Open
Abstract
The utilization of binary oxide nanoparticles is geometrically increasing due to their numerous applications. Their intentional or accidental release after usage has led to their omnipresence in the environment. The usage of sludge or fertilizer containing binary oxide nanoparticles is likely to increase the chance of the plants being exposed to these binary oxide nanoparticles. The aim of the present review is to assess the detailed positive and negative impacts of these oxide nanoparticles on the soybean plants and its rhizosphere. In this study, methods of synthesizing binary oxide nanoparticles, as well as the merits and demerits of these methods, are discussed. Furthermore, various methods of characterizing the binary oxide nanoparticles in the tissues of soybean are highlighted. These characterization techniques help to track the nanoparticles inside the soybean plant. In addition, the assessment of rhizosphere microbial communities of soybean that have been exposed to these binary oxide nanoparticles is discussed. The impacts of binary oxide nanoparticles on the leaf, stem, root, seeds, and rhizosphere of soybean plant are comprehensively discussed. The impacts of binary oxides on the bioactive compounds such as phytohormones are also highlighted. Overall, it was observed that the impacts of the oxide nanoparticles on the soybean, rhizosphere, and bioactive compounds were dose-dependent. Lastly, the way forward on research involving the interactions of binary oxide nanoparticles and soybean plants is suggested.
Collapse
Affiliation(s)
- Titilope Tinu Ajiboye
- Food Security and Safety Niche Area, Faculty of Natural and Agricultural Sciences, North-West University, Private Bag X2046, Mmabatho 2735, South Africa
| | - Timothy Oladiran Ajiboye
- Chemistry Department, Nelson Mandela University, University Way, Summerstrand, Gqeberha 6019, South Africa
| | - Olubukola Oluranti Babalola
- Food Security and Safety Niche Area, Faculty of Natural and Agricultural Sciences, North-West University, Private Bag X2046, Mmabatho 2735, South Africa
| |
Collapse
|
8
|
Debroas D, Hochart C, Galand PE. Seasonal microbial dynamics in the ocean inferred from assembled and unassembled data: a view on the unknown biosphere. ISME COMMUNICATIONS 2022; 2:87. [PMID: 37938749 PMCID: PMC9723795 DOI: 10.1038/s43705-022-00167-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 08/23/2022] [Accepted: 09/02/2022] [Indexed: 11/09/2023]
Abstract
In environmental metagenomic experiments, a very high proportion of the microbial sequencing data (> 70%) remains largely unexploited because rare and closely related genomes are missed in short-read assemblies. The identity and the potential metabolisms of a large fraction of natural microbial communities thus remain inaccessible to researchers. The purpose of this study was to explore the genomic content of unassembled metagenomic data and test their level of novelty. We used data from a three-year microbial metagenomic time series of the NW Mediterranean Sea, and conducted reference-free and database-guided analysis. The results revealed a significant genomic difference between the assembled and unassembled reads. The unassembled reads had a lower mean identity against public databases, and fewer metabolic pathways could be reconstructed. In addition, the unassembled fraction presented a clear temporal pattern, unlike the assembled ones, and a specific community composition that was similar to the rare communities defined by metabarcoding using the 16S rRNA gene. The rare gene pool was characterised by keystone bacterial taxa, and the presence of viruses, suggesting that viral lysis could maintain some taxa in a state of rarity. Our study demonstrates that unassembled metagenomic data can provide important information on the structure and functioning of microbial communities.
Collapse
Affiliation(s)
- Didier Debroas
- Université Clermont Auvergne, CNRS, Laboratoire Microorganismes: Genome et Environnement, 63000, Clermont-Ferrand, France.
| | - Corentin Hochart
- Sorbonne Universités, CNRS, Laboratoire d'Ecogéochimie des Environnements Benthiques (LECOB), Observatoire Océanologique de Banyuls, Banyuls sur Mer, France
| | - Pierre E Galand
- Sorbonne Universités, CNRS, Laboratoire d'Ecogéochimie des Environnements Benthiques (LECOB), Observatoire Océanologique de Banyuls, Banyuls sur Mer, France
| |
Collapse
|
9
|
Sereika M, Kirkegaard RH, Karst SM, Michaelsen TY, Sørensen EA, Wollenberg RD, Albertsen M. Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing. Nat Methods 2022; 19:823-826. [PMID: 35789207 PMCID: PMC9262707 DOI: 10.1038/s41592-022-01539-7] [Citation(s) in RCA: 211] [Impact Index Per Article: 70.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 05/24/2022] [Indexed: 12/26/2022]
Abstract
Long-read Oxford Nanopore sequencing has democratized microbial genome sequencing and enables the recovery of highly contiguous microbial genomes from isolates or metagenomes. However, to obtain near-finished genomes it has been necessary to include short-read polishing to correct insertions and deletions derived from homopolymer regions. Here, we show that Oxford Nanopore R10.4 can be used to generate near-finished microbial genomes from isolates or metagenomes without short-read or reference polishing.
Collapse
Affiliation(s)
- Mantas Sereika
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark
| | - Rasmus Hansen Kirkegaard
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark.,Joint Microbiome Facility, University of Vienna, Vienna, Austria
| | | | | | | | | | - Mads Albertsen
- Center for Microbial Communities, Aalborg University, Aalborg, Denmark.
| |
Collapse
|
10
|
Czech L, Stamatakis A, Dunthorn M, Barbera P. Metagenomic Analysis Using Phylogenetic Placement-A Review of the First Decade. FRONTIERS IN BIOINFORMATICS 2022; 2:871393. [PMID: 36304302 PMCID: PMC9580882 DOI: 10.3389/fbinf.2022.871393] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 04/11/2022] [Indexed: 12/20/2022] Open
Abstract
Phylogenetic placement refers to a family of tools and methods to analyze, visualize, and interpret the tsunami of metagenomic sequencing data generated by high-throughput sequencing. Compared to alternative (e. g., similarity-based) methods, it puts metabarcoding sequences into a phylogenetic context using a set of known reference sequences and taking evolutionary history into account. Thereby, one can increase the accuracy of metagenomic surveys and eliminate the requirement for having exact or close matches with existing sequence databases. Phylogenetic placement constitutes a valuable analysis tool per se, but also entails a plethora of downstream tools to interpret its results. A common use case is to analyze species communities obtained from metagenomic sequencing, for example via taxonomic assignment, diversity quantification, sample comparison, and identification of correlations with environmental variables. In this review, we provide an overview over the methods developed during the first 10 years. In particular, the goals of this review are 1) to motivate the usage of phylogenetic placement and illustrate some of its use cases, 2) to outline the full workflow, from raw sequences to publishable figures, including best practices, 3) to introduce the most common tools and methods and their capabilities, 4) to point out common placement pitfalls and misconceptions, 5) to showcase typical placement-based analyses, and how they can help to analyze, visualize, and interpret phylogenetic placement data.
Collapse
Affiliation(s)
- Lucas Czech
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, United States
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Micah Dunthorn
- Natural History Museum, University of Oslo, Oslo, Norway
| | | |
Collapse
|
11
|
Jin H, You L, Zhao F, Li S, Ma T, Kwok LY, Xu H, Sun Z. Hybrid, ultra-deep metagenomic sequencing enables genomic and functional characterization of low-abundance species in the human gut microbiome. Gut Microbes 2022; 14:2021790. [PMID: 35067170 PMCID: PMC8786330 DOI: 10.1080/19490976.2021.2021790] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 12/03/2021] [Accepted: 12/13/2021] [Indexed: 02/04/2023] Open
Abstract
A large number of microbial genomes have already been identified from the human gut microbiome, but the understanding of the role of the low-abundance species at the individual level remains challenging, largely due to the relatively shallow sequencing depth used in most studies. To improve genome assembling performance, a HiSeq-PacBio hybrid, ultra-deep metagenomic sequencing approach was used to reconstruct metagenomic-assembled genomes (MAGs) from 12 fecal samples. Such approach combined third-generation sequencing with ultra-deep second-generation sequencing to improve the sequencing coverage of the low-abundance subpopulation in the gut microbiome. Our study generated a total of 44 megabase-scale scaffolds, achieving four single-scaffolds of complete (circularized, no gaps) MAGs (CMAGs) that were the first circular genomes of their species. Moreover, 475 high-quality MAGs were assembled across all samples. Among them, 234 MAGs were currently uncultured, including 24 MAGs that were not found in any public genome database. Additionally, 287 and 77 MAGs were classified as low-abundance (0.1-1%) and extra-low-abundance (<0.1%) gut species in each individual, respectively. Our results also revealed individual-specific genomic features in the MAG profiles, including microbial genome growth rate, selective pressure, and frequency of chromosomal mobile genetic elements. Finally, thousands of extrachromosomal mobile genetic elements were identified from the metagenomic data, including 5097 bacteriophages and 79 novel plasmid genomes. Overall, our strategy represents an important step toward comprehensive genomic and functional characterization of the human gut microbiome at an individual level.
Collapse
Affiliation(s)
- Hao Jin
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Lijun You
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Feiyan Zhao
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Shenghui Li
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Teng Ma
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Lai-Yu Kwok
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Haiyan Xu
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Zhihong Sun
- Key Laboratory of Dairy Biotechnology and Engineering, Ministry of Education, Key Laboratory of Dairy Products Processing, Ministry of Agriculture and Rural Affairs, Inner Mongolia Key Laboratory of Dairy Biotechnology and Engineering, Inner Mongolia Agricultural University, Hohhot, China
| |
Collapse
|
12
|
Wu C, Yin Y, Zhu L, Zhang Y, Li YZ. Metagenomic sequencing-driven multidisciplinary approaches to shed light on the untapped microbial natural products. Drug Discov Today 2021; 27:730-742. [PMID: 34775105 DOI: 10.1016/j.drudis.2021.11.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Revised: 10/07/2021] [Accepted: 11/08/2021] [Indexed: 11/17/2022]
Abstract
The advantage of metagenomics over the culture-based natural product (NP) discovery pipeline is the ability to access the biosynthetic potential of uncultivable microbes. Advances in DNA sequencing are revolutionizing conventional metagenomics approaches for microbial NP discovery. The genomes of (in)cultivable bugs can be resolved straightforwardly from environmental samples, enabling in situ prediction of biosynthetic gene clusters (BGCs). The predicted chemical diversities could be realized not only by heterologous expression of gene clusters originating from DNA synthesis or direct cloning, but also potentially by bioinformatic-directed organic synthesis or chemoenzymatic total synthesis. In this review, we suggest that metagenomic sequencing in tandem with multidisciplinary approaches will form a versatile platform to shed light on a plethora of microbial 'dark matter'.
Collapse
Affiliation(s)
- Changsheng Wu
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, Qingdao 266237, China.
| | - Yizhen Yin
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, Qingdao 266237, China
| | - Lele Zhu
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, Qingdao 266237, China
| | - Youming Zhang
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, Qingdao 266237, China
| | - Yue-Zhong Li
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, Qingdao 266237, China.
| |
Collapse
|
13
|
Distinct methane-dependent biogeochemical states in Arctic seafloor gas hydrate mounds. Nat Commun 2021; 12:6296. [PMID: 34728618 PMCID: PMC8563959 DOI: 10.1038/s41467-021-26549-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 09/27/2021] [Indexed: 01/04/2023] Open
Abstract
Archaea mediating anaerobic methane oxidation are key in preventing methane produced in marine sediments from reaching the hydrosphere; however, a complete understanding of how microbial communities in natural settings respond to changes in the flux of methane remains largely uncharacterized. We investigate microbial communities in gas hydrate-bearing seafloor mounds at Storfjordrenna, offshore Svalbard in the high Arctic, where we identify distinct methane concentration profiles that include steady-state, recently-increasing subsurface diffusive flux, and active gas seepage. Populations of anaerobic methanotrophs and sulfate-reducing bacteria were highest at the seep site, while decreased community diversity was associated with a recent increase in methane influx. Despite high methane fluxes and methanotroph doubling times estimated at 5-9 months, microbial community responses were largely synchronous with the advancement of methane into shallower sediment horizons. Together, these provide a framework for interpreting subseafloor microbial responses to methane escape in a warming Arctic Ocean.
Collapse
|
14
|
Stable-Isotope-Informed, Genome-Resolved Metagenomics Uncovers Potential Cross-Kingdom Interactions in Rhizosphere Soil. mSphere 2021; 6:e0008521. [PMID: 34468166 PMCID: PMC8550312 DOI: 10.1128/msphere.00085-21] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
The functioning, health, and productivity of soil are intimately tied to a complex network of interactions, particularly in plant root-associated rhizosphere soil. We conducted a stable-isotope-informed, genome-resolved metagenomic study to trace carbon from Avena fatua grown in a 13CO2 atmosphere into soil. We collected paired rhizosphere and nonrhizosphere soil at 6 and 9 weeks of plant growth and extracted DNA that was then separated by density using ultracentrifugation. Thirty-two fractions from each of five samples were grouped by density, sequenced, assembled, and binned to generate 55 unique bacterial genomes that were ≥70% complete. We also identified complete 18S rRNA sequences of several 13C-enriched microeukaryotic bacterivores and fungi. We generated 10 circularized bacteriophage (phage) genomes, some of which were the most labeled entities in the rhizosphere, suggesting that phage may be important agents of turnover of plant-derived C in soil. CRISPR locus targeting connected one of these phage to a Burkholderiales host predicted to be a plant pathogen. Another highly labeled phage is predicted to replicate in a Catenulispora sp., a possible plant growth-promoting bacterium. We searched the genome bins for traits known to be used in interactions involving bacteria, microeukaryotes, and plant roots and found DNA from heavily 13C-labeled bacterial genes thought to be involved in modulating plant signaling hormones, plant pathogenicity, and defense against microeukaryote grazing. Stable-isotope-informed, genome-resolved metagenomics indicated that phage can be important agents of turnover of plant-derived carbon in soil. IMPORTANCE Plants grow in intimate association with soil microbial communities; these microbes can facilitate the availability of essential resources to plants. Thus, plant productivity commonly depends on interactions with rhizosphere bacteria, viruses, and eukaryotes. Our work is significant because we identified the organisms that took up plant-derived organic C in rhizosphere soil and determined that many of the active bacteria are plant pathogens or can impact plant growth via hormone modulation. Further, by showing that bacteriophage accumulate CO2-derived carbon, we demonstrated their vital roles in redistribution of plant-derived C into the soil environment through bacterial cell lysis. The use of stable-isotope probing (SIP) to identify consumption (or lack thereof) of root-derived C by key microbial community members within highly complex microbial communities opens the way for assessing manipulations of bacteria and phage with potentially beneficial and detrimental traits, ultimately providing a path to improved plant health and soil carbon storage.
Collapse
|
15
|
Emerson JB, Varner RK, Wik M, Parks DH, Neumann RB, Johnson JE, Singleton CM, Woodcroft BJ, Tollerson R, Owusu-Dommey A, Binder M, Freitas NL, Crill PM, Saleska SR, Tyson GW, Rich VI. Diverse sediment microbiota shape methane emission temperature sensitivity in Arctic lakes. Nat Commun 2021; 12:5815. [PMID: 34611153 PMCID: PMC8492752 DOI: 10.1038/s41467-021-25983-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 07/07/2021] [Indexed: 11/23/2022] Open
Abstract
Northern post-glacial lakes are significant, increasing sources of atmospheric carbon through ebullition (bubbling) of microbially-produced methane (CH4) from sediments. Ebullitive CH4 flux correlates strongly with temperature, reflecting that solar radiation drives emissions. However, here we show that the slope of the temperature-CH4 flux relationship differs spatially across two post-glacial lakes in Sweden. We compared these CH4 emission patterns with sediment microbial (metagenomic and amplicon), isotopic, and geochemical data. The temperature-associated increase in CH4 emissions was greater in lake middles—where methanogens were more abundant—than edges, and sediment communities were distinct between edges and middles. Microbial abundances, including those of CH4-cycling microorganisms and syntrophs, were predictive of porewater CH4 concentrations. Results suggest that deeper lake regions, which currently emit less CH4 than shallower edges, could add substantially to CH4 emissions in a warmer Arctic and that CH4 emission predictions may be improved by accounting for spatial variations in sediment microbiota. Arctic lakes are strong and increasing sources of atmospheric methane, but extreme conditions and limited observations hinder robust understanding. Here the authors show that microbes in the middle of Arctic lakes have elevated methane producing potential, and are poised to release even more in the future.
Collapse
Affiliation(s)
- Joanne B Emerson
- Department of Microbiology, The Ohio State University, 496W 12th Ave, Columbus, OH, 43210, USA. .,Department of Plant Pathology, University of California, Davis, One Shields Ave, Davis, CA, 95616, USA.
| | - Ruth K Varner
- Department of Earth Sciences, University of New Hampshire, 56 College Road, Durham, NH, 03824, USA. .,Earth Systems Research Center, Institute for the Study of Earth, Oceans and Space, University of New Hampshire, 8 College Road, Durham, NH, 03824, USA.
| | - Martin Wik
- Department of Geological Sciences, Stockholm University, Stockholm, 106 91, Sweden
| | - Donovan H Parks
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, 4072, Australia
| | - Rebecca B Neumann
- Civil & Environmental Engineering, University of Washington, 201 More Hall, Box 352700, Seattle, WA, 98195-2700, USA
| | - Joel E Johnson
- Department of Earth Sciences, University of New Hampshire, 56 College Road, Durham, NH, 03824, USA
| | - Caitlin M Singleton
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, 4072, Australia.,Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, Aalborg, 9220, Denmark
| | - Ben J Woodcroft
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, 4072, Australia
| | - Rodney Tollerson
- Department of Microbiology, The Ohio State University, 496W 12th Ave, Columbus, OH, 43210, USA.,Department of Geological and Planetary Sciences, California Institute of Technology, Pasadena, CA, 91106, USA
| | - Akosua Owusu-Dommey
- Department of Environmental Science, University of Arizona, Tucson, AZ, 85721, USA.,Parkland Hospital, 5200 Harry Hines Blvd., Dallas, TX, 75235, USA
| | - Morgan Binder
- Department of Environmental Science, University of Arizona, Tucson, AZ, 85721, USA.,John C. Lincoln Health Network, 34975N North Valley Pkwy Ste 100, Phoenix, AZ, 85086, USA
| | - Nancy L Freitas
- Department of Environmental Science, University of Arizona, Tucson, AZ, 85721, USA.,Energy and Resources Group, University of California, Berkeley, USA
| | - Patrick M Crill
- Department of Geological Sciences, Stockholm University, Stockholm, 106 91, Sweden
| | - Scott R Saleska
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721, USA
| | - Gene W Tyson
- Australian Centre for Ecogenomics, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, 4072, Australia.,Centre for Microbiome Research, Queensland University of Technology, 37 Kent St, Woolloongabba, QLD, 4102, Australia
| | - Virginia I Rich
- Department of Microbiology, The Ohio State University, 496W 12th Ave, Columbus, OH, 43210, USA.
| |
Collapse
|
16
|
Haro-Moreno JM, López-Pérez M, Rodriguez-Valera F. Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics. Front Microbiol 2021; 12:708782. [PMID: 34512586 PMCID: PMC8430335 DOI: 10.3389/fmicb.2021.708782] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 07/30/2021] [Indexed: 12/12/2022] Open
Abstract
Third-generation sequencing has penetrated little in metagenomics due to the high error rate and dependence for assembly on short-read designed bioinformatics. However, second-generation sequencing metagenomics (mostly Illumina) suffers from limitations, particularly in the assembly of microbes with high microdiversity and retrieval of the flexible (adaptive) fraction of prokaryotic genomes. Here, we have used a third-generation technique to study the metagenome of a well-known marine sample from the mixed epipelagic water column of the winter Mediterranean. We have compared PacBio Sequel II with the classical approach using Illumina Nextseq short reads followed by assembly to study the metagenome. Long reads allow for efficient direct retrieval of complete genes avoiding the bias of the assembly step. Besides, the application of long reads on metagenomic assembly allows for the reconstruction of much more complete metagenome-assembled genomes (MAGs), particularly from microbes with high microdiversity such as Pelagibacterales. The flexible genome of reconstructed MAGs was much more complete containing many adaptive genes (some with biotechnological potential). PacBio Sequel II CCS appears particularly suitable for cellular metagenomics due to its low error rate. For most applications of metagenomics, from community structure analysis to ecosystem functioning, long reads should be applied whenever possible. Specifically, for in silico screening of biotechnologically useful genes, or population genomics, long-read metagenomics appears presently as a very fruitful approach and can be analyzed from raw reads before a computationally demanding (and potentially artifactual) assembly step.
Collapse
Affiliation(s)
- Jose M. Haro-Moreno
- Evolutionary Genomics Group, División de Microbiología, Universidad Miguel Hernández, Alicante, Spain
| | - Mario López-Pérez
- Evolutionary Genomics Group, División de Microbiología, Universidad Miguel Hernández, Alicante, Spain
| | - Francisco Rodriguez-Valera
- Evolutionary Genomics Group, División de Microbiología, Universidad Miguel Hernández, Alicante, Spain
- Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, Dolgoprudny, Russia
| |
Collapse
|
17
|
Tedersoo L, Albertsen M, Anslan S, Callahan B. Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology. Appl Environ Microbiol 2021; 87:e0062621. [PMID: 34132589 PMCID: PMC8357291 DOI: 10.1128/aem.00626-21] [Citation(s) in RCA: 74] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Short-read, high-throughput sequencing (HTS) methods have yielded numerous important insights into microbial ecology and function. Yet, in many instances short-read HTS techniques are suboptimal, for example, by providing insufficient phylogenetic resolution or low integrity of assembled genomes. Single-molecule and synthetic long-read (SLR) HTS methods have successfully ameliorated these limitations. In addition, nanopore sequencing has generated a number of unique analysis opportunities, such as rapid molecular diagnostics and direct RNA sequencing, and both Pacific Biosciences (PacBio) and nanopore sequencing support detection of epigenetic modifications. Although initially suffering from relatively low sequence quality, recent advances have greatly improved the accuracy of long-read sequencing technologies. In spite of great technological progress in recent years, the long-read HTS methods (PacBio and nanopore sequencing) are still relatively costly, require large amounts of high-quality starting material, and commonly need specific solutions in various analysis steps. Despite these challenges, long-read sequencing technologies offer high-quality, cutting-edge alternatives for testing hypotheses about microbiome structure and functioning as well as assembly of eukaryote genomes from complex environmental DNA samples.
Collapse
Affiliation(s)
- Leho Tedersoo
- Mycology and Microbiology Center, University of Tartu, Tartu, Estonia
| | - Mads Albertsen
- Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| | - Sten Anslan
- Mycology and Microbiology Center, University of Tartu, Tartu, Estonia
- Braunschweig University of Technology, Zoological Institute, Braunschweig, Germany
| | - Benjamin Callahan
- Department of Population Health and Pathobiology, College of Veterinary Medicine and Bioinformatics Research Center, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
18
|
Lapidus AL, Korobeynikov AI. Metagenomic Data Assembly - The Way of Decoding Unknown Microorganisms. Front Microbiol 2021; 12:613791. [PMID: 33833738 PMCID: PMC8021871 DOI: 10.3389/fmicb.2021.613791] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Accepted: 03/03/2021] [Indexed: 01/08/2023] Open
Abstract
Metagenomics is a segment of conventional microbial genomics dedicated to the sequencing and analysis of combined genomic DNA of entire environmental samples. The most critical step of the metagenomic data analysis is the reconstruction of individual genes and genomes of the microorganisms in the communities using metagenomic assemblers - computational programs that put together small fragments of sequenced DNA generated by sequencing instruments. Here, we describe the challenges of metagenomic assembly, a wide spectrum of applications in which metagenomic assemblies were used to better understand the ecology and evolution of microbial ecosystems, and present one of the most efficient microbial assemblers, SPAdes that was upgraded to become applicable for metagenomics.
Collapse
Affiliation(s)
- Alla L. Lapidus
- Center for Algorithmic Biotechnology, St. Petersburg State University, Saint Petersburg, Russia
| | | |
Collapse
|
19
|
Dvorkina T, Antipov D, Korobeynikov A, Nurk S. SPAligner: alignment of long diverged molecular sequences to assembly graphs. BMC Bioinformatics 2020; 21:306. [PMID: 32703258 PMCID: PMC7379835 DOI: 10.1186/s12859-020-03590-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Accepted: 06/08/2020] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Graph-based representation of genome assemblies has been recently used in different contexts - from improved reconstruction of plasmid sequences and refined analysis of metagenomic data to read error correction and reference-free haplotype reconstruction. While many of these applications heavily utilize the alignment of long nucleotide sequences to assembly graphs, first general-purpose software tools for finding such alignments have been released only recently and their deficiencies and limitations are yet to be discovered. Moreover, existing tools can not perform alignment of amino acid sequences, which could prove useful in various contexts - in particular the analysis of metagenomic sequencing data. RESULTS In this work we present a novel SPAligner (Saint-Petersburg Aligner) tool for aligning long diverged nucleotide and amino acid sequences to assembly graphs. We demonstrate that SPAligner is an efficient solution for mapping third generation sequencing reads onto assembly graphs of various complexity and also show how it can facilitate the identification of known genes in complex metagenomic datasets. CONCLUSIONS Our work will facilitate accelerating the development of graph-based approaches in solving sequence to genome assembly alignment problem. SPAligner is implemented as a part of SPAdes tools library and is available on Github.
Collapse
Affiliation(s)
- Tatiana Dvorkina
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| | - Dmitry Antipov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| | - Anton Korobeynikov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
- Department of Statistical Modelling, St. Petersburg State University, St. Petersburg, Russia
| | - Sergey Nurk
- Genome Informatics Section, NHGRI, National Institutes of Health, Bethesda MD, USA
| |
Collapse
|
20
|
Brown CT, Moritz D, O'Brien MP, Reidl F, Reiter T, Sullivan BD. Exploring neighborhoods in large metagenome assembly graphs using spacegraphcats reveals hidden sequence diversity. Genome Biol 2020; 21:164. [PMID: 32631445 PMCID: PMC7336657 DOI: 10.1186/s13059-020-02066-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 05/29/2020] [Indexed: 11/10/2022] Open
Abstract
Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at https://github.com/spacegraphcats/spacegraphcats under the 3-Clause BSD License.
Collapse
Affiliation(s)
- C Titus Brown
- Department of Population Health and Reproduction, University of California Davis, Davis, USA.
| | - Dominik Moritz
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, USA
| | | | - Felix Reidl
- Department of Computer Science, NC State University, Raleigh, USA
| | - Taylor Reiter
- Department of Population Health and Reproduction, University of California Davis, Davis, USA
| | - Blair D Sullivan
- Department of Computer Science, NC State University, Raleigh, USA.
| |
Collapse
|
21
|
Tolstoganov I, Bankevich A, Chen Z, Pevzner PA. cloudSPAdes: assembly of synthetic long reads using de Bruijn graphs. Bioinformatics 2020; 35:i61-i70. [PMID: 31510642 PMCID: PMC6612831 DOI: 10.1093/bioinformatics/btz349] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Motivation The recently developed barcoding-based synthetic long read (SLR) technologies have already found many applications in genome assembly and analysis. However, although some new barcoding protocols are emerging and the range of SLR applications is being expanded, the existing SLR assemblers are optimized for a narrow range of parameters and are not easily extendable to new barcoding technologies and new applications such as metagenomics or hybrid assembly. Results We describe the algorithmic challenge of the SLR assembly and present a cloudSPAdes algorithm for SLR assembly that is based on analyzing the de Bruijn graph of SLRs. We benchmarked cloudSPAdes across various barcoding technologies/applications and demonstrated that it improves on the state-of-the-art SLR assemblers in accuracy and speed. Availability and implementation Source code and installation manual for cloudSPAdes are available at https://github.com/ablab/spades/releases/tag/cloudspades-paper. Supplementary Information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ivan Tolstoganov
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia
| | - Anton Bankevich
- Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA, USA
| | - Zhoutao Chen
- Universal Sequencing Technology Corporation, Carlsbad, CA, USA
| | - Pavel A Pevzner
- Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg, Russia.,Department of Computer Science and Engineering, University of California at San Diego, La Jolla, CA, USA
| |
Collapse
|
22
|
Probst AJ, Elling FJ, Castelle CJ, Zhu Q, Elvert M, Birarda G, Holman HYN, Lane KR, Ladd B, Ryan MC, Woyke T, Hinrichs KU, Banfield JF. Lipid analysis of CO 2-rich subsurface aquifers suggests an autotrophy-based deep biosphere with lysolipids enriched in CPR bacteria. ISME JOURNAL 2020; 14:1547-1560. [PMID: 32203118 PMCID: PMC7242380 DOI: 10.1038/s41396-020-0624-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 02/05/2020] [Accepted: 02/25/2020] [Indexed: 11/09/2022]
Abstract
Sediment-hosted CO2-rich aquifers deep below the Colorado Plateau (USA) contain a remarkable diversity of uncultivated microorganisms, including Candidate Phyla Radiation (CPR) bacteria that are putative symbionts unable to synthesize membrane lipids. The origin of organic carbon in these ecosystems is unknown and the source of CPR membrane lipids remains elusive. We collected cells from deep groundwater brought to the surface by eruptions of Crystal Geyser, sequenced the community, and analyzed the whole community lipidome over time. Characteristic stable carbon isotopic compositions of microbial lipids suggest that bacterial and archaeal CO2 fixation ongoing in the deep subsurface provides organic carbon for the complex communities that reside there. Coupled lipidomic-metagenomic analysis indicates that CPR bacteria lack complete lipid biosynthesis pathways but still possess regular lipid membranes. These lipids may therefore originate from other community members, which also adapt to high in situ pressure by increasing fatty acid unsaturation. An unusually high abundance of lysolipids attributed to CPR bacteria may represent an adaptation to membrane curvature stress induced by their small cell sizes. Our findings provide new insights into the carbon cycle in the deep subsurface and suggest the redistribution of lipids into putative symbionts within this community.
Collapse
Affiliation(s)
- Alexander J Probst
- Department of Earth and Planetary Science, University of California, Berkeley, CA, 94720, USA. .,Institute for Environmental Microbiology and Biotechnology, Department of Chemistry, University of Duisburg-Essen, Essen, Germany.
| | - Felix J Elling
- MARUM Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany. .,Department of Earth and Planetary Sciences, Harvard University, Cambridge, MA, 02138, USA.
| | - Cindy J Castelle
- Department of Earth and Planetary Science, University of California, Berkeley, CA, 94720, USA.,MARUM Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany
| | - Qingzeng Zhu
- MARUM Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany
| | - Marcus Elvert
- MARUM Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany
| | - Giovanni Birarda
- Elettra-Sincrotrone Trieste, Strada Statale 14-km 163,5 Basovizza, 34149, Trieste, Italy.,Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Hoi-Ying N Holman
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Katherine R Lane
- Department of Earth and Planetary Science, University of California, Berkeley, CA, 94720, USA
| | - Bethany Ladd
- Department of Geoscience, University of Calgary, Calgary, AB, T2N 1N4, Canada.,Department of Earth, Ocean, and Atmospheric Sciences, University of British Columbia, Vancouver, Canada
| | - M Cathryn Ryan
- Department of Geoscience, University of Calgary, Calgary, AB, T2N 1N4, Canada
| | - Tanja Woyke
- DOE Joint Genome Institute, Walnut Creek, MA, USA
| | - Kai-Uwe Hinrichs
- MARUM Center for Marine Environmental Sciences, University of Bremen, Bremen, Germany.
| | - Jillian F Banfield
- Department of Earth and Planetary Science, University of California, Berkeley, CA, 94720, USA.
| |
Collapse
|
23
|
Starr EP, Nuccio EE, Pett-Ridge J, Banfield JF, Firestone MK. Metatranscriptomic reconstruction reveals RNA viruses with the potential to shape carbon cycling in soil. Proc Natl Acad Sci U S A 2019; 116:25900-25908. [PMID: 31772013 PMCID: PMC6926006 DOI: 10.1073/pnas.1908291116] [Citation(s) in RCA: 123] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Viruses impact nearly all organisms on Earth, with ripples of influence in agriculture, health, and biogeochemical processes. However, very little is known about RNA viruses in an environmental context, and even less is known about their diversity and ecology in soil, 1 of the most complex microbial systems. Here, we assembled 48 individual metatranscriptomes from 4 habitats within a planted soil sampled over a 22-d time series: Rhizosphere alone, detritosphere alone, rhizosphere with added root detritus, and unamended soil (4 time points and 3 biological replicates). We resolved the RNA viral community, uncovering a high diversity of viral sequences. We also investigated possible host organisms by analyzing metatranscriptome marker genes. Based on viral phylogeny, much of the diversity was Narnaviridae that may parasitize fungi or Leviviridae, which may infect Proteobacteria. Both host and viral communities appear to be highly dynamic, and rapidly diverged depending on experimental conditions. The viral and host communities were structured based on the presence of root litter. Clear temporal dynamics by Leviviridae and their hosts indicated that viruses were replicating. With this time-resolved analysis, we show that RNA viruses are diverse, abundant, and active in soil. When viral infection causes host cell death, it may mobilize cell carbon in a process that may represent an overlooked component of soil carbon cycling.
Collapse
Affiliation(s)
- Evan P Starr
- Department of Plant and Microbial Biology, University of California, Berkeley, CA 94720
| | - Erin E Nuccio
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94550
| | - Jennifer Pett-Ridge
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94550
| | - Jillian F Banfield
- Department of Earth and Planetary Science, University of California, Berkeley, CA 94720;
- Earth Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, CA 94720
- Chan Zuckerberg Biohub, San Francisco, CA 94158
- Innovative Genomics Institute, Berkeley, CA 94720
| | - Mary K Firestone
- Earth Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720;
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, CA 94720
| |
Collapse
|
24
|
Royo-Llonch M, Sánchez P, González JM, Pedrós-Alió C, Acinas SG. Ecological and functional capabilities of an uncultured Kordia sp. Syst Appl Microbiol 2019; 43:126045. [PMID: 31831198 DOI: 10.1016/j.syapm.2019.126045] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 10/28/2019] [Accepted: 11/12/2019] [Indexed: 01/07/2023]
Abstract
Cultivable bacteria represent only a fraction of the diversity in microbial communities. However, the official procedures for classification and characterization of a novel prokaryotic species still rely on isolates. Nevertheless, due to single cell genomics, it is possible to retrieve genomes from environmental samples by sequencing them individually, and to assign specific genes to a specific taxon, regardless of their ability to grow in culture. In this study, a complete description was performed for uncultured Kordia sp. TARA_039_SRF, a proposed novel species within the genus Kordia, using culture-independent techniques. The type material was a high-quality draft genome (94.97% complete, 4.65% gene redundancy) co-assembled using ten nearly identical single amplified genomes (SAGs) from surface seawater in the North Indian Ocean during the Tara Oceans Expedition. The assembly process was optimized to obtain the best possible assembly metrics and a less fragmented genome. The closest relative of the species was Kordia periserrulae, which shared 97.56% similarity of the 16S rRNA gene, 75% orthologs and 89.13% average nucleotide identity. The functional potential of the proposed novel species included proteorhodopsin, the ability to incorporate nitrate, cytochrome oxidases with high affinity for oxygen, and CAZymes that were unique features within the genus. Its abundance at different depths and size fractions was also evaluated together with its functional annotation, revealing that its putative ecological niche could be particles of phytoplanktonic origin. It could putatively attach to these particles and consume them while sinking to the deeper and oxygen depleted layers of the North Indian Ocean.
Collapse
Affiliation(s)
- M Royo-Llonch
- Department of Marine Biology and Oceanography, Institut de Ciències del Mar (ICM), CSIC, Barcelona, Spain
| | - P Sánchez
- Department of Marine Biology and Oceanography, Institut de Ciències del Mar (ICM), CSIC, Barcelona, Spain
| | - J M González
- Department of Microbiology, University of La Laguna, La Laguna, Spain
| | - C Pedrós-Alió
- Systems Biology Program, Centro Nacional de Biotecnología (CNB), CSIC, Madrid, Spain
| | - S G Acinas
- Department of Marine Biology and Oceanography, Institut de Ciències del Mar (ICM), CSIC, Barcelona, Spain.
| |
Collapse
|
25
|
Unlinked rRNA genes are widespread among bacteria and archaea. ISME JOURNAL 2019; 14:597-608. [PMID: 31712737 DOI: 10.1038/s41396-019-0552-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Revised: 10/23/2019] [Accepted: 10/29/2019] [Indexed: 02/06/2023]
Abstract
Ribosomes are essential to cellular life and the genes for their RNA components are the most conserved and transcribed genes in bacteria and archaea. Ribosomal RNA genes are typically organized into a single operon, an arrangement thought to facilitate gene regulation. In reality, some bacteria and archaea do not share this canonical rRNA arrangement-their 16S and 23S rRNA genes are separated across the genome and referred to as "unlinked". This rearrangement has previously been treated as an anomaly or a byproduct of genome degradation in intracellular bacteria. Here, we leverage complete genome and long-read metagenomic data to show that unlinked 16S and 23S rRNA genes are more common than previously thought. Unlinked rRNA genes occur in many phyla, most significantly within Deinococcus-Thermus, Chloroflexi, and Planctomycetes, and occur in differential frequencies across natural environments. We found that up to 41% of rRNA genes in soil were unlinked, in contrast to the human gut, where all sequenced rRNA genes were linked. The frequency of unlinked rRNA genes may reflect meaningful life history traits, as they tend to be associated with a mix of slow-growing free-living species and intracellular species. We speculate that unlinked rRNA genes may confer selective advantages in some environments, though the specific nature of these advantages remains undetermined and worthy of further investigation. More generally, the prevalence of unlinked rRNA genes in poorly-studied taxa serves as a reminder that paradigms derived from model organisms do not necessarily extend to the broader diversity of bacteria and archaea.
Collapse
|
26
|
Ramírez GA, Garber AI, Lecoeuvre A, D’Angelo T, Wheat CG, Orcutt BN. Ecology of Subseafloor Crustal Biofilms. Front Microbiol 2019; 10:1983. [PMID: 31551949 PMCID: PMC6736579 DOI: 10.3389/fmicb.2019.01983] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Accepted: 08/13/2019] [Indexed: 11/26/2022] Open
Abstract
The crustal subseafloor is the least explored and largest biome on Earth. Interrogating crustal life is difficult due to habitat inaccessibility, low-biomass and contamination challenges. Subseafloor observatories have facilitated the study of planktonic life in crustal aquifers, however, studies of life in crust-attached biofilms are rare. Here, we investigate biofilms grown on various minerals at different temperatures over 1-6 years at subseafloor observatories in the Eastern Pacific. To mitigate potential sequence contamination, we developed a new bioinformatics tool - TaxonSluice. We explore ecological factors driving community structure and potential function of biofilms by comparing our sequence data to previous amplicon and metagenomic surveys of this habitat. We reveal that biofilm community structure is driven by temperature rather than minerology, and that rare planktonic lineages colonize the crustal biofilms. Based on 16S rRNA gene overlap, we partition metagenome assembled genomes into planktonic and biofilm fractions and suggest that there are functional differences between these community types, emphasizing the need to separately examine each to accurately describe subseafloor microbe-rock-fluid processes. Lastly, we report that some rare lineages present in our warm and anoxic study site are also found in cold and oxic crustal fluids in the Mid-Atlantic Ridge, suggesting global crustal biogeography patterns.
Collapse
Affiliation(s)
- Gustavo A. Ramírez
- Graduate School of Oceanography, University of Rhode Island, Narragansett, RI, United States
| | - Arkadiy I. Garber
- Division of Biological Sciences, University of Montana, Missoula, MT, United States
| | - Aurélien Lecoeuvre
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, United States
- Université de Bretagne Occidentale, UFR Sciences et Techniques, Brest, France
| | - Timothy D’Angelo
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, United States
| | - C. Geoffrey Wheat
- Institute of Marine Science, University of Alaska Fairbanks, Fairbanks, AK, United States
| | - Beth N. Orcutt
- Bigelow Laboratory for Ocean Sciences, East Boothbay, ME, United States
| |
Collapse
|
27
|
Suzuki Y, Nishijima S, Furuta Y, Yoshimura J, Suda W, Oshima K, Hattori M, Morishita S. Long-read metagenomic exploration of extrachromosomal mobile genetic elements in the human gut. MICROBIOME 2019; 7:119. [PMID: 31455406 PMCID: PMC6712665 DOI: 10.1186/s40168-019-0737-z] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Accepted: 08/16/2019] [Indexed: 05/13/2023]
Abstract
BACKGROUND Elucidating the ecological and biological identity of extrachromosomal mobile genetic elements (eMGEs), such as plasmids and bacteriophages, in the human gut remains challenging due to their high complexity and diversity. RESULTS Here, we show efficient identification of eMGEs as complete circular or linear contigs from PacBio long-read metagenomic data. De novo assembly of PacBio long reads from 12 faecal samples generated 82 eMGE contigs (2.5~666.7-kb), which were classified as 71 plasmids and 11 bacteriophages, including 58 novel plasmids and six bacteriophages, and complete genomes of five diverse crAssphages with terminal direct repeats. In a dataset of 413 gut metagenomes from five countries, many of the identified plasmids were highly abundant and prevalent. The ratio of gut plasmids by our plasmid data is more than twice that in the public database. Plasmids outnumbered bacterial chromosomes three to one on average in this metagenomic dataset. Host prediction suggested that Bacteroidetes-associated plasmids predominated, regardless of microbial abundance. The analysis found several plasmid-enriched functions, such as inorganic ion transport, while antibiotic resistance genes were harboured mostly in low-abundance Proteobacteria-associated plasmids. CONCLUSIONS Overall, long-read metagenomics provided an efficient approach for unravelling the complete structure of human gut eMGEs, particularly plasmids.
Collapse
Affiliation(s)
- Yoshihiko Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
| | - Suguru Nishijima
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory, Tokyo, 169-8555 Japan
- Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, 169-8555 Japan
| | - Yoshikazu Furuta
- Division of Infection and Immunity, Research Center for Zoonosis Control, Hokkaido University, Sapporo, 001-0020 Japan
| | - Jun Yoshimura
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
| | - Wataru Suda
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
- Laboratory for Microbiome Sciences, RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045 Japan
| | - Kenshiro Oshima
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
| | - Masahira Hattori
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
- Graduate School of Advanced Science and Engineering, Waseda University, Tokyo, 169-8555 Japan
- Laboratory for Microbiome Sciences, RIKEN Center for Integrative Medical Sciences, Yokohama, 230-0045 Japan
| | - Shinichi Morishita
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8568 Japan
| |
Collapse
|
28
|
Li S, Tang H, Ye Y. A Meta-proteogenomic Approach to Peptide Identification Incorporating Assembly Uncertainty and Genomic Variation. Mol Cell Proteomics 2019; 18:S183-S192. [PMID: 31142575 PMCID: PMC6692780 DOI: 10.1074/mcp.tir118.001233] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Revised: 04/25/2019] [Indexed: 01/07/2023] Open
Abstract
Matching metagenomic and/or metatranscriptomic data, currently often under-used, can be useful reference for metaproteomic tandem mass spectra (MS/MS) data analysis. Here we developed a software pipeline for identification of peptides and proteins from metaproteomic MS/MS data using proteins derived from matching metagenomic (and metatranscriptomic) data as the search database, based on two novel approaches Graph2Pro (published) and Var2Pep (new). Graph2Pro retains and uses uncertainties of metagenome assembly for reference-based MS/MS data analysis. Var2Pep considers the variations found in metagenomic/metatranscriptomic sequencing reads that are not retained in the assemblies (contigs). The new software pipeline provides one stop application of both tools, and it supports the use of metagenome assembly from commonly used assemblers including MegaHit and metaSPAdes. When tested on two collections of multi-omic microbiome data sets, our pipeline significantly improved the identification rate of the metaproteomic MS/MS spectra by about two folds, comparing to conventional contig- or read-based approaches (the Var2Pep alone identified 5.6% to 24.1% more unique peptides, depending on the data set). We also showed that identified variant peptides are important for functional profiling of microbiomes. All results suggested that it is important to take into consideration of the assembly uncertainties and genomic variants to facilitate metaproteomic MS/MS data interpretation.
Collapse
Affiliation(s)
- Sujun Li
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN
| | - Haixu Tang
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN
| | - Yuzhen Ye
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN.
| |
Collapse
|
29
|
Diamond S, Andeer PF, Li Z, Crits-Christoph A, Burstein D, Anantharaman K, Lane KR, Thomas BC, Pan C, Northen TR, Banfield JF. Mediterranean grassland soil C-N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms. Nat Microbiol 2019; 4:1356-1367. [PMID: 31110364 PMCID: PMC6784897 DOI: 10.1038/s41564-019-0449-y] [Citation(s) in RCA: 102] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 04/03/2019] [Indexed: 12/15/2022]
Abstract
Soil microbial activity drives the carbon and nitrogen cycles and is an important determinant of atmospheric trace gas turnover, yet most soils are dominated by microorganisms with unknown metabolic capacities. Even Acidobacteria, among the most abundant bacteria in soil, remain poorly characterized, and functions across groups such as Verrucomicrobia, Gemmatimonadetes, Chloroflexi and Rokubacteria are understudied. Here, we have resolved 60 metagenomic and 20 proteomic data sets from a Mediterranean grassland soil ecosystem and recovered 793 near-complete microbial genomes from 18 phyla, representing around one-third of all microorganisms detected. Importantly, this enabled extensive genomics-based metabolic predictions for these communities. Acidobacteria from multiple previously unstudied classes have genomes that encode large enzyme complements for complex carbohydrate degradation. Alternatively, most microorganisms encode carbohydrate esterases that strip readily accessible methyl and acetyl groups from polymers like pectin and xylan, forming methanol and acetate, the availability of which could explain the high prevalence of C1 metabolism and acetate utilization in genomes. Microorganism abundances among samples collected at three soil depths and under natural and amended rainfall regimes indicate statistically higher associations of inorganic nitrogen metabolism and carbon degradation in deep and shallow soils, respectively. This partitioning decreased in samples under extended spring rainfall, indicating that long-term climate alteration can affect both carbon and nitrogen cycling. Overall, by leveraging natural and experimental gradients with genome-resolved metabolic profiles, we link microorganisms lacking prior genomic characterization to specific roles in complex carbon, C1, nitrate and ammonia transformations, and constrain factors that impact their distributions in soil.
Collapse
Affiliation(s)
- Spencer Diamond
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA
| | - Peter F Andeer
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Zhou Li
- Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | | | - David Burstein
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA
- School of Molecular Cell Biology and Biotechnology, Tel Aviv University, Tel Aviv, Israel
| | - Karthik Anantharaman
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA
- Department of Bacteriology, University of Wisconsin, Madison, WI, USA
| | - Katherine R Lane
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA
| | - Brian C Thomas
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA
| | - Chongle Pan
- Oak Ridge National Laboratory, Oak Ridge, TN, USA
- School of Computer Science and Department of Microbiology and Plant Biology, University of Oklahoma, Norman, OK, USA
| | - Trent R Northen
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Walnut Creek, CA, USA
| | - Jillian F Banfield
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA, USA.
- Department of Environmental Science, Policy, and Management, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
30
|
Hug LA. Subsampled Assemblies and Hybrid Nucleotide Composition/Differential Coverage Binning for Genome-Resolved Metagenomics. Methods Mol Biol 2019; 1849:215-225. [PMID: 30298257 DOI: 10.1007/978-1-4939-8728-3_14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Metagenomic analyses for reconstruction of genomes from mixed microbial community datasets now routinely allow rapid, accurate genome recovery for tens to hundreds of organisms from environmental samples. This chapter provides a step-by-step protocol for reconstructing genomes from metagenomic datasets, with a focus on the most abundant community members. Subsampling assembly approaches are implemented to improve assembly of abundant genome sequences, an iterative process that targets progressively less abundant populations and improves total community representation in the final merged assembly. A hybrid approach to genome binning is described, combining differential coverage information from a series of metagenomic samples with nucleotide composition information. This approach strengthens binning through application of multiple independent variables for contig clustering. Genome curation through error correction and gap closure leads to high-quality draft genomes, and, for some community members, closed and complete genome sequences reconstructed directly from environmental samples.
Collapse
Affiliation(s)
- Laura A Hug
- Department of Biology, University of Waterloo, Waterloo, ON, Canada.
| |
Collapse
|
31
|
Somerville V, Lutz S, Schmid M, Frei D, Moser A, Irmler S, Frey JE, Ahrens CH. Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system. BMC Microbiol 2019; 19:143. [PMID: 31238873 PMCID: PMC6593500 DOI: 10.1186/s12866-019-1500-0] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2018] [Accepted: 05/31/2019] [Indexed: 01/18/2023] Open
Abstract
BACKGROUND Complete and contiguous genome assemblies greatly improve the quality of subsequent systems-wide functional profiling studies and the ability to gain novel biological insights. While a de novo genome assembly of an isolated bacterial strain is in most cases straightforward, more informative data about co-existing bacteria as well as synergistic and antagonistic effects can be obtained from a direct analysis of microbial communities. However, the complexity of metagenomic samples represents a major challenge. While third generation sequencing technologies have been suggested to enable finished metagenome-assembled genomes, to our knowledge, the complete genome assembly of all dominant strains in a microbiome sample has not been demonstrated. Natural whey starter cultures (NWCs) are used in cheese production and represent low-complexity microbiomes. Previous studies of Swiss Gruyère and selected Italian hard cheeses, mostly based on amplicon metagenomics, concurred that three species generally pre-dominate: Streptococcus thermophilus, Lactobacillus helveticus and Lactobacillus delbrueckii. RESULTS Two NWCs from Swiss Gruyère producers were subjected to whole metagenome shotgun sequencing using the Pacific Biosciences Sequel and Illumina MiSeq platforms. In addition, longer Oxford Nanopore Technologies MinION reads had to be generated for one to resolve repeat regions. Thereby, we achieved the complete assembly of all dominant bacterial genomes from these low-complexity NWCs, which was corroborated by a 16S rRNA amplicon survey. Moreover, two distinct L. helveticus strains were successfully co-assembled from the same sample. Besides bacterial chromosomes, we could also assemble several bacterial plasmids and phages and a corresponding prophage. Biologically relevant insights were uncovered by linking the plasmids and phages to their respective host genomes using DNA methylation motifs on the plasmids and by matching prokaryotic CRISPR spacers with the corresponding protospacers on the phages. These results could only be achieved by employing long-read sequencing data able to span intragenomic as well as intergenomic repeats. CONCLUSIONS Here, we demonstrate the feasibility of complete de novo genome assembly of all dominant strains from low-complexity NWCs based on whole metagenomics shotgun sequencing data. This allowed to gain novel biological insights and is a fundamental basis for subsequent systems-wide omics analyses, functional profiling and phenotype to genotype analysis of specific microbial communities.
Collapse
Affiliation(s)
- Vincent Somerville
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics, Schloss 1, CH-8820 Wädenswil, Switzerland
- SIB Swiss Institute of Bioinformatics, CH-8820 Wädenswil, Switzerland
| | - Stefanie Lutz
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics, Schloss 1, CH-8820 Wädenswil, Switzerland
- SIB Swiss Institute of Bioinformatics, CH-8820 Wädenswil, Switzerland
| | - Michael Schmid
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics, Schloss 1, CH-8820 Wädenswil, Switzerland
- SIB Swiss Institute of Bioinformatics, CH-8820 Wädenswil, Switzerland
| | - Daniel Frei
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics, Schloss 1, CH-8820 Wädenswil, Switzerland
| | - Aline Moser
- Agroscope, Research Group Biochemistry of Milk and Microorganisms, CH-3003 Bern, Switzerland
| | - Stefan Irmler
- Agroscope, Research Group Biochemistry of Milk and Microorganisms, CH-3003 Bern, Switzerland
| | - Jürg E. Frey
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics, Schloss 1, CH-8820 Wädenswil, Switzerland
| | - Christian H. Ahrens
- Agroscope, Research Group Molecular Diagnostics, Genomics & Bioinformatics, Schloss 1, CH-8820 Wädenswil, Switzerland
- SIB Swiss Institute of Bioinformatics, CH-8820 Wädenswil, Switzerland
| |
Collapse
|
32
|
Vuillemin A, Ariztegui D, Horn F, Kallmeyer J, Orsi WD. Microbial community composition along a 50 000-year lacustrine sediment sequence. FEMS Microbiol Ecol 2019; 94:4880442. [PMID: 29471361 PMCID: PMC5905624 DOI: 10.1093/femsec/fiy029] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 02/19/2018] [Indexed: 02/01/2023] Open
Abstract
For decades, microbial community composition in subseafloor sediments has been the focus of extensive studies. In deep lacustrine sediments, however, the taxonomic composition of microbial communities remains undercharacterized. Greater knowledge on microbial diversity in lacustrine sediments would improve our understanding of how environmental factors, and resulting selective pressures, shape subsurface biospheres in marine and freshwater sediments. Using high-throughput sequencing of 16S rRNA genes across high-resolution climate intervals covering the last 50 000 years in Laguna Potrok Aike, Argentina, we identified changes in microbial populations in response to both past environmental conditions and geochemical changes of the sediment during burial. Microbial communities in Holocene sediments were most diverse, reflecting a layering of taxa linked to electron acceptors availability. In deeper intervals, the data show that salinity, organic matter and the depositional conditions over the Last Glacial-interglacial cycle were all selective pressures in the deep lacustrine assemblage resulting in a genetically distinct biosphere from the surface dominated primarily by Bathyarchaeota and Atribacteria groups. However, similar to marine sediments, some dominant taxa in the shallow subsurface persisted into the subsurface as minor fraction of the community. The subsequent establishment of a deep subsurface community likely results from a combination of paleoenvironmental factors that have shaped the pool of available substrates, together with substrate depletion and/or reworking of organic matter with depth.
Collapse
Affiliation(s)
- Aurèle Vuillemin
- Department of Earth & Environmental Science, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333 Munich, Germany.,Section of Earth & Environmental Sciences, University of Geneva, rue des Maraichers 13, 1205 Geneva, Switzerland.,GFZ German Research Centre for Geosciences, Helmholtz Centre Potsdam, Section 5.3: Geomicrobiology, Telegrafenberg, 14473 Potsdam, Germany
| | - Daniel Ariztegui
- Section of Earth & Environmental Sciences, University of Geneva, rue des Maraichers 13, 1205 Geneva, Switzerland
| | - Fabian Horn
- GFZ German Research Centre for Geosciences, Helmholtz Centre Potsdam, Section 5.3: Geomicrobiology, Telegrafenberg, 14473 Potsdam, Germany
| | - Jens Kallmeyer
- GFZ German Research Centre for Geosciences, Helmholtz Centre Potsdam, Section 5.3: Geomicrobiology, Telegrafenberg, 14473 Potsdam, Germany
| | - William D Orsi
- Department of Earth & Environmental Science, Paleontology & Geobiology, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333 Munich, Germany.,Geobio-Center, Ludwig-Maximilians-Universität München, Richard-Wagner-Str. 10, 80333 Munich, Germany
| | | |
Collapse
|
33
|
Genome of the candidate phylum Aminicenantes bacterium from a deep subsurface thermal aquifer revealed its fermentative saccharolytic lifestyle. Extremophiles 2019; 23:189-200. [DOI: 10.1007/s00792-018-01073-5] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 12/17/2018] [Indexed: 10/27/2022]
|
34
|
Bishara A, Moss EL, Kolmogorov M, Parada AE, Weng Z, Sidow A, Dekas AE, Batzoglou S, Bhatt AS. High-quality genome sequences of uncultured microbes by assembly of read clouds. Nat Biotechnol 2018; 36:nbt.4266. [PMID: 30320765 PMCID: PMC6465186 DOI: 10.1038/nbt.4266] [Citation(s) in RCA: 70] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2018] [Accepted: 08/28/2018] [Indexed: 01/08/2023]
Abstract
Although shotgun metagenomic sequencing of microbiome samples enables partial reconstruction of strain-level community structure, obtaining high-quality microbial genome drafts without isolation and culture remains difficult. Here, we present an application of read clouds, short-read sequences tagged with long-range information, to microbiome samples. We present Athena, a de novo assembler that uses read clouds to improve metagenomic assemblies. We applied this approach to sequence stool samples from two healthy individuals and compared it with existing short-read and synthetic long-read metagenomic sequencing techniques. Read-cloud metagenomic sequencing and Athena assembly produced the most comprehensive individual genome drafts with high contiguity (>200-kb N50, fewer than ten contigs), even for bacteria with relatively low (20×) raw short-read-sequence coverage. We also sequenced a complex marine-sediment sample and generated 24 intermediate-quality genome drafts (>70% complete, <10% contaminated), nine of which were complete (>90% complete, <5% contaminated). Our approach allows for culture-free generation of high-quality microbial genome drafts by using a single shotgun experiment.
Collapse
Affiliation(s)
- Alex Bishara
- Department of Computer Science, Stanford University, Stanford, California, USA
- Department of Medicine (Hematology, Blood and Marrow Transplantation) and Department of Genetics, Stanford University, Stanford, California, USA
| | - Eli L. Moss
- Department of Medicine (Hematology, Blood and Marrow Transplantation) and Department of Genetics, Stanford University, Stanford, California, USA
| | - Mikhail Kolmogorov
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, California, USA
| | - Alma E. Parada
- Department of Earth System Science, Stanford University, Stanford, CA, USA
| | - Ziming Weng
- Department of Pathology, Stanford University School of Medicine, Stanford, California, USA
| | - Arend Sidow
- Department of Medicine (Hematology, Blood and Marrow Transplantation) and Department of Genetics, Stanford University, Stanford, California, USA
- Department of Pathology, Stanford University School of Medicine, Stanford, California, USA
| | - Anne E. Dekas
- Department of Earth System Science, Stanford University, Stanford, CA, USA
| | - Serafim Batzoglou
- Department of Computer Science, Stanford University, Stanford, California, USA
| | - Ami S. Bhatt
- Department of Medicine (Hematology, Blood and Marrow Transplantation) and Department of Genetics, Stanford University, Stanford, California, USA
| |
Collapse
|
35
|
Personalized Gut Mucosal Colonization Resistance to Empiric Probiotics Is Associated with Unique Host and Microbiome Features. Cell 2018; 174:1388-1405.e21. [DOI: 10.1016/j.cell.2018.08.041] [Citation(s) in RCA: 725] [Impact Index Per Article: 103.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2016] [Revised: 06/05/2018] [Accepted: 08/20/2018] [Indexed: 12/17/2022]
|
36
|
Structure and function of the global topsoil microbiome. Nature 2018; 560:233-237. [PMID: 30069051 DOI: 10.1038/s41586-018-0386-6] [Citation(s) in RCA: 971] [Impact Index Per Article: 138.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2017] [Accepted: 06/13/2018] [Indexed: 01/25/2023]
Abstract
Soils harbour some of the most diverse microbiomes on Earth and are essential for both nutrient cycling and carbon storage. To understand soil functioning, it is necessary to model the global distribution patterns and functional gene repertoires of soil microorganisms, as well as the biotic and environmental associations between the diversity and structure of both bacterial and fungal soil communities1-4. Here we show, by leveraging metagenomics and metabarcoding of global topsoil samples (189 sites, 7,560 subsamples), that bacterial, but not fungal, genetic diversity is highest in temperate habitats and that microbial gene composition varies more strongly with environmental variables than with geographic distance. We demonstrate that fungi and bacteria show global niche differentiation that is associated with contrasting diversity responses to precipitation and soil pH. Furthermore, we provide evidence for strong bacterial-fungal antagonism, inferred from antibiotic-resistance genes, in topsoil and ocean habitats, indicating the substantial role of biotic interactions in shaping microbial communities. Our results suggest that both competition and environmental filtering affect the abundance, composition and encoded gene functions of bacterial and fungal communities, indicating that the relative contributions of these microorganisms to global nutrient cycling varies spatially.
Collapse
|
37
|
Joint Analysis of Long and Short Reads Enables Accurate Estimates of Microbiome Complexity. Cell Syst 2018; 7:192-200.e3. [PMID: 30056005 DOI: 10.1016/j.cels.2018.06.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Revised: 05/05/2018] [Accepted: 06/15/2018] [Indexed: 01/09/2023]
Abstract
Reduced microbiome diversity has been linked to several diseases. However, estimating the diversity of bacterial communities-the number and the total length of distinct genomes within a metagenome-remains an open problem in microbial ecology. Here, we describe an algorithm for estimating the microbial diversity in a metagenomic sample based on a joint analysis of short and long reads. Unlike previous approaches, the algorithm does not make any assumptions on the distribution of the frequencies of genomes within a metagenome (as in parametric methods) and does not require a large database that covers the total diversity (as in non-parametric methods). We estimate that genomes comprising a human gut metagenome have total length varying from 1.3 to 3.5 billion nucleotides, with genomes responsible for 50% of total abundance having total length varying from only 25 to 61 million nucleotides. In contrast, genomes comprising an aquifer sediment metagenome have more than two orders of magnitude larger total length (≈840 billion nucleotides).
Collapse
|
38
|
Young ND, Gasser RB. Opisthorchis viverrini Draft Genome - Biomedical Implications and Future Avenues. ADVANCES IN PARASITOLOGY 2018; 101:125-148. [PMID: 29907252 DOI: 10.1016/bs.apar.2018.05.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Opisthorchiasis is a neglected tropical disease of major proportion, caused by the carcinogenic, Asian liver fluke, Opisthorchis viverrini. This hepatobiliary disease is known to be associated with malignant cancer (cholangiocarcinoma, CCA) and affects millions of people in Southeast Asia. No vaccine is available, and only one drug (praziquantel) is routinely employed against the parasite. Despite technological advances, little is known about the molecular biology of the fluke itself and the disease complex that it causes in humans. The advent of high-throughput nucleic acid sequencing and bioinformatic technologies is enabling researchers to gain global insights into the molecular pathways and processes in parasites. The principal aims of this chapter are to (1) review molecular research of O. viverrini and opisthorchiasis; (2) provide an account of recent advances in the sequencing and characterization of the genome and transcriptomes of O. viverrini; (3) describe the complex life of this worm in the biliary system of the definitive (human) host and how the fluke interacts with this host and causes disease at the molecular level; (4) discuss the implications of systems biological research and (5) consider how progress in genomics and informatics might enable explorations of O. viverrini and related worms and the discovery of new interventions against opisthorchiasis and CCA.
Collapse
Affiliation(s)
- Neil D Young
- The University of Melbourne, Parkville, VIC, Australia
| | | |
Collapse
|
39
|
Identification of Major Rhizobacterial Taxa Affected by a Glyphosate-Tolerant Soybean Line via Shotgun Metagenomic Approach. Genes (Basel) 2018; 9:genes9040214. [PMID: 29659545 PMCID: PMC5924556 DOI: 10.3390/genes9040214] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Revised: 03/19/2018] [Accepted: 04/13/2018] [Indexed: 01/08/2023] Open
Abstract
The worldwide commercial cultivation of transgenic crops, including glyphosate-tolerant (GT) soybeans, has increased widely during the past 20 years. However, it is accompanied with a growing concern about potential effects of transgenic crops on the soil microbial communities, especially on rhizosphere bacterial communities. Our previous study found that the GT soybean line NZL06-698 (N698) significantly affected rhizosphere bacteria, including some unidentified taxa, through 16S rRNA gene (16S rDNA) V4 region amplicon deep sequencing via Illumina MiSeq. In this study, we performed 16S rDNA V5–V7 region amplicon deep sequencing via Illumina MiSeq and shotgun metagenomic approaches to identify those major taxa. Results of these processes revealed that the species richness and evenness increased in the rhizosphere bacterial communities of N698, the beta diversity of the rhizosphere bacterial communities of N698 was affected, and that certain dominant bacterial phyla and genera were related to N698 compared with its control cultivar Mengdou12. Consistent with our previous findings, this study showed that N698 affects the rhizosphere bacterial communities. In specific, N698 negatively affects Rahnella, Janthinobacterium, Stenotrophomonas, Sphingomonas and Luteibacter while positively affecting Arthrobacter, Bradyrhizobium, Ramlibacter and Nitrospira.
Collapse
|
40
|
Karpinets TV, Gopalakrishnan V, Wargo J, Futreal AP, Schadt CW, Zhang J. Linking Associations of Rare Low-Abundance Species to Their Environments by Association Networks. Front Microbiol 2018; 9:297. [PMID: 29563898 PMCID: PMC5850922 DOI: 10.3389/fmicb.2018.00297] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2017] [Accepted: 02/08/2018] [Indexed: 01/07/2023] Open
Abstract
Studies of microbial communities by targeted sequencing of rRNA genes lead to recovering numerous rare low-abundance taxa with unknown biological roles. We propose to study associations of such rare organisms with their environments by a computational framework based on transformation of the data into qualitative variables. Namely, we analyze the sparse table of putative species or OTUs (operational taxonomic units) and samples generated in such studies, also known as an OTU table, by collecting statistics on co-occurrences of the species and on shared species richness across samples. Based on the statistics we built two association networks, of the rare putative species and of the samples respectively, using a known computational technique, Association networks (Anets) developed for analysis of qualitative data. Clusters of samples and clusters of OTUs are then integrated and combined with metadata of the study to produce a map of associated putative species in their environments. We tested and validated the framework on two types of microbiomes, of human body sites and that of the Populus tree root systems. We show that in both studies the associations of OTUs can separate samples according to environmental or physiological characteristics of the studied systems.
Collapse
Affiliation(s)
- Tatiana V Karpinets
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, United States.,Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
| | - Vancheswaran Gopalakrishnan
- Department of Surgical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, United States.,Department of Epidemiology, Human Genetics and Environmental Sciences, University of Texas School of Public Health, Dallas, TX, United States
| | - Jennifer Wargo
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, United States.,Department of Surgical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| | - Andrew P Futreal
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| | - Christopher W Schadt
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States.,Department of Microbiology, University of Tennessee, Knoxville, Knoxville, TN, United States
| | - Jianhua Zhang
- Department of Genomic Medicine, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| |
Collapse
|
41
|
Tracanna V, de Jong A, Medema MH, Kuipers OP. Mining prokaryotes for antimicrobial compounds: from diversity to function. FEMS Microbiol Rev 2018; 41:417-429. [PMID: 28402441 DOI: 10.1093/femsre/fux014] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Accepted: 03/02/2017] [Indexed: 01/03/2023] Open
Abstract
The bacterial kingdom provides a major source of antimicrobials that can either be directly applied or used as scaffolds to further improve their functionality in the host. The rapidly increasing amount of bacterial genomic, metabolomic and transcriptomic data offers unique opportunities to apply a variety of approaches to mine for existing and novel antimicrobials. Here, we discuss several powerful mining approaches to identify novel molecules with antimicrobial activity across structurally diverse natural products, including ribosomally synthesized and posttranslationally modified peptides, nonribosomal peptides and polyketides. We not only discuss the direct mining of genomes based on identification of biosynthetic gene clusters, but also describe more advanced and integrative approaches in ecology-based mining, functionality-based mining and mode-of-action-based mining. These efforts are likely to accelerate the discovery and development of novel antimicrobial drugs.
Collapse
Affiliation(s)
- Vittorio Tracanna
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, Radix West, Building 107, 6708 PB Wageningen, The Netherlands
| | - Anne de Jong
- Molecular Genetics, University of Groningen, Nijenborgh 7, 9726AG Groningen, The Netherlands
| | - Marnix H Medema
- Bioinformatics Group, Wageningen University, Droevendaalsesteeg 1, Radix West, Building 107, 6708 PB Wageningen, The Netherlands
| | - Oscar P Kuipers
- Molecular Genetics, University of Groningen, Nijenborgh 7, 9726AG Groningen, The Netherlands
| |
Collapse
|
42
|
Swenson TL, Karaoz U, Swenson JM, Bowen BP, Northen TR. Linking soil biology and chemistry in biological soil crust using isolate exometabolomics. Nat Commun 2018; 9:19. [PMID: 29296020 PMCID: PMC5750228 DOI: 10.1038/s41467-017-02356-9] [Citation(s) in RCA: 85] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 11/21/2017] [Indexed: 11/15/2022] Open
Abstract
Metagenomic sequencing provides a window into microbial community structure and metabolic potential; however, linking these data to exogenous metabolites that microorganisms process and produce (the exometabolome) remains challenging. Previously, we observed strong exometabolite niche partitioning among bacterial isolates from biological soil crust (biocrust). Here we examine native biocrust to determine if these patterns are reproduced in the environment. Overall, most soil metabolites display the expected relationship (positive or negative correlation) with four dominant bacteria following a wetting event and across biocrust developmental stages. For metabolites that were previously found to be consumed by an isolate, 70% are negatively correlated with the abundance of the isolate’s closest matching environmental relative in situ, whereas for released metabolites, 67% were positively correlated. Our results demonstrate that metabolite profiling, shotgun sequencing and exometabolomics may be successfully integrated to functionally link microbial community structure with environmental chemistry in biocrust. Metagenomic sequencing provides a window into microbial community structure and metabolic potential. Here, Swenson et al. integrate metabolomics and shotgun sequencing to functionally link microbial community structure with environmental chemistry in biological soil crust (biocrust).
Collapse
Affiliation(s)
- Tami L Swenson
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Ulas Karaoz
- Climate and Ecosystems Sciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Joel M Swenson
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA
| | - Benjamin P Bowen
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA.,DOE Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, CA, 94598, USA
| | - Trent R Northen
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, Berkeley, CA, 94720, USA. .,DOE Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, CA, 94598, USA.
| |
Collapse
|
43
|
Peering into the Genetic Makeup of Natural Microbial Populations Using Metagenomics. POPULATION GENOMICS: MICROORGANISMS 2018. [DOI: 10.1007/13836_2018_14] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
44
|
Dudek NK, Sun CL, Burstein D, Kantor RS, Aliaga Goltsman DS, Bik EM, Thomas BC, Banfield JF, Relman DA. Novel Microbial Diversity and Functional Potential in the Marine Mammal Oral Microbiome. Curr Biol 2017; 27:3752-3762.e6. [PMID: 29153320 DOI: 10.1016/j.cub.2017.10.040] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Revised: 09/13/2017] [Accepted: 10/13/2017] [Indexed: 12/28/2022]
Abstract
The vast majority of bacterial diversity lies within phylum-level lineages called "candidate phyla," which lack isolated representatives and are poorly understood. These bacteria are surprisingly abundant in the oral cavity of marine mammals. We employed a genome-resolved metagenomic approach to recover and characterize genomes and functional potential from microbes in the oral gingival sulcus of two bottlenose dolphins (Tursiops truncatus). We detected organisms from 24 known bacterial phyla and one archaeal phylum. We also recovered genomes from two deep-branching, previously uncharacterized phylum-level lineages (here named "Candidatus Delphibacteria" and "Candidatus Fertabacteria"). The Delphibacteria lineage is found in both managed and wild dolphins; its metabolic profile suggests a capacity for denitrification and a possible role in dolphin health. We uncovered a rich diversity of predicted Cas9 proteins, including the two longest predicted Cas9 proteins to date. Notably, we identified the first type II CRISPR-Cas systems encoded by members of the Candidate Phyla Radiation. Using their spacer sequences, we subsequently identified and assembled a complete Saccharibacteria phage genome. These findings underscore the immense microbial diversity and functional potential that await discovery in previously unexplored environments.
Collapse
Affiliation(s)
- Natasha K Dudek
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Christine L Sun
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - David Burstein
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Rose S Kantor
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Daniela S Aliaga Goltsman
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Elisabeth M Bik
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Brian C Thomas
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jillian F Banfield
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720, USA; Earth and Environmental Science, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - David A Relman
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Medicine, Stanford University School of Medicine, Stanford, CA 94305, USA; Veterans Affairs Palo Alto Health Care System, Palo Alto, CA 94304, USA.
| |
Collapse
|
45
|
The trajectory of microbial single-cell sequencing. Nat Methods 2017; 14:1045-1054. [DOI: 10.1038/nmeth.4469] [Citation(s) in RCA: 88] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 08/04/2017] [Indexed: 12/21/2022]
|
46
|
Shotgun metagenomics, from sampling to analysis. Nat Biotechnol 2017; 35:833-844. [PMID: 28898207 DOI: 10.1038/nbt.3935] [Citation(s) in RCA: 949] [Impact Index Per Article: 118.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 07/12/2017] [Indexed: 02/06/2023]
Abstract
Diverse microbial communities of bacteria, archaea, viruses and single-celled eukaryotes have crucial roles in the environment and in human health. However, microbes are frequently difficult to culture in the laboratory, which can confound cataloging of members and understanding of how communities function. High-throughput sequencing technologies and a suite of computational pipelines have been combined into shotgun metagenomics methods that have transformed microbiology. Still, computational approaches to overcome the challenges that affect both assembly-based and mapping-based metagenomic profiling, particularly of high-complexity samples or environments containing organisms with limited similarity to sequenced genomes, are needed. Understanding the functions and characterizing specific strains of these communities offers biotechnological promise in therapeutic discovery and innovative ways to synthesize products using microbial factories and can pinpoint the contributions of microorganisms to planetary, animal and human health.
Collapse
|
47
|
Roux S, Emerson JB, Eloe-Fadrosh EA, Sullivan MB. Benchmarking viromics: an in silico evaluation of metagenome-enabled estimates of viral community composition and diversity. PeerJ 2017; 5:e3817. [PMID: 28948103 PMCID: PMC5610896 DOI: 10.7717/peerj.3817] [Citation(s) in RCA: 185] [Impact Index Per Article: 23.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2017] [Accepted: 08/26/2017] [Indexed: 12/20/2022] Open
Abstract
Background Viral metagenomics (viromics) is increasingly used to obtain uncultivated viral genomes, evaluate community diversity, and assess ecological hypotheses. While viromic experimental methods are relatively mature and widely accepted by the research community, robust bioinformatics standards remain to be established. Here we used in silico mock viral communities to evaluate the viromic sequence-to-ecological-inference pipeline, including (i) read pre-processing and metagenome assembly, (ii) thresholds applied to estimate viral relative abundances based on read mapping to assembled contigs, and (iii) normalization methods applied to the matrix of viral relative abundances for alpha and beta diversity estimates. Results Tools specifically designed for metagenomes, specifically metaSPAdes, MEGAHIT, and IDBA-UD, were the most effective at assembling viromes. Read pre-processing, such as partitioning, had virtually no impact on assembly output, but may be useful when hardware is limited. Viral populations with 2–5 × coverage typically assembled well, whereas lesser coverage led to fragmented assembly. Strain heterogeneity within populations hampered assembly, especially when strains were closely related (average nucleotide identity, or ANI ≥97%) and when the most abundant strain represented <50% of the population. Viral community composition assessments based on read recruitment were generally accurate when the following thresholds for detection were applied: (i) ≥10 kb contig lengths to define populations, (ii) coverage defined from reads mapping at ≥90% identity, and (iii) ≥75% of contig length with ≥1 × coverage. Finally, although data are limited to the most abundant viruses in a community, alpha and beta diversity patterns were robustly estimated (±10%) when comparing samples of similar sequencing depth, but more divergent (up to 80%) when sequencing depth was uneven across the dataset. In the latter cases, the use of normalization methods specifically developed for metagenomes provided the best estimates. Conclusions These simulations provide benchmarks for selecting analysis cut-offs and establish that an optimized sample-to-ecological-inference viromics pipeline is robust for making ecological inferences from natural viral communities. Continued development to better accessing RNA, rare, and/or diverse viral populations and improved reference viral genome availability will alleviate many of viromics remaining limitations.
Collapse
Affiliation(s)
- Simon Roux
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America
| | - Joanne B Emerson
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America
| | - Emiley A Eloe-Fadrosh
- Joint Genome Institute, Department of Energy, Walnut Creek, CA, United States of America
| | - Matthew B Sullivan
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America.,Department of Civil, Environmental and Geodetic Engineering, Ohio State University, Columbus, OH, United States of America
| |
Collapse
|
48
|
Abstract
MOTIVATION Despite rapid progress in sequencing technology, assembling de novo the genomes of new species as well as reconstructing complex metagenomes remains major technological challenges. New synthetic long read (SLR) technologies promise significant advances towards these goals; however, their applicability is limited by high sequencing requirements and the inability of current assembly paradigms to cope with combinations of short and long reads. RESULTS Here, we introduce Architect, a new de novo scaffolder aimed at SLR technologies. Unlike previous assembly strategies, Architect does not require a costly subassembly step; instead it assembles genomes directly from the SLR's underlying short reads, which we refer to as read clouds This enables a 4- to 20-fold reduction in sequencing requirements and a 5-fold increase in assembly contiguity on both genomic and metagenomic datasets relative to state-of-the-art assembly strategies aimed directly at fully subassembled long reads. AVAILABILITY AND IMPLEMENTATION Our source code is freely available at https://github.com/kuleshov/architect CONTACT kuleshov@stanford.edu.
Collapse
Affiliation(s)
- Volodymyr Kuleshov
- Department of Computer Science, Stanford University Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Michael P Snyder
- Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | |
Collapse
|
49
|
|
50
|
Interpreting Microbial Biosynthesis in the Genomic Age: Biological and Practical Considerations. Mar Drugs 2017; 15:md15060165. [PMID: 28587290 PMCID: PMC5484115 DOI: 10.3390/md15060165] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Revised: 05/22/2017] [Accepted: 05/31/2017] [Indexed: 02/06/2023] Open
Abstract
Genome mining has become an increasingly powerful, scalable, and economically accessible tool for the study of natural product biosynthesis and drug discovery. However, there remain important biological and practical problems that can complicate or obscure biosynthetic analysis in genomic and metagenomic sequencing projects. Here, we focus on limitations of available technology as well as computational and experimental strategies to overcome them. We review the unique challenges and approaches in the study of symbiotic and uncultured systems, as well as those associated with biosynthetic gene cluster (BGC) assembly and product prediction. Finally, to explore sequencing parameters that affect the recovery and contiguity of large and repetitive BGCs assembled de novo, we simulate Illumina and PacBio sequencing of the Salinispora tropica genome focusing on assembly of the salinilactam (slm) BGC.
Collapse
|