1
|
Miliotis G, Sengupta P, Hameed A, Chuvochina M, McDonagh F, Simpson AC, Parker CW, Singh NK, Rekha PD, Morris D, Raman K, Kyrpides NC, Hugenholtz P, Venkateswaran K. Novel spore-forming species exhibiting intrinsic resistance to third- and fourth-generation cephalosporins and description of Tigheibacillus jepli gen. nov., sp. nov. mBio 2024; 15:e0018124. [PMID: 38477597 PMCID: PMC11005411 DOI: 10.1128/mbio.00181-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 01/29/2024] [Indexed: 03/14/2024] Open
Abstract
A comprehensive microbial surveillance was conducted at NASA's Mars 2020 spacecraft assembly facility (SAF), where whole-genome sequencing (WGS) of 110 bacterial strains was performed. One isolate, designated 179-BFC-A-HST, exhibited less than 80% average nucleotide identity (ANI) to known species, suggesting a novel organism. This strain demonstrated high-level resistance [minimum inhibitory concentration (MIC) >256 mg/L] to third-generation cephalosporins, including ceftazidime, cefpodoxime, combination ceftazidime/avibactam, and the fourth-generation cephalosporin cefepime. The results of a comparative genomic analysis revealed that 179-BFC-A-HST is most closely related to Virgibacillus halophilus 5B73CT, sharing an ANI of 78.7% and a digital DNA-DNA hybridization (dDDH) value of 23.5%, while their 16S rRNA gene sequences shared 97.7% nucleotide identity. Based on these results and the recent recognition that the genus Virgibacillus is polyphyletic, strain 179-BFC-A-HST is proposed as a novel species of a novel genus, Tigheibacillus jepli gen. nov., sp. nov (type strain 179-BFC-A-HST = DSM 115946T = NRRL B-65666T), and its closest neighbor, V. halophilus, is proposed to be reassigned to this genus as Tigheibacillus halophilus comb. nov. (type strain 5B73CT = DSM 21623T = JCM 21758T = KCTC 13935T). It was also necessary to reclassify its second closest neighbor Virgibacillus soli, as a member of a novel genus Paracerasibacillus, reflecting its phylogenetic position relative to the genus Cerasibacillus, for which we propose Paracerasibacillus soli comb. nov. (type strain CC-YMP-6T = DSM 22952T = CCM 7714T). Within Amphibacillaceae (n = 64), P. soli exhibited 11 antibiotic resistance genes (ARG), while T. jepli encoded for 3, lacking any known β-lactamases, suggesting resistance from variant penicillin-binding proteins, disrupting cephalosporin efficacy. P. soli was highly resistant to azithromycin (MIC >64 mg/L) yet susceptible to cephalosporins and penicillins. IMPORTANCE The significance of this research extends to understanding microbial survival and adaptation in oligotrophic environments, such as those found in SAF. Whole-genome sequencing of several strains isolated from Mars 2020 mission assembly cleanroom facilities, including the discovery of the novel species Tigheibacillus jepli, highlights the resilience and antimicrobial resistance (AMR) in clinically relevant antibiotic classes of microbes in nutrient-scarce settings. The study also redefines the taxonomic classifications within the Amphibacillaceae family, aligning genetic identities with phylogenetic data. Investigating ARG and virulence factors (VF) across these strains illuminates the microbial capability for resistance under resource-limited conditions while emphasizing the role of human-associated VF in microbial survival, informing sterilization practices and microbial management in similar oligotrophic settings beyond spacecraft assembly cleanrooms such as pharmaceutical and medical industry cleanrooms.
Collapse
Affiliation(s)
- Georgios Miliotis
- Antimicrobial Resistance and Microbial Ecology Group, School of Medicine, University of Galway, Galway, Ireland
- Centre for One Health, Ryan Institute, University of Galway, Galway, Ireland
| | - Pratyay Sengupta
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Center for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Asif Hameed
- Division of Microbiology and Biotechnology, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, Karnataka, India
| | - Maria Chuvochina
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, Brisbane, Australia
| | - Francesca McDonagh
- Antimicrobial Resistance and Microbial Ecology Group, School of Medicine, University of Galway, Galway, Ireland
| | - Anna C. Simpson
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| | - Ceth W. Parker
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| | - Nitin K. Singh
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| | - Punchappady D. Rekha
- Division of Microbiology and Biotechnology, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, Karnataka, India
| | - Dearbháile Morris
- Antimicrobial Resistance and Microbial Ecology Group, School of Medicine, University of Galway, Galway, Ireland
- Centre for One Health, Ryan Institute, University of Galway, Galway, Ireland
| | - Karthik Raman
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Center for Integrative Biology and Systems mEdicine (IBSE), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), Indian Institute of Technology Madras, Chennai, Tamil Nadu, India
| | - Nikos C. Kyrpides
- US Department of Energy Joint Genome Institute, Berkeley, California, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Philip Hugenholtz
- The University of Queensland, School of Chemistry and Molecular Biosciences, Australian Centre for Ecogenomics, Brisbane, Australia
| | - Kasthuri Venkateswaran
- Biotechnology and Planetary Protection Group, Jet Propulsion Laboratory, California Institute of Technology, Pasadena, California, USA
| |
Collapse
|
2
|
Nguyen TM, Pombubpa N, Huntemann M, Clum A, Foster B, Foster B, Roux S, Palaniappan K, Varghese N, Mukherjee S, Reddy TBK, Daum C, Copeland A, Chen IMA, Ivanova NN, Kyrpides NC, Harmon-Smith M, Eloe-Fadrosh EA, Pietrasiak N, Stajich JE, Hom EFY. Whole community shotgun metagenomes of two biological soil crust types from the Mojave Desert. Microbiol Resour Announc 2024; 13:e0098023. [PMID: 38329355 DOI: 10.1128/mra.00980-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Accepted: 01/23/2024] [Indexed: 02/09/2024] Open
Abstract
We present six whole community shotgun metagenomic sequencing data sets of two types of biological soil crusts sampled at the ecotone of the Mojave Desert and Colorado Desert in California. These data will help us understand the diversity and function of biocrust microbial communities, which are essential for desert ecosystems.
Collapse
Affiliation(s)
- Thuy M Nguyen
- Department of Biology and Center for Biodiversity and Conservation Research, University of Mississippi, University, Mississippi, USA
| | - Nuttapon Pombubpa
- Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
| | - Marcel Huntemann
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Alicia Clum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Brian Foster
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Bryce Foster
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Simon Roux
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Krishnaveni Palaniappan
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Neha Varghese
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Supratim Mukherjee
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - T B K Reddy
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Chris Daum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Alex Copeland
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - I-Min A Chen
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Natalia N Ivanova
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Nikos C Kyrpides
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Miranda Harmon-Smith
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Emiley A Eloe-Fadrosh
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Nicole Pietrasiak
- School of Life Sciences, University of Nevada-Las Vegas, Las Vegas, Nevada, USA
| | - Jason E Stajich
- Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
| | - Erik F Y Hom
- Department of Biology and Center for Biodiversity and Conservation Research, University of Mississippi, University, Mississippi, USA
| |
Collapse
|
3
|
Nguyen TM, Pombubpa N, Huntemann M, Clum A, Foster B, Foster B, Roux S, Palaniappan K, Varghese N, Mukherjee S, Reddy TBK, Daum C, Copeland A, Chen IMA, Ivanova NN, Kyrpides NC, Harmon-Smith M, Eloe-Fadrosh EA, Pietrasiak N, Stajich JE, Hom EFY. Metatranscriptomes of two biological soil crust types from the Mojave desert in response to wetting. Microbiol Resour Announc 2024; 13:e0108023. [PMID: 38189307 PMCID: PMC10868201 DOI: 10.1128/mra.01080-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 12/13/2023] [Indexed: 01/09/2024] Open
Abstract
We present eight metatranscriptomic datasets of light algal and cyanolichen biological soil crusts from the Mojave Desert in response to wetting. These data will help us understand gene expression patterns in desert biocrust microbial communities after they have been reactivated by the addition of water.
Collapse
Affiliation(s)
- Thuy M. Nguyen
- Department of Biology and Center for Biodiversity and Conservation Research, University, University of Mississippi, Mississippi, USA
| | - Nuttapon Pombubpa
- Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
| | - Marcel Huntemann
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Alicia Clum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Brian Foster
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Bryce Foster
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Simon Roux
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Krishnaveni Palaniappan
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Neha Varghese
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Supratim Mukherjee
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - T. B. K. Reddy
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Chris Daum
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Alex Copeland
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - I-Min A. Chen
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Natalia N. Ivanova
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Nikos C. Kyrpides
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Miranda Harmon-Smith
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Emiley A. Eloe-Fadrosh
- US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Nicole Pietrasiak
- School of Life Sciences, University of Nevada-Las Vegas, Las Vegas, Nevada, USA
| | - Jason E. Stajich
- Department of Microbiology and Plant Pathology, University of California, Riverside, California, USA
| | - Erik F. Y. Hom
- Department of Biology and Center for Biodiversity and Conservation Research, University, University of Mississippi, Mississippi, USA
| |
Collapse
|
4
|
Toumi M, Whitman WB, Kyrpides NC, Woyke T, Wolf J, Neumann-Schaal M, Abbaszade G, Károly B, Tóth E. Antiquaquibacter oligotrophicus gen. nov., sp. nov., a novel oligotrophic bacterium from groundwater. Int J Syst Evol Microbiol 2023; 73. [PMID: 38108591 DOI: 10.1099/ijsem.0.006205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2023] Open
Abstract
In this study, a Gram-stain-positive, non-motile, oxidase- and catalase-negative, rod-shaped, bacterial strain (SG_E_30_P1T) that formed light yellow colonies was isolated from a groundwater sample of Sztaravoda spring, Hungary. Based on 16S rRNA phylogenetic and phylogenomic analyses, the strain was found to form a distinct linage within the family Microbacteriaceae. Its closest relatives in terms of near full-length 16S rRNA gene sequences are Salinibacterium hongtaonis MH299814 (97.72 % sequence similarity) and Leifsonia psychrotolerans GQ406810 (97.57 %). The novel strain grows optimally at 20-28 °C, at neutral pH and in the presence of NaCl (1-2 w/v%). Strain SG_E_30_P1T contains MK-7 and B-type peptidoglycan with diaminobutyrate as the diagnostic amino acid. The major cellular fatty acids are anteiso-C15 : 0, iso-C16 : 0 and iso-C14 : 0, and the polar lipid profile is composed of diphosphatidylglycerol and phosphatidylglycerol, as well as an unidentified aminoglycolipid, aminophospholipid and some unidentified phospholipids. The assembled draft genome is a contig with a total length of 2 897 968 bp and a DNA G+C content of 65.5 mol%. Amino acid identity values with it closest relatives with sequenced genomes of <62.54 %, as well as other genome distance results, indicate that this bacterium represents a novel genus within the family Microbacteriaceae. We suggest that SG_E_30_P1T (=DSM 111415T=NCAIM B.02656T) represents the type strain of a novel genus and species for which the name Antiquaquibacter oligotrophicus gen. nov., sp. nov. is proposed.
Collapse
Affiliation(s)
- Marwene Toumi
- Department of Microbiology, Faculty of Science, Eötvös Loránd University, Pázmány Péter stny, 1/C, H-1117 Budapest, Hungary
| | - William B Whitman
- Department of Microbiology, University of Georgia, Athens, GA 30602, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Tanja Woyke
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Jacqueline Wolf
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7B, D-38124, Braunschweig, Germany
| | - Meina Neumann-Schaal
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Inhoffenstrasse 7B, D-38124, Braunschweig, Germany
| | - Gorkhmaz Abbaszade
- Department of Microbiology, Faculty of Science, Eötvös Loránd University, Pázmány Péter stny, 1/C, H-1117 Budapest, Hungary
| | - Bóka Károly
- Department of plant anatomy, Faculty of Science, Eötvös Loránd University, Pázmány Péter stny, 1/C, H-1117 Budapest, Hungary
| | - Erika Tóth
- Department of Microbiology, Faculty of Science, Eötvös Loránd University, Pázmány Péter stny, 1/C, H-1117 Budapest, Hungary
| |
Collapse
|
5
|
Fremin BJ, Bhatt AS, Kyrpides NC. Identification of over ten thousand candidate structured RNAs in viruses and phages. Comput Struct Biotechnol J 2023; 21:5630-5639. [PMID: 38047235 PMCID: PMC10690425 DOI: 10.1016/j.csbj.2023.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 11/03/2023] [Accepted: 11/03/2023] [Indexed: 12/05/2023] Open
Abstract
Structured RNAs play crucial roles in viruses, exerting influence over both viral and host gene expression. However, the extensive diversity of structured RNAs and their ability to act in cis or trans positions pose challenges for predicting and assigning their functions. While comparative genomics approaches have successfully predicted candidate structured RNAs in microbes on a large scale, similar efforts for viruses have been lacking. In this study, we screened over 5 million DNA and RNA viral sequences, resulting in the prediction of 10,006 novel candidate structured RNAs. These predictions are widely distributed across taxonomy and ecosystem. We found transcriptional evidence for 206 of these candidate structured RNAs in the human fecal microbiome. These candidate RNAs exhibited evidence of nucleotide covariation, indicative of selective pressure maintaining the predicted secondary structures. Our analysis revealed a diverse repertoire of candidate structured RNAs, encompassing a substantial number of putative tRNAs or tRNA-like structures, Rho-independent transcription terminators, and potentially cis-regulatory structures consistently positioned upstream of genes. In summary, our findings shed light on the extensive diversity of structured RNAs in viruses, offering a valuable resource for further investigations into their functional roles and implications in viral gene expression and pave the way for a deeper understanding of the intricate interplay between viruses and their hosts at the molecular level.
Collapse
Affiliation(s)
- Brayon J. Fremin
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ami S. Bhatt
- Blood and Marrow Transplantation) and Genetics, Stanford University, Stanford, CA, USA
- Department of Medicine (Hematology, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Lead Contact, USA
| |
Collapse
|
6
|
Pavlopoulos GA, Baltoumas FA, Liu S, Selvitopi O, Camargo AP, Nayfach S, Azad A, Roux S, Call L, Ivanova NN, Chen IM, Paez-Espino D, Karatzas E, Iliopoulos I, Konstantinidis K, Tiedje JM, Pett-Ridge J, Baker D, Visel A, Ouzounis CA, Ovchinnikov S, Buluç A, Kyrpides NC. Unraveling the functional dark matter through global metagenomics. Nature 2023; 622:594-602. [PMID: 37821698 PMCID: PMC10584684 DOI: 10.1038/s41586-023-06583-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 08/30/2023] [Indexed: 10/13/2023]
Abstract
Metagenomes encode an enormous diversity of proteins, reflecting a multiplicity of functions and activities1,2. Exploration of this vast sequence space has been limited to a comparative analysis against reference microbial genomes and protein families derived from those genomes. Here, to examine the scale of yet untapped functional diversity beyond what is currently possible through the lens of reference genomes, we develop a computational approach to generate reference-free protein families from the sequence space in metagenomes. We analyse 26,931 metagenomes and identify 1.17 billion protein sequences longer than 35 amino acids with no similarity to any sequences from 102,491 reference genomes or the Pfam database3. Using massively parallel graph-based clustering, we group these proteins into 106,198 novel sequence clusters with more than 100 members, doubling the number of protein families obtained from the reference genomes clustered using the same approach. We annotate these families on the basis of their taxonomic, habitat, geographical and gene neighbourhood distributions and, where sufficient sequence diversity is available, predict protein three-dimensional models, revealing novel structures. Overall, our results uncover an enormously diverse functional space, highlighting the importance of further exploring the microbial functional dark matter.
Collapse
Affiliation(s)
- Georgios A Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Science Research Center Alexander Fleming, Vari, Greece.
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece.
| | - Fotis A Baltoumas
- Institute for Fundamental Biomedical Research, Biomedical Science Research Center Alexander Fleming, Vari, Greece
| | - Sirui Liu
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
| | - Oguz Selvitopi
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Antonio Pedro Camargo
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Ariful Azad
- Luddy School of Informatics, Computing and Engineering, Indiana University Bloomington, Bloomington, IN, USA
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Lee Call
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Natalia N Ivanova
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - I Min Chen
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David Paez-Espino
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, Biomedical Science Research Center Alexander Fleming, Vari, Greece
| | - Ioannis Iliopoulos
- Department of Basic Sciences, School of Medicine, University of Crete, Heraklion, Greece
| | | | - James M Tiedje
- Center for Microbial Ecology, Michigan State University, East Lansing, MI, USA
| | - Jennifer Pett-Ridge
- Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Axel Visel
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Christos A Ouzounis
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
- Biological Computation & Computational Biology Group, Artificial Intelligence & Information Analysis Lab, School of Informatics, Aristotle University of Thessalonica, Thessalonica, Greece
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
| | - Aydin Buluç
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
7
|
Camargo AP, Roux S, Schulz F, Babinski M, Xu Y, Hu B, Chain PSG, Nayfach S, Kyrpides NC. Identification of mobile genetic elements with geNomad. Nat Biotechnol 2023:10.1038/s41587-023-01953-y. [PMID: 37735266 DOI: 10.1038/s41587-023-01953-y] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 08/17/2023] [Indexed: 09/23/2023]
Abstract
Identifying and characterizing mobile genetic elements in sequencing data is essential for understanding their diversity, ecology, biotechnological applications and impact on public health. Here we introduce geNomad, a classification and annotation framework that combines information from gene content and a deep neural network to identify sequences of plasmids and viruses. geNomad uses a dataset of more than 200,000 marker protein profiles to provide functional gene annotation and taxonomic assignment of viral genomes. Using a conditional random field model, geNomad also detects proviruses integrated into host genomes with high precision. In benchmarks, geNomad achieved high classification performance for diverse plasmids and viruses (Matthews correlation coefficient of 77.8% and 95.3%, respectively), substantially outperforming other tools. Leveraging geNomad's speed and scalability, we processed over 2.7 trillion base pairs of sequencing data, leading to the discovery of millions of viruses and plasmids that are available through the IMG/VR and IMG/PR databases. geNomad is available at https://portal.nersc.gov/genomad .
Collapse
Affiliation(s)
- Antonio Pedro Camargo
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Michal Babinski
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Yan Xu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Bin Hu
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Patrick S G Chain
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
8
|
Mukherjee S, Ovchinnikova G, Stamatis D, Li CT, Chen IMA, Kyrpides NC, Reddy TBK. Standardized naming of microbiome samples in Genomes OnLine Database. Database (Oxford) 2023; 2023:7042581. [PMID: 36794865 PMCID: PMC9933444 DOI: 10.1093/database/baad001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 01/24/2023] [Indexed: 02/17/2023]
Abstract
The power of next-generation sequencing has resulted in an explosive growth in the number of projects aiming to understand the metagenomic diversity of complex microbial environments. The interdisciplinary nature of this microbiome research community, along with the absence of reporting standards for microbiome data and samples, poses a significant challenge for follow-up studies. Commonly used names of metagenomes and metatranscriptomes in public databases currently lack the essential information necessary to accurately describe and classify the underlying samples, which makes a comparative analysis difficult to conduct and often results in misclassified sequences in data repositories. The Genomes OnLine Database (GOLD) (https:// gold.jgi.doe.gov/) at the Department of Energy Joint Genome Institute has been at the forefront of addressing this challenge by developing a standardized nomenclature system for naming microbiome samples. GOLD, currently in its twenty-fifth anniversary, continues to enrich the research community with hundreds of thousands of metagenomes and metatranscriptomes with well-curated and easy-to-understand names. Through this manuscript, we describe the overall naming process that can be easily adopted by researchers worldwide. Additionally, we propose the use of this naming system as a best practice for the scientific community to facilitate better interoperability and reusability of microbiome data.
Collapse
Affiliation(s)
- Supratim Mukherjee
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Galina Ovchinnikova
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Dimitri Stamatis
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Cindy Tianqing Li
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - I-Min A Chen
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - T B K Reddy
- *Corresponding author: Tel: +1 408 505 8273;
| |
Collapse
|
9
|
Baltoumas FA, Karatzas E, Paez-Espino D, Venetsianou NK, Aplakidou E, Oulas A, Finn RD, Ovchinnikov S, Pafilis E, Kyrpides NC, Pavlopoulos GA. Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters. Front Bioinform 2023; 3:1157956. [PMID: 36959975 PMCID: PMC10029925 DOI: 10.3389/fbinf.2023.1157956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 02/21/2023] [Indexed: 03/06/2023] Open
Abstract
Metagenomics has enabled accessing the genetic repertoire of natural microbial communities. Metagenome shotgun sequencing has become the method of choice for studying and classifying microorganisms from various environments. To this end, several methods have been developed to process and analyze the sequence data from raw reads to end-products such as predicted protein sequences or families. In this article, we provide a thorough review to simplify such processes and discuss the alternative methodologies that can be followed in order to explore biodiversity at the protein family level. We provide details for analysis tools and we comment on their scalability as well as their advantages and disadvantages. Finally, we report the available data repositories and recommend various approaches for protein family annotation related to phylogenetic distribution, structure prediction and metadata enrichment.
Collapse
Affiliation(s)
- Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
- *Correspondence: Fotis A. Baltoumas, ; Nikos C. Kyrpides, ; Georgios A. Pavlopoulos,
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - David Paez-Espino
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Anastasis Oulas
- The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Robert D. Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, United Kingdom
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, United States
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Nikos C. Kyrpides
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States
- *Correspondence: Fotis A. Baltoumas, ; Nikos C. Kyrpides, ; Georgios A. Pavlopoulos,
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
- Center of New Biotechnologies and Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Athens, Greece
- Hellenic Army Academy, Vari, Greece
- *Correspondence: Fotis A. Baltoumas, ; Nikos C. Kyrpides, ; Georgios A. Pavlopoulos,
| |
Collapse
|
10
|
Proença DN, Whitman WB, Shapiro N, Woyke T, Kyrpides NC, Morais PV. Faunimonas pinastri gen. nov., sp. nov., an endophyte from a pine tree of the family Pleomorphomonadaceae, class Alphaproteobacteria. Int J Syst Evol Microbiol 2022; 72. [PMID: 36748409 DOI: 10.1099/ijsem.0.005623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Bacterial strain A52C2T was isolated from the endophytic microbial community of a Pinus pinaster tree trunk and characterized. Strain A52C2T stained Gram-negative and formed rod-shaped cells that grew optimally at 30 °C and at pH 6.0-7.0. The G+C content of the DNA was 65.1 mol %. The respiratory quinone was ubiquinone 10, and the major fatty acids were cyclo-C19:0 ω8c and C18:0, representing 70.1 % of the total fatty acids. Phylogenetic analyses based on the 16S rRNA gene sequences placed strain A52C2T in a distinct lineage within the order Hyphomicrobiales, family Pleomorphomonadaceae. The 16S rRNA gene sequence similarities of A52C2T to that of Mongoliimonas terrestris and Oharaeibacter diazotrophicus were 93.15 and 93.2 %, respectively. The draft genome sequence of strain A52C2T comprises 4 196 045 bases with a 195-fold mapped coverage of the genome. The assembled genome consists of 43 contigs of more than 1 000 bp (N50 contig size was 209 720 bp). The genome encodes 4033 putative coding sequences. The phylogenetic, phenotypic and chemotaxonomic data showed that strain A52C2T (=UCCCB 130T=CECT 8949T=LMG 29042T) represents the type of a novel species and genus, for which we propose the name Faunimonas pinastri gen. nov., sp. nov.
Collapse
Affiliation(s)
- Diogo N Proença
- University of Coimbra, Centre for Mechanical Engineering, Materials and Processes, Department of Life Sciences, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - William B Whitman
- Department of Microbiology, 527 Biological Sciences Building, University of Georgia, Athens, GA 30602-2605, USA
| | - Nicole Shapiro
- DOE Joint Genome Institute, 1 Cyclotron Rd, Berkeley, CA 94720, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, 1 Cyclotron Rd, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, 1 Cyclotron Rd, Berkeley, CA 94720, USA
| | - Paula V Morais
- University of Coimbra, Centre for Mechanical Engineering, Materials and Processes, Department of Life Sciences, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| |
Collapse
|
11
|
Seshadri R, Roux S, Huber KJ, Wu D, Yu S, Udwary D, Call L, Nayfach S, Hahnke RL, Pukall R, White JR, Varghese NJ, Webb C, Palaniappan K, Reimer LC, Sardà J, Bertsch J, Mukherjee S, Reddy T, Hajek PP, Huntemann M, Chen IMA, Spunde A, Clum A, Shapiro N, Wu ZY, Zhao Z, Zhou Y, Evtushenko L, Thijs S, Stevens V, Eloe-Fadrosh EA, Mouncey NJ, Yoshikuni Y, Whitman WB, Klenk HP, Woyke T, Göker M, Kyrpides NC, Ivanova NN. Expanding the genomic encyclopedia of Actinobacteria with 824 isolate reference genomes. Cell Genom 2022; 2:100213. [PMID: 36778052 PMCID: PMC9903846 DOI: 10.1016/j.xgen.2022.100213] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 07/19/2022] [Accepted: 10/16/2022] [Indexed: 11/13/2022]
Abstract
The phylum Actinobacteria includes important human pathogens like Mycobacterium tuberculosis and Corynebacterium diphtheriae and renowned producers of secondary metabolites of commercial interest, yet only a small part of its diversity is represented by sequenced genomes. Here, we present 824 actinobacterial isolate genomes in the context of a phylum-wide analysis of 6,700 genomes including public isolates and metagenome-assembled genomes (MAGs). We estimate that only 30%-50% of projected actinobacterial phylogenetic diversity possesses genomic representation via isolates and MAGs. A comparison of gene functions reveals novel determinants of host-microbe interaction as well as environment-specific adaptations such as potential antimicrobial peptides. We identify plasmids and prophages across isolates and uncover extensive prophage diversity structured mainly by host taxonomy. Analysis of >80,000 biosynthetic gene clusters reveals that horizontal gene transfer and gene loss shape secondary metabolite repertoire across taxa. Our observations illustrate the essential role of and need for high-quality isolate genome sequences.
Collapse
Affiliation(s)
- Rekha Seshadri
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Corresponding author
| | - Simon Roux
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Katharina J. Huber
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Dongying Wu
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Sora Yu
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Dan Udwary
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Lee Call
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Stephen Nayfach
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Richard L. Hahnke
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Rüdiger Pukall
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | | | - Neha J. Varghese
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Cody Webb
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Lorenz C. Reimer
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Joaquim Sardà
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Jonathon Bertsch
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - T.B.K. Reddy
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Patrick P. Hajek
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Marcel Huntemann
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - I-Min A. Chen
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Alex Spunde
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Alicia Clum
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Nicole Shapiro
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Zong-Yen Wu
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Zhiying Zhao
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Yuguang Zhou
- China General Microbiological Culture Collection Center, Beijing, China
| | - Lyudmila Evtushenko
- Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, All-Russian Collection of Microorganisms (VKM), Pushchino, Russia
| | - Sofie Thijs
- Center for Environmental Sciences, Environmental Biology, Hasselt University, Diepenbeek, Belgium
| | - Vincent Stevens
- Center for Environmental Sciences, Environmental Biology, Hasselt University, Diepenbeek, Belgium
| | - Emiley A. Eloe-Fadrosh
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nigel J. Mouncey
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Yasuo Yoshikuni
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA,Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA,Center for Advanced Bioenergy and Bioproducts Innovation, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA,Global Institution for Collaborative Research and Education, Hokkaido University, Hokkaido 060-8589, Japan
| | | | - Hans-Peter Klenk
- School of Biology, Newcastle University, Newcastle upon Tyne, UK
| | - Tanja Woyke
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Markus Göker
- Leibniz Institute DSMZ - German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany,Corresponding author
| | - Nikos C. Kyrpides
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Natalia N. Ivanova
- US Department of Energy Joint Genome Institute, Berkeley, CA, USA,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA,Corresponding author
| |
Collapse
|
12
|
Neri U, Wolf YI, Roux S, Camargo AP, Lee B, Kazlauskas D, Chen IM, Ivanova N, Zeigler Allen L, Paez-Espino D, Bryant DA, Bhaya D, Krupovic M, Dolja VV, Kyrpides NC, Koonin EV, Gophna U. Expansion of the global RNA virome reveals diverse clades of bacteriophages. Cell 2022; 185:4023-4037.e18. [PMID: 36174579 DOI: 10.1016/j.cell.2022.08.023] [Citation(s) in RCA: 59] [Impact Index Per Article: 29.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 05/16/2022] [Accepted: 08/24/2022] [Indexed: 01/26/2023]
Abstract
High-throughput RNA sequencing offers broad opportunities to explore the Earth RNA virome. Mining 5,150 diverse metatranscriptomes uncovered >2.5 million RNA virus contigs. Analysis of >330,000 RNA-dependent RNA polymerases (RdRPs) shows that this expansion corresponds to a 5-fold increase of the known RNA virus diversity. Gene content analysis revealed multiple protein domains previously not found in RNA viruses and implicated in virus-host interactions. Extended RdRP phylogeny supports the monophyly of the five established phyla and reveals two putative additional bacteriophage phyla and numerous putative additional classes and orders. The dramatically expanded phylum Lenarviricota, consisting of bacterial and related eukaryotic viruses, now accounts for a third of the RNA virome. Identification of CRISPR spacer matches and bacteriolytic proteins suggests that subsets of picobirnaviruses and partitiviruses, previously associated with eukaryotes, infect prokaryotic hosts.
Collapse
Affiliation(s)
- Uri Neri
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv 6997801, Israel.
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Simon Roux
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Antonio Pedro Camargo
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Benjamin Lee
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; Nuffield Department of Medicine, University of Oxford, Oxford OX3 7BN, UK
| | - Darius Kazlauskas
- Institute of Biotechnology, Life Sciences Center, Vilnius University, Saulėtekio av. 7, Vilnius 10257, Lithuania
| | - I Min Chen
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Natalia Ivanova
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lisa Zeigler Allen
- Microbial and Environmental Genomics, J. Craig Venter Institute, La Jolla, CA, USA; Marine Biology Research Division, Scripps Institution of Oceanography, La Jolla, CA, USA
| | - David Paez-Espino
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Donald A Bryant
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - Devaki Bhaya
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA 94305, USA
| | - Mart Krupovic
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Archaeal Virology Unit, 75015 Paris, France
| | - Valerian V Dolja
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA.
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA.
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | - Uri Gophna
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv 6997801, Israel.
| |
Collapse
|
13
|
Keller-Costa T, Kozma L, Silva SG, Toscan R, Gonçalves J, Lago-Lestón A, Kyrpides NC, Nunes da Rocha U, Costa R. Metagenomics-resolved genomics provides novel insights into chitin turnover, metabolic specialization, and niche partitioning in the octocoral microbiome. Microbiome 2022; 10:151. [PMID: 36138466 PMCID: PMC9502895 DOI: 10.1186/s40168-022-01343-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 08/03/2022] [Indexed: 05/31/2023]
Abstract
BACKGROUND The role of bacterial symbionts that populate octocorals (Cnidaria, Octocorallia) is still poorly understood. To shed light on their metabolic capacities, we examined 66 high-quality metagenome-assembled genomes (MAGs) spanning 30 prokaryotic species, retrieved from microbial metagenomes of three octocoral species and seawater. RESULTS Symbionts of healthy octocorals were affiliated with the taxa Endozoicomonadaceae, Candidatus Thioglobaceae, Metamycoplasmataceae, unclassified Pseudomonadales, Rhodobacteraceae, unclassified Alphaproteobacteria and Ca. Rhabdochlamydiaceae. Phylogenomics inference revealed that the Endozoicomonadaceae symbionts uncovered here represent two species of a novel genus unique to temperate octocorals, here denoted Ca. Gorgonimonas eunicellae and Ca. Gorgonimonas leptogorgiae. Their genomes revealed metabolic capacities to thrive under suboxic conditions and high gene copy numbers of serine-threonine protein kinases, type 3-secretion system, type-4 pili, and ankyrin-repeat proteins, suggesting excellent capabilities to colonize, aggregate, and persist inside their host. Contrarily, MAGs obtained from seawater frequently lacked symbiosis-related genes. All Endozoicomonadaceae symbionts harbored endo-chitinase and chitin-binging protein-encoding genes, indicating that they can hydrolyze the most abundant polysaccharide in the oceans. Other symbionts, including Metamycoplasmataceae and Ca. Thioglobaceae, may assimilate the smaller chitin oligosaccharides resulting from chitin breakdown and engage in chitin deacetylation, respectively, suggesting possibilities for substrate cross-feeding and a role for the coral microbiome in overall chitin turnover. We also observed sharp differences in secondary metabolite production potential between symbiotic lineages. Specific Proteobacteria taxa may specialize in chemical defense and guard other symbionts, including Endozoicomonadaceae, which lack such capacity. CONCLUSION This is the first study to recover MAGs from dominant symbionts of octocorals, including those of so-far unculturable Endozoicomonadaceae, Ca. Thioglobaceae and Metamycoplasmataceae symbionts. We identify a thus-far unanticipated, global role for Endozoicomonadaceae symbionts of corals in the processing of chitin, the most abundant natural polysaccharide in the oceans and major component of the natural zoo- and phytoplankton feed of octocorals. We conclude that niche partitioning, metabolic specialization, and adaptation to low oxygen conditions among prokaryotic symbionts likely contribute to the plasticity and adaptability of the octocoral holobiont in changing marine environments. These findings bear implications not only for our understanding of symbiotic relationships in the marine realm but also for the functioning of benthic ecosystems at large. Video Abstract.
Collapse
Affiliation(s)
- Tina Keller-Costa
- Institute for Bioengineering and Biosciences, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
| | - Lydia Kozma
- Institute for Bioengineering and Biosciences, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
- École Polytechnique Fédérale de Lausanne, Écublens, Switzerland
| | - Sandra G. Silva
- Institute for Bioengineering and Biosciences, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
| | - Rodolfo Toscan
- Helmholtz Centre for Environmental Research, Leipzig, Germany
| | - Jorge Gonçalves
- Centro de Ciências Do Mar, Universidade Do Algarve, Faro, Portugal
| | - Asunción Lago-Lestón
- Centro de Investigación Científica Y de Educación Superior de Ensenada, Ensenada, Mexico
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA USA
| | | | - Rodrigo Costa
- Institute for Bioengineering and Biosciences, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
- Associate Laboratory i4HB - Institute for Health and Bioeconomy, Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
- Centro de Ciências Do Mar, Universidade Do Algarve, Faro, Portugal
| |
Collapse
|
14
|
Chaput G, Ford J, DeDiego L, Narayanan A, Tam WY, Whalen M, Huntemann M, Clum A, Spunde A, Pillay M, Palaniappan K, Varghese N, Mikhailova N, Chen IM, Stamatis D, Reddy TBK, O’Malley R, Daum C, Shapiro N, Ivanova N, Kyrpides NC, Woyke T, Glavina del Rio T, DeAngelis KM. Sodalis ligni Strain 159R Isolated from an Anaerobic Lignin-Degrading Consortium. Microbiol Spectr 2022; 10:e0234621. [PMID: 35579457 PMCID: PMC9241852 DOI: 10.1128/spectrum.02346-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Accepted: 04/19/2022] [Indexed: 11/20/2022] Open
Abstract
Novel bacterial isolates with the capabilities of lignin depolymerization, catabolism, or both, could be pertinent to lignocellulosic biofuel applications. In this study, we aimed to identify anaerobic bacteria that could address the economic challenges faced with microbial-mediated biotechnologies, such as the need for aeration and mixing. Using a consortium seeded from temperate forest soil and enriched under anoxic conditions with organosolv lignin as the sole carbon source, we successfully isolated a novel bacterium, designated 159R. Based on the 16S rRNA gene, the isolate belongs to the genus Sodalis in the family Bruguierivoracaceae. Whole-genome sequencing revealed a genome size of 6.38 Mbp and a GC content of 55 mol%. To resolve the phylogenetic position of 159R, its phylogeny was reconstructed using (i) 16S rRNA genes of its closest relatives, (ii) multilocus sequence analysis (MLSA) of 100 genes, (iii) 49 clusters of orthologous groups (COG) domains, and (iv) 400 conserved proteins. Isolate 159R was closely related to the deadwood associated Sodalis guild rather than the tsetse fly and other insect endosymbiont guilds. Estimated genome-sequence-based digital DNA-DNA hybridization (dDDH), genome percentage of conserved proteins (POCP), and an alignment analysis between 159R and the Sodalis clade species further supported that isolate 159R was part of the Sodalis genus and a strain of Sodalis ligni. We proposed the name Sodalis ligni str. 159R (=DSM 110549 = ATCC TSD-177). IMPORTANCE Currently, in the paper industry, paper mill pulping relies on unsustainable and costly processes to remove lignin from lignocellulosic material. A greener approach is biopulping, which uses microbes and their enzymes to break down lignin. However, there are limitations to biopulping that prevent it from outcompeting other pulping processes, such as requiring constant aeration and mixing. Anaerobic bacteria are a promising alternative source for consolidated depolymerization of lignin and its conversion to valuable by-products. We presented Sodalis ligni str. 159R and its characteristics as another example of potential mechanisms that can be developed for lignocellulosic applications.
Collapse
Affiliation(s)
- Gina Chaput
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| | - Jacob Ford
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| | - Lani DeDiego
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| | - Achala Narayanan
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| | - Wing Yin Tam
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| | - Meghan Whalen
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| | - Marcel Huntemann
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Alicia Clum
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Alex Spunde
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Manoj Pillay
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | | | - Neha Varghese
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Natalia Mikhailova
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - I-Min Chen
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Dimitrios Stamatis
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - T. B. K Reddy
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Ronan O’Malley
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Chris Daum
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Nicole Shapiro
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Natalia Ivanova
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Nikos C. Kyrpides
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | - Tanja Woyke
- United States Department of Energy Joint Genome Institute, Berkeley, California, USA
| | | | - Kristen M. DeAngelis
- Department of Microbiology, University of Massachusetts–Amherst, Amherst, Massachusetts, USA
| |
Collapse
|
15
|
Abstract
Noncoding RNAs with secondary structures play important roles in CRISPR-Cas systems. Many of these structures likely remain undiscovered. We used a large-scale comparative genomics approach to predict 156 novel candidate structured RNAs from 36,111 CRISPR-Cas systems. A number of these were found to overlap with coding genes, including palindromic candidates that overlapped with a variety of Cas genes in type I and III systems. Among these 156 candidates, we identified 46 new models of CRISPR direct repeats and 1 tracrRNA. This tracrRNA model occasionally overlapped with predicted cas9 coding regions, emphasizing the importance of expanding our search windows for novel structure RNAs in coding regions. We also demonstrated that the antirepeat sequence in this tracrRNA model can be used to accurately assign thousands of predicted CRISPR arrays to type II-C systems. This study highlights the importance of unbiased identification of candidate structured RNAs across CRISPR-Cas systems.
Collapse
Affiliation(s)
- Brayon J. Fremin
- Department of Energy, Joint Genome Institute, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| |
Collapse
|
16
|
Bowers RM, Nayfach S, Schulz F, Jungbluth SP, Ruhl IA, Sheremet A, Lee J, Goudeau D, Eloe-Fadrosh EA, Stepanauskas R, Malmstrom RR, Kyrpides NC, Dunfield PF, Woyke T. Dissecting the dominant hot spring microbial populations based on community-wide sampling at single-cell genomic resolution. ISME J 2022; 16:1337-1347. [PMID: 34969995 PMCID: PMC9039060 DOI: 10.1038/s41396-021-01178-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Revised: 11/29/2021] [Accepted: 12/10/2021] [Indexed: 02/07/2023]
Abstract
With advances in DNA sequencing and miniaturized molecular biology workflows, rapid and affordable sequencing of single-cell genomes has become a reality. Compared to 16S rRNA gene surveys and shotgun metagenomics, large-scale application of single-cell genomics to whole microbial communities provides an integrated snapshot of community composition and function, directly links mobile elements to their hosts, and enables analysis of population heterogeneity of the dominant community members. To that end, we sequenced nearly 500 single-cell genomes from a low diversity hot spring sediment sample from Dewar Creek, British Columbia, and compared this approach to 16S rRNA gene amplicon and shotgun metagenomics applied to the same sample. We found that the broad taxonomic profiles were similar across the three sequencing approaches, though several lineages were missing from the 16S rRNA gene amplicon dataset, likely the result of primer mismatches. At the functional level, we detected a large array of mobile genetic elements present in the single-cell genomes but absent from the corresponding same species metagenome-assembled genomes. Moreover, we performed a single-cell population genomic analysis of the three most abundant community members, revealing differences in population structure based on mutation and recombination profiles. While the average pairwise nucleotide identities were similar across the dominant species-level lineages, we observed differences in the extent of recombination between these dominant populations. Most intriguingly, the creek's Hydrogenobacter sp. population appeared to be so recombinogenic that it more closely resembled a sexual species than a clonally evolving microbe. Together, this work demonstrates that a randomized single-cell approach can be useful for the exploration of previously uncultivated microbes from community composition to population structure.
Collapse
Affiliation(s)
- Robert M. Bowers
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Stephen Nayfach
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Frederik Schulz
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Sean P. Jungbluth
- grid.184769.50000 0001 2231 4551Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA USA
| | - Ilona A. Ruhl
- grid.22072.350000 0004 1936 7697Department of Biological Sciences, University of Calgary, 2500 University Dr. NW, Calgary, AB T2N 1N4 Canada ,grid.419357.d0000 0001 2199 3636National Bioenergy Center, National Renewable Energy Laboratory, Golden, CO USA
| | - Andriy Sheremet
- grid.22072.350000 0004 1936 7697Department of Biological Sciences, University of Calgary, 2500 University Dr. NW, Calgary, AB T2N 1N4 Canada
| | - Janey Lee
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Danielle Goudeau
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Emiley A. Eloe-Fadrosh
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Ramunas Stepanauskas
- grid.296275.d0000 0000 9516 4913Bigelow Laboratory for Ocean Sciences, 60 Bigelow Drive, East Boothbay, ME USA
| | - Rex R. Malmstrom
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Nikos C. Kyrpides
- grid.451309.a0000 0004 0449 479XU.S. Department of Energy, Joint Genome Institute, Berkeley, CA USA
| | - Peter F. Dunfield
- grid.22072.350000 0004 1936 7697Department of Biological Sciences, University of Calgary, 2500 University Dr. NW, Calgary, AB T2N 1N4 Canada
| | - Tanja Woyke
- U.S. Department of Energy, Joint Genome Institute, Berkeley, CA, USA.
| |
Collapse
|
17
|
Martin K, Schmidt K, Toseland A, Boulton CA, Barry K, Beszteri B, Brussaard CPD, Clum A, Daum CG, Eloe-Fadrosh E, Fong A, Foster B, Foster B, Ginzburg M, Huntemann M, Ivanova NN, Kyrpides NC, Lindquist E, Mukherjee S, Palaniappan K, Reddy TBK, Rizkallah MR, Roux S, Timmermans K, Tringe SG, van de Poll WH, Varghese N, Valentin KU, Lenton TM, Grigoriev IV, Leggett RM, Moulton V, Mock T. The biogeographic differentiation of algal microbiomes in the upper ocean from pole to pole. Nat Commun 2021; 12:5483. [PMID: 34531387 PMCID: PMC8446083 DOI: 10.1038/s41467-021-25646-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 08/12/2021] [Indexed: 02/08/2023] Open
Abstract
Eukaryotic phytoplankton are responsible for at least 20% of annual global carbon fixation. Their diversity and activity are shaped by interactions with prokaryotes as part of complex microbiomes. Although differences in their local species diversity have been estimated, we still have a limited understanding of environmental conditions responsible for compositional differences between local species communities on a large scale from pole to pole. Here, we show, based on pole-to-pole phytoplankton metatranscriptomes and microbial rDNA sequencing, that environmental differences between polar and non-polar upper oceans most strongly impact the large-scale spatial pattern of biodiversity and gene activity in algal microbiomes. The geographic differentiation of co-occurring microbes in algal microbiomes can be well explained by the latitudinal temperature gradient and associated break points in their beta diversity, with an average breakpoint at 14 °C ± 4.3, separating cold and warm upper oceans. As global warming impacts upper ocean temperatures, we project that break points of beta diversity move markedly pole-wards. Hence, abrupt regime shifts in algal microbiomes could be caused by anthropogenic climate change.
Collapse
Affiliation(s)
- Kara Martin
- School of Computing Sciences, University of East Anglia, Norwich Research Park, Norwich, UK
- Earlham Institute, Norwich Research Park, Norwich, UK
| | - Katrin Schmidt
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, UK
| | - Andrew Toseland
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, UK
| | | | - Kerrie Barry
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Bánk Beszteri
- Department of Biology, University of Duisburg-Essen, Essen, Essen, Germany
| | | | - Alicia Clum
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Chris G Daum
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Emiley Eloe-Fadrosh
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Allison Fong
- Alfred Wegener Institute for Polar and Marine Research, Bremerhaven, Germany
| | - Brian Foster
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Bryce Foster
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Michael Ginzburg
- Alfred Wegener Institute for Polar and Marine Research, Bremerhaven, Germany
| | - Marcel Huntemann
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Natalia N Ivanova
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C Kyrpides
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Erika Lindquist
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Supratim Mukherjee
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Krishnaveni Palaniappan
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - T B K Reddy
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Mariam R Rizkallah
- Alfred Wegener Institute for Polar and Marine Research, Bremerhaven, Germany
| | - Simon Roux
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Klaas Timmermans
- Royal Netherlands Institute for Sea Research, Texel, The Netherlands
| | - Susannah G Tringe
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Willem H van de Poll
- Centre for Isotope Research - Oceans, Energy and Sustainability Research Institute Groningen, Faculty of Science and Engineering, University of Groningen, AG Groningen, The Netherlands
| | - Neha Varghese
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Klaus U Valentin
- Alfred Wegener Institute for Polar and Marine Research, Bremerhaven, Germany
| | | | - Igor V Grigoriev
- U.S. Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Plant and Microbial Biology Department, University of California, Berkeley, CA, USA
| | | | - Vincent Moulton
- School of Computing Sciences, University of East Anglia, Norwich Research Park, Norwich, UK
| | - Thomas Mock
- School of Environmental Sciences, University of East Anglia, Norwich Research Park, Norwich, UK.
| |
Collapse
|
18
|
Abstract
Viruses are the most abundant biological entity on Earth, infect cellular organisms from all domains of life, and are central players in the global biosphere. Over the last century, the discovery and characterization of viruses have progressed steadily alongside much of modern biology. In terms of outright numbers of novel viruses discovered, however, the last few years have been by far the most transformative for the field. Advances in methods for identifying viral sequences in genomic and metagenomic datasets, coupled to the exponential growth of environmental sequencing, have greatly expanded the catalog of known viruses and fueled the tremendous growth of viral sequence databases. Development and implementation of new standards, along with careful study of the newly discovered viruses, have transformed and will continue to transform our understanding of microbial evolution, ecology, and biogeochemical cycles, leading to new biotechnological innovations across many diverse fields, including environmental, agricultural, and biomedical sciences.
Collapse
Affiliation(s)
- Lee Call
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; ,
| | - Stephen Nayfach
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; ,
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA; ,
| |
Collapse
|
19
|
Danko D, Bezdan D, Afshin EE, Ahsanuddin S, Bhattacharya C, Butler DJ, Chng KR, Donnellan D, Hecht J, Jackson K, Kuchin K, Karasikov M, Lyons A, Mak L, Meleshko D, Mustafa H, Mutai B, Neches RY, Ng A, Nikolayeva O, Nikolayeva T, Png E, Ryon KA, Sanchez JL, Shaaban H, Sierra MA, Thomas D, Young B, Abudayyeh OO, Alicea J, Bhattacharyya M, Blekhman R, Castro-Nallar E, Cañas AM, Chatziefthimiou AD, Crawford RW, De Filippis F, Deng Y, Desnues C, Dias-Neto E, Dybwad M, Elhaik E, Ercolini D, Frolova A, Gankin D, Gootenberg JS, Graf AB, Green DC, Hajirasouliha I, Hastings JJA, Hernandez M, Iraola G, Jang S, Kahles A, Kelly FJ, Knights K, Kyrpides NC, Łabaj PP, Lee PKH, Leung MHY, Ljungdahl PO, Mason-Buck G, McGrath K, Meydan C, Mongodin EF, Moraes MO, Nagarajan N, Nieto-Caballero M, Noushmehr H, Oliveira M, Ossowski S, Osuolale OO, Özcan O, Paez-Espino D, Rascovan N, Richard H, Rätsch G, Schriml LM, Semmler T, Sezerman OU, Shi L, Shi T, Siam R, Song LH, Suzuki H, Court DS, Tighe SW, Tong X, Udekwu KI, Ugalde JA, Valentine B, Vassilev DI, Vayndorf EM, Velavan TP, Wu J, Zambrano MM, Zhu J, Zhu S, Mason CE. A global metagenomic map of urban microbiomes and antimicrobial resistance. Cell 2021; 184:3376-3393.e17. [PMID: 34043940 PMCID: PMC8238498 DOI: 10.1016/j.cell.2021.05.002] [Citation(s) in RCA: 124] [Impact Index Per Article: 41.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 03/05/2021] [Accepted: 04/29/2021] [Indexed: 01/14/2023]
Abstract
We present a global atlas of 4,728 metagenomic samples from mass-transit systems in 60 cities over 3 years, representing the first systematic, worldwide catalog of the urban microbial ecosystem. This atlas provides an annotated, geospatial profile of microbial strains, functional characteristics, antimicrobial resistance (AMR) markers, and genetic elements, including 10,928 viruses, 1,302 bacteria, 2 archaea, and 838,532 CRISPR arrays not found in reference databases. We identified 4,246 known species of urban microorganisms and a consistent set of 31 species found in 97% of samples that were distinct from human commensal organisms. Profiles of AMR genes varied widely in type and density across cities. Cities showed distinct microbial taxonomic signatures that were driven by climate and geographic differences. These results constitute a high-resolution global metagenomic atlas that enables discovery of organisms and genes, highlights potential public health and forensic applications, and provides a culture-independent view of AMR burden in cities.
Collapse
Affiliation(s)
- David Danko
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Daniela Bezdan
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA; Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany; NGS Competence Center Tübingen (NCCT), University of Tübingen, Tübingen, Germany
| | - Evan E Afshin
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | | | - Chandrima Bhattacharya
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Daniel J Butler
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Kern Rei Chng
- Genome Institute of Singapore, A(∗)STAR, Singapore, Singapore
| | - Daisy Donnellan
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Jochen Hecht
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Katelyn Jackson
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Katerina Kuchin
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Mikhail Karasikov
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; University Hospital Zurich, Biomedical Informatics Research, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Abigail Lyons
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Lauren Mak
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Dmitry Meleshko
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Harun Mustafa
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; University Hospital Zurich, Biomedical Informatics Research, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Beth Mutai
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain; Kenya Medical Research Institute - Kisumu, Kisumu, Kenya
| | - Russell Y Neches
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Amanda Ng
- Genome Institute of Singapore, A(∗)STAR, Singapore, Singapore
| | | | | | - Eileen Png
- Genome Institute of Singapore, A(∗)STAR, Singapore, Singapore
| | - Krista A Ryon
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Jorge L Sanchez
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Heba Shaaban
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Maria A Sierra
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Dominique Thomas
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Ben Young
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Omar O Abudayyeh
- Massachusetts Institute of Technology, McGovern Institute for Brain Research, Cambridge, MA, USA
| | - Josue Alicea
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Malay Bhattacharyya
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India; Centre for Artificial Intelligence and Machine Learning, Indian Statistical Institute, Kolkata, India
| | | | - Eduardo Castro-Nallar
- Universidad Andres Bello, Center for Bioinformatics and Integrative Biology, Facultad de Ciencias de la Vida, Santiago, Chile
| | - Ana M Cañas
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Aspassia D Chatziefthimiou
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | | | - Francesca De Filippis
- Department of Agricultural Sciences, Division of Microbiology, University of Naples Federico II, Naples, Italy; Task Force on Microbiome Studies, University of Naples Federico II, Naples, Italy
| | - Youping Deng
- University of Hawaii John A. Burns School of Medicine, Honolulu, HI, USA
| | - Christelle Desnues
- Aix-Marseille Université, Mediterranean Institute of Oceanology, Université de Toulon, CNRS, IRD, UM 110, Marseille, France
| | - Emmanuel Dias-Neto
- Medical Genomics group, A.C.Camargo Cancer Center, São Paulo - SP, Brazil
| | - Marius Dybwad
- Norwegian Defence Research Establishment FFI, Kjeller, Norway
| | - Eran Elhaik
- Department of Biology, Lund University, Lund, Sweden
| | - Danilo Ercolini
- Department of Agricultural Sciences, Division of Microbiology, University of Naples Federico II, Naples, Italy; Task Force on Microbiome Studies, University of Naples Federico II, Naples, Italy
| | - Alina Frolova
- Institute of Molecular Biology and Genetics of National Academy of Sciences of Ukraine, Kyiv, Ukraine; Kyiv Academic University, Kyiv, Ukraine
| | - Dennis Gankin
- Massachusetts Institute of Technology, McGovern Institute for Brain Research, Cambridge, MA, USA
| | - Jonathan S Gootenberg
- Massachusetts Institute of Technology, McGovern Institute for Brain Research, Cambridge, MA, USA
| | | | - David C Green
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | - Iman Hajirasouliha
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Jaden J A Hastings
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | | | - Gregorio Iraola
- Microbial Genomics Laboratory, Institut Pasteur de Montevideo, Montevideo, Uruguay; Center for Integrative Biology, Universidad Mayor, Santiago de Chile, Santiago, Chile; Wellcome Sanger Institute, Hinxton, UK
| | | | - Andre Kahles
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; Kyiv Academic University, Kyiv, Ukraine; C+, Research Center in Technologies for Society, School of Engineering, Universidad del Desarrollo, Santiago, Chile
| | - Frank J Kelly
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | - Kaymisha Knights
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Nikos C Kyrpides
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Paweł P Łabaj
- State Key Laboratory of Genetic Engineering (SKLGE) and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China; Małopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland; Boku University Viennna, Vienna, Austria
| | - Patrick K H Lee
- School of Energy and Environment, City University of Hong Kong, Hong Kong SAR, China
| | - Marcus H Y Leung
- School of Energy and Environment, City University of Hong Kong, Hong Kong SAR, China
| | - Per O Ljungdahl
- Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden
| | - Gabriella Mason-Buck
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | - Ken McGrath
- Microba, 388 Queen St, Brisbane City, QLD 4000, Australia
| | - Cem Meydan
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Emmanuel F Mongodin
- University of Maryland School of Medicine, Institute for Genome Sciences, Baltimore, MD, USA
| | | | | | | | - Houtan Noushmehr
- University of São Paulo, Ribeirão Preto Medical School, Ribeirão Preto - SP, Brazil
| | - Manuela Oliveira
- Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Porto, Portugal
| | - Stephan Ossowski
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain; Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany; NGS Competence Center Tübingen (NCCT), University of Tübingen, Tübingen, Germany
| | - Olayinka O Osuolale
- Applied Environmental Metagenomics and Infectious Diseases Research (AEMIDR), Department of Biological Sciences, Elizade University, Ilara-Mokin, Nigeria
| | - Orhan Özcan
- Acibadem Mehmet Ali Aydınlar University, Istanbul, Turkey
| | - David Paez-Espino
- Department of Energy, Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nicolás Rascovan
- Microbial Paleogenomics Unit, Institut Pasteur, CNRS UMR2000, Paris 75015, France
| | - Hugues Richard
- Sorbonne University, Faculty of Science, Institute of Biology Paris-Seine, Laboratory of Computational and Quantitative Biology, Paris, France; Robert Koch Institute, Berlin, Germany
| | - Gunnar Rätsch
- ETH Zurich, Department of Computer Science, Biomedical Informatics Group, Zurich, Switzerland; University Hospital Zurich, Biomedical Informatics Research, Zurich, Switzerland; Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Lynn M Schriml
- University of Maryland School of Medicine, Institute for Genome Sciences, Baltimore, MD, USA
| | | | | | - Leming Shi
- Center for Pharmacogenomics, School of Life Sciences and Shanghai Cancer Center, Fudan University, Shanghai, China; State Key Laboratory of Genetic Engineering (SKLGE) and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Tieliu Shi
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Rania Siam
- University of Medicine and Health Sciences, St. Kitts, West Indies and American University in Cairo, Cairo, Egypt
| | - Le Huu Song
- 108 Military Central Hospital, Hanoi, Vietnam; Vietnamese-German Center for Medical Research (VG-CARE), Hanoi, Vietnam
| | | | - Denise Syndercombe Court
- Department of Analytical, Environmental and Forensic Sciences, King's College London, London, UK
| | | | - Xinzhao Tong
- School of Energy and Environment, City University of Hong Kong, Hong Kong SAR, China
| | - Klas I Udekwu
- Department of Molecular Biosciences, The Wenner-Gren Institute, Stockholm University, Stockholm, Sweden; SciLife EVP, Department of Aquatic Sciences Assessment, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Juan A Ugalde
- Millennium Initiative for Collaborative Research on Bacterial Resistance, Santiago, Chile; C+, Research Center in Technologies for Society, School of Engineering, Universidad del Desarrollo, Santiago, Chile
| | - Brandon Valentine
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Dimitar I Vassilev
- Faculty of Mathematics and Informatics, Sofia University "St. Kliment Ohridski," Sofia, Bulgaria
| | - Elena M Vayndorf
- Institute of Arctic Biology, University of Alaska, Fairbanks, Fairbanks, AK, USA
| | - Thirumalaisamy P Velavan
- Institute of Tropical Medicine, Univeristätsklinikum Tübingen, Tübingen, Germany; Faculty of Medicine, Duy Tan University, Da Nang, Vietnam
| | - Jun Wu
- The Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | | | - Jifeng Zhu
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA
| | - Sibo Zhu
- State Key Laboratory of Genetic Engineering (SKLGE) and MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China; Department of Epidemiology, School of Public Health, Fudan University, Shanghai, China
| | - Christopher E Mason
- Weill Cornell Medicine, New York, NY, USA; The Bin Talal Bin Abdulaziz Alsaud Institute for Computational Biomedicine, New York, NY, USA; The WorldQuant Initiative for Quantitative Prediction, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
20
|
Thomas SC, Payne D, Tamadonfar KO, Seymour CO, Jiao JY, Murugapiran SK, Lai D, Lau R, Bowen BP, Silva LP, Louie KB, Huntemann M, Clum A, Spunde A, Pillay M, Palaniappan K, Varghese N, Mikhailova N, Chen IM, Stamatis D, Reddy TBK, O'Malley R, Daum C, Shapiro N, Ivanova N, Kyrpides NC, Woyke T, Eloe-Fadrosh E, Hamilton TL, Dijkstra P, Dodsworth JA, Northen TR, Li WJ, Hedlund BP. Genomics, Exometabolomics, and Metabolic Probing Reveal Conserved Proteolytic Metabolism of Thermoflexus hugenholtzii and Three Candidate Species From China and Japan. Front Microbiol 2021; 12:632731. [PMID: 34017316 PMCID: PMC8129789 DOI: 10.3389/fmicb.2021.632731] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Accepted: 03/02/2021] [Indexed: 01/21/2023] Open
Abstract
Thermoflexus hugenholtzii JAD2T, the only cultured representative of the Chloroflexota order Thermoflexales, is abundant in Great Boiling Spring (GBS), NV, United States, and close relatives inhabit geothermal systems globally. However, no defined medium exists for T. hugenholtzii JAD2T and no single carbon source is known to support its growth, leaving key knowledge gaps in its metabolism and nutritional needs. Here, we report comparative genomic analysis of the draft genome of T. hugenholtzii JAD2T and eight closely related metagenome-assembled genomes (MAGs) from geothermal sites in China, Japan, and the United States, representing “Candidatus Thermoflexus japonica,” “Candidatus Thermoflexus tengchongensis,” and “Candidatus Thermoflexus sinensis.” Genomics was integrated with targeted exometabolomics and 13C metabolic probing of T. hugenholtzii. The Thermoflexus genomes each code for complete central carbon metabolic pathways and an unusually high abundance and diversity of peptidases, particularly Metallo- and Serine peptidase families, along with ABC transporters for peptides and some amino acids. The T. hugenholtzii JAD2T exometabolome provided evidence of extracellular proteolytic activity based on the accumulation of free amino acids. However, several neutral and polar amino acids appear not to be utilized, based on their accumulation in the medium and the lack of annotated transporters. Adenine and adenosine were scavenged, and thymine and nicotinic acid were released, suggesting interdependency with other organisms in situ. Metabolic probing of T. hugenholtzii JAD2T using 13C-labeled compounds provided evidence of oxidation of glucose, pyruvate, cysteine, and citrate, and functioning glycolytic, tricarboxylic acid (TCA), and oxidative pentose-phosphate pathways (PPPs). However, differential use of position-specific 13C-labeled compounds showed that glycolysis and the TCA cycle were uncoupled. Thus, despite the high abundance of Thermoflexus in sediments of some geothermal systems, they appear to be highly focused on chemoorganotrophy, particularly protein degradation, and may interact extensively with other microorganisms in situ.
Collapse
Affiliation(s)
- Scott C Thomas
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States
| | - Devon Payne
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States
| | - Kevin O Tamadonfar
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States
| | - Cale O Seymour
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States
| | - Jian-Yu Jiao
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China.,State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory, Zhuhai, China
| | - Senthil K Murugapiran
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States.,Department of Plant and Microbial Biology, The BioTechnology Institute, University of Minnesota, St. Paul, MN, United States
| | - Dengxun Lai
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States
| | - Rebecca Lau
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Benjamin P Bowen
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Leslie P Silva
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Katherine B Louie
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Marcel Huntemann
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Alicia Clum
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Alex Spunde
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Manoj Pillay
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Krishnaveni Palaniappan
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Neha Varghese
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Natalia Mikhailova
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - I-Min Chen
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Dimitrios Stamatis
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - T B K Reddy
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Ronan O'Malley
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Chris Daum
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Nicole Shapiro
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Natalia Ivanova
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Nikos C Kyrpides
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Tanja Woyke
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Emiley Eloe-Fadrosh
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Trinity L Hamilton
- Department of Plant and Microbial Biology, The BioTechnology Institute, University of Minnesota, St. Paul, MN, United States
| | - Paul Dijkstra
- Department of Biological Sciences, Center of Ecosystem Science and Society, Northern Arizona University, Flagstaff, AZ, United States
| | - Jeremy A Dodsworth
- Department of Biology, California State University, San Bernardino, CA, United States
| | - Trent R Northen
- The Department of Energy Joint Genome Institute, Berkeley, CA, United States.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Wen-Jun Li
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China.,State Key Laboratory of Biocontrol, Guangdong Provincial Key Laboratory of Plant Resources and Southern Marine Science and Engineering Guangdong Laboratory, Zhuhai, China
| | - Brian P Hedlund
- School of Life Sciences, University of Nevada, Las Vegas, Las Vegas, NV, United States.,Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, Las Vegas, NV, United States
| |
Collapse
|
21
|
Nayfach S, Roux S, Seshadri R, Udwary D, Varghese N, Schulz F, Wu D, Paez-Espino D, Chen IM, Huntemann M, Palaniappan K, Ladau J, Mukherjee S, Reddy TBK, Nielsen T, Kirton E, Faria JP, Edirisinghe JN, Henry CS, Jungbluth SP, Chivian D, Dehal P, Wood-Charlson EM, Arkin AP, Tringe SG, Visel A, Woyke T, Mouncey NJ, Ivanova NN, Kyrpides NC, Eloe-Fadrosh EA. Author Correction: A genomic catalog of Earth's microbiomes. Nat Biotechnol 2021; 39:521. [PMID: 33795890 PMCID: PMC8041621 DOI: 10.1038/s41587-021-00898-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Affiliation(s)
| | - Simon Roux
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | - Dongying Wu
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | - I-Min Chen
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | - T B K Reddy
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | | | - Sean P Jungbluth
- DOE Joint Genome Institute, Berkeley, CA, USA.,Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Dylan Chivian
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Paramvir Dehal
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Adam P Arkin
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Axel Visel
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | - Tanja Woyke
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | |
Collapse
|
22
|
Nayfach S, Roux S, Seshadri R, Udwary D, Varghese N, Schulz F, Wu D, Paez-Espino D, Chen IM, Huntemann M, Palaniappan K, Ladau J, Mukherjee S, Reddy TBK, Nielsen T, Kirton E, Faria JP, Edirisinghe JN, Henry CS, Jungbluth SP, Chivian D, Dehal P, Wood-Charlson EM, Arkin AP, Tringe SG, Visel A, Woyke T, Mouncey NJ, Ivanova NN, Kyrpides NC, Eloe-Fadrosh EA. A genomic catalog of Earth's microbiomes. Nat Biotechnol 2021; 39:499-509. [PMID: 33169036 PMCID: PMC8041624 DOI: 10.1038/s41587-020-0718-6] [Citation(s) in RCA: 307] [Impact Index Per Article: 102.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Accepted: 09/28/2020] [Indexed: 01/02/2023]
Abstract
The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth's continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.
Collapse
Affiliation(s)
| | - Simon Roux
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | - Dongying Wu
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | - I-Min Chen
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | - T B K Reddy
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | | | | - Sean P Jungbluth
- DOE Joint Genome Institute, Berkeley, CA, USA
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Dylan Chivian
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Paramvir Dehal
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Adam P Arkin
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Axel Visel
- DOE Joint Genome Institute, Berkeley, CA, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Berkeley, CA, USA
| | | | | | | | | |
Collapse
|
23
|
Pons JC, Paez-Espino D, Riera G, Ivanova N, Kyrpides NC, Llabrés M. VPF-Class: Taxonomic assignment and host prediction of uncultivated viruses based on viral protein families. Bioinformatics 2021; 37:1805-1813. [PMID: 33471063 PMCID: PMC8830756 DOI: 10.1093/bioinformatics/btab026] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 12/11/2020] [Accepted: 01/13/2021] [Indexed: 12/03/2022] Open
Abstract
Motivation Two key steps in the analysis of uncultured viruses recovered from metagenomes are the taxonomic classification of the viral sequences and the identification of putative host(s). Both steps rely mainly on the assignment of viral proteins to orthologs in cultivated viruses. Viral Protein Families (VPFs) can be used for the robust identification of new viral sequences in large metagenomics datasets. Despite the importance of VPF information for viral discovery, VPFs have not yet been explored for determining viral taxonomy and host targets. Results In this work, we classified the set of VPFs from the IMG/VR database and developed VPF-Class. VPF-Class is a tool that automates the taxonomic classification and host prediction of viral contigs based on the assignment of their proteins to a set of classified VPFs. Applying VPF-Class on 731K uncultivated virus contigs from the IMG/VR database, we were able to classify 363K contigs at the genus level and predict the host of over 461K contigs. In the RefSeq database, VPF-class reported an accuracy of nearly 100% to classify dsDNA, ssDNA and retroviruses, at the genus level, considering a membership ratio and a confidence score of 0.2. The accuracy in host prediction was 86.4%, also at the genus level, considering a membership ratio of 0.3 and a confidence score of 0.5. And, in the prophages dataset, the accuracy in host prediction was 86% considering a membership ratio of 0.6 and a confidence score of 0.8. Moreover, from the Global Ocean Virome dataset, over 817K viral contigs out of 1 million were classified. Availability and implementation The implementation of VPF-Class can be downloaded from https://github.com/biocom-uib/vpf-tools. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joan Carles Pons
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| | | | - Gabriel Riera
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| | - Natalia Ivanova
- Department of Energy Joint Genome Institute, Berkeley, 94720, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Berkeley, 94720, USA
| | - Mercè Llabrés
- Department of Mathematics and Computer Science, University of the Balearic Islands, Palma, 07122, Spain
| |
Collapse
|
24
|
Neches RY, Kyrpides NC, Ouzounis CA. Atypical Divergence of SARS-CoV-2 Orf8 from Orf7a within the Coronavirus Lineage Suggests Potential Stealthy Viral Strategies in Immune Evasion. mBio 2021; 12:e03014-20. [PMID: 33468697 PMCID: PMC7845636 DOI: 10.1128/mbio.03014-20] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2020] [Accepted: 12/11/2020] [Indexed: 01/19/2023] Open
Abstract
Orf8, one of the most puzzling genes in the SARS lineage of coronaviruses, marks a unique and striking difference in genome organization between SARS-CoV-2 and SARS-CoV-1. Here, using sequence comparisons, we unequivocally reveal the distant sequence similarities between SARS-CoV-2 Orf8 with its SARS-CoV-1 counterparts and the X4-like genes of coronaviruses, including its highly divergent "paralog" gene Orf7a, whose product is a potential immune antagonist of known structure. Supervised sequence space walks unravel identity levels that drop below 10% and yet exhibit subtle conservation patterns in this novel superfamily, characterized by an immunoglobulin-like beta sandwich topology. We document the high accuracy of the sequence space walk process in detail and characterize the subgroups of the superfamily in sequence space by systematic annotation of gene and taxon groups. While SARS-CoV-1 Orf7a and Orf8 genes are most similar to bat virus sequences, their SARS-CoV-2 counterparts are closer to pangolin virus homologs, reflecting the fine structure of conservation patterns within the SARS-CoV-2 genomes. The divergence between Orf7a and Orf8 is exceptionally idiosyncratic, since Orf7a is more constrained, whereas Orf8 is subject to rampant change, a peculiar feature that may be related to hitherto-unknown viral infection strategies. Despite their common origin, the Orf7a and Orf8 protein families exhibit different modes of evolutionary trajectories within the coronavirus lineage, which might be partly attributable to their complex interactions with the mammalian host cell, reflected by a multitude of functional associations of Orf8 in SARS-CoV-2 compared to a very small number of interactions discovered for Orf7a.IMPORTANCE Orf8 is one of the most puzzling genes in the SARS lineage of coronaviruses, including SARS-CoV-2. Using sophisticated sequence comparisons, we confirm its origins from Orf7a, another gene in the lineage that appears as more conserved, compared to Orf8. Orf7a is a potential immune antagonist of known structure, while a deletion of Orf8 was shown to decrease the severity of the infection in a cohort study. The subtle sequence similarities imply that Orf8 has the same immunoglobulin-like fold as Orf7a, confirmed by structure determination. We characterize the subgroups of this superfamily and demonstrate the highly idiosyncratic divergence patterns during the evolution of the virus.
Collapse
Affiliation(s)
- Russell Y Neches
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley California, USA
| | - Christos A Ouzounis
- Biological Computation and Process Laboratory, Chemical Process and Energy Resources Institute, Centre for Research and Technology Hellas, Thessalonica, Greece
| |
Collapse
|
25
|
Roux S, Páez-Espino D, Chen IMA, Palaniappan K, Ratner A, Chu K, Reddy TBK, Nayfach S, Schulz F, Call L, Neches RY, Woyke T, Ivanova NN, Eloe-Fadrosh EA, Kyrpides NC. IMG/VR v3: an integrated ecological and evolutionary framework for interrogating genomes of uncultivated viruses. Nucleic Acids Res 2021; 49:D764-D775. [PMID: 33137183 PMCID: PMC7778971 DOI: 10.1093/nar/gkaa946] [Citation(s) in RCA: 179] [Impact Index Per Article: 59.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/02/2020] [Accepted: 10/09/2020] [Indexed: 12/28/2022] Open
Abstract
Viruses are integral components of all ecosystems and microbiomes on Earth. Through pervasive infections of their cellular hosts, viruses can reshape microbial community structure and drive global nutrient cycling. Over the past decade, viral sequences identified from genomes and metagenomes have provided an unprecedented view of viral genome diversity in nature. Since 2016, the IMG/VR database has provided access to the largest collection of viral sequences obtained from (meta)genomes. Here, we present the third version of IMG/VR, composed of 18 373 cultivated and 2 314 329 uncultivated viral genomes (UViGs), nearly tripling the total number of sequences compared to the previous version. These clustered into 935 362 viral Operational Taxonomic Units (vOTUs), including 188 930 with two or more members. UViGs in IMG/VR are now reported as single viral contigs, integrated proviruses or genome bins, and are annotated with a new standardized pipeline including genome quality estimation using CheckV, taxonomic classification reflecting the latest ICTV update, and expanded host taxonomy prediction. The new IMG/VR interface enables users to efficiently browse, search, and select UViGs based on genome features and/or sequence similarity. IMG/VR v3 is available at https://img.jgi.doe.gov/vr, and the underlying data are available to download at https://genome.jgi.doe.gov/portal/IMG_VR.
Collapse
Affiliation(s)
- Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - David Páez-Espino
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - I-Min A Chen
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Krishna Palaniappan
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Anna Ratner
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ken Chu
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - T B K Reddy
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Stephen Nayfach
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lee Call
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Russell Y Neches
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Natalia N Ivanova
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Emiley A Eloe-Fadrosh
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| |
Collapse
|
26
|
Mukherjee S, Stamatis D, Bertsch J, Ovchinnikova G, Sundaramurthi J, Lee J, Kandimalla M, Chen IMA, Kyrpides NC, Reddy TBK. Genomes OnLine Database (GOLD) v.8: overview and updates. Nucleic Acids Res 2021; 49:D723-D733. [PMID: 33152092 PMCID: PMC7778979 DOI: 10.1093/nar/gkaa983] [Citation(s) in RCA: 104] [Impact Index Per Article: 34.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/08/2020] [Accepted: 10/19/2020] [Indexed: 12/28/2022] Open
Abstract
The Genomes OnLine Database (GOLD) (https://gold.jgi.doe.gov/) is a manually curated, daily updated collection of genome projects and their metadata accumulated from around the world. The current version of the database includes over 1.17 million entries organized broadly into Studies (45 770), Organisms (387 382) or Biosamples (101 207), Sequencing Projects (355 364) and Analysis Projects (283 481). These four levels contain over 600 metadata fields, which includes 76 controlled vocabulary (CV) tables containing 3873 terms. GOLD provides an interactive web user interface for browsing and searching by a wide range of project and metadata fields. Users can enter details about their own projects in GOLD, which acts as a gatekeeper to ensure that metadata is accurately documented before submitting sequence information to the Integrated Microbial Genomes (IMG) system for analysis. In order to maintain a reference dataset for use by members of the scientific community, GOLD also imports projects from public repositories such as GenBank and SRA. The current status of the database, along with recent updates and improvements are described in this manuscript.
Collapse
Affiliation(s)
- Supratim Mukherjee
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Dimitri Stamatis
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Jon Bertsch
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Galina Ovchinnikova
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Janey Lee
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Mahathi Kandimalla
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - I-Min A Chen
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - T B K Reddy
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| |
Collapse
|
27
|
Panwar P, Allen MA, Williams TJ, Hancock AM, Brazendale S, Bevington J, Roux S, Páez-Espino D, Nayfach S, Berg M, Schulz F, Chen IMA, Huntemann M, Shapiro N, Kyrpides NC, Woyke T, Eloe-Fadrosh EA, Cavicchioli R. Influence of the polar light cycle on seasonal dynamics of an Antarctic lake microbial community. Microbiome 2020; 8:116. [PMID: 32772914 PMCID: PMC7416419 DOI: 10.1186/s40168-020-00889-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 06/30/2020] [Indexed: 05/10/2023]
Abstract
BACKGROUND Cold environments dominate the Earth's biosphere and microbial activity drives ecosystem processes thereby contributing greatly to global biogeochemical cycles. Polar environments differ to all other cold environments by experiencing 24-h sunlight in summer and no sunlight in winter. The Vestfold Hills in East Antarctica contains hundreds of lakes that have evolved from a marine origin only 3000-7000 years ago. Ace Lake is a meromictic (stratified) lake from this region that has been intensively studied since the 1970s. Here, a total of 120 metagenomes representing a seasonal cycle and four summers spanning a 10-year period were analyzed to determine the effects of the polar light cycle on microbial-driven nutrient cycles. RESULTS The lake system is characterized by complex sulfur and hydrogen cycling, especially in the anoxic layers, with multiple mechanisms for the breakdown of biopolymers present throughout the water column. The two most abundant taxa are phototrophs (green sulfur bacteria and cyanobacteria) that are highly influenced by the seasonal availability of sunlight. The extent of the Chlorobium biomass thriving at the interface in summer was captured in underwater video footage. The Chlorobium abundance dropped from up to 83% in summer to 6% in winter and 1% in spring, before rebounding to high levels. Predicted Chlorobium viruses and cyanophage were also abundant, but their levels did not negatively correlate with their hosts. CONCLUSION Over-wintering expeditions in Antarctica are logistically challenging, meaning insight into winter processes has been inferred from limited data. Here, we found that in contrast to chemolithoautotrophic carbon fixation potential of Southern Ocean Thaumarchaeota, this marine-derived lake evolved a reliance on photosynthesis. While viruses associated with phototrophs also have high seasonal abundance, the negative impact of viral infection on host growth appeared to be limited. The microbial community as a whole appears to have developed a capacity to generate biomass and remineralize nutrients, sufficient to sustain itself between two rounds of sunlight-driven summer-activity. In addition, this unique metagenome dataset provides considerable opportunity for future interrogation of eukaryotes and their viruses, abundant uncharacterized taxa (i.e. dark matter), and for testing hypotheses about endemic species in polar aquatic ecosystems. Video Abstract.
Collapse
Affiliation(s)
- Pratibha Panwar
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Michelle A Allen
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Timothy J Williams
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Alyce M Hancock
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
- Institute for Marine and Antarctic Studies, University of Tasmania, 20 Castray Esplanade, Battery Point, Tasmania, Australia
| | - Sarah Brazendale
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
- , 476 Lancaster Rd, Pegarah, Australia
| | - James Bevington
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia
| | - Simon Roux
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - David Páez-Espino
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
- Mammoth BioSciences, 279 East Grand Ave, South San Francisco, CA, USA
| | - Stephen Nayfach
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Maureen Berg
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - Frederik Schulz
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | - I-Min A Chen
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Nicole Shapiro
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Tanja Woyke
- Department of Energy Joint Genome Institute, Berkeley, CA, USA
| | | | - Ricardo Cavicchioli
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Sydney, New South Wales, 2052, Australia.
| |
Collapse
|
28
|
Harrington LB, Ma E, Chen JS, Witte IP, Gertz D, Paez-Espino D, Al-Shayeb B, Kyrpides NC, Burstein D, Banfield JF, Doudna JA. A scoutRNA Is Required for Some Type V CRISPR-Cas Systems. Mol Cell 2020; 79:416-424.e5. [PMID: 32645367 PMCID: PMC8196889 DOI: 10.1016/j.molcel.2020.06.022] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 03/02/2020] [Accepted: 06/12/2020] [Indexed: 12/26/2022]
Abstract
CRISPR-Cas12c/d proteins share limited homology with Cas12a and Cas9 bacterial CRISPR RNA (crRNA)-guided nucleases used widely for genome editing and DNA detection. However, Cas12c (C2c3)- and Cas12d (CasY)-catalyzed DNA cleavage and genome editing activities have not been directly observed. We show here that a short-complementarity untranslated RNA (scoutRNA), together with crRNA, is required for Cas12d-catalyzed DNA cutting. The scoutRNA differs in secondary structure from previously described tracrRNAs used by CRISPR-Cas9 and some Cas12 enzymes, and in Cas12d-containing systems, scoutRNA includes a conserved five-nucleotide sequence that is essential for activity. In addition to supporting crRNA-directed DNA recognition, biochemical and cell-based experiments establish scoutRNA as an essential cofactor for Cas12c-catalyzed pre-crRNA maturation. These results define scoutRNA as a third type of transcript encoded by a subset of CRISPR-Cas genomic loci and explain how Cas12c/d systems avoid requirements for host factors including ribonuclease III for bacterial RNA-mediated adaptive immunity.
Collapse
MESH Headings
- Bacteria/classification
- Bacteria/genetics
- Bacteria/immunology
- Bacteria/metabolism
- Bacterial Proteins/genetics
- Bacterial Proteins/metabolism
- Base Sequence
- CRISPR-Cas Systems
- Clustered Regularly Interspaced Short Palindromic Repeats
- DNA, Bacterial/chemistry
- DNA, Bacterial/genetics
- DNA, Bacterial/metabolism
- Endodeoxyribonucleases/genetics
- Endodeoxyribonucleases/metabolism
- Escherichia coli/genetics
- Escherichia coli/immunology
- Escherichia coli/metabolism
- Genome, Bacterial/immunology
- Nucleic Acid Conformation
- Phylogeny
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Bacterial/metabolism
- RNA, Guide, CRISPR-Cas Systems/genetics
- RNA, Guide, CRISPR-Cas Systems/metabolism
- RNA, Small Untranslated/chemistry
- RNA, Small Untranslated/genetics
- RNA, Small Untranslated/metabolism
- Sequence Alignment
- Sequence Homology, Nucleic Acid
Collapse
Affiliation(s)
- Lucas B Harrington
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Enbo Ma
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Janice S Chen
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Isaac P Witte
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Dov Gertz
- School of Molecular Cell Biology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - David Paez-Espino
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Basem Al-Shayeb
- Department of Plant and Microbial Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - David Burstein
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720, USA; School of Molecular Cell Biology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Jillian F Banfield
- Department of Earth and Planetary Science, University of California, Berkeley, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jennifer A Doudna
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, CA 94720, USA; Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Gladstone Institutes, University of California, San Francisco, San Francisco, CA 94114, USA.
| |
Collapse
|
29
|
Cooper CJ, Zheng K, Rush KW, Johs A, Sanders BC, Pavlopoulos GA, Kyrpides NC, Podar M, Ovchinnikov S, Ragsdale SW, Parks JM. Structure determination of the HgcAB complex using metagenome sequence data: insights into microbial mercury methylation. Commun Biol 2020; 3:320. [PMID: 32561885 PMCID: PMC7305189 DOI: 10.1038/s42003-020-1047-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Accepted: 05/27/2020] [Indexed: 11/09/2022] Open
Abstract
Bacteria and archaea possessing the hgcAB gene pair methylate inorganic mercury (Hg) to form highly toxic methylmercury. HgcA consists of a corrinoid binding domain and a transmembrane domain, and HgcB is a dicluster ferredoxin. However, their detailed structure and function have not been thoroughly characterized. We modeled the HgcAB complex by combining metagenome sequence data mining, coevolution analysis, and Rosetta structure calculations. In addition, we overexpressed HgcA and HgcB in Escherichia coli, confirmed spectroscopically that they bind cobalamin and [4Fe-4S] clusters, respectively, and incorporated these cofactors into the structural model. Surprisingly, the two domains of HgcA do not interact with each other, but HgcB forms extensive contacts with both domains. The model suggests that conserved cysteines in HgcB are involved in shuttling HgII, methylmercury, or both. These findings refine our understanding of the mechanism of Hg methylation and expand the known repertoire of corrinoid methyltransferases in nature. Connor J. Cooper et al. expressed HgcA and HgcB in Escherichia coli and modeled the structure of the HgcAB complex by combining metagenome sequence data, coevolution analysis, and ab initio structure calculations. This study provides insights into the biochemical mechanism of mercury (Hg) methylation.
Collapse
Affiliation(s)
- Connor J Cooper
- Graduate School of Genome Science and Technology, University of Tennessee, F225 Walters Life Science, Knoxville, TN, 37996, USA.,Biosciences Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN, 37831-6038, USA
| | - Kaiyuan Zheng
- Department of Biological Chemistry, University of Michigan Medical School, 1150 West Medical Center Drive, Ann Arbor, MI, 48109-0606, USA
| | - Katherine W Rush
- Department of Biological Chemistry, University of Michigan Medical School, 1150 West Medical Center Drive, Ann Arbor, MI, 48109-0606, USA
| | - Alexander Johs
- Environmental Sciences Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN, 37831-6038, USA
| | - Brian C Sanders
- Biosciences Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN, 37831-6038, USA
| | - Georgios A Pavlopoulos
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA.,Institute for Fundamental Biomedical Research, Biomedical Science Research Center "Alexander Fleming", 34 Fleming Street, 16672, Vari, Greece
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, 94720, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory Berkeley, California, USA
| | - Mircea Podar
- Graduate School of Genome Science and Technology, University of Tennessee, F225 Walters Life Science, Knoxville, TN, 37996, USA.,Biosciences Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN, 37831-6038, USA
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, 02138, USA
| | - Stephen W Ragsdale
- Department of Biological Chemistry, University of Michigan Medical School, 1150 West Medical Center Drive, Ann Arbor, MI, 48109-0606, USA
| | - Jerry M Parks
- Graduate School of Genome Science and Technology, University of Tennessee, F225 Walters Life Science, Knoxville, TN, 37996, USA. .,Biosciences Division, Oak Ridge National Laboratory, 1 Bethel Valley Road, Oak Ridge, TN, 37831-6038, USA.
| |
Collapse
|
30
|
Palaniappan K, Chen IMA, Chu K, Ratner A, Seshadri R, Kyrpides NC, Ivanova NN, Mouncey NJ. IMG-ABC v.5.0: an update to the IMG/Atlas of Biosynthetic Gene Clusters Knowledgebase. Nucleic Acids Res 2020; 48:D422-D430. [PMID: 31665416 DOI: 10.1093/nar/gkz932] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 10/02/2019] [Accepted: 10/09/2019] [Indexed: 01/14/2023] Open
Abstract
Microbial secondary metabolism is a reservoir of bioactive compounds of immense biotechnological and biomedical potential. The biosynthetic machinery responsible for the production of these secondary metabolites (SMs) (also called natural products) is often encoded by collocated groups of genes called biosynthetic gene clusters (BGCs). High-throughput genome sequencing of both isolates and metagenomic samples combined with the development of specialized computational workflows is enabling systematic identification of BGCs and the discovery of novel SMs. In order to advance exploration of microbial secondary metabolism and its diversity, we developed the largest publicly available database of predicted BGCs combined with experimentally verified BGCs, the Integrated Microbial Genomes Atlas of Biosynthetic gene Clusters (IMG-ABC) (https://img.jgi.doe.gov/abc-public). Here we describe the first major content update of the IMG-ABC knowledgebase, since its initial release in 2015, refreshing the BGC prediction pipeline with the latest version of antiSMASH (v5) as well as presenting the data in the context of underlying environmental metadata sourced from GOLD (https://gold.jgi.doe.gov/). This update has greatly improved the quality and expanded the types of predicted BGCs compared to the previous version.
Collapse
Affiliation(s)
- Krishnaveni Palaniappan
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - I-Min A Chen
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ken Chu
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Anna Ratner
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Rekha Seshadri
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Natalia N Ivanova
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Nigel J Mouncey
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| |
Collapse
|
31
|
Mukherjee S, Stamatis D, Bertsch J, Ovchinnikova G, Katta HY, Mojica A, Chen IMA, Kyrpides NC, Reddy T. Genomes OnLine database (GOLD) v.7: updates and new features. Nucleic Acids Res 2020; 47:D649-D659. [PMID: 30357420 PMCID: PMC6323969 DOI: 10.1093/nar/gky977] [Citation(s) in RCA: 125] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 10/08/2018] [Indexed: 12/11/2022] Open
Abstract
The Genomes Online Database (GOLD) (https://gold.jgi.doe.gov) is an open online resource, which maintains an up-to-date catalog of genome and metagenome projects in the context of a comprehensive list of associated metadata. Information in GOLD is organized into four levels: Study, Biosample/Organism, Sequencing Project and Analysis Project. Currently GOLD hosts information on 33 415 Studies, 49 826 Biosamples, 313 324 Organisms, 215 881 Sequencing Projects and 174 454 Analysis Projects with a total of 541 metadata fields, of which 80 are based on controlled vocabulary (CV) terms. GOLD provides a user-friendly web interface to browse sequencing projects and launch advanced search tools across four classification levels. Users submit metadata on a wide range of Sequencing and Analysis Projects in GOLD before depositing sequence data to the Integrated Microbial Genomes (IMG) system for analysis. GOLD conforms with and supports the rules set by the Genomic Standards Consortium (GSC) Minimum Information standards. The current version of GOLD (v.7) has seen the number of projects and associated metadata increase exponentially over the years. This paper provides an update on the current status of GOLD and highlights the new features added over the last two years.
Collapse
Affiliation(s)
- Supratim Mukherjee
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Dimitri Stamatis
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Jon Bertsch
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Galina Ovchinnikova
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Hema Y Katta
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Alejandro Mojica
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - I-Min A Chen
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Nikos C Kyrpides
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Tbk Reddy
- Prokaryotic Super Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA
| |
Collapse
|
32
|
Paez-Espino D, Roux S, Chen IMA, Palaniappan K, Ratner A, Chu K, Huntemann M, Reddy TBK, Pons JC, Llabrés M, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. IMG/VR v.2.0: an integrated data management and analysis system for cultivated and environmental viral genomes. Nucleic Acids Res 2020; 47:D678-D686. [PMID: 30407573 PMCID: PMC6323928 DOI: 10.1093/nar/gky1127] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2018] [Accepted: 10/31/2018] [Indexed: 01/06/2023] Open
Abstract
The Integrated Microbial Genome/Virus (IMG/VR) system v.2.0 (https://img.jgi.doe.gov/vr/) is the largest publicly available data management and analysis platform dedicated to viral genomics. Since the last report published in the 2016, NAR Database Issue, the data has tripled in size and currently contains genomes of 8389 cultivated reference viruses, 12 498 previously published curated prophages derived from cultivated microbial isolates, and 735 112 viral genomic fragments computationally predicted from assembled shotgun metagenomes. Nearly 60% of the viral genomes and genome fragments are clustered into 110 384 viral Operational Taxonomic Units (vOTUs) with two or more members. To improve data quality and predictions of host specificity, IMG/VR v.2.0 now separates prokaryotic and eukaryotic viruses, utilizes known prophage sequences to improve taxonomic assignments, and provides viral genome quality scores based on the estimated genome completeness. New features also include enhanced BLAST search capabilities for external queries. Finally, geographic map visualization to locate user-selected viral genomes or genome fragments has been implemented and download options have been extended. All of these features make IMG/VR v.2.0 a key resource for the study of viruses.
Collapse
Affiliation(s)
| | - Simon Roux
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, USA
| | - I-Min A Chen
- Biological Data Management and Technology Center, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, USA
| | - Krishna Palaniappan
- Biological Data Management and Technology Center, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, USA
| | - Anna Ratner
- Biological Data Management and Technology Center, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, USA
| | - Ken Chu
- Biological Data Management and Technology Center, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA, USA
| | - Marcel Huntemann
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, USA
| | - T B K Reddy
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, USA
| | - Joan Carles Pons
- Department of Mathematics and Computer Science, University of the Balearic Islands, Spain
| | - Mercè Llabrés
- Department of Mathematics and Computer Science, University of the Balearic Islands, Spain
| | | | | | - Nikos C Kyrpides
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, USA
| |
Collapse
|
33
|
Chen IMA, Chu K, Palaniappan K, Pillay M, Ratner A, Huang J, Huntemann M, Varghese N, White JR, Seshadri R, Smirnova T, Kirton E, Jungbluth SP, Woyke T, Eloe-Fadrosh EA, Ivanova NN, Kyrpides NC. IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes. Nucleic Acids Res 2020; 47:D666-D677. [PMID: 30289528 PMCID: PMC6323987 DOI: 10.1093/nar/gky901] [Citation(s) in RCA: 536] [Impact Index Per Article: 134.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 09/24/2018] [Indexed: 11/12/2022] Open
Abstract
The Integrated Microbial Genomes & Microbiomes system v.5.0 (IMG/M: https://img.jgi.doe.gov/m/) contains annotated datasets categorized into: archaea, bacteria, eukarya, plasmids, viruses, genome fragments, metagenomes, cell enrichments, single particle sorts, and metatranscriptomes. Source datasets include those generated by the DOE's Joint Genome Institute (JGI), submitted by external scientists, or collected from public sequence data archives such as NCBI. All submissions are typically processed through the IMG annotation pipeline and then loaded into the IMG data warehouse. IMG's web user interface provides a variety of analytical and visualization tools for comparative analysis of isolate genomes and metagenomes in IMG. IMG/M allows open access to all public genomes in the IMG data warehouse, while its expert review (ER) system (IMG/MER: https://img.jgi.doe.gov/mer/) allows registered users to access their private genomes and to store their private datasets in workspace for sharing and for further analysis. IMG/M data content has grown by 60% since the last report published in the 2017 NAR Database Issue. IMG/M v.5.0 has a new and more powerful genome search feature, new statistical tools, and supports metagenome binning.
Collapse
Affiliation(s)
- I-Min A Chen
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Ken Chu
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Krishna Palaniappan
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Manoj Pillay
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Anna Ratner
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Jinghua Huang
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Marcel Huntemann
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Neha Varghese
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | | | - Rekha Seshadri
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Tatyana Smirnova
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Edward Kirton
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Sean P Jungbluth
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Tanja Woyke
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Emiley A Eloe-Fadrosh
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Natalia N Ivanova
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Nikos C Kyrpides
- Department of Energy, Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| |
Collapse
|
34
|
Wiese J, Imhoff JF, Horn H, Borchert E, Kyrpides NC, Göker M, Klenk HP, Woyke T, Hentschel U. Genome analysis of the marine bacterium Kiloniella laminariae and first insights into comparative genomics with related Kiloniella species. Arch Microbiol 2020; 202:815-824. [PMID: 31844948 DOI: 10.1007/s00203-019-01791-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 11/27/2019] [Accepted: 12/03/2019] [Indexed: 10/25/2022]
Abstract
Kiloniella laminariae is a true marine bacterium and the first member of the family and order, the Kiloniellaceae and Kiloniellales. K. laminariae LD81T (= DSM 19542T) was isolated from the marine macroalga Saccharina latissima and is a mesophilic, typical marine chemoheterotrophic aerobic bacterium with antifungal activity. Phylogenetic analysis of 16S rRNA gene sequence revealed the similarity of K. laminariae LD81T not only with three validly described species of the genus Kiloniella, but also with undescribed isolates and clone sequences from marine samples in the range of 93.6-96.7%. We report on the analysis of the draft genome of this alphaproteobacterium and describe some selected features. The 4.4 Mb genome has a G + C content of 51.4%, contains 4213 coding sequences including 51 RNA genes as well as 4162 protein-coding genes, and is a part of the Genomic Encyclopaedia of Bacteria and Archaea (GEBA) project. The genome provides insights into a number of metabolic properties, such as carbon and sulfur metabolism, and indicates the potential for denitrification and the biosynthesis of secondary metabolites. Comparative genome analysis was performed with K. laminariae LD81T and the animal-associated species Kiloniella majae M56.1T from a spider crab, Kiloniella spongiae MEBiC09566T from a sponge as well as Kiloniella litopenai P1-1 from a white shrimp, which all inhabit quite different marine habitats. The analysis revealed that the K. laminariae LD81T contains 1397 unique genes, more than twice the amount of the other species. Unique among others is a mixed PKS/NRPS biosynthetic gene cluster with similarity to the biosynthetic gene cluster responsible for the production of syringomycin.
Collapse
Affiliation(s)
- Jutta Wiese
- GEOMAR Helmholtz Centre for Ocean Research Kiel, RD3 Marine Symbioses, Düsternbrooker Weg 20, 24105, Kiel, Germany.
| | - Johannes F Imhoff
- GEOMAR Helmholtz Centre for Ocean Research Kiel, RD3 Marine Symbioses, Düsternbrooker Weg 20, 24105, Kiel, Germany
| | - Hannes Horn
- GEOMAR Helmholtz Centre for Ocean Research Kiel, RD3 Marine Symbioses, Düsternbrooker Weg 20, 24105, Kiel, Germany
| | - Erik Borchert
- GEOMAR Helmholtz Centre for Ocean Research Kiel, RD3 Marine Symbioses, Düsternbrooker Weg 20, 24105, Kiel, Germany
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Markus Göker
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Inhoffenstraße 7B, 38124, Braunschweig, Germany
| | - Hans-Peter Klenk
- School of Natural and Environmental Sciences, Newcastle University, Ridley Building 2, Newcastle upon Tyne, NE1 7RU, UK
| | - Tanja Woyke
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA, 94598, USA
| | - Ute Hentschel
- GEOMAR Helmholtz Centre for Ocean Research Kiel, RD3 Marine Symbioses, Düsternbrooker Weg 20, 24105, Kiel, Germany
- Christian-Albrechts-University (CAU) of Kiel, Kiel, Germany
| |
Collapse
|
35
|
Hördt A, López MG, Meier-Kolthoff JP, Schleuning M, Weinhold LM, Tindall BJ, Gronow S, Kyrpides NC, Woyke T, Göker M. Analysis of 1,000+ Type-Strain Genomes Substantially Improves Taxonomic Classification of Alphaproteobacteria. Front Microbiol 2020; 11:468. [PMID: 32373076 PMCID: PMC7179689 DOI: 10.3389/fmicb.2020.00468] [Citation(s) in RCA: 223] [Impact Index Per Article: 55.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 03/04/2020] [Indexed: 11/13/2022] Open
Abstract
The class Alphaproteobacteria is comprised of a diverse assemblage of Gram-negative bacteria that includes organisms of varying morphologies, physiologies and habitat preferences many of which are of clinical and ecological importance. Alphaproteobacteria classification has proved to be difficult, not least when taxonomic decisions rested heavily on a limited number of phenotypic features and interpretation of poorly resolved 16S rRNA gene trees. Despite progress in recent years regarding the classification of bacteria assigned to the class, there remains a need to further clarify taxonomic relationships. Here, draft genome sequences of a collection of genomes of more than 1000 Alphaproteobacteria and outgroup type strains were used to infer phylogenetic trees from genome-scale data using the principles drawn from phylogenetic systematics. The majority of taxa were found to be monophyletic but several orders, families and genera, including taxa recognized as problematic long ago but also quite recent taxa, as well as a few species were shown to be in need of revision. According proposals are made for the recognition of new orders, families and genera, as well as the transfer of a variety of species to other genera and of a variety of genera to other families. In addition, emended descriptions are given for many species mainly involving information on DNA G+C content and (approximate) genome size, both of which are confirmed as valuable taxonomic markers. Similarly, analysis of the gene content was shown to provide valuable taxonomic insights in the class. Significant incongruities between 16S rRNA gene and whole genome trees were not found in the class. The incongruities that became obvious when comparing the results of the present study with existing classifications appeared to be caused mainly by insufficiently resolved 16S rRNA gene trees or incomplete taxon sampling. Another probable cause of misclassifications in the past is the partially low overall fit of phenotypic characters to the sequence-based tree. Even though a significant degree of phylogenetic conservation was detected in all characters investigated, the overall fit to the tree varied considerably.
Collapse
Affiliation(s)
- Anton Hördt
- Department of Bioinformatics, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| | - Marina García López
- Department of Bioinformatics, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| | - Jan P. Meier-Kolthoff
- Department of Bioinformatics, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| | - Marcel Schleuning
- Department of Bioinformatics, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| | - Lisa-Maria Weinhold
- Department of Bioinformatics, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czechia
| | - Brian J. Tindall
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| | - Sabine Gronow
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Berkeley, CA, United States
| | - Tanja Woyke
- Department of Energy, Joint Genome Institute, Berkeley, CA, United States
| | - Markus Göker
- Department of Bioinformatics, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Brunswick, Germany
| |
Collapse
|
36
|
Laviad-Shitrit S, Izhaki I, Whitman WB, Shapiro N, Woyke T, Kyrpides NC, Halpern M. Draft genome of Rosenbergiella nectarea strain 8N4 T provides insights into the potential role of this species in its plant host. PeerJ 2020; 8:e8822. [PMID: 32292647 PMCID: PMC7144588 DOI: 10.7717/peerj.8822] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 02/28/2020] [Indexed: 11/20/2022] Open
Abstract
Background Rosenbergiella nectarea strain 8N4T, the type species of the genus Rosenbergiella, was isolated from Amygdalus communis (almond) floral nectar. Other strains of this species were isolated from the floral nectar of Citrus paradisi (grapefruit), Nicotiana glauca (tobacco tree) and from Asphodelus aestivus. R. nectarea strain 8N4T is a Gram-negative, oxidase-negative, facultatively anaerobic bacterium in the family Enterobacteriaceae. Results Here we describe features of this organism, together with its genome sequence and annotation. The DNA GC content is 47.38%, the assembly size is 3,294,717 bp, and the total number of genes are 3,346. The genome discloses the possible role that this species may play in the plant. The genome contains both virulence genes, like pectin lyase and hemolysin, that may harm plant cells and genes that are predicted to produce volatile compounds that may impact the visitation rates by nectar consumers, such as pollinators and nectar thieves. Conclusions The genome of R. nectarea strain 8N4T reveals a mutualistic interaction with the plant host and a possible effect on plant pollination and fitness.
Collapse
Affiliation(s)
- Sivan Laviad-Shitrit
- Department of Evolutionary and Environmental Biology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
| | - Ido Izhaki
- Department of Evolutionary and Environmental Biology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
| | | | - Nicole Shapiro
- Department of Energy Joint Genome Institute, DOE Joint Genome Institute, Berkeley, CA, USA
| | - Tanja Woyke
- Department of Energy Joint Genome Institute, DOE Joint Genome Institute, Berkeley, CA, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, DOE Joint Genome Institute, Berkeley, CA, USA
| | - Malka Halpern
- Department of Biology and Environment, Faculty of Natural Sciences, University of Haifa, Oranim, Kiryat Tivon, Israel
| |
Collapse
|
37
|
Roux S, Krupovic M, Daly RA, Borges AL, Nayfach S, Schulz F, Sharrar A, Matheus Carnevali PB, Cheng JF, Ivanova NN, Bondy-Denomy J, Wrighton KC, Woyke T, Visel A, Kyrpides NC, Eloe-Fadrosh EA. Author Correction: Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth’s biomes. Nat Microbiol 2020; 5:527. [PMID: 32047285 PMCID: PMC7608142 DOI: 10.1038/s41564-020-0681-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
38
|
Schulz F, Roux S, Paez-Espino D, Jungbluth S, Walsh DA, Denef VJ, McMahon KD, Konstantinidis KT, Eloe-Fadrosh EA, Kyrpides NC, Woyke T. Giant virus diversity and host interactions through global metagenomics. Nature 2020; 578:432-436. [PMID: 31968354 PMCID: PMC7162819 DOI: 10.1038/s41586-020-1957-x] [Citation(s) in RCA: 140] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2019] [Accepted: 01/09/2020] [Indexed: 12/11/2022]
Abstract
Our current knowledge about nucleocytoplasmic large DNA viruses (NCLDVs) is largely derived from viral isolates that are co-cultivated with protists and algae. Here we reconstructed 2,074 NCLDV genomes from sampling sites across the globe by building on the rapidly increasing amount of publicly available metagenome data. This led to an 11-fold increase in phylogenetic diversity and a parallel 10-fold expansion in functional diversity. Analysis of 58,023 major capsid proteins from large and giant viruses using metagenomic data revealed the global distribution patterns and cosmopolitan nature of these viruses. The discovered viral genomes encoded a wide range of proteins with putative roles in photosynthesis and diverse substrate transport processes, indicating that host reprogramming is probably a common strategy in the NCLDVs. Furthermore, inferences of horizontal gene transfer connected viral lineages to diverse eukaryotic hosts. We anticipate that the global diversity of NCLDVs that we describe here will establish giant viruses-which are associated with most major eukaryotic lineages-as important players in ecosystems across Earth's biomes.
Collapse
Affiliation(s)
- Frederik Schulz
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Simon Roux
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David Paez-Espino
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Sean Jungbluth
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - David A Walsh
- Groupe de recherche interuniversitaire en limnologie, Department of Biology, Concordia University, Montréal, Québec, Canada
| | - Vincent J Denef
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Katherine D McMahon
- Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, USA
- Department of Civil and Environmental Engineering, University of Wisconsin-Madison, Madison, WI, USA
| | | | - Emiley A Eloe-Fadrosh
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
39
|
Chasapi A, Aivaliotis M, Angelis L, Chanalaris A, Iliopoulos I, Kappas I, Karapiperis C, Kyrpides NC, Pafilis E, Panteris E, Topalis P, Tsiamis G, Vizirianakis IS, Vlassi M, Promponas VJ, Ouzounis CA. Establishment of computational biology in Greece and Cyprus: Past, present, and future. PLoS Comput Biol 2019; 15:e1007532. [PMID: 31856214 PMCID: PMC6922331 DOI: 10.1371/journal.pcbi.1007532] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Affiliation(s)
- Anastasia Chasapi
- Biological Computation & Process Lab, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
| | - Michalis Aivaliotis
- School of Medicine, Faculty of Health Sciences, Aristotle University of Thessaloniki, Thessalonica, Greece
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - Lefteris Angelis
- School of Informatics, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Anastasios Chanalaris
- Botnar Research Centre, NDORMS, Medical Sciences Division, University of Oxford, Oxford, United Kingdom
| | - Ioannis Iliopoulos
- Division of Basic Sciences, School of Medicine, University of Crete, Heraklion, Greece
| | - Ilias Kappas
- School of Biology, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Christos Karapiperis
- School of Informatics, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Walnut Creek, California, United States of America
| | - Evangelos Pafilis
- Institute of Marine Biology Biotechnology and Aquaculture, Hellenic Centre for Marine Research, Heraklion, Greece
| | - Eleftherios Panteris
- First Psychiatric Clinic, Papageorgiou General Hospital, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Pantelis Topalis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - George Tsiamis
- Department of Environmental Engineering, School of Engineering, University of Patras, Patras, Greece
| | - Ioannis S. Vizirianakis
- Department of Pharmacology, School of Pharmacy, Aristotle University of Thessaloniki, Thessalonica, Greece
| | - Metaxia Vlassi
- Institute of Biosciences & Applications, National Centre for Scientific Research Demokritos, Athens, Greece
| | - Vasilis J. Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
- * E-mail: (VJP); (CAO)
| | - Christos A. Ouzounis
- Biological Computation & Process Lab, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
- * E-mail: (VJP); (CAO)
| |
Collapse
|
40
|
Paez-Espino D, Zhou J, Roux S, Nayfach S, Pavlopoulos GA, Schulz F, McMahon KD, Walsh D, Woyke T, Ivanova NN, Eloe-Fadrosh EA, Tringe SG, Kyrpides NC. Diversity, evolution, and classification of virophages uncovered through global metagenomics. Microbiome 2019; 7:157. [PMID: 31823797 PMCID: PMC6905037 DOI: 10.1186/s40168-019-0768-5] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 11/11/2019] [Indexed: 05/19/2023]
Abstract
BACKGROUND Virophages are small viruses with double-stranded DNA genomes that replicate along with giant viruses and co-infect eukaryotic cells. Due to the paucity of virophage reference genomes, a collective understanding of the global virophage diversity, distribution, and evolution is lacking. RESULTS Here we screened a public collection of over 14,000 metagenomes using the virophage-specific major capsid protein (MCP) as "bait." We identified 44,221 assembled virophage sequences, of which 328 represent high-quality (complete or near-complete) genomes from diverse habitats including the human gut, plant rhizosphere, and terrestrial subsurface. Comparative genomic analysis confirmed the presence of four core genes in a conserved block. We used these genes to establish a revised virophage classification including 27 clades with consistent genome length, gene content, and habitat distribution. Moreover, for eight high-quality virophage genomes, we computationally predicted putative eukaryotic virus hosts. CONCLUSION Overall, our approach has increased the number of known virophage genomes by 10-fold and revealed patterns of genome evolution and global virophage distribution. We anticipate that the expanded diversity presented here will provide the backbone for further virophage studies.
Collapse
Affiliation(s)
- David Paez-Espino
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Jinglie Zhou
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Simon Roux
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Stephen Nayfach
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Georgios A. Pavlopoulos
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
- BSRC “Alexander Fleming”, 34 Fleming Street, Vari, 16672 Athens, Greece
| | - Frederik Schulz
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Katherine D. McMahon
- Departments of Civil and Environmental Engineering and Bacteriology, University of Wisconsin Madison, 1550 Linden Drive, Madison, WI 53726 USA
| | - David Walsh
- Department of Biology, Concordia University, 7141 Sherbrooke St. West, Montreal, QC, H4B 1R6 Canada
| | - Tanja Woyke
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Natalia N. Ivanova
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Emiley A. Eloe-Fadrosh
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Susannah G. Tringe
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, 2800 Mitchell Dr., Walnut Creek, 94598 USA
| |
Collapse
|
41
|
Högfors-Rönnholm E, Lopez-Fernandez M, Christel S, Brambilla D, Huntemann M, Clum A, Foster B, Foster B, Roux S, Palaniappan K, Varghese N, Mukherjee S, Reddy TBK, Daum C, Copeland A, Chen IMA, Ivanova NN, Kyrpides NC, Harmon-Smith M, Eloe-Fadrosh EA, Lundin D, Engblom S, Dopson M. Metagenomes and metatranscriptomes from boreal potential and actual acid sulfate soil materials. Sci Data 2019; 6:207. [PMID: 31619684 PMCID: PMC6795848 DOI: 10.1038/s41597-019-0222-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 08/27/2019] [Indexed: 11/09/2022] Open
Abstract
Natural sulfide rich deposits are common in coastal areas worldwide, including along the Baltic Sea coast. When artificial drainage exposes these deposits to atmospheric oxygen, iron sulfide minerals in the soils are rapidly oxidized. This process turns the potential acid sulfate soils into actual acid sulfate soils and mobilizes large quantities of acidity and leachable toxic metals that cause severe environmental problems. It is known that acidophilic microorganisms living in acid sulfate soils catalyze iron sulfide mineral oxidation. However, only a few studies regarding these communities have been published. In this study, we sampled the oxidized actual acid sulfate soil, the transition zone where oxidation is actively taking place, and the deepest un-oxidized potential acid sulfate soil. Nucleic acids were extracted and 16S rRNA gene amplicons, metagenomes, and metatranscriptomes generated to gain a detailed insight into the communities and their activities. The project will be of great use to microbiologists, environmental biologists, geochemists, and geologists as there is hydrological and geochemical monitoring from the site stretching back for many years.
Collapse
Affiliation(s)
- Eva Högfors-Rönnholm
- Research and Development, Novia University of Applied Sciences, Vaasa, 65200, Finland.
| | - Margarita Lopez-Fernandez
- Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, 59231, Sweden
| | - Stephan Christel
- Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, 59231, Sweden
| | - Diego Brambilla
- Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, 59231, Sweden
| | - Marcel Huntemann
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Alicia Clum
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Brian Foster
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Bryce Foster
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Simon Roux
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | - Neha Varghese
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Supratim Mukherjee
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - T B K Reddy
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Chris Daum
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Alex Copeland
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - I-Min A Chen
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Natalia N Ivanova
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | | | | - Daniel Lundin
- Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, 59231, Sweden
| | - Sten Engblom
- Research and Development, Novia University of Applied Sciences, Vaasa, 65200, Finland
| | - Mark Dopson
- Centre for Ecology and Evolution in Microbial Model Systems (EEMiS), Linnaeus University, Kalmar, 59231, Sweden
| |
Collapse
|
42
|
García-López M, Meier-Kolthoff JP, Tindall BJ, Gronow S, Woyke T, Kyrpides NC, Hahnke RL, Göker M. Analysis of 1,000 Type-Strain Genomes Improves Taxonomic Classification of Bacteroidetes. Front Microbiol 2019; 10:2083. [PMID: 31608019 PMCID: PMC6767994 DOI: 10.3389/fmicb.2019.02083] [Citation(s) in RCA: 163] [Impact Index Per Article: 32.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Accepted: 08/23/2019] [Indexed: 11/25/2022] Open
Abstract
Although considerable progress has been made in recent years regarding the classification of bacteria assigned to the phylum Bacteroidetes, there remains a need to further clarify taxonomic relationships within a diverse assemblage that includes organisms of clinical, piscicultural, and ecological importance. Bacteroidetes classification has proved to be difficult, not least when taxonomic decisions rested heavily on interpretation of poorly resolved 16S rRNA gene trees and a limited number of phenotypic features. Here, draft genome sequences of a greatly enlarged collection of genomes of more than 1,000 Bacteroidetes and outgroup type strains were used to infer phylogenetic trees from genome-scale data using the principles drawn from phylogenetic systematics. The majority of taxa were found to be monophyletic but several orders, families and genera, including taxa proposed long ago such as Bacteroides, Cytophaga, and Flavobacterium but also quite recent taxa, as well as a few species were shown to be in need of revision. According proposals are made for the recognition of new orders, families and genera, as well as the transfer of a variety of species to other genera. In addition, emended descriptions are given for many species mainly involving information on DNA G+C content and (approximate) genome size, both of which can be considered valuable taxonomic markers. We detected many incongruities when comparing the results of the present study with existing classifications, which appear to be caused by insufficiently resolved 16S rRNA gene trees or incomplete taxon sampling. The few significant incongruities found between 16S rRNA gene and whole genome trees underline the pitfalls inherent in phylogenies based upon single gene sequences and the impediment in using ordinary bootstrapping in phylogenomic studies, particularly when combined with too narrow gene selections. While a significant degree of phylogenetic conservation was detected in all phenotypic characters investigated, the overall fit to the tree varied considerably, which is one of the probable causes of misclassifications in the past, much like the use of plesiomorphic character states as diagnostic features.
Collapse
Affiliation(s)
- Marina García-López
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Jan P. Meier-Kolthoff
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Brian J. Tindall
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Sabine Gronow
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Tanja Woyke
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, United States
| | - Nikos C. Kyrpides
- Department of Energy, Joint Genome Institute, Walnut Creek, CA, United States
| | - Richard L. Hahnke
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Markus Göker
- Department of Microorganisms, Leibniz Institute DSMZ – German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| |
Collapse
|
43
|
Camargo AP, de Souza RSC, de Britto Costa P, Gerhardt IR, Dante RA, Teodoro GS, Abrahão A, Lambers H, Carazzolle MF, Huntemann M, Clum A, Foster B, Foster B, Roux S, Palaniappan K, Varghese N, Mukherjee S, Reddy TBK, Daum C, Copeland A, Chen IMA, Ivanova NN, Kyrpides NC, Pennacchio C, Eloe-Fadrosh EA, Arruda P, Oliveira RS. Microbiomes of Velloziaceae from phosphorus-impoverished soils of the campos rupestres, a biodiversity hotspot. Sci Data 2019; 6:140. [PMID: 31366912 PMCID: PMC6668480 DOI: 10.1038/s41597-019-0141-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2019] [Accepted: 06/25/2019] [Indexed: 12/22/2022] Open
Abstract
The rocky, seasonally-dry and nutrient-impoverished soils of the Brazilian campos rupestres impose severe growth-limiting conditions on plants. Species of a dominant plant family, Velloziaceae, are highly specialized to low-nutrient conditions and seasonal water availability of this environment, where phosphorus (P) is the key limiting nutrient. Despite plant-microbe associations playing critical roles in stressful ecosystems, the contribution of these interactions in the campos rupestres remains poorly studied. Here we present the first microbiome data of Velloziaceae spp. thriving in contrasting substrates of campos rupestres. We assessed the microbiomes of Vellozia epidendroides, which occupies shallow patches of soil, and Barbacenia macrantha, growing on exposed rocks. The prokaryotic and fungal profiles were assessed by rRNA barcode sequencing of epiphytic and endophytic compartments of roots, stems, leaves and surrounding soil/rocks. We also generated root and substrate (rock/soil)-associated metagenomes of each plant species. We foresee that these data will contribute to decipher how the microbiome contributes to plant functioning in the campos rupestres, and to unravel new strategies for improved crop productivity in stressful environments.
Collapse
Affiliation(s)
- Antonio Pedro Camargo
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Departamento de Genética e Evolução, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Genomics for Climate Change Research Center, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
| | - Rafael Soares Correa de Souza
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil.
- Departamento de Genética e Evolução, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil.
- Genomics for Climate Change Research Center, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil.
| | - Patrícia de Britto Costa
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-862, Campinas, SP, Brazil
- School of Biological Sciences, University of Western Australia (UWA), Perth, WA, 6009, Australia
| | - Isabel Rodrigues Gerhardt
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Genomics for Climate Change Research Center, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Embrapa Informática Agropecuária, 13083-886, Campinas, SP, Brazil
| | - Ricardo Augusto Dante
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Genomics for Climate Change Research Center, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Embrapa Informática Agropecuária, 13083-886, Campinas, SP, Brazil
| | - Grazielle Sales Teodoro
- Instituto de Ciências Biológicas, Universidade Federal do Para (UFPA), 66075-750, Belem, PA, Brazil
| | - Anna Abrahão
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-862, Campinas, SP, Brazil
- School of Biological Sciences, University of Western Australia (UWA), Perth, WA, 6009, Australia
| | - Hans Lambers
- School of Biological Sciences, University of Western Australia (UWA), Perth, WA, 6009, Australia
| | - Marcelo Falsarella Carazzolle
- Departamento de Genética e Evolução, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
| | - Marcel Huntemann
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Alicia Clum
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Brian Foster
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Bryce Foster
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Simon Roux
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | | | - Neha Varghese
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Supratim Mukherjee
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - T B K Reddy
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Chris Daum
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Alex Copeland
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - I-Min A Chen
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Natalia N Ivanova
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Nikos C Kyrpides
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | - Christa Pennacchio
- Department of Energy Joint Genome Institute, Walnut Creek, California, 94598, USA
| | | | - Paulo Arruda
- Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Departamento de Genética e Evolução, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
- Genomics for Climate Change Research Center, Universidade Estadual de Campinas (UNICAMP), 13083-875, Campinas, SP, Brazil
| | - Rafael Silva Oliveira
- Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), 13083-862, Campinas, SP, Brazil.
- School of Biological Sciences, University of Western Australia (UWA), Perth, WA, 6009, Australia.
| |
Collapse
|
44
|
Azad A, Pavlopoulos GA, Ouzounis CA, Kyrpides NC, Buluç A. HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks. Nucleic Acids Res 2019; 46:e33. [PMID: 29315405 PMCID: PMC5888241 DOI: 10.1093/nar/gkx1313] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 01/02/2018] [Indexed: 11/13/2022] Open
Abstract
Biological networks capture structural or functional properties of relevant entities such as molecules, proteins or genes. Characteristic examples are gene expression networks or protein–protein interaction networks, which hold information about functional affinities or structural similarities. Such networks have been expanding in size due to increasing scale and abundance of biological data. While various clustering algorithms have been proposed to find highly connected regions, Markov Clustering (MCL) has been one of the most successful approaches to cluster sequence similarity or expression networks. Despite its popularity, MCL’s scalability to cluster large datasets still remains a bottleneck due to high running times and memory demands. Here, we present High-performance MCL (HipMCL), a parallel implementation of the original MCL algorithm that can run on distributed-memory computers. We show that HipMCL can efficiently utilize 2000 compute nodes and cluster a network of ∼70 million nodes with ∼68 billion edges in ∼2.4 h. By exploiting distributed-memory environments, HipMCL clusters large-scale networks several orders of magnitude faster than MCL and enables clustering of even bigger networks. HipMCL is based on MPI and OpenMP and is freely available under a modified BSD license.
Collapse
Affiliation(s)
- Ariful Azad
- Computational Research Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720-8150, USA
| | - Georgios A Pavlopoulos
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Christos A Ouzounis
- Biological Computation & Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica 57001, Greece
| | - Nikos C Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Aydin Buluç
- Computational Research Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720-8150, USA.,Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
45
|
Roux S, Krupovic M, Daly RA, Borges AL, Nayfach S, Schulz F, Sharrar A, Matheus Carnevali PB, Cheng JF, Ivanova NN, Bondy-Denomy J, Wrighton KC, Woyke T, Visel A, Kyrpides NC, Eloe-Fadrosh EA. Cryptic inoviruses revealed as pervasive in bacteria and archaea across Earth's biomes. Nat Microbiol 2019; 4:1895-1906. [PMID: 31332386 PMCID: PMC6813254 DOI: 10.1038/s41564-019-0510-x] [Citation(s) in RCA: 147] [Impact Index Per Article: 29.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2019] [Accepted: 06/05/2019] [Indexed: 01/02/2023]
Abstract
Bacteriophages from the Inoviridae family (inoviruses) are characterized by their unique morphology, genome content and infection cycle. One of the most striking features of inoviruses is their ability to establish a chronic infection whereby the viral genome resides within the cell in either an exclusively episomal state or integrated into the host chromosome and virions are continuously released without killing the host. To date, a relatively small number of inovirus isolates have been extensively studied, either for biotechnological applications, such as phage display, or because of their effect on the toxicity of known bacterial pathogens including Vibrio cholerae and Neisseria meningitidis. Here, we show that the current 56 members of the Inoviridae family represent a minute fraction of a highly diverse group of inoviruses. Using a machine learning approach leveraging a combination of marker gene and genome features, we identified 10,295 inovirus-like sequences from microbial genomes and metagenomes. Collectively, our results call for reclassification of the current Inoviridae family into a viral order including six distinct proposed families associated with nearly all bacterial phyla across virtually every ecosystem. Putative inoviruses were also detected in several archaeal genomes, suggesting that, collectively, members of this supergroup infect hosts across the domains Bacteria and Archaea. Finally, we identified an expansive diversity of inovirus-encoded toxin–antitoxin and gene expression modulation systems, alongside evidence of both synergistic (CRISPR evasion) and antagonistic (superinfection exclusion) interactions with co-infecting viruses, which we experimentally validated in a Pseudomonas model. Capturing this previously obscured component of the global virosphere may spark new avenues for microbial manipulation approaches and innovative biotechnological applications. A machine learning approach was used to recover over 10,000 inovirus-like sequences from existing microbial genomes and metagenomes, consequently proposing the reclassification of the Inoviridae family to a viral order, and uncover the previously unrecognized diversity of these viruses across hosts and environments.
Collapse
Affiliation(s)
- Simon Roux
- DOE Joint Genome Institute, Walnut Creek, CA, USA.
| | - Mart Krupovic
- Department of Microbiology, Institut Pasteur, Paris, France
| | - Rebecca A Daly
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, USA
| | - Adair L Borges
- Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, CA, USA
| | | | | | - Allison Sharrar
- Department of Earth & Planetary Sciences, University of California, Berkeley, Berkeley, CA, USA
| | | | | | | | - Joseph Bondy-Denomy
- Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA, USA
| | - Kelly C Wrighton
- Department of Soil and Crop Sciences, Colorado State University, Fort Collins, CO, USA
| | - Tanja Woyke
- DOE Joint Genome Institute, Walnut Creek, CA, USA
| | - Axel Visel
- DOE Joint Genome Institute, Walnut Creek, CA, USA
| | | | | |
Collapse
|
46
|
Amann RI, Baichoo S, Blencowe BJ, Bork P, Borodovsky M, Brooksbank C, Chain PSG, Colwell RR, Daffonchio DG, Danchin A, de Lorenzo V, Dorrestein PC, Finn RD, Fraser CM, Gilbert JA, Hallam SJ, Hugenholtz P, Ioannidis JPA, Jansson JK, Kim JF, Klenk HP, Klotz MG, Knight R, Konstantinidis KT, Kyrpides NC, Mason CE, McHardy AC, Meyer F, Ouzounis CA, Patrinos AAN, Podar M, Pollard KS, Ravel J, Muñoz AR, Roberts RJ, Rosselló-Móra R, Sansone SA, Schloss PD, Schriml LM, Setubal JC, Sorek R, Stevens RL, Tiedje JM, Turjanski A, Tyson GW, Ussery DW, Weinstock GM, White O, Whitman WB, Xenarios I. Consent insufficient for data release-Response. Science 2019; 364:446. [PMID: 31048484 DOI: 10.1126/science.aax7509] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
47
|
Spring S, Sorokin DY, Verbarg S, Rohde M, Woyke T, Kyrpides NC. Sulfate-Reducing Bacteria That Produce Exopolymers Thrive in the Calcifying Zone of a Hypersaline Cyanobacterial Mat. Front Microbiol 2019; 10:862. [PMID: 31068923 PMCID: PMC6491731 DOI: 10.3389/fmicb.2019.00862] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 04/04/2019] [Indexed: 12/31/2022] Open
Abstract
Calcifying microbial mats in hypersaline environments are important model systems for the study of the earliest ecosystems on Earth that started to appear more than three billion years ago and have been preserved in the fossil record as laminated lithified structures known as stromatolites. It is believed that sulfate-reducing bacteria play a pivotal role in the lithification process by increasing the saturation index of calcium minerals within the mat. Strain L21-Syr-ABT was isolated from anoxic samples of a several centimeters-thick microbialite-forming cyanobacterial mat of a hypersaline lake on the Kiritimati Atoll (Kiribati, Central Pacific). The novel isolate was assigned to the family Desulfovibrionaceae within the Deltaproteobacteria. Available 16S rRNA-based population surveys obtained from discrete layers of the mat indicate that the occurrence of a species-level clade represented by strain L21-Syr-ABT is restricted to a specific layer of the suboxic zone, which is characterized by the presence of aragonitic spherulites. To elucidate a possible function of this sulfate-reducing bacterium in the mineral formation within the mat a comprehensive phenotypic characterization was combined with the results of a comparative genome analysis. Among the determined traits of strain L21-Syr-ABT, several features were identified that could play a role in the precipitation of calcium carbonate: (i) the potential deacetylation of polysaccharides and consumption of substrates such as lactate and sulfate could mobilize free calcium; (ii) under conditions that favor the utilization of formate and hydrogen, the alkalinity engine within the mat is stimulated, thereby increasing the availability of carbonate; (iii) the production of extracellular polysaccharides could provide nucleation sites for calcium mineralization. In addition, our data suggest the proposal of the novel species and genus Desulfohalovibrio reitneri represented by the type strain L21-Syr-ABT (=DSM 26903T = JCM 18662T).
Collapse
Affiliation(s)
- Stefan Spring
- Department Microorganisms, Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Dimitry Y Sorokin
- Winogradsky Institute of Microbiology, Research Centre of Biotechnology, Moscow, Russia.,Department of Biotechnology, Delft University of Technology, Delft, Netherlands
| | - Susanne Verbarg
- Department Services Microorganisms, Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Manfred Rohde
- Central Facility for Microscopy, Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Tanja Woyke
- DOE Joint Genome Institute, Walnut Creek, CA, United States
| | | |
Collapse
|
48
|
Avontuur JR, Palmer M, Beukes CW, Chan WY, Coetzee MPA, Blom J, Stępkowski T, Kyrpides NC, Woyke T, Shapiro N, Whitman WB, Venter SN, Steenkamp ET. Genome-informed Bradyrhizobium taxonomy: where to from here? Syst Appl Microbiol 2019; 42:427-439. [PMID: 31031014 DOI: 10.1016/j.syapm.2019.03.006] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2019] [Revised: 03/26/2019] [Accepted: 03/26/2019] [Indexed: 02/06/2023]
Abstract
Bradyrhizobium is thought to be the largest and most diverse rhizobial genus, but this is not reflected in the number of described species. Although it was one of the first rhizobial genera recognised, its taxonomy remains complex. Various contemporary studies are showing that genome sequence information may simplify taxonomic decisions. Therefore, the growing availability of genomes for Bradyrhizobium will likely aid in the delineation and characterization of new species. In this study, we addressed two aims: first, we reviewed the availability and quality of available genomic resources for Bradyrhizobium. This was achieved by comparing genome sequences in terms of sequencing technologies used and estimated level of completeness for inclusion in genome-based phylogenetic analyses. Secondly, we utilized these genomes to investigate the taxonomic standing of Bradyrhizobium in light of its diverse lifestyles. Although genome sequences differed in terms of their quality and completeness, our data indicate that the use of these genome sequences is adequate for taxonomic purposes. By using these resources, we inferred a fully resolved, well-supported phylogeny. It separated Bradyrhizobium into seven lineages, three of which corresponded to the so-called supergroups known for the genus. Wide distribution of key lifestyle traits such as nodulation, nitrogen fixation and photosynthesis revealed that these traits have complicated evolutionary histories. We present the first robust Bradyrhizobium species phylogeny based on genome sequence information for investigating the evolution of this important assemblage of bacteria. Furthermore, this study provides the basis for using genome sequence information as a resource to make important taxonomic decisions, particularly at the species and genus levels.
Collapse
Affiliation(s)
- Juanita R Avontuur
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Marike Palmer
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Chrizelle W Beukes
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Wai Y Chan
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa; Biotechnology Platform, Agricultural Research Council Onderstepoort Veterinary Institute (ARC-OVI), Onderstepoort 0110, South Africa
| | - Martin P A Coetzee
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Jochen Blom
- Bioinformatics and Systems Biology, Justus-Liebig-University Giessen, Giessen, Germany
| | - Tomasz Stępkowski
- Autonomous Department of Microbial Biology, Faculty of Agriculture and Biology, Warsaw University of Life Sciences (SGGW), Poland
| | | | - Tanja Woyke
- DOE Joint Genome Institute, Walnut Creek, CA, United States
| | - Nicole Shapiro
- DOE Joint Genome Institute, Walnut Creek, CA, United States
| | - William B Whitman
- Department of Microbiology, University of Georgia, Athens, GA, United States
| | - Stephanus N Venter
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Emma T Steenkamp
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa. http://emma.steenkamp.up.ac.za
| |
Collapse
|
49
|
Nayfach S, Shi ZJ, Seshadri R, Pollard KS, Kyrpides NC. New insights from uncultivated genomes of the global human gut microbiome. Nature 2019; 568:505-510. [PMID: 30867587 PMCID: PMC6784871 DOI: 10.1038/s41586-019-1058-x] [Citation(s) in RCA: 362] [Impact Index Per Article: 72.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 03/06/2019] [Indexed: 12/22/2022]
Abstract
The genome sequences of many species of the human gut microbiome remain unknown, largely owing to challenges in cultivating microorganisms under laboratory conditions. Here we address this problem by reconstructing 60,664 draft prokaryotic genomes from 3,810 faecal metagenomes, from geographically and phenotypically diverse humans. These genomes provide reference points for 2,058 newly identified species-level operational taxonomic units (OTUs), which represents a 50% increase over the previously known phylogenetic diversity of sequenced gut bacteria. On average, the newly identified OTUs comprise 33% of richness and 28% of species abundance per individual, and are enriched in humans from rural populations. A meta-analysis of clinical gut-microbiome studies pinpointed numerous disease associations for the newly identified OTUs, which have the potential to improve predictive models. Finally, our analysis revealed that uncultured gut species have undergone genome reduction that has resulted in the loss of certain biosynthetic pathways, which may offer clues for improving cultivation strategies in the future.
Collapse
Affiliation(s)
- Stephen Nayfach
- United States Department of Energy Joint Genome Institute, Walnut Creek, CA, USA.
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| | - Zhou Jason Shi
- Gladstone Institutes, San Francisco, CA, USA
- Chan-Zuckerberg Biohub, San Francisco, CA, USA
| | - Rekha Seshadri
- United States Department of Energy Joint Genome Institute, Walnut Creek, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Katherine S Pollard
- Gladstone Institutes, San Francisco, CA, USA
- Chan-Zuckerberg Biohub, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
- Institute for Computational Health Sciences, University of California San Francisco, San Francisco, CA, USA
- Quantitative Biology Institute, University of California San Francisco, San Francisco, CA, USA
- Department of Epidemiology and Biostatistics, University of California San Francisco, San Francisco, CA, USA
| | - Nikos C Kyrpides
- United States Department of Energy Joint Genome Institute, Walnut Creek, CA, USA.
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
50
|
Wright AV, Wang JY, Burstein D, Harrington LB, Paez-Espino D, Kyrpides NC, Iavarone AT, Banfield JF, Doudna JA. A Functional Mini-Integrase in a Two-Protein-type V-C CRISPR System. Mol Cell 2019; 73:727-737.e3. [PMID: 30709710 PMCID: PMC6386590 DOI: 10.1016/j.molcel.2018.12.015] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2018] [Revised: 11/21/2018] [Accepted: 12/14/2018] [Indexed: 12/26/2022]
Abstract
CRISPR-Cas immunity requires integration of short, foreign DNA fragments into the host genome at the CRISPR locus, a site consisting of alternating repeat sequences and foreign-derived spacers. In most CRISPR systems, the proteins Cas1 and Cas2 form the integration complex and are both essential for DNA acquisition. Most type V-C and V-D systems lack the cas2 gene and have unusually short CRISPR repeats and spacers. Here, we show that a mini-integrase comprising the type V-C Cas1 protein alone catalyzes DNA integration with a preference for short (17- to 19-base-pair) DNA fragments. The mini-integrase has weak specificity for the CRISPR array. We present evidence that the Cas1 proteins form a tetramer for integration. Our findings support a model of a minimal integrase with an internal ruler mechanism that favors shorter repeats and spacers. This minimal integrase may represent the function of the ancestral Cas1 prior to Cas2 adoption.
Collapse
Affiliation(s)
- Addison V Wright
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Joy Y Wang
- Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720, USA
| | - David Burstein
- California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA 94720, USA
| | - Lucas B Harrington
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA
| | - David Paez-Espino
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Nikos C Kyrpides
- Department of Energy, Joint Genome Institute, Walnut Creek, CA 94598, USA
| | - Anthony T Iavarone
- Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720, USA; California Institute for Quantitative Biosciences (QB3), University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jillian F Banfield
- Department of Earth and Planetary Sciences, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Environmental Science, Policy, and Management, University of California, Berkeley, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jennifer A Doudna
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720, USA; Innovative Genomics Institute, University of California, Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California, Berkeley, Berkeley, CA 94720, USA; MBIB Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA; Gladstone Institutes, San Francisco, CA 94158, USA.
| |
Collapse
|