1
|
Dwivedi SL, Vetukuri RR, Kelbessa BG, Gepts P, Heslop-Harrison P, Araujo ASF, Sharma S, Ortiz R. Exploitation of rhizosphere microbiome biodiversity in plant breeding. TRENDS IN PLANT SCIENCE 2025:S1360-1385(25)00103-7. [PMID: 40335388 DOI: 10.1016/j.tplants.2025.04.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 03/28/2025] [Accepted: 04/07/2025] [Indexed: 05/09/2025]
Abstract
Climate change-induced stresses are perceived by plants at the root-soil interface, where they are alleviated through interactions between the host plant and the rhizosphere microbiome. The recruitment of specific microbiomes helps mitigate stress, increases resistance to pathogens, and promotes plant growth, development, and reproduction. The structure of the rhizosphere microbiome is shaped by crop domestication and variations in ploidy levels. Here we list key genes that regulate rhizosphere microbiomes and host genetic traits. We also discuss the prospects for rigorous analysis of symbiotic interactions, research needs, and strategies for systematically utilizing microbe-crop interactions to improve crop performance. Finally, we highlight challenges of maintaining live rhizosphere microbiome collections and mining heritable variability to enhance interactions between host plants and their rhizosphere microbiomes.
Collapse
Affiliation(s)
| | - Ramesh Raju Vetukuri
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Lomma, Sweden
| | - Bekele Gelena Kelbessa
- Department of Plant Breeding, Swedish University of Agricultural Sciences, Lomma, Sweden
| | - Paul Gepts
- Department of Plant Sciences, University of California, Davis, CA 95616-8780, USA
| | - Pat Heslop-Harrison
- University of Leicester, Department of Genetics and Genome Biology, Institute for Environmental Futures, Leicester LE1 7RH, UK
| | - Ademir S F Araujo
- Soil Microbial Ecology Group, Agricultural Science Center, Federal University of Piauí, Teresina, PI, Brazil
| | - Shilpi Sharma
- Department of Biochemical Engineering and Biotechnology, Indian Institute of Technology Delhi, New Delhi 110016, India
| | - Rodomiro Ortiz
- Department of Plant Breeding, Swedish University of Agricultural Science, Alnarp, Sweden.
| |
Collapse
|
2
|
Zhou J, Zhao W, Wu M, Wu J, Zhu J, Liu X, Hu J, Cai Z, Chan LL. Quorum sensing regulates the efficiency of a microcystin-degrading microbial consortium. JOURNAL OF HAZARDOUS MATERIALS 2025; 494:138479. [PMID: 40339372 DOI: 10.1016/j.jhazmat.2025.138479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2025] [Revised: 04/04/2025] [Accepted: 05/01/2025] [Indexed: 05/10/2025]
Abstract
Microbial biodegradation represents an environmentally friendly solution for microcystin (MC) removal. However, the regulatory factors influencing MC biodegradation within microbial communities remain poorly understood. We hypothesized that a consortium of MC-degrading microorganisms can synergistically enhance MC biodegradation efficiency under quorum sensing (QS) signal regulation. Initially, analysis of publicly available data identified a widespread correlation between QS signals and MC-degrading genes. Subsequent laboratory studies confirmed that acyl-homoserine lactones (AHLs) represent the predominant QS signal type during the degradation of MCs by microbial consortia. A significant positive correlation was found among the AHL signal, MC degradation genes, and microbial members of the degradation process. Finally, we found that the absence of the AHL system reduced both the efficiency of MC degradation and the expression of mlr cluster genes in the microbial consortium, confirming the regulatory role of the AHL system in MC degradation at the community level. The mutualistic cooperation mechanisms were also demonstrated by metatranscriptomic and qRT-PCR analyses. These findings underscore the significant role played by the QS system in microbial community-mediated MC degradation and suggest that the manipulation of QS signals could be a promising strategy for enhancing MC treatment efficiency. Harnessing microbial cooperation through QS offers a sustainable approach for mitigating MC contamination and safeguarding water health.
Collapse
Affiliation(s)
- Jin Zhou
- Shenzhen Public Platform for Screening and Application of Marine Microbial Resources, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China; Marine Ecology and Human Factors Assessment Technical Innovation Center of Natural Resources Ministry, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong Province 518055, PR China; Shenzhen Key Laboratory of Advanced Technology for Marine Ecology, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China.
| | - Wei Zhao
- Shenzhen Public Platform for Screening and Application of Marine Microbial Resources, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China; Marine Ecology and Human Factors Assessment Technical Innovation Center of Natural Resources Ministry, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong Province 518055, PR China
| | - Mengjie Wu
- Shenzhen Public Platform for Screening and Application of Marine Microbial Resources, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China; Marine Ecology and Human Factors Assessment Technical Innovation Center of Natural Resources Ministry, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong Province 518055, PR China; Shenzhen Key Laboratory of Advanced Technology for Marine Ecology, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China
| | - Jiajun Wu
- State Key Laboratory of Marine Pollution and Department of Biomedical Sciences, City University of Hong Kong, Hong Kong 999077, PR China
| | - Jinming Zhu
- Shenzhen Public Platform for Screening and Application of Marine Microbial Resources, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China; Marine Ecology and Human Factors Assessment Technical Innovation Center of Natural Resources Ministry, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong Province 518055, PR China; Shenzhen Key Laboratory of Advanced Technology for Marine Ecology, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China
| | - Xiaowan Liu
- State Key Laboratory of Marine Pollution and Department of Biomedical Sciences, City University of Hong Kong, Hong Kong 999077, PR China
| | - Jing Hu
- State Key Laboratory of Marine Pollution and Department of Biomedical Sciences, City University of Hong Kong, Hong Kong 999077, PR China
| | - Zhonghua Cai
- Shenzhen Public Platform for Screening and Application of Marine Microbial Resources, Institute for Ocean Engineering, Shenzhen International Graduate School, Tsinghua University, Shenzhen, Guangdong Province 518055, PR China; Marine Ecology and Human Factors Assessment Technical Innovation Center of Natural Resources Ministry, Tsinghua Shenzhen International Graduate School, Shenzhen, Guangdong Province 518055, PR China
| | - Leo Lai Chan
- State Key Laboratory of Marine Pollution and Department of Biomedical Sciences, City University of Hong Kong, Hong Kong 999077, PR China.
| |
Collapse
|
3
|
Langenfeld K, Hegarty B, Vidaurri S, Crossette E, Duhaime M, Wigginton K. Development of a quantitative metagenomic approach to establish quantitative limits and its application to viruses. Nucleic Acids Res 2025; 53:gkaf118. [PMID: 40036505 PMCID: PMC11878531 DOI: 10.1093/nar/gkaf118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2024] [Revised: 01/25/2025] [Accepted: 02/06/2025] [Indexed: 03/06/2025] Open
Abstract
Quantitative metagenomic methods are maturing but continue to lack clearly-defined analytical limits. Here, we developed a computational tool, QuantMeta, to determine the absolute abundance of targets in metagenomes spiked with synthetic DNA standards. The tool establishes (i) entropy-based detection thresholds to confidently determine the presence of targets, and (ii) an approach to identify and correct read mapping or assembly errors and thus improve the quantification accuracy. Together this allows for an approach to confidently quantify absolute abundance of targets, be they microbial populations, genes, contigs, or metagenome-assembled genomes. We applied the approach to quantify single- and double-stranded DNA viruses in wastewater viral metagenomes, including pathogens and bacteriophages. Concentrations of total DNA viruses in wastewater influent and effluent were >108 copies/ml using QuantMeta. Human-associated DNA viruses were detected and quantifiable with QuantMeta thresholds, including polyomavirus, papillomavirus, and crAss-like phages, at concentrations similar to previous reports that utilized quantitative polymerase chain reaction (PCR)-based assays. Our results highlight the higher detection thresholds of quantitative metagenomics (approximately 500 copies/μl) as compared to PCR-based quantification (approximately 10 copies/μl) despite a sequencing depth of 200 million reads per sample. The QuantMeta approach, applicable to both viral and cellular metagenomes, advances quantitative metagenomics by improving the accuracy of measured target absolute abundances.
Collapse
Affiliation(s)
- Kathryn Langenfeld
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, MI 48109, United States
| | - Bridget Hegarty
- Department of Civil and Environmental Engineering, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Santiago Vidaurri
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, MI 48109, United States
| | - Emily Crossette
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, MI 48109, United States
| | - Melissa B Duhaime
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, United States
| | - Krista R Wigginton
- Department of Civil and Environmental Engineering, University of Michigan, Ann Arbor, MI 48109, United States
| |
Collapse
|
4
|
Wang D, Meng Y, Huang LN, Zhang XX, Luo X, Meng F. A comprehensive catalog encompassing 1376 species-level genomes reveals the core community and functional diversity of anammox microbiota. WATER RESEARCH 2024; 266:122356. [PMID: 39236503 DOI: 10.1016/j.watres.2024.122356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Revised: 08/21/2024] [Accepted: 08/28/2024] [Indexed: 09/07/2024]
Abstract
Research on the microbial community and function of the anammox process for environmentally friendly wastewater treatment has achieved certain success, which may mean more universal insights are needed. However, the comprehensive understanding of the anammox process is constrained by the limited taxonomic assignment and functional characterization of anammox microbiota, primarily due to the scarcity of high-quality genomes for most organisms. This study reported a global genome catalog of anammox microbiotas based on numerous metagenomes obtained from both lab- and full-scale systems. A total of 1376 candidate species from 7474 metagenome-assembled genomes were used to construct the genome catalog, providing extensive microbial coverage (averaged of 92.40 %) of anammox microbiota. Moreover, a total of 64 core genera and 44 core species were identified, accounting for approximately 64.25 % and 43.97 %, respectively, of anammox microbiota. The strict core genera encompassed not only functional bacteria (e.g., Brocadia, Desulfobacillus, Zeimonas, and Nitrosomonas) but also two candidate genera (UBA12294 and OLB14) affiliated with the order Anaerolineales. In particular, core denitrifying bacteria with observably taxonomic diversity exhibited diverse functional profiles; for instance, the potential of carbohydrate metabolism in Desulfobacillus and Zeimonas likely improves the mixotrophic lifestyle of anammox microbiota. Besides, a noteworthy association was detected between anammox microbiota and system type. Microbiota in coupling system exhibited complex diversity and interspecies interactions by limiting numerous core denitrifying bacteria. In summary, the constructed catalog substantially expands our understanding of the core community and their functions of anammox microbiota, providing a valuable resource for future studies on anammox systems.
Collapse
Affiliation(s)
- Depeng Wang
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou, 510275, China; State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou, 510275, China
| | - Yabing Meng
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou, 510275, China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou, 510275, China
| | - Li-Nan Huang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, 510275, Guangzhou, China
| | - Xu-Xiang Zhang
- State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing, 210023, China
| | - Xiaonan Luo
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou, 510275, China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou, 510275, China
| | - Fangang Meng
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou, 510275, China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou, 510275, China.
| |
Collapse
|
5
|
Commichaux S, Luan T, Muralidharan HS, Pop M. Database size positively correlates with the loss of species-level taxonomic resolution for the 16S rRNA and other prokaryotic marker genes. PLoS Comput Biol 2024; 20:e1012343. [PMID: 39102435 PMCID: PMC11326629 DOI: 10.1371/journal.pcbi.1012343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 08/15/2024] [Accepted: 07/22/2024] [Indexed: 08/07/2024] Open
Abstract
For decades, the 16S rRNA gene has been used to taxonomically classify prokaryotic species and to taxonomically profile microbial communities. However, the 16S rRNA gene has been criticized for being too conserved to differentiate between distinct species. We argue that the inability to differentiate between species is not a unique feature of the 16S rRNA gene. Rather, we observe the gradual loss of species-level resolution for other nearly-universal prokaryotic marker genes as the number of gene sequences increases in reference databases. This trend was strongly correlated with how represented a taxonomic group was in the database and indicates that, at the gene-level, the boundaries between many species might be fuzzy. Through our study, we argue that any approach that relies on a single marker to distinguish bacterial taxa is fraught even if some markers appear to be discriminative in current databases.
Collapse
Affiliation(s)
- Seth Commichaux
- Center for Food Safety and Nutrition, Food and Drug Administration, Laurel, Maryland, United States of America
| | - Tu Luan
- Department of Computer Science, University of Maryland, College Park, Maryland, United States of America
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Harihara Subrahmaniam Muralidharan
- Department of Computer Science, University of Maryland, College Park, Maryland, United States of America
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, Maryland, United States of America
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| |
Collapse
|
6
|
Zhang K, He C, Wang L, Suo L, Guo M, Guo J, Zhang T, Xu Y, Lei Y, Liu G, Qian Q, Mao Y, Kalds P, Wu Y, Cuoji A, Yang Y, Brugger D, Gan S, Wang M, Wang X, Zhao F, Chen Y. Compendium of 5810 genomes of sheep and goat gut microbiomes provides new insights into the glycan and mucin utilization. MICROBIOME 2024; 12:104. [PMID: 38845047 PMCID: PMC11155115 DOI: 10.1186/s40168-024-01806-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 04/03/2024] [Indexed: 06/09/2024]
Abstract
BACKGROUND Ruminant gut microbiota are critical in ecological adaptation, evolution, and nutrition utilization because it regulates energy metabolism, promotes nutrient absorption, and improves immune function. To study the functional roles of key gut microbiota in sheep and goats, it is essential to construct reference microbial gene catalogs and high-quality microbial genomes database. RESULTS A total of 320 fecal samples were collected from 21 different sheep and goat breeds, originating from 32 distinct farms. Metagenomic deep sequencing and binning assembly were utilized to construct a comprehensive microbial genome information database for the gut microbiota. We successfully generated the largest reference gene catalogs for gut microbiota in sheep and goats, containing over 162 million and 82 million nonredundant predicted genes, respectively, with 49 million shared nonredundant predicted genes and 1138 shared species. We found that the rearing environment has a greater impact on microbial composition and function than the host's species effect. Through subsequent assembly, we obtained 5810 medium- and high-quality metagenome-assembled genomes (MAGs), out of which 2661 were yet unidentified species. Among these MAGs, we identified 91 bacterial taxa that specifically colonize the sheep gut, which encode polysaccharide utilization loci for glycan and mucin degradation. CONCLUSIONS By shedding light on the co-symbiotic microbial communities in the gut of small ruminants, our study significantly enhances the understanding of their nutrient degradation and disease susceptibility. Our findings emphasize the vast potential of untapped resources in functional bacterial species within ruminants, further expanding our knowledge of how the ruminant gut microbiota recognizes and processes glycan and mucins. Video Abstract.
Collapse
Affiliation(s)
- Ke Zhang
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Chong He
- College of Information Engineering, Northwest A&F University, Yangling, 712100, China
| | - Lei Wang
- Plateau Livestock Genetic Resources Protection and Innovative Utilization Key Laboratory of Qinghai Province, Key Laboratory of Animal Genetics and Breeding On Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Qinghai Academy of Animal and Veterinary Medicine, Qinghai University, Xining, 810016, China
| | - Langda Suo
- Institute of Animal Sciences, Tibet Academy of Agricultural and Animal Husbandry Sciences, Lhasa, 850009, China
- Key Laboratory of Animal Genetics and Breeding On Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lhasa, 850009, China
| | - Mengmeng Guo
- College of Animal Engineering, Yangling Vocational and Technical College, Yangling, 712100, China
| | - Jiazhong Guo
- College of Animal Science and Technology, Sichuan Agricultural University, Chengdu, 611100, China
| | - Ting Zhang
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Yangbin Xu
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Yu Lei
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Gongwei Liu
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Quan Qian
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Yunrui Mao
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Peter Kalds
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Yujiang Wu
- Institute of Animal Sciences, Tibet Academy of Agricultural and Animal Husbandry Sciences, Lhasa, 850009, China
- Key Laboratory of Animal Genetics and Breeding On Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lhasa, 850009, China
| | - Awang Cuoji
- Institute of Animal Sciences, Tibet Academy of Agricultural and Animal Husbandry Sciences, Lhasa, 850009, China
- Key Laboratory of Animal Genetics and Breeding On Tibetan Plateau, Ministry of Agriculture and Rural Affairs, Lhasa, 850009, China
| | - Yuxin Yang
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China
| | - Daniel Brugger
- Institute of Animal Nutrition and Dietetics, Vetsuisse-Faculty, University of Zurich, 8057, Zurich, Switzerland
| | - Shangquan Gan
- College of Coastal Agricultural Sciences, Guangdong Ocean University, Zhanjiang, 524088, China
| | - Meili Wang
- College of Information Engineering, Northwest A&F University, Yangling, 712100, China
| | - Xiaolong Wang
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, 712100, China.
- School of Future Technology On Bio-Breeding, Northwest A&F University, Yangling, 712100, China.
| | - Fangqing Zhao
- Computational Genomics Lab, Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 102206, China.
| | - Yulin Chen
- International Joint Agriculture Research Center for Animal Bio-Breeding, Ministry of Agriculture and Rural Affairs/Key Laboratory of Animal Genetics, Breeding and Reproduction of Shaanxi Province, College of Animal Science and Technology, Northwest A&F University, Yangling, 712100, China.
- Key Laboratory of Livestock Biology, Northwest A&F University, Yangling, 712100, China.
- School of Future Technology On Bio-Breeding, Northwest A&F University, Yangling, 712100, China.
| |
Collapse
|
7
|
Qiu Z, Yuan L, Lian CA, Lin B, Chen J, Mu R, Qiao X, Zhang L, Xu Z, Fan L, Zhang Y, Wang S, Li J, Cao H, Li B, Chen B, Song C, Liu Y, Shi L, Tian Y, Ni J, Zhang T, Zhou J, Zhuang WQ, Yu K. BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis. Nat Commun 2024; 15:2179. [PMID: 38467684 PMCID: PMC10928208 DOI: 10.1038/s41467-024-46539-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 03/01/2024] [Indexed: 03/13/2024] Open
Abstract
Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT.
Collapse
Affiliation(s)
- Zhiguang Qiu
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
| | - Li Yuan
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Chun-Ang Lian
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
| | - Bin Lin
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
| | - Jie Chen
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Rong Mu
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
| | - Xuejiao Qiao
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
| | - Liyu Zhang
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
| | - Zheng Xu
- Southern University of Sciences and Technology Yantian Hospital, Shenzhen, China
- Institute of Biomedicine and Biotechnology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China
| | - Lu Fan
- Department of Ocean Science and Engineering, Southern University of Science and Technology (SUSTech), Shenzhen, China
| | - Yunzeng Zhang
- Joint International Research Laboratory of Agriculture and Agri-Product Safety, the Ministry of Education of China, Yangzhou University, Yangzhou, China
| | - Shanquan Wang
- Environmental Microbiomics Research Center, School of Environmental Science and Engineering, Sun Yat-Sen University, Guangzhou, China
| | - Junyi Li
- School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, Guangdong, China
| | - Huiluo Cao
- Department of Microbiology, University of Hong Kong, Hong Kong, China
| | - Bing Li
- Shenzhen International Graduate School, Tsinghua University, Shenzhen, China
| | - Baowei Chen
- Guangdong Provincial Key Laboratory of Marine Resources and Coastal Engineering, School of Marine Sciences, Sun Yat-sen University, Zhuhai, China
| | - Chi Song
- Institute of Herbgenomics, Chengdu University of Traditional Chinese Medicine, Chengdu, China
- Wuhan Benagen Technology Co., Ltd, Wuhan, China
| | - Yongxin Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Lili Shi
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Yonghong Tian
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China
- School of Electronic and Computer Engineering, Peking University, Shenzhen, China
- Peng Cheng Laboratory, Shenzhen, China
| | - Jinren Ni
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China
- College of Environmental Sciences and Engineering, Key Laboratory of Water and Sediment Sciences, Ministry of Education, Peking University, Beijing, China
| | - Tong Zhang
- Department of Civil Engineering, University of Hong Kong, Hong Kong, China
| | - Jizhong Zhou
- Institute for Environmental Genomics, University of Oklahoma, Norman, OK, USA
| | - Wei-Qin Zhuang
- Department of Civil and Environmental Engineering, Faculty of Engineering, University of Auckland, Auckland, New Zealand
| | - Ke Yu
- Eco-environment and Resource Efficiency Research Laboratory, School of Environment and Energy, Shenzhen Graduate School, Peking University, Shenzhen, China.
- AI for Science (AI4S)-Preferred Program, Peking University, Shenzhen, China.
| |
Collapse
|
8
|
Cerk K, Ugalde‐Salas P, Nedjad CG, Lecomte M, Muller C, Sherman DJ, Hildebrand F, Labarthe S, Frioux C. Community-scale models of microbiomes: Articulating metabolic modelling and metagenome sequencing. Microb Biotechnol 2024; 17:e14396. [PMID: 38243750 PMCID: PMC10832553 DOI: 10.1111/1751-7915.14396] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 11/27/2023] [Accepted: 12/20/2023] [Indexed: 01/21/2024] Open
Abstract
Building models is essential for understanding the functions and dynamics of microbial communities. Metabolic models built on genome-scale metabolic network reconstructions (GENREs) are especially relevant as a means to decipher the complex interactions occurring among species. Model reconstruction increasingly relies on metagenomics, which permits direct characterisation of naturally occurring communities that may contain organisms that cannot be isolated or cultured. In this review, we provide an overview of the field of metabolic modelling and its increasing reliance on and synergy with metagenomics and bioinformatics. We survey the means of assigning functions and reconstructing metabolic networks from (meta-)genomes, and present the variety and mathematical fundamentals of metabolic models that foster the understanding of microbial dynamics. We emphasise the characterisation of interactions and the scaling of model construction to large communities, two important bottlenecks in the applicability of these models. We give an overview of the current state of the art in metagenome sequencing and bioinformatics analysis, focusing on the reconstruction of genomes in microbial communities. Metagenomics benefits tremendously from third-generation sequencing, and we discuss the opportunities of long-read sequencing, strain-level characterisation and eukaryotic metagenomics. We aim at providing algorithmic and mathematical support, together with tool and application resources, that permit bridging the gap between metagenomics and metabolic modelling.
Collapse
Affiliation(s)
- Klara Cerk
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | | | - Chabname Ghassemi Nedjad
- Inria, University of Bordeaux, INRAETalenceFrance
- University of Bordeaux, CNRS, Bordeaux INP, LaBRI, UMR 5800TalenceFrance
| | - Maxime Lecomte
- Inria, University of Bordeaux, INRAETalenceFrance
- INRAE STLO¸University of RennesRennesFrance
| | | | | | - Falk Hildebrand
- Quadram Institute BioscienceNorwichUK
- Earlham InstituteNorwichUK
| | - Simon Labarthe
- Inria, University of Bordeaux, INRAETalenceFrance
- INRAE, University of Bordeaux, BIOGECO, UMR 1202CestasFrance
| | | |
Collapse
|
9
|
Commichaux S, Luan T, Muralidharan HS, Pop M. Database size positively correlates with the loss of species-level taxonomic resolution for the 16S rRNA and other prokaryotic marker genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.13.571439. [PMID: 38168205 PMCID: PMC10760110 DOI: 10.1101/2023.12.13.571439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
For decades, the 16S rRNA gene has been used to taxonomically classify prokaryotic species and to taxonomically profile microbial communities. The 16S rRNA gene has been criticized for being too conserved to differentiate between distinct species. We argue that the inability to differentiate between species is not a unique feature of the 16S rRNA gene. Rather, we observe the gradual loss of species-level resolution for other marker genes as the number of gene sequences increases in reference databases. We demonstrate this effect through the analysis of three commonly used databases of nearly-universal prokaryotic marker genes: the SILVA 16S rRNA gene database, the Genome Taxonomy Database (GTDB), and a set of 40 taxonomically-informative single-copy genes. Our results reflect a more fundamental property of the taxonomies themselves and have broad implications for bioinformatic analyses beyond taxonomic classification. Effective solutions for fine-level taxonomic classification require a more precise, and operationally-relevant, definition of the taxonomic labels being sought, and the use of combinations of genomic markers in the classification process. Importance The use of reference databases for assigning taxonomic labels to genomic and metagenomic sequences is a fundamental bioinformatic task in the characterization of microbial communities. The increasing accessibility of high throughput sequencing has led to a rapid increase in the size and number of sequences in databases. This has been beneficial for improving our understanding of the global microbial genetic diversity. However, there is evidence that as the microbial diversity is more densely sampled, increasingly longer genomic segments are needed to differentiate between distinct species. The scientific community needs to be aware of this issue and needs to develop methods that better account for it when assigning taxonomic labels to metagenomic sequences from microbial communities.
Collapse
|
10
|
Luan T, Muralidharan HS, Alshehri M, Mittra I, Pop M. SCRAPT: an iterative algorithm for clustering large 16S rRNA gene data sets. Nucleic Acids Res 2023; 51:e46. [PMID: 36912074 PMCID: PMC10164572 DOI: 10.1093/nar/gkad158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 02/01/2023] [Accepted: 02/28/2023] [Indexed: 03/14/2023] Open
Abstract
16S rRNA gene sequence clustering is an important tool in characterizing the diversity of microbial communities. As 16S rRNA gene data sets are growing in size, existing sequence clustering algorithms increasingly become an analytical bottleneck. Part of this bottleneck is due to the substantial computational cost expended on small clusters and singleton sequences. We propose an iterative sampling-based 16S rRNA gene sequence clustering approach that targets the largest clusters in the data set, allowing users to stop the clustering process when sufficient clusters are available for the specific analysis being targeted. We describe a probabilistic analysis of the iterative clustering process that supports the intuition that the clustering process identifies the larger clusters in the data set first. Using real data sets of 16S rRNA gene sequences, we show that the iterative algorithm, coupled with an adaptive sampling process and a mode-shifting strategy for identifying cluster representatives, substantially speeds up the clustering process while being effective at capturing the large clusters in the data set. The experiments also show that SCRAPT (Sample, Cluster, Recruit, AdaPt and iTerate) is able to produce operational taxonomic units that are less fragmented than popular tools: UCLUST, CD-HIT and DNACLUST. The algorithm is implemented in the open-source package SCRAPT. The source code used to generate the results presented in this paper is available at https://github.com/hsmurali/SCRAPT.
Collapse
Affiliation(s)
- Tu Luan
- Department of Computer Science, University of Maryland, College Park, 20742 MD, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Harihara Subrahmaniam Muralidharan
- Department of Computer Science, University of Maryland, College Park, 20742 MD, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Marwan Alshehri
- Department of Computer Science, University of Maryland, College Park, 20742 MD, USA
| | - Ipsa Mittra
- Department of Computer Science, University of Maryland, College Park, 20742 MD, USA
| | - Mihai Pop
- Department of Computer Science, University of Maryland, College Park, 20742 MD, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
11
|
Zimmerman S, Tierney BT, Patel CJ, Kostic AD. Quantifying Shared and Unique Gene Content across 17 Microbial Ecosystems. mSystems 2023; 8:e0011823. [PMID: 37022232 PMCID: PMC10134805 DOI: 10.1128/msystems.00118-23] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 02/27/2023] [Indexed: 04/07/2023] Open
Abstract
Measuring microbial diversity is traditionally based on microbe taxonomy. Here, in contrast, we aimed to quantify heterogeneity in microbial gene content across 14,183 metagenomic samples spanning 17 ecologies, including 6 human associated, 7 nonhuman host associated, and 4 in other nonhuman host environments. In total, we identified 117,629,181 nonredundant genes. The vast majority of genes (66%) occurred in only one sample (i.e., "singletons"). In contrast, we found 1,864 sequences present in every metagenome, but not necessarily every bacterial genome. Additionally, we report data sets of other ecology-associated genes (e.g., abundant in only gut ecosystems) and simultaneously demonstrated that prior microbiome gene catalogs are both incomplete and inaccurately cluster microbial genetic life (e.g., at gene sequence identities that are too restrictive). We provide our results and the sets of environmentally differentiating genes described above at http://www.microbial-genes.bio. IMPORTANCE The amount of shared genetic elements has not been quantified between the human microbiome and other host- and non-host-associated microbiomes. Here, we made a gene catalog of 17 different microbial ecosystems and compared them. We show that most species shared between environment and human gut microbiomes are pathogens and that prior gene catalogs described as "nearly complete" are far from it. Additionally, over two-thirds of all genes only appear in a single sample, and only 1,864 genes (0.001%) are found in all types of metagenomes. These results highlight the large diversity between metagenomes and reveal a new, rare class of genes, those found in every type of metagenome, but not every microbial genome.
Collapse
Affiliation(s)
- Samuel Zimmerman
- Section on Pathophysiology and Molecular Pharmacology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Section on Islet Cell and Regenerative Biology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA
| | - Braden T. Tierney
- Section on Pathophysiology and Molecular Pharmacology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Section on Islet Cell and Regenerative Biology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Chirag J. Patel
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Aleksandar D. Kostic
- Section on Pathophysiology and Molecular Pharmacology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Section on Islet Cell and Regenerative Biology, Joslin Diabetes Center, Boston, Massachusetts, USA
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
12
|
Behling AH, Wilson BC, Ho D, Virta M, O'Sullivan JM, Vatanen T. Addressing antibiotic resistance: computational answers to a biological problem? Curr Opin Microbiol 2023; 74:102305. [PMID: 37031568 DOI: 10.1016/j.mib.2023.102305] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 03/01/2023] [Accepted: 03/02/2023] [Indexed: 04/11/2023]
Abstract
The increasing prevalence of infections caused by antibiotic-resistant bacteria is a global healthcare crisis. Understanding the spread of resistance is predicated on the surveillance of antibiotic resistance genes within an environment. Bioinformatics and artificial intelligence (AI) methods applied to metagenomic sequencing data offer the capacity to detect known and infer yet-unknown resistance mechanisms, and predict future outbreaks of antibiotic-resistant infections. Machine learning methods, in particular, could revive the waning antibiotic discovery pipeline by helping to predict the molecular structure and function of antibiotic resistance compounds, and optimising their interactions with target proteins. Consequently, AI has the capacity to play a central role in guiding antibiotic stewardship and future clinical decision-making around antibiotic resistance.
Collapse
Affiliation(s)
- Anna H Behling
- Liggins Institute, University of Auckland, Auckland, New Zealand
| | - Brooke C Wilson
- Liggins Institute, University of Auckland, Auckland, New Zealand
| | - Daniel Ho
- Liggins Institute, University of Auckland, Auckland, New Zealand
| | - Marko Virta
- Department of Microbiology, University of Helsinki, Helsinki, Finland
| | - Justin M O'Sullivan
- Liggins Institute, University of Auckland, Auckland, New Zealand; The Maurice Wilkins Centre, The University of Auckland, Private Bag 92019, Auckland, New Zealand; Australian Parkinsons Mission, Garvan Institute of Medical Research, Sydney, New South Wales, 384 Victoria Street, Darlinghurst, NSW 2010, Australia; MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton SO16 6YD, United Kingdom; Singapore Institute for Clinical Sciences, Agency for Science Technology and Research, Singapore.
| | - Tommi Vatanen
- Liggins Institute, University of Auckland, Auckland, New Zealand; Department of Microbiology, University of Helsinki, Helsinki, Finland; Research Program for Clinical and Molecular Metabolism, Faculty of Medicine, University of Helsinki, Helsinki, Finland; Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
13
|
Wang D, Meng Y, Meng F. Genome-centric metagenomics insights into functional divergence and horizontal gene transfer of denitrifying bacteria in anammox consortia. WATER RESEARCH 2022; 224:119062. [PMID: 36116192 DOI: 10.1016/j.watres.2022.119062] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 08/21/2022] [Accepted: 09/03/2022] [Indexed: 06/15/2023]
Abstract
Denitrifying bacteria with high abundances in anammox communities play crucial roles in achieving stable anammox-based systems. Despite the relative constant composition of denitrifying bacteria, their functional diversity remains to be explored in anammox communities. Herein, a total of 77 high-quality metagenome-assembled genomes (MAGs) of denitrifying bacteria were recovered from the anammox community in a full-scale swine wastewater treatment plant. Among these microbes, a total of 26 MAGs were affiliated with the seven dominant denitrifying genera that have total abundances higher than 1%. A meta-analysis of these species suggested that external organics reduced the abundances of genus Ignavibacterium and species MAG.305 of UTPRO2 in anammox communities. Comparative genome analysis revealed functional divergence across different denitrifying bacteria, largely owing to their distinct capabilities for carbohydrate (including endogenous and exogenous) utilization and vitamin (e.g., pantothenate and thiamine) biosynthesis. Serval microbes in this system contained fewer genes encoding biotin, pantothenate and methionine biosynthesis compared with their related species from other habitats. In addition, the genes encoding energy production and conversion (73 genes) and inorganic ion transport (53 genes) putatively transferred from other species to denitrifying bacteria, while these denitrifying bacteria (especially genera UTPRO2 and SCN-69-89) likely donated the genes encoding nutrients (e.g., inorganic ion and amino acid) transporter (64 genes) for other members to utilize new metabolites. Collectively, these findings highlighted the functional divergence of these denitrifying bacteria and speculated that the genetic interactions within anammox communities through horizontal gene transfer may be one of the reasons for their functional divergence.
Collapse
Affiliation(s)
- Depeng Wang
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou 510275, PR China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou 510275, PR China; National Engineering Laboratory for Pollution Control and Waste Utilization in Livestock and Poultry Production, Changsha, Hunan 410125, PR China
| | - Yabing Meng
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou 510275, PR China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou 510275, PR China; National Engineering Laboratory for Pollution Control and Waste Utilization in Livestock and Poultry Production, Changsha, Hunan 410125, PR China
| | - Fangang Meng
- School of Environmental Science and Engineering, Sun Yat-sen University, Guangzhou 510275, PR China; Guangdong Provincial Key Laboratory of Environmental Pollution Control and Remediation Technology, Sun Yat-sen University, Guangzhou 510275, PR China; National Engineering Laboratory for Pollution Control and Waste Utilization in Livestock and Poultry Production, Changsha, Hunan 410125, PR China.
| |
Collapse
|
14
|
Meyer F, Fritz A, Deng ZL, Koslicki D, Lesker TR, Gurevich A, Robertson G, Alser M, Antipov D, Beghini F, Bertrand D, Brito JJ, Brown CT, Buchmann J, Buluç A, Chen B, Chikhi R, Clausen PTLC, Cristian A, Dabrowski PW, Darling AE, Egan R, Eskin E, Georganas E, Goltsman E, Gray MA, Hansen LH, Hofmeyr S, Huang P, Irber L, Jia H, Jørgensen TS, Kieser SD, Klemetsen T, Kola A, Kolmogorov M, Korobeynikov A, Kwan J, LaPierre N, Lemaitre C, Li C, Limasset A, Malcher-Miranda F, Mangul S, Marcelino VR, Marchet C, Marijon P, Meleshko D, Mende DR, Milanese A, Nagarajan N, Nissen J, Nurk S, Oliker L, Paoli L, Peterlongo P, Piro VC, Porter JS, Rasmussen S, Rees ER, Reinert K, Renard B, Robertsen EM, Rosen GL, Ruscheweyh HJ, Sarwal V, Segata N, Seiler E, Shi L, Sun F, Sunagawa S, Sørensen SJ, Thomas A, Tong C, Trajkovski M, Tremblay J, Uritskiy G, Vicedomini R, Wang Z, Wang Z, Wang Z, Warren A, Willassen NP, Yelick K, You R, Zeller G, Zhao Z, Zhu S, Zhu J, Garrido-Oter R, Gastmeier P, Hacquard S, Häußler S, Khaledi A, Maechler F, Mesny F, Radutoiu S, Schulze-Lefert P, Smit N, Strowig T, et alMeyer F, Fritz A, Deng ZL, Koslicki D, Lesker TR, Gurevich A, Robertson G, Alser M, Antipov D, Beghini F, Bertrand D, Brito JJ, Brown CT, Buchmann J, Buluç A, Chen B, Chikhi R, Clausen PTLC, Cristian A, Dabrowski PW, Darling AE, Egan R, Eskin E, Georganas E, Goltsman E, Gray MA, Hansen LH, Hofmeyr S, Huang P, Irber L, Jia H, Jørgensen TS, Kieser SD, Klemetsen T, Kola A, Kolmogorov M, Korobeynikov A, Kwan J, LaPierre N, Lemaitre C, Li C, Limasset A, Malcher-Miranda F, Mangul S, Marcelino VR, Marchet C, Marijon P, Meleshko D, Mende DR, Milanese A, Nagarajan N, Nissen J, Nurk S, Oliker L, Paoli L, Peterlongo P, Piro VC, Porter JS, Rasmussen S, Rees ER, Reinert K, Renard B, Robertsen EM, Rosen GL, Ruscheweyh HJ, Sarwal V, Segata N, Seiler E, Shi L, Sun F, Sunagawa S, Sørensen SJ, Thomas A, Tong C, Trajkovski M, Tremblay J, Uritskiy G, Vicedomini R, Wang Z, Wang Z, Wang Z, Warren A, Willassen NP, Yelick K, You R, Zeller G, Zhao Z, Zhu S, Zhu J, Garrido-Oter R, Gastmeier P, Hacquard S, Häußler S, Khaledi A, Maechler F, Mesny F, Radutoiu S, Schulze-Lefert P, Smit N, Strowig T, Bremges A, Sczyrba A, McHardy AC. Critical Assessment of Metagenome Interpretation: the second round of challenges. Nat Methods 2022; 19:429-440. [PMID: 35396482 PMCID: PMC9007738 DOI: 10.1038/s41592-022-01431-4] [Show More Authors] [Citation(s) in RCA: 155] [Impact Index Per Article: 51.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Accepted: 02/14/2022] [Indexed: 12/20/2022]
Abstract
Evaluating metagenomic software is key for optimizing metagenome interpretation and focus of the Initiative for the Critical Assessment of Metagenome Interpretation (CAMI). The CAMI II challenge engaged the community to assess methods on realistic and complex datasets with long- and short-read sequences, created computationally from around 1,700 new and known genomes, as well as 600 new plasmids and viruses. Here we analyze 5,002 results by 76 program versions. Substantial improvements were seen in assembly, some due to long-read data. Related strains still were challenging for assembly and genome recovery through binning, as was assembly quality for the latter. Profilers markedly matured, with taxon profilers and binners excelling at higher bacterial ranks, but underperforming for viruses and Archaea. Clinical pathogen detection results revealed a need to improve reproducibility. Runtime and memory usage analyses identified efficient programs, including top performers with other metrics. The results identify challenges and guide researchers in selecting methods for analyses. This study presents the results of the second round of the Critical Assessment of Metagenome Interpretation challenges (CAMI II), which is a community-driven effort for comprehensively benchmarking tools for metagenomics data analysis.
Collapse
Affiliation(s)
- Fernando Meyer
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany.,Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Adrian Fritz
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany.,Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany.,German Center for Infection Research (DZIF), Hannover-Braunschweig Site, Braunschweig, Germany
| | - Zhi-Luo Deng
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany.,Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany.,Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany
| | | | - Till Robin Lesker
- German Center for Infection Research (DZIF), Hannover-Braunschweig Site, Braunschweig, Germany.,Helmholtz Centre for Infection Research, Braunschweig, Germany
| | | | - Gary Robertson
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany.,Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Mohammed Alser
- Department of Information Technology and Electrical Engineering, ETH Zürich, Zurich, Switzerland
| | - Dmitry Antipov
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, Russia
| | | | | | | | | | - Jan Buchmann
- Institute for Biological Data Science, Heinrich-Heine-University, Düsseldorf, Germany
| | - Aydin Buluç
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,University of California, Berkeley, Berkeley, CA, USA
| | - Bo Chen
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,University of California, Berkeley, Berkeley, CA, USA
| | | | - Philip T L C Clausen
- National Food Institute, Division of Global Surveillance, Technical University of Denmark, Lyngby, Denmark
| | - Alexandru Cristian
- Drexel University, Philadelphia, PA, USA.,Google Inc., Philadelphia, PA, USA
| | - Piotr Wojciech Dabrowski
- Robert Koch-Institut, Berlin, Germany.,Hochschule für Technik und Wirtschaft Berlin, Berlin, Germany
| | | | - Rob Egan
- DOE Joint Genome Institute, Berkeley, CA, USA.,Lawrence Berkeley National Laboratories, Berkeley, CA, USA
| | - Eleazar Eskin
- University of California, Los Angeles, Los Angeles, CA, USA
| | | | - Eugene Goltsman
- DOE Joint Genome Institute, Berkeley, CA, USA.,Lawrence Berkeley National Laboratories, Berkeley, CA, USA
| | - Melissa A Gray
- Drexel University, Philadelphia, PA, USA.,Ecological and Evolutionary Signal-Processing and Informatics Laboratory, Philadelphia, PA, USA
| | - Lars Hestbjerg Hansen
- University of Copenhagen, Department of Plant and Environmental Science, Frederiksberg, Denmark
| | - Steven Hofmeyr
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,University of California, Berkeley, Berkeley, CA, USA
| | - Pingqin Huang
- School of Computer Science, Fudan University, Shanghai, China
| | - Luiz Irber
- University of California, Davis, Davis, CA, USA
| | - Huijue Jia
- BGI-Shenzhen, Shenzhen, China.,Shenzhen Key Laboratory of Human Commensal Microorganisms and Health Research, BGI-Shenzhen, Shenzhen, China
| | - Tue Sparholt Jørgensen
- Technical University of Denmark, Novo Nordisk Foundation Center for Biosustainability, Lyngby, Denmark.,Aarhus University, Department of Environmental Science, Roskilde, Denmark
| | - Silas D Kieser
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland.,Swiss Institute of Bioinformatics, Geneva, Switzerland
| | | | - Axel Kola
- Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Mikhail Kolmogorov
- Department of Computer Science and Engineering, University of California San Diego, San Diego, CA, USA
| | - Anton Korobeynikov
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, Russia.,Department of Statistical Modelling, Saint Petersburg State University, Saint Petersburg, Russia
| | - Jason Kwan
- University of Wisconsin-Madison, Madison, WI, USA
| | | | | | - Chenhao Li
- Genome Institute of Singapore, Singapore, Singapore
| | | | - Fabio Malcher-Miranda
- Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
| | | | - Vanessa R Marcelino
- Sydney Medical School, The University of Sydney, Sydney, Australia.,Centre for Innate Immunity and Infectious Diseases, Hudson Institute of Medical Research, Clayton, Australia
| | | | - Pierre Marijon
- Department of Computer Science, Inria, University of Lille, CNRS, Lille, France
| | - Dmitry Meleshko
- Center for Algorithmic Biotechnology, Saint Petersburg State University, Saint Petersburg, Russia
| | - Daniel R Mende
- Amsterdam University Medical Center, Amsterdam, the Netherlands
| | - Alessio Milanese
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zürich, Zürich, Switzerland.,Structural and Computational Biology Unit, EMBL, Heidelberg, Germany
| | - Niranjan Nagarajan
- Genome Institute of Singapore, A*STAR, Singapore, Singapore.,National University of Singapore, Singapore, Singapore
| | | | - Sergey Nurk
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Leonid Oliker
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,University of California, Berkeley, Berkeley, CA, USA
| | - Lucas Paoli
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zürich, Zürich, Switzerland
| | | | - Vitor C Piro
- Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Potsdam, Germany
| | | | - Simon Rasmussen
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Evan R Rees
- University of Wisconsin-Madison, Madison, WI, USA
| | - Knut Reinert
- Institute for Bioinformatics, FU Berlin, Berlin, Germany
| | - Bernhard Renard
- Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Potsdam, Germany.,Bioinformatics Unit (MF1), Robert Koch Institute, Berlin, Germany
| | | | - Gail L Rosen
- Drexel University, Philadelphia, PA, USA.,Ecological and Evolutionary Signal-Processing and Informatics Laboratory, Philadelphia, PA, USA.,Center for Biological Discovery from Big Data, Philadelphia, PA, USA
| | - Hans-Joachim Ruscheweyh
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zürich, Zürich, Switzerland
| | - Varuni Sarwal
- University of California, Los Angeles, Los Angeles, CA, USA
| | - Nicola Segata
- Department CIBIO, University of Trento, Trento, Italy
| | - Enrico Seiler
- Institute for Bioinformatics, FU Berlin, Berlin, Germany
| | - Lizhen Shi
- Florida Polytechnic University, Lakeland, FL, USA
| | - Fengzhu Sun
- Quantitative and Computational Biology Department, University of Southern California, Los Angeles, CA, USA
| | - Shinichi Sunagawa
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zürich, Zürich, Switzerland
| | | | - Ashleigh Thomas
- DOE Joint Genome Institute, Berkeley, CA, USA.,University of British Columbia, Vancouver, British Columbia, Canada
| | | | - Mirko Trajkovski
- Department of Cell Physiology and Metabolism, Faculty of Medicine, University of Geneva, Geneva, Switzerland.,Diabetes Center, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Julien Tremblay
- Energy, Mining and Environment, National Research Council Canada, Montreal, Quebec, Canada
| | | | | | - Zhengyang Wang
- School of Computer Science, Fudan University, Shanghai, China
| | - Ziye Wang
- School of Mathematical Sciences, Fudan University, Shanghai, China
| | - Zhong Wang
- Department of Energy Joint Genome Institute, Berkeley, CA, USA.,Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,School of Natural Sciences, University of California at Merced, Merced, CA, USA
| | | | | | - Katherine Yelick
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA.,University of California, Berkeley, Berkeley, CA, USA
| | - Ronghui You
- School of Computer Science, Fudan University, Shanghai, China
| | - Georg Zeller
- Structural and Computational Biology Unit, EMBL, Heidelberg, Germany
| | | | - Shanfeng Zhu
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, China.,Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence (Fudan University), Ministry of Education, Shanghai, China
| | - Jie Zhu
- BGI-Shenzhen, Shenzhen, China.,Shenzhen Key Laboratory of Human Commensal Microorganisms and Health Research, BGI-Shenzhen, Shenzhen, China
| | | | | | | | - Susanne Häußler
- Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Ariane Khaledi
- Helmholtz Centre for Infection Research, Braunschweig, Germany
| | | | - Fantin Mesny
- Max Planck Institute for Plant Breeding Research, Köln, Germany
| | | | | | - Nathiana Smit
- Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Till Strowig
- Helmholtz Centre for Infection Research, Braunschweig, Germany
| | - Andreas Bremges
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany.,German Center for Infection Research (DZIF), Hannover-Braunschweig Site, Braunschweig, Germany
| | - Alexander Sczyrba
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany
| | - Alice Carolyn McHardy
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, Braunschweig, Germany. .,Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany. .,German Center for Infection Research (DZIF), Hannover-Braunschweig Site, Braunschweig, Germany. .,Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany.
| |
Collapse
|
15
|
Djemiel C, Maron PA, Terrat S, Dequiedt S, Cottin A, Ranjard L. Inferring microbiota functions from taxonomic genes: a review. Gigascience 2022; 11:giab090. [PMID: 35022702 PMCID: PMC8756179 DOI: 10.1093/gigascience/giab090] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 12/02/2021] [Accepted: 12/02/2021] [Indexed: 12/13/2022] Open
Abstract
Deciphering microbiota functions is crucial to predict ecosystem sustainability in response to global change. High-throughput sequencing at the individual or community level has revolutionized our understanding of microbial ecology, leading to the big data era and improving our ability to link microbial diversity with microbial functions. Recent advances in bioinformatics have been key for developing functional prediction tools based on DNA metabarcoding data and using taxonomic gene information. This cheaper approach in every aspect serves as an alternative to shotgun sequencing. Although these tools are increasingly used by ecologists, an objective evaluation of their modularity, portability, and robustness is lacking. Here, we reviewed 100 scientific papers on functional inference and ecological trait assignment to rank the advantages, specificities, and drawbacks of these tools, using a scientific benchmarking. To date, inference tools have been mainly devoted to bacterial functions, and ecological trait assignment tools, to fungal functions. A major limitation is the lack of reference genomes-compared with the human microbiota-especially for complex ecosystems such as soils. Finally, we explore applied research prospects. These tools are promising and already provide relevant information on ecosystem functioning, but standardized indicators and corresponding repositories are still lacking that would enable them to be used for operational diagnosis.
Collapse
Affiliation(s)
- Christophe Djemiel
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Pierre-Alain Maron
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Sébastien Terrat
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Samuel Dequiedt
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Aurélien Cottin
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| | - Lionel Ranjard
- Agroécologie, AgroSup Dijon, INRAE, Université de Bourgogne, Université de Bourgogne Franche-Comté, F-21000 Dijon, France
| |
Collapse
|