1
|
Zhou J, Wu H, Du K, Zhou W, Zhou CZ, Li H. PCVR: a pre-trained contextualized visual representation for DNA sequence classification. BMC Bioinformatics 2025; 26:125. [PMID: 40346458 PMCID: PMC12065381 DOI: 10.1186/s12859-025-06136-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2024] [Accepted: 04/07/2025] [Indexed: 05/11/2025] Open
Abstract
BACKGROUND The classification of DNA sequences is pivotal in bioinformatics, essentially for genetic information analysis. Traditional alignment-based tools tend to have slow speed and low recall. Machine learning methods learn implicit patterns from data with encoding techniques such as k-mer counting and ordinal encoding, which fail to handle long sequences or sacrifice structural and sequential information. Frequency chaos game representation (FCGR) converts DNA sequences of arbitrary lengths into fixed-size images, breaking free from the constraints of sequence length while preserving more sequential information than other representations. However, existing works merely consider local information, ignoring long-range dependencies and global contextual information within FCGR image. RESULTS We propose PCVR, a Pre-trained Contextualized Visual Representation for DNA sequence classification. PCVR encodes FCGR with a vision transformer into contextualized features containing more global information. To meet the substantial data requirements of the training of vision transformer and learn more robust features, we pre-train the encoder with a masked autoencoder. Pre-trained PCVR exhibits impressive performance on three datasets even with only unsupervised learning. After fine-tuning, PCVR outperforms existing methods on superkingdom and phylum levels. Additionally, our ablation studies confirm the contribution of the vision transformer encoder and masked autoencoder pre-training to performance improvement. CONCLUSIONS PCVR significantly improves DNA sequence classification accuracy and shows strong potential for new species discovery due to its effective capture of global information and robustness. Codes for PCVR are available at https://github.com/jiaruizhou/PCVR .
Collapse
Affiliation(s)
- Jiarui Zhou
- School of Artificial Intelligence and Data Science, University of Science and Technology of China, Hefei, 230026, Anhui Province, China
| | - Hui Wu
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230026, Anhui Province, China
| | - Kang Du
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, Anhui Province, China
| | - Wengang Zhou
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230026, Anhui Province, China.
| | - Cong-Zhao Zhou
- Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230026, Anhui Province, China
| | - Houqiang Li
- Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei, 230026, Anhui Province, China
| |
Collapse
|
2
|
Jia Y, Huang D, Lan X, Sun X, Lin W, Sun W, Wang Y. Community structure and metabolic potentials of keystone taxa and their associated bacteriophages within rice root endophytic microbiome in response to metal(loid)s contamination. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2025; 372:126028. [PMID: 40064231 DOI: 10.1016/j.envpol.2025.126028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2024] [Revised: 03/03/2025] [Accepted: 03/07/2025] [Indexed: 03/17/2025]
Abstract
Heavy metal (HM) contamination of agricultural products is of global environmental concern as it directly threatened the food safety. Plant-associated microbiome, particularly endophytic microbiome, hold the potential for mitigating HM stress as well as promoting plant growth. The metabolic potentials of the endophytes, especially those under the HM stresses, have not been well addressed. Rice, a major staple food worldwide, is more vulnerable to HM contamination compared to other crops and therefore requires special attentions. Therefore, this study selected rice as the target plants. Geochemical analysis and amplicon sequencing were combined to characterize the rice root endophytic bacterial communities and identify keystone taxa in two HM-contaminated rice fields. Metagenomic analysis was employed to investigate the metabolic potentials of these keystone taxa. Burkholderiales and Rhizobiales were identified as predominant keystone taxa. The metagenome-assembled genome (MAG)s associated with these keystone populations suggested that they possessed diverse genetic potentials related to metal resistance and transformation (e.g., As resistance and cycling, V reduction, Cr efflux and reduction), and plant growth promotion (nitrogen fixation, phosphate solubilization, oxidative stress resistance, indole-3-acetic acid, and siderophore production). Moreover, bacteriophages encoding auxiliary metabolism genes (AMGs) associated with the HM resistance as well as nitrogen and phosphate acquisition were identified, suggesting that these phages may contribute to these crucial biogeochemical processes within rice roots. The current findings revealed the beneficial roles of rice endophytic keystone taxa and their associated bacteriophages within HM-contaminated rice root endophytic microbiome, which may provide valuable insights on future applications of employing root microbiome for safety management of agriculture productions.
Collapse
Affiliation(s)
- Yanlong Jia
- School of Chemistry and Environmental Engineering, Hanshan Normal University, Chaozhou, 521041, China; School of Resources and Environmental Engineering, Guizhou Institute of Technology, Guiyang, 550002, China
| | - Duanyi Huang
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-Environmental Pollution Control and Management, Institute of Eco-Environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou, 510650, China; College of Environmental Science and Engineering, Hunan University, Changsha, 410082, China
| | - Xiaolong Lan
- School of Chemistry and Environmental Engineering, Hanshan Normal University, Chaozhou, 521041, China.
| | - Xiaoxu Sun
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-Environmental Pollution Control and Management, Institute of Eco-Environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou, 510650, China
| | - Wenjie Lin
- School of Chemistry and Environmental Engineering, Hanshan Normal University, Chaozhou, 521041, China
| | - Weimin Sun
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-Environmental Pollution Control and Management, Institute of Eco-Environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou, 510650, China; College of Environmental Science and Engineering, Hunan University, Changsha, 410082, China
| | - Yize Wang
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-Environmental Pollution Control and Management, Institute of Eco-Environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou, 510650, China.
| |
Collapse
|
3
|
Sirasani JP, Gardner C, Jung G, Lee H, Ahn TH. Bioinformatic approaches to blood and tissue microbiome analyses: challenges and perspectives. Brief Bioinform 2025; 26:bbaf176. [PMID: 40269515 PMCID: PMC12018304 DOI: 10.1093/bib/bbaf176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 03/05/2025] [Accepted: 03/25/2025] [Indexed: 04/25/2025] Open
Abstract
Advances in next-generation sequencing have resulted in a growing understanding of the microbiome and its role in human health. Unlike traditional microbiome analysis, blood and tissue microbiome analyses focus on the detection and characterization of microbial DNA in blood and tissue, previously considered a sterile environment. In this review, we discuss the challenges and methodologies associated with analyzing these samples, particularly emphasizing blood and tissue microbiome research. Key preprocessing steps-including the removal of ribosomal RNA, host DNA, and other contaminants-are critical to reducing noise and accurately capturing microbial evidence. We also explore how taxonomic profiling tools, machine learning, and advanced normalization techniques address contamination and low microbial biomass, thereby improving reliability. While it offers the potential for identifying microbial involvement in systemic diseases previously undetectable by traditional methods, this methodology also carries risks and lacks universal acceptance due to concerns over reliability and interpretation errors. This paper critically reviews these factors, highlighting both the promise and pitfalls of using blood and tissue microbiome analyses as a tool for biomarker discovery.
Collapse
Affiliation(s)
- Jammi Prasanthi Sirasani
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, MO, United States
| | - Cory Gardner
- Department of Computer Science, Saint Louis University, St. Louis, MO, United States
| | - Gihwan Jung
- Department of Computer Science, Saint Louis University, St. Louis, MO, United States
| | - Hyunju Lee
- AI Graduate School, Gwangju Institute of Science and Technology, Gwangju 61005, South Korea
| | - Tae-Hyuk Ahn
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, MO, United States
- Department of Computer Science, Saint Louis University, St. Louis, MO, United States
| |
Collapse
|
4
|
Sáenz JS, Rios-Galicia B, Seifert J. Antiviral defense systems in the rumen microbiome. mSystems 2025; 10:e0152124. [PMID: 39807869 PMCID: PMC11834463 DOI: 10.1128/msystems.01521-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2024] [Accepted: 12/10/2024] [Indexed: 01/16/2025] Open
Abstract
The continuous interaction between phages and their respective hosts has resulted in the evolution of multiple bacterial immune mechanisms. However, the diversity and prevalence of antiviral defense systems in complex communities are still unknown. We therefore investigated the diversity and abundance of viral defense systems in 3,038 high-quality bacterial and archaeal genomes from the rumen. In total, 14,241 defense systems and 31,948 antiviral-related genes were identified. Those genes represented 114 unique system types grouped into 49 families. We observed a high prevalence of defense systems in the genomes. However, the number of defense systems, defense system families, and system density varied widely from genome to genome. Additionally, the number of defense system per genome correlated positively with the number of defense system families and the genome size. Restriction modification, Abi, and cas system families were the most common, but many rare systems were present in only 1% of the genomes. Antiviral defense systems are prevalent and diverse in the rumen, but only a few are dominant, indicating that most systems are rarely present. However, the collection of systems throughout the rumen may represent a pool of mechanisms that can be shared by different members of the community and modulate the phage-host interaction.IMPORTANCEPhages may act antagonistically at the cell level but have a mutualistic interaction at the microbiome level. This interaction shapes the structure of microbial communities and is mainly driven by the defense mechanism. However, the diversity of such mechanism is larger than previously thought. Because of that, we described the abundance and diversity of the antiviral defense system of a collection of genomes, metagenome-assembled genomes (MAGs) and isolates, from the rumen. While defense mechanisms seem to be prevalent among bacteria and archaea, only a few were common. This suggests that most of these defense mechanisms are not present in many rumen microbes but could be shared among different members of the microbial community. This is consistent with the "pan-immune system" model, which appears to be common across different environments.
Collapse
Affiliation(s)
- Johan S. Sáenz
- Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
- HoLMiR—Hohenheim Center for Livestock Microbiome Research, University of Hohenheim, Stuttgart, Germany
| | - Bibiana Rios-Galicia
- Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
- HoLMiR—Hohenheim Center for Livestock Microbiome Research, University of Hohenheim, Stuttgart, Germany
| | - Jana Seifert
- Institute of Animal Science, University of Hohenheim, Stuttgart, Germany
- HoLMiR—Hohenheim Center for Livestock Microbiome Research, University of Hohenheim, Stuttgart, Germany
| |
Collapse
|
5
|
Qayyum H, Talib MS, Ali A, Kayani MUR. Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes. Heliyon 2025; 11:e41938. [PMID: 39897886 PMCID: PMC11786835 DOI: 10.1016/j.heliyon.2025.e41938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Revised: 01/08/2025] [Accepted: 01/13/2025] [Indexed: 02/04/2025] Open
Abstract
Human-associated microbial communities are a complex mixture of bacterial species and diverse strains prevalent at varying abundances. Due to the inherent limitations of metagenomic assemblers and genome binning tools in recovering low-abundance species (<1 %) and strains, we lack comprehensive insight into these communities. Although many bioinformatics approaches are available for recovering metagenome-assembled genomes, their effectiveness in recovering low-abundance species and strains is often questioned. Moreover, each tool has its trade-offs, making selecting the right tools challenging. In this study, we investigated the combinatory effect of various assemblers and binning tools on the recovery of low-abundance species and strain-resolved genomes from real and simulated human metagenomes. We evaluated the performance of nine combinations of metagenome assemblers and genome binning tools for their potential to recover genomes of useable quality. Our results revealed that the metaSPAdes-MetaBAT2 combination is highly effective in recovering low-abundance species, while MEGAHIT-MetaBAT2 excels in recovering strain-resolved genomes. These findings highlight the significant variation in the performance of different combinations, even when aiming for the same objective. This suggests the profound impact of selecting the right assembler-binner combination for metagenome analyses. We believe this study will be a cornerstone for the scientific community, guiding the choice of tools by highlighting their complementary effects. Furthermore, it underscores the potential of existing tools to address the current challenges in the field improving the recovery of information from metagenomes.
Collapse
Affiliation(s)
- Hajra Qayyum
- Integrative Biology Laboratory, Department of Microbiology and Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan
- Capital University of Science & Technology, Islamabad Expressway, Kahuta Road Zone-V Sihala, Islamabad, Pakistan
| | - Muhammad Sarfraz Talib
- Integrative Biology Laboratory, Department of Microbiology and Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan
| | - Amjad Ali
- Integrative Biology Laboratory, Department of Microbiology and Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan
| | - Masood Ur Rehman Kayani
- Metagenomics Discovery Lab, School of Interdisciplinary Engineering and Sciences (SINES), National University of Sciences and Technology (NUST), Srinagar Highway, Sector H-12, Islamabad, Pakistan
| |
Collapse
|
6
|
Nowak VV, Hou P, Owen JG. Microbial communities associated with marine sponges from diverse geographic locations harbor biosynthetic novelty. Appl Environ Microbiol 2024; 90:e0072624. [PMID: 39565113 DOI: 10.1128/aem.00726-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Accepted: 10/09/2024] [Indexed: 11/21/2024] Open
Abstract
Marine sponges are a prolific source of biologically active small molecules, many of which originate from sponge-associated bacteria. Identifying the producing bacteria is a key step in developing sustainable routes for the production of these metabolites. To facilitate the required computational analyses, we developed MetaSing, a reproducible singularity-based pipeline for assembly, identification of high-quality metagenome-assembled genomes (MAGs), and analysis of biosynthetic gene clusters (BGCs) from metagenomic short-read data. We applied this pipeline to metagenomic sequencing data from 16 marine sponges collected from New Zealand, Tonga, and the Mediterranean Sea. This analysis yielded 643 MAGs representing 510 species. Of the 2,670 BGCs identified across all samples, 70.8% were linked to a MAG. Comparison of BGCs to those identified from previously sequenced bacteria revealed high biosynthetic novelty in variety of underexplored phyla, including Poribacteria, Acidobacteriota, and Dadabacteria. Alongside the observation that each sample contains unique biosynthetic potential, this holds great promise for natural product discovery and for furthering the understanding of different sponge holobionts.IMPORTANCEDiscovery of new chemical compounds such as natural products is a crucial endeavor to combat the increasing resistance to antibiotics and other drugs. This manuscript demonstrates that microbial communities associated with marine sponges investigated in this work encode the potential to produce novel chemistry. Lesser studied bacterial taxa that are often difficult to cultivate are particularly rich in potential.
Collapse
Affiliation(s)
- Vincent V Nowak
- School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
- Centre for Biodiscovery, Victoria University of Wellington, Wellington, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| | - Peng Hou
- School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
- Centre for Biodiscovery, Victoria University of Wellington, Wellington, New Zealand
| | - Jeremy G Owen
- School of Biological Sciences, Victoria University of Wellington, Wellington, New Zealand
- Centre for Biodiscovery, Victoria University of Wellington, Wellington, New Zealand
- Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| |
Collapse
|
7
|
Aplakidou E, Vergoulidis N, Chasapi M, Venetsianou NK, Kokoli M, Panagiotopoulou E, Iliopoulos I, Karatzas E, Pafilis E, Georgakopoulos-Soares I, Kyrpides NC, Pavlopoulos GA, Baltoumas FA. Visualizing metagenomic and metatranscriptomic data: A comprehensive review. Comput Struct Biotechnol J 2024; 23:2011-2033. [PMID: 38765606 PMCID: PMC11101950 DOI: 10.1016/j.csbj.2024.04.060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 04/25/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024] Open
Abstract
The fields of Metagenomics and Metatranscriptomics involve the examination of complete nucleotide sequences, gene identification, and analysis of potential biological functions within diverse organisms or environmental samples. Despite the vast opportunities for discovery in metagenomics, the sheer volume and complexity of sequence data often present challenges in processing analysis and visualization. This article highlights the critical role of advanced visualization tools in enabling effective exploration, querying, and analysis of these complex datasets. Emphasizing the importance of accessibility, the article categorizes various visualizers based on their intended applications and highlights their utility in empowering bioinformaticians and non-bioinformaticians to interpret and derive insights from meta-omics data effectively.
Collapse
Affiliation(s)
- Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nikolaos Vergoulidis
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Chasapi
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Maria Kokoli
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| | - Eleni Panagiotopoulou
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Department of Informatics and Telecommunications, Data Science and Information Technologies program, University of Athens, 15784 Athens, Greece
| | - Ioannis Iliopoulos
- Department of Basic Sciences, School of Medicine, University of Crete, 71003 Heraklion, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Ilias Georgakopoulos-Soares
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Nikos C. Kyrpides
- DOE Joint Genome Institute, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
- Institute for Personalized Medicine, Department of Biochemistry and Molecular Biology, The Pennsylvania State University College of Medicine, Hershey, PA, USA
- Center of New Biotechnologies & Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Greece
- Hellenic Army Academy, 16673 Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC "Alexander Fleming", Vari, Greece
| |
Collapse
|
8
|
Zhang P, Wang H, Klima C, Yang X. Microbiota in lymph nodes of cattle harvested in a Canadian meat processing plant. Food Res Int 2024; 191:114693. [PMID: 39059949 DOI: 10.1016/j.foodres.2024.114693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 06/19/2024] [Accepted: 06/26/2024] [Indexed: 07/28/2024]
Abstract
Lymph nodes (LN) harboring bacteria, when being incorporated into ground beef, may impact the microbial safety and quality of such products. We tested two main foodborne pathogens Salmonella and Shiga toxin-producing Escherichia coli (STEC) and profiled the microbiota in LNs (n = 160) of cattle harvested at a Canadian abattoir, by conventional plating methods, PCR, and high throughput sequencing. LNs at two anatomical locations, subiliac and popliteal from 80 cattle were included. All cattle had bacteria detected in popliteal and/or subiliac LNs with the maximum bacterial load of 5.4 and 2.8 log10CFU/g in popliteal and subiliac LNs, respectively. Neither Salmonella nor STEC was found in LNs although STEC was detected in a significant percentage of samples from beef hides (50.6 %) by plating and/or PCR. Both 16S rRNA gene amplicon and metagenome sequencing found the predominance of Escherichia (13-34.6 % among bacterial community), Clostridium (12.6-20.6 %) and Streptococcus (9.7-10 %) in popliteal LNs. Metagenomic sequencing was able to identify the predominant taxa at species level with E. coli (13 %), Clostridium perfringens (11.1 %) and Streptococcus uberis (6 %) predominant in LNs. Low prevalence/abundance of Salmonella was found by metagenomic sequencing. In conclusion, the relatively high bacterial load and diversity in LNs may affect the shelf life of ground beef and high relative abundance of E. coli would warrant further monitoring.
Collapse
Affiliation(s)
- Peipei Zhang
- Agriculture and Agri-Food Canada, Lacombe, Alberta, Canada
| | - Hui Wang
- Agriculture and Agri-Food Canada, Lacombe, Alberta, Canada
| | - Cassidy Klima
- Beef Cattle Research Council, Calgary, Alberta, Canada
| | - Xianqin Yang
- Agriculture and Agri-Food Canada, Lacombe, Alberta, Canada.
| |
Collapse
|
9
|
Meier TA, Refahi MS, Hearne G, Restifo DS, Munoz-Acuna R, Rosen GL, Woloszynek S. The Role and Applications of Artificial Intelligence in the Treatment of Chronic Pain. Curr Pain Headache Rep 2024; 28:769-784. [PMID: 38822995 DOI: 10.1007/s11916-024-01264-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/28/2024] [Indexed: 06/03/2024]
Abstract
PURPOSE OF REVIEW This review aims to explore the interface between artificial intelligence (AI) and chronic pain, seeking to identify areas of focus for enhancing current treatments and yielding novel therapies. RECENT FINDINGS In the United States, the prevalence of chronic pain is estimated to be upwards of 40%. Its impact extends to increased healthcare costs, reduced economic productivity, and strain on healthcare resources. Addressing this condition is particularly challenging due to its complexity and the significant variability in how patients respond to treatment. Current options often struggle to provide long-term relief, with their benefits rarely outweighing the risks, such as dependency or other side effects. Currently, AI has impacted four key areas of chronic pain treatment and research: (1) predicting outcomes based on clinical information; (2) extracting features from text, specifically clinical notes; (3) modeling 'omic data to identify meaningful patient subgroups with potential for personalized treatments and improved understanding of disease processes; and (4) disentangling complex neuronal signals responsible for pain, which current therapies attempt to modulate. As AI advances, leveraging state-of-the-art architectures will be essential for improving chronic pain treatment. Current efforts aim to extract meaningful representations from complex data, paving the way for personalized medicine. The identification of unique patient subgroups should reveal targets for tailored chronic pain treatments. Moreover, enhancing current treatment approaches is achievable by gaining a more profound understanding of patient physiology and responses. This can be realized by leveraging AI on the increasing volume of data linked to chronic pain.
Collapse
Affiliation(s)
| | - Mohammad S Refahi
- Ecological and Evolutionary Signal-Processing and Informatics (EESI) Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Gavin Hearne
- Ecological and Evolutionary Signal-Processing and Informatics (EESI) Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | | | - Ricardo Munoz-Acuna
- Anesthesia, Critical Care, and Pain Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Gail L Rosen
- Ecological and Evolutionary Signal-Processing and Informatics (EESI) Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Stephen Woloszynek
- Anesthesia, Critical Care, and Pain Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.
| |
Collapse
|
10
|
Mallawaarachchi V, Wickramarachchi A, Xue H, Papudeshi B, Grigson SR, Bouras G, Prahl RE, Kaphle A, Verich A, Talamantes-Becerra B, Dinsdale EA, Edwards RA. Solving genomic puzzles: computational methods for metagenomic binning. Brief Bioinform 2024; 25:bbae372. [PMID: 39082646 PMCID: PMC11289683 DOI: 10.1093/bib/bbae372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 06/05/2024] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open
Abstract
Metagenomics involves the study of genetic material obtained directly from communities of microorganisms living in natural environments. The field of metagenomics has provided valuable insights into the structure, diversity and ecology of microbial communities. Once an environmental sample is sequenced and processed, metagenomic binning clusters the sequences into bins representing different taxonomic groups such as species, genera, or higher levels. Several computational tools have been developed to automate the process of metagenomic binning. These tools have enabled the recovery of novel draft genomes of microorganisms allowing us to study their behaviors and functions within microbial communities. This review classifies and analyzes different approaches of metagenomic binning and different refinement, visualization, and evaluation techniques used by these methods. Furthermore, the review highlights the current challenges and areas of improvement present within the field of research.
Collapse
Affiliation(s)
- Vijini Mallawaarachchi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - Anuradha Wickramarachchi
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Hansheng Xue
- School of Computing, National University of Singapore, Singapore 119077, Singapore
| | - Bhavya Papudeshi
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - Susanna R Grigson
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - George Bouras
- Adelaide Medical School, Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005, Australia
- The Department of Surgery—Otolaryngology Head and Neck Surgery, University of Adelaide and the Basil Hetzel Institute for Translational Health Research, Central Adelaide Local Health Network, Adelaide, SA 5011, Australia
| | - Rosa E Prahl
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Anubhav Kaphle
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Andrey Verich
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
- The Kirby Institute, The University of New South Wales, Randwick, Sydney, NSW 2052, Australia
| | - Berenice Talamantes-Becerra
- Australian e-Health Research Centre, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Westmead, NSW 2145, Australia
| | - Elizabeth A Dinsdale
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| | - Robert A Edwards
- Flinders Accelerator for Microbiome Exploration, College of Science and Engineering, Flinders University, Adelaide, SA 5042, Australia
| |
Collapse
|
11
|
Xu D, Zhang X, Usman S, Bai J, Sheoran N, Guo X. Reducing transmission of high-risk antibiotic resistance genes in whole-crop corn silage through lactic acid bacteria inoculation and increasing ensiling temperature. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 926:172114. [PMID: 38561127 DOI: 10.1016/j.scitotenv.2024.172114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 03/28/2024] [Accepted: 03/28/2024] [Indexed: 04/04/2024]
Abstract
The microbial hosts of antibiotic resistance genes (ARGs) found epiphytically on plant materials could grow and flourish during silage fermentation. This study employed metagenomic analysis and elucidated the occurrence and transmission mechanisms of ARGs and their microbial hosts in whole-crop corn silage inoculated with homofermentative strain Lactiplantibacillus plantarum or heterofermentative strain Lentilactobacillus buchneri ensiled under different temperature (20 and 30 °C). The results revealed that the corn silage was dominated by Lactobacillus, Leuconostoc, Lentilactobacillus, and Latilactobacillus. Both the ensiling temperature and inoculation had greatly modified the silage microbiota. However, regardless of the ensiling temperature, L. buchneri had significantly higher ARGs, while it only exhibited significantly higher mobile genetic elements (MGEs) in low temperature treatments. The microbial community of the corn silage hosted highly diverse form of ARGs, which were primarily MacB, RanA, bcrA, msbA, TetA (58), and TetT and mainly corresponded to macrolides and tetracyclines drug classes. Plasmids were identified as the most abundant MGEs with significant correlation with some high-risk ARGs (tetM, TolC, mdtH, and NorA), and their abundances have been reduced by ensiling process. Furthermore, higher temperature and L. buchneri reduced abundances of high-risk ARGs by modifying their hosts and reduced their transmission in the silage. Therefore, ensiling, L. buchneri inoculation and higher storage temperature could improve the biosafety of corn silage.
Collapse
Affiliation(s)
- Dongmei Xu
- State Key Laboratory of Grassland and Agro-ecosystems, School of Life Sciences, Lanzhou University, Lanzhou 730000, PR China; Probiotics and Biological Feed Research Center, Lanzhou University, Lanzhou 730000, PR China
| | - Xia Zhang
- State Key Laboratory of Grassland and Agro-ecosystems, School of Life Sciences, Lanzhou University, Lanzhou 730000, PR China; Probiotics and Biological Feed Research Center, Lanzhou University, Lanzhou 730000, PR China
| | - Samaila Usman
- State Key Laboratory of Grassland and Agro-ecosystems, School of Life Sciences, Lanzhou University, Lanzhou 730000, PR China; Probiotics and Biological Feed Research Center, Lanzhou University, Lanzhou 730000, PR China
| | - Jie Bai
- State Key Laboratory of Grassland and Agro-ecosystems, School of Life Sciences, Lanzhou University, Lanzhou 730000, PR China; Probiotics and Biological Feed Research Center, Lanzhou University, Lanzhou 730000, PR China
| | - Neha Sheoran
- State Key Laboratory of Grassland and Agro-ecosystems, School of Life Sciences, Lanzhou University, Lanzhou 730000, PR China; Probiotics and Biological Feed Research Center, Lanzhou University, Lanzhou 730000, PR China
| | - Xusheng Guo
- State Key Laboratory of Grassland and Agro-ecosystems, School of Life Sciences, Lanzhou University, Lanzhou 730000, PR China; Probiotics and Biological Feed Research Center, Lanzhou University, Lanzhou 730000, PR China.
| |
Collapse
|
12
|
Macey MC. Genome-resolved metagenomics identifies novel active microbes in biogeochemical cycling within methanol-enriched soil. ENVIRONMENTAL MICROBIOLOGY REPORTS 2024; 16:e13246. [PMID: 38575138 PMCID: PMC10994693 DOI: 10.1111/1758-2229.13246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 03/15/2024] [Indexed: 04/06/2024]
Abstract
Metagenome assembled genomes (MAGs), generated from sequenced 13C-labelled DNA from 13C-methanol enriched soils, were binned using an ensemble approach. This method produced a significantly larger number of higher-quality MAGs compared to direct binning approaches. These MAGs represent both the primary methanol utilizers and the secondary utilizers labelled via cross-feeding and predation on the labelled methylotrophs, including numerous uncultivated taxa. Analysis of these MAGs enabled the identification of multiple metabolic pathways within these active taxa that have climatic relevance relating to nitrogen, sulfur and trace gas metabolism. This includes denitrification, dissimilatory nitrate reduction to ammonium, ammonia oxidation and metabolism of organic sulfur species. The binning of viral sequence data also yielded extensive viral MAGs, identifying active viral replication by both lytic and lysogenic phages within the methanol-enriched soils. These MAGs represent a valuable resource for characterizing biogeochemical cycling within terrestrial environments.
Collapse
Affiliation(s)
- Michael C. Macey
- AstrobiologyOU, Earth, Environment and Ecosystem SciencesThe Open UniversityMilton KeynesUK
| |
Collapse
|
13
|
Li J, Wu YJ, Liu MF, Li N, Dang LH, An GS, Lu XJ, Wang LL, Du QX, Cao J, Sun JH. Multi-omics integration strategy in the post-mortem interval of forensic science. Talanta 2024; 268:125249. [PMID: 37839320 DOI: 10.1016/j.talanta.2023.125249] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 08/13/2023] [Accepted: 09/25/2023] [Indexed: 10/17/2023]
Abstract
Estimates of post-mortem interval (PMI), which often serve as pivotal evidence in forensic contexts, are fundamentally based on assessments of variability among diverse molecular markers (including proteins and metabolites), their correlations, and their temporal changes in post-mortem organisms. Nevertheless, the present approach to estimating the PMI is not comprehensive and exhibits poor performance. We developed an innovative approach that integrates multi-omics and artificial intelligence, using multimolecular, multimarker, and multidimensional information to accurately describe the intricate biological processes that occur after death, ultimately enabling inference of the PMI. Called the multi-omics stacking model (MOSM), it combines metabolomics, protein microarray electrophoresis, and fourier transform-infrared spectroscopy data. It shows improved prediction accuracy of the PMI, which is urgently needed in the forensic field. It achieved an accuracy of 0.93, generalized area under the receiver operating characteristic curve of 0.98, and minimum mean absolute error of 0.07. The MOSM integration framework not only considers multiple markers but also incorporates machine-learning models with distinct algorithmic principles. The diversity of biological mechanisms and algorithmic models further ensures the generalizability and robustness of PMI estimation.
Collapse
Affiliation(s)
- Jian Li
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Yan-Juan Wu
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Ming-Feng Liu
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Na Li
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Li-Hong Dang
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Guo-Shuai An
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Xiao-Jun Lu
- Criminal Investigation Detachment, Baotou City Public Security Bureau, No. 191, Jianshe Road, Qingshan District, Baotou City, Inner Mongolia Autonomous Region, 014030, PR China
| | - Liang-Liang Wang
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Qiu-Xiang Du
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China
| | - Jie Cao
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China.
| | - Jun-Hong Sun
- School of Forensic Medicine, Shanxi Medical University, No. 98, University Street, Wujinshan Town, Yuci District, Jinzhong City, Shanxi Province, 030604, PR China; Shanxi Key Laboratory of Forensic Medicine, Jinzhong, 030600, Shanxi, China.
| |
Collapse
|
14
|
Kim C, Pongpanich M, Porntaveetus T. Unraveling metagenomics through long-read sequencing: a comprehensive review. J Transl Med 2024; 22:111. [PMID: 38282030 PMCID: PMC10823668 DOI: 10.1186/s12967-024-04917-1] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 01/21/2024] [Indexed: 01/30/2024] Open
Abstract
The study of microbial communities has undergone significant advancements, starting from the initial use of 16S rRNA sequencing to the adoption of shotgun metagenomics. However, a new era has emerged with the advent of long-read sequencing (LRS), which offers substantial improvements over its predecessor, short-read sequencing (SRS). LRS produces reads that are several kilobases long, enabling researchers to obtain more complete and contiguous genomic information, characterize structural variations, and study epigenetic modifications. The current leaders in LRS technologies are Pacific Biotechnologies (PacBio) and Oxford Nanopore Technologies (ONT), each offering a distinct set of advantages. This review covers the workflow of long-read metagenomics sequencing, including sample preparation (sample collection, sample extraction, and library preparation), sequencing, processing (quality control, assembly, and binning), and analysis (taxonomic annotation and functional annotation). Each section provides a concise outline of the key concept of the methodology, presenting the original concept as well as how it is challenged or modified in the context of LRS. Additionally, the section introduces a range of tools that are compatible with LRS and can be utilized to execute the LRS process. This review aims to present the workflow of metagenomics, highlight the transformative impact of LRS, and provide researchers with a selection of tools suitable for this task.
Collapse
Affiliation(s)
- Chankyung Kim
- Center of Excellence in Genomics and Precision Dentistry, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand
- Graduate Program in Bioinformatics and Computational Biology, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
| | - Monnat Pongpanich
- Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, Thailand
- Center of Excellence for Cancer and Inflammation, Chulalongkorn University, Bangkok, Thailand
| | - Thantrira Porntaveetus
- Center of Excellence in Genomics and Precision Dentistry, Department of Physiology, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand.
- Graduate Program in Geriatric and Special Patients Care, Faculty of Dentistry, Chulalongkorn University, Bangkok, Thailand.
| |
Collapse
|
15
|
Sanchez FB, Sato Guima SE, Setubal JC. How to Obtain and Compare Metagenome-Assembled Genomes. Methods Mol Biol 2024; 2802:135-163. [PMID: 38819559 DOI: 10.1007/978-1-0716-3838-5_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Metagenome-assembled genomes, or MAGs, are genomes retrieved from metagenome datasets. In the vast majority of cases, MAGs are genomes from prokaryotic species that have not been isolated or cultivated in the lab. They, therefore, provide us with information on these species that are impossible to obtain otherwise, at least until new cultivation methods are devised. Thanks to improvements and cost reductions of DNA sequencing technologies and growing interest in microbial ecology, the rise in number of MAGs in genome repositories has been exponential. This chapter covers the basics of MAG retrieval and processing and provides a practical step-by-step guide using a real dataset and state-of-the-art tools for MAG analysis and comparison.
Collapse
Affiliation(s)
- Fabio Beltrame Sanchez
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, Brazil
| | - Suzana Eiko Sato Guima
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, Brazil
| | - João Carlos Setubal
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, Brazil.
| |
Collapse
|
16
|
Wiegand S, Sobol M, Schnepp-Pesch LK, Yan G, Iqbal S, Vollmers J, Müller JA, Kaster AK. Taxonomic Re-Classification and Expansion of the Phylum Chloroflexota Based on over 5000 Genomes and Metagenome-Assembled Genomes. Microorganisms 2023; 11:2612. [PMID: 37894270 PMCID: PMC10608941 DOI: 10.3390/microorganisms11102612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 10/20/2023] [Accepted: 10/21/2023] [Indexed: 10/29/2023] Open
Abstract
The phylum Chloroflexota (formerly Chloroflexi) encompasses metabolically diverse bacteria that often have high prevalence in terrestrial and aquatic habitats, some even with biotechnological application. However, there is substantial disagreement in public databases which lineage should be considered a member of the phylum and at what taxonomic level. Here, we addressed these issues through extensive phylogenomic analyses. The analyses were based on a collection of >5000 Chloroflexota genomes and metagenome-assembled genomes (MAGs) from public databases, novel environmental sites, as well as newly generated MAGs from publicly available sequence reads via an improved binning approach incorporating covariance information. Based on calculated relative evolutionary divergence, we propose that Candidatus Dormibacterota should be listed as a class (i.e., Ca. Dormibacteria) within Chloroflexota together with the classes Anaerolineae, Chloroflexia, Dehalococcoidia, Ktedonobacteria, Ca. Limnocylindria, Thermomicrobia, and two other classes containing only uncultured members. All other Chloroflexota lineages previously listed at the class rank appear to be rather orders or families in the Anaerolineae and Dehalococcoidia, which contain the vast majority of genomes and exhibited the strongest phylogenetic radiation within the phylum. Furthermore, the study suggests that a common ecophysiological capability of members of the phylum is to successfully cope with low energy fluxes.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Anne-Kristin Kaster
- Institute for Biological Interfaces (IBG 5), Karlsruhe Institute of Technology, 76344 Eggenstein-Leopoldshafen, Germany; (S.W.); (M.S.); (L.K.S.-P.); (G.Y.); (S.I.); (J.V.); (J.A.M.)
| |
Collapse
|
17
|
Hassa J, Tubbesing TJ, Maus I, Heyer R, Benndorf D, Effenberger M, Henke C, Osterholz B, Beckstette M, Pühler A, Sczyrba A, Schlüter A. Uncovering Microbiome Adaptations in a Full-Scale Biogas Plant: Insights from MAG-Centric Metagenomics and Metaproteomics. Microorganisms 2023; 11:2412. [PMID: 37894070 PMCID: PMC10608942 DOI: 10.3390/microorganisms11102412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 09/15/2023] [Accepted: 09/18/2023] [Indexed: 10/29/2023] Open
Abstract
The current focus on renewable energy in global policy highlights the importance of methane production from biomass through anaerobic digestion (AD). To improve biomass digestion while ensuring overall process stability, microbiome-based management strategies become more important. In this study, metagenomes and metaproteomes were used for metagenomically assembled genome (MAG)-centric analyses to investigate a full-scale biogas plant consisting of three differentially operated digesters. Microbial communities were analyzed regarding their taxonomic composition, functional potential, as well as functions expressed on the proteome level. Different abundances of genes and enzymes related to the biogas process could be mostly attributed to different process parameters. Individual MAGs exhibiting different abundances in the digesters were studied in detail, and their roles in the hydrolysis, acidogenesis and acetogenesis steps of anaerobic digestion could be assigned. Methanoculleus thermohydrogenotrophicum was an active hydrogenotrophic methanogen in all three digesters, whereas Methanothermobacter wolfeii was more prevalent at higher process temperatures. Further analysis focused on MAGs, which were abundant in all digesters, indicating their potential to ensure biogas process stability. The most prevalent MAG belonged to the class Limnochordia; this MAG was ubiquitous in all three digesters and exhibited activity in numerous pathways related to different steps of AD.
Collapse
Affiliation(s)
- Julia Hassa
- Genome Research of Industrial Microorganisms, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstrasse 27, 33615 Bielefeld, Germany; (J.H.)
| | - Tom Jonas Tubbesing
- Computational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstraße 27, 33615 Bielefeld, Germany; (T.J.T.)
| | - Irena Maus
- Genome Research of Industrial Microorganisms, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstrasse 27, 33615 Bielefeld, Germany; (J.H.)
| | - Robert Heyer
- Multidimensional Omics Data Analyses Group, Leibniz-Institut für Analytische Wissenschaften-ISAS-e.V., Bunsen-Kirchhoff-Straße 11, Dortmund 44139, Germany
- Multidimensional Omics Data Analyses Group, Faculty of Technology, Bielefeld University, Universitätsstraße 25, 33615 Bielefeld, Germany
| | - Dirk Benndorf
- Biosciences and Process Engineering, Anhalt University of Applied Sciences, Bernburger Straße 55, Postfach 1458, 06366 Köthen, Germany
- Bioprocess Engineering, Otto von Guericke University, Universitätsplatz 2, 39106 Magdeburg, Germany
- Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Sandtorstraße 1, 39106 Magdeburg, Germany
| | - Mathias Effenberger
- Bavarian State Research Center for Agriculture, Institute for Agricultural Engineering and Animal Husbandry, Vöttinger Straße 36, 85354 Freising, Germany
| | - Christian Henke
- Computational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstraße 27, 33615 Bielefeld, Germany; (T.J.T.)
| | - Benedikt Osterholz
- Computational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstraße 27, 33615 Bielefeld, Germany; (T.J.T.)
| | - Michael Beckstette
- Computational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstraße 27, 33615 Bielefeld, Germany; (T.J.T.)
| | - Alfred Pühler
- Genome Research of Industrial Microorganisms, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstrasse 27, 33615 Bielefeld, Germany; (J.H.)
| | - Alexander Sczyrba
- Computational Metagenomics Group, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstraße 27, 33615 Bielefeld, Germany; (T.J.T.)
| | - Andreas Schlüter
- Genome Research of Industrial Microorganisms, Center for Biotechnology (CeBiTec), Bielefeld University, Universitätsstrasse 27, 33615 Bielefeld, Germany; (J.H.)
| |
Collapse
|
18
|
Jia L, Wu Y, Dong Y, Chen J, Chen WH, Zhao XM. A survey on computational strategies for genome-resolved gut metagenomics. Brief Bioinform 2023; 24:7145904. [PMID: 37114640 DOI: 10.1093/bib/bbad162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 03/20/2023] [Accepted: 04/04/2023] [Indexed: 04/29/2023] Open
Abstract
Recovering high-quality metagenome-assembled genomes (HQ-MAGs) is critical for exploring microbial compositions and microbe-phenotype associations. However, multiple sequencing platforms and computational tools for this purpose may confuse researchers and thus call for extensive evaluation. Here, we systematically evaluated a total of 40 combinations of popular computational tools and sequencing platforms (i.e. strategies), involving eight assemblers, eight metagenomic binners and four sequencing technologies, including short-, long-read and metaHiC sequencing. We identified the best tools for the individual tasks (e.g. the assembly and binning) and combinations (e.g. generating more HQ-MAGs) depending on the availability of the sequencing data. We found that the combination of the hybrid assemblies and metaHiC-based binning performed best, followed by the hybrid and long-read assemblies. More importantly, both long-read and metaHiC sequencings link more mobile elements and antibiotic resistance genes to bacterial hosts and improve the quality of public human gut reference genomes with 32% (34/105) HQ-MAGs that were either of better quality than those in the Unified Human Gastrointestinal Genome catalog version 2 or novel.
Collapse
Affiliation(s)
- Longhao Jia
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Yingjian Wu
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Yanqi Dong
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
| | - Jingchao Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
| | - Wei-Hua Chen
- Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular-imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China
- Institution of Medical Artificial Intelligence, Binzhou Medical University, Yantai 264003, China
| | - Xing-Ming Zhao
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China
- Key Laboratory of Computational Neuroscience and Brain-Inspired Intelligence, Ministry of Education, Ministry of Education, Shanghai 200433, China
- MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China
- State Key Laboratory of Medical Neurobiology, Institutes of Brain Science, Fudan University, Shanghai, China
| |
Collapse
|
19
|
Gabrielli M, Dai Z, Delafont V, Timmers PHA, van der Wielen PWJJ, Antonelli M, Pinto AJ. Identifying Eukaryotes and Factors Influencing Their Biogeography in Drinking Water Metagenomes. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:3645-3660. [PMID: 36827617 PMCID: PMC9996835 DOI: 10.1021/acs.est.2c09010] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/13/2023] [Accepted: 02/13/2023] [Indexed: 06/18/2023]
Abstract
The biogeography of eukaryotes in drinking water systems is poorly understood relative to that of prokaryotes or viruses, limiting the understanding of their role and management. A challenge with studying complex eukaryotic communities is that metagenomic analysis workflows are currently not as mature as those that focus on prokaryotes or viruses. In this study, we benchmarked different strategies to recover eukaryotic sequences and genomes from metagenomic data and applied the best-performing workflow to explore the factors affecting the relative abundance and diversity of eukaryotic communities in drinking water distribution systems (DWDSs). We developed an ensemble approach exploiting k-mer- and reference-based strategies to improve eukaryotic sequence identification and identified MetaBAT2 as the best-performing binning approach for their clustering. Applying this workflow to the DWDS metagenomes showed that eukaryotic sequences typically constituted small proportions (i.e., <1%) of the overall metagenomic data with higher relative abundances in surface water-fed or chlorinated systems with high residuals. The α and β diversities of eukaryotes were correlated with those of prokaryotic and viral communities, highlighting the common role of environmental/management factors. Finally, a co-occurrence analysis highlighted clusters of eukaryotes whose members' presence and abundance in DWDSs were affected by disinfection strategies, climate conditions, and source water types.
Collapse
Affiliation(s)
- Marco Gabrielli
- Dipartimento
di Ingegneria Civile e Ambientale—Sezione Ambientale, Politecnico di Milano, Milan 20133, Italy
| | - Zihan Dai
- Research
Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Vincent Delafont
- Laboratoire
Ecologie et Biologie des Interactions (EBI), Equipe Microorganismes,
Hôtes, Environnements, Université
de Poitiers, Poitiers 86073, France
| | - Peer H. A. Timmers
- KWR
Watercycle Research Institute, 3433 PE Nieuwegein, The Netherlands
- Department
of Microbiology, Radboud University, Heyendaalseweg 135, 6525 AJ Nijmegen, The Netherlands
| | - Paul W. J. J. van der Wielen
- KWR
Watercycle Research Institute, 3433 PE Nieuwegein, The Netherlands
- Laboratory
of Microbiology, Wageningen University, 6700 HB Wageningen, The Netherlands
| | - Manuela Antonelli
- Dipartimento
di Ingegneria Civile e Ambientale—Sezione Ambientale, Politecnico di Milano, Milan 20133, Italy
| | - Ameet J. Pinto
- School
of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
20
|
Baltoumas FA, Karatzas E, Paez-Espino D, Venetsianou NK, Aplakidou E, Oulas A, Finn RD, Ovchinnikov S, Pafilis E, Kyrpides NC, Pavlopoulos GA. Exploring microbial functional biodiversity at the protein family level-From metagenomic sequence reads to annotated protein clusters. FRONTIERS IN BIOINFORMATICS 2023; 3:1157956. [PMID: 36959975 PMCID: PMC10029925 DOI: 10.3389/fbinf.2023.1157956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 02/21/2023] [Indexed: 03/06/2023] Open
Abstract
Metagenomics has enabled accessing the genetic repertoire of natural microbial communities. Metagenome shotgun sequencing has become the method of choice for studying and classifying microorganisms from various environments. To this end, several methods have been developed to process and analyze the sequence data from raw reads to end-products such as predicted protein sequences or families. In this article, we provide a thorough review to simplify such processes and discuss the alternative methodologies that can be followed in order to explore biodiversity at the protein family level. We provide details for analysis tools and we comment on their scalability as well as their advantages and disadvantages. Finally, we report the available data repositories and recommend various approaches for protein family annotation related to phylogenetic distribution, structure prediction and metadata enrichment.
Collapse
Affiliation(s)
- Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - David Paez-Espino
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States
| | - Nefeli K. Venetsianou
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Eleni Aplakidou
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
| | - Anastasis Oulas
- The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Robert D. Finn
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, United Kingdom
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, United States
| | - Evangelos Pafilis
- Institute of Marine Biology, Biotechnology and Aquaculture (IMBBC), Hellenic Centre for Marine Research (HCMR), Heraklion, Greece
| | - Nikos C. Kyrpides
- Lawrence Berkeley National Laboratory, DOE Joint Genome Institute, Berkeley, CA, United States
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, BSRC “Alexander Fleming”, Vari, Greece
- Center of New Biotechnologies and Precision Medicine, Department of Medicine, School of Health Sciences, National and Kapodistrian University of Athens, Athens, Greece
- Hellenic Army Academy, Vari, Greece
| |
Collapse
|
21
|
Arumugam K, Bessarab I, Haryono MAS, Williams RBH. Recovery and Analysis of Long-Read Metagenome-Assembled Genomes. Methods Mol Biol 2023; 2649:235-259. [PMID: 37258866 DOI: 10.1007/978-1-0716-3072-3_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The development of long-read nucleic acid sequencing is beginning to make very substantive impact on the conduct of metagenome analysis, particularly in relation to the problem of recovering the genomes of member species of complex microbial communities. Here we outline bioinformatics workflows for the recovery and characterization of complete genomes from long-read metagenome data and some complementary procedures for comparison of cognate draft genomes and gene quality obtained from short-read sequencing and long-read sequencing.
Collapse
Affiliation(s)
- Krithika Arumugam
- Singapore Centre for Environmental Life Sciences Engineering, Nanyang Technological University, Singapore, Singapore
| | - Irina Bessarab
- Singapore Centre for Environmental Life Sciences Engineering, National University of Singapore, Singapore, Singapore
| | - Mindia A S Haryono
- Singapore Centre for Environmental Life Sciences Engineering, National University of Singapore, Singapore, Singapore
| | - Rohan B H Williams
- Singapore Centre for Environmental Life Sciences Engineering, National University of Singapore, Singapore, Singapore.
| |
Collapse
|
22
|
Metagenome-assembled genome extraction and analysis from microbiomes using KBase. Nat Protoc 2023; 18:208-238. [PMID: 36376589 DOI: 10.1038/s41596-022-00747-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Accepted: 06/28/2022] [Indexed: 11/16/2022]
Abstract
Uncultivated Bacteria and Archaea account for the vast majority of species on Earth, but obtaining their genomes directly from the environment, using shotgun sequencing, has only become possible recently. To realize the hope of capturing Earth's microbial genetic complement and to facilitate the investigation of the functional roles of specific lineages in a given ecosystem, technologies that accelerate the recovery of high-quality genomes are necessary. We present a series of analysis steps and data products for the extraction of high-quality metagenome-assembled genomes (MAGs) from microbiomes using the U.S. Department of Energy Systems Biology Knowledgebase (KBase) platform ( http://www.kbase.us/ ). Overall, these steps take about a day to obtain extracted genomes when starting from smaller environmental shotgun read libraries, or up to about a week from larger libraries. In KBase, the process is end-to-end, allowing a user to go from the initial sequencing reads all the way through to MAGs, which can then be analyzed with other KBase capabilities such as phylogenetic placement, functional assignment, metabolic modeling, pangenome functional profiling, RNA-Seq and others. While portions of such capabilities are available individually from other resources, the combination of the intuitive usability, data interoperability and integration of tools in a freely available computational resource makes KBase a powerful platform for obtaining MAGs from microbiomes. While this workflow offers tools for each of the key steps in the genome extraction process, it also provides a scaffold that can be easily extended with additional MAG recovery and analysis tools, via the KBase software development kit (SDK).
Collapse
|
23
|
Danneels B, Carlier A. Whole-Genome Sequencing of Bacterial Endophytes From Fresh and Preserved Plant Specimens. Methods Mol Biol 2022; 2605:133-155. [PMID: 36520392 DOI: 10.1007/978-1-0716-2871-3_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Many plants harbor symbiotic bacteria in their leaves, sometimes within structures visible with the naked eye. These bacteria play critical roles for host development and defense, but are often not amenable to culture. Gaining insight into the functions of these obligate endophytic bacteria hinges on culture-independent omics approaches, which have seen tremendous development in recent years. We describe in this chapter a set of protocols for the extraction and bioinformatic analysis of bacterial genomic DNA from leaf samples of various origins, including fresh, silica-preserved, or herbarium specimens.
Collapse
Affiliation(s)
- Bram Danneels
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France.,CBU, Department of Informatics, University of Bergen, Bergen, Norway
| | - Aurélien Carlier
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France. .,Laboratory of Microbiology, Ghent University, Ghent, Belgium.
| |
Collapse
|
24
|
Ide K, Nishikawa Y, Maruyama T, Tsukada Y, Kogawa M, Takeda H, Ito H, Wagatsuma R, Miyaoka R, Nakano Y, Kinjo K, Ito M, Hosokawa M, Yura K, Suda S, Takeyama H. Targeted single-cell genomics reveals novel host adaptation strategies of the symbiotic bacteria Endozoicomonas in Acropora tenuis coral. MICROBIOME 2022; 10:220. [PMID: 36503599 PMCID: PMC9743535 DOI: 10.1186/s40168-022-01395-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 10/13/2022] [Indexed: 06/17/2023]
Abstract
BACKGROUND Endozoicomonas bacteria symbiosis with various marine organisms is hypothesized as a potential indicator of health in corals. Although many amplicon analyses using 16S rRNA gene have suggested the diversity of Endozoicomonas species, genome analysis has been limited due to contamination of host-derived sequences and difficulties in culture and metagenomic analysis. Therefore, the evolutionary and functional potential of individual Endozoicomonas species symbiotic with the same coral species remains unresolved. RESULTS In this study, we applied a novel single-cell genomics technique using droplet microfluidics to obtain single-cell amplified genomes (SAGs) for uncultured coral-associated Endozoicomonas spp. We obtained seven novel Endozoicomonas genomes and quantitative bacterial composition from Acropora tenuis corals at four sites in Japan. Our quantitative 16S rRNA gene and comparative genomic analysis revealed that these Endozoicomonas spp. belong to different lineages (Clade A and Clade B), with widely varying abundance among individual corals. Furthermore, each Endozoicomonas species possessed various eukaryotic-like genes in clade-specific genes. It was suggested that these eukaryotic-like genes might have a potential ability of different functions in each clade, such as infection of the host coral or suppression of host immune pathways. These Endozoicomonas species may have adopted different host adaptation strategies despite living symbiotically on the same coral. CONCLUSIONS This study suggests that coral-associated Endozoicomonas spp. on the same species of coral have different evolutional strategies and functional potentials in each species and emphasizes the need to analyze the genome of each uncultured strain in future coral-Endozoicomonas relationships studies. Video Abstract.
Collapse
Affiliation(s)
- Keigo Ide
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
- Research Organization for Nano and Life Innovation, Waseda University, Tokyo, Japan
| | - Yohei Nishikawa
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
- Research Organization for Nano and Life Innovation, Waseda University, Tokyo, Japan
| | - Toru Maruyama
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
| | - Yuko Tsukada
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Masato Kogawa
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
- Research Organization for Nano and Life Innovation, Waseda University, Tokyo, Japan
| | - Hiroki Takeda
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
| | - Haruka Ito
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
| | - Ryota Wagatsuma
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Rimi Miyaoka
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
| | - Yoshikatsu Nakano
- Tropical Biosphere Research Center, University of the Ryukyus, Okinawa, Japan
- Marine Science Section, Research Support Division, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | | | - Michihiro Ito
- Tropical Biosphere Research Center, University of the Ryukyus, Okinawa, Japan
| | - Masahito Hosokawa
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
- Research Organization for Nano and Life Innovation, Waseda University, Tokyo, Japan
- Institute for Advanced Research of Biosystem Dynamics, Waseda Research Institute for Science and Engineering, Tokyo, Japan
| | - Kei Yura
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
- Research Organization for Nano and Life Innovation, Waseda University, Tokyo, Japan
- Graduate School of Humanities and Sciences, Ochanomizu University, Tokyo, Japan
| | - Shoichiro Suda
- Faculty of Science, University of the Ryukyus, Okinawa, Japan
| | - Haruko Takeyama
- Department of Life Science and Medical Bioscience, Waseda University, Tokyo, Japan.
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology, Tokyo, Japan.
- Research Organization for Nano and Life Innovation, Waseda University, Tokyo, Japan.
- Institute for Advanced Research of Biosystem Dynamics, Waseda Research Institute for Science and Engineering, Tokyo, Japan.
| |
Collapse
|
25
|
Community-integrated multi-omics facilitates screening and isolation of the organohalide dehalogenation microorganism. Innovation (N Y) 2022; 4:100355. [PMID: 36506298 PMCID: PMC9730224 DOI: 10.1016/j.xinn.2022.100355] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 11/17/2022] [Indexed: 11/23/2022] Open
Abstract
A variety of anthropogenic organohalide contaminants generated from industry are released into the environment and thus cause serious pollution that endangers human health. In the present study, we investigated the microbial community composition of industrial saponification wastewater using 16S rRNA sequencing, providing genomic insights of potential organohalide dehalogenation bacteria (OHDBs) by metagenomic sequencing. We also explored yet-to-culture OHDBs involved in the microbial community. Microbial diversity analysis reveals that Proteobacteria and Patescibacteria phyla dominate microbiome abundance of the wastewater. In addition, a total of six bacterial groups (Rhizobiales, Rhodobacteraceae, Rhodospirillales, Flavob a cteriales, Micrococcales, and Saccharimonadales) were found as biomarkers in the key organohalide removal module. Ninety-four metagenome-assembled genomes were reconstructed from the microbial community, and 105 hydrolytic dehalogenase genes within 42 metagenome-assembled genomes were identified, suggesting that the potential for organohalide hydrolytic dehalogenation is present in the microbial community. Subsequently, we characterized the organohalide dehalogenation of an isolated OHDB, Microbacterium sp. J1-1, which shows the dehalogenation activities of chloropropanol, dichloropropanol, and epichlorohydrin. This study provides a community-integrated multi-omics approach to gain functional OHDBs for industrial organohalide dehalogenation.
Collapse
|
26
|
Zhang Y, Jiang F, Yang B, Wang S, Wang H, Wang A, Xu D, Fan W. Improved microbial genomes and gene catalog of the chicken gut from metagenomic sequencing of high-fidelity long reads. Gigascience 2022; 11:giac116. [PMID: 36399059 PMCID: PMC9673493 DOI: 10.1093/gigascience/giac116] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 09/27/2022] [Accepted: 10/30/2022] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Due to the importance of chicken production and the remarkable influence of the gut microbiota on host health and growth, tens of thousands of metagenome-assembled genomes (MAGs) have been constructed for the chicken gut microbiome. However, due to the limitations of short-read sequencing and assembly technologies, most of these MAGs are far from complete, are of lower quality, and include contaminant reads. RESULTS We generated 332 Gb of high-fidelity (HiFi) long reads from the 5 chicken intestinal compartments and assembled 461 and 337 microbial genomes, of which 53% and 55% are circular, at the species and strain levels, respectively. For the assembled microbial genomes, approximately 95% were regarded as complete according to the "RNA complete" criteria, which requires at least 1 full-length ribosomal RNA (rRNA) operon encoding all 3 types of rRNA (16S, 23S, and 5S) and at least 18 copies of full-length transfer RNA genes. In comparison with the short-read-derived chicken MAGs, 384 (83% of 461) and 89 (26% of 337) strain-level and species-level genomes in this study are novel, with no matches to previously reported sequences. At the gene level, one-third of the 2.5 million genes in the HiFi-derived gene catalog are novel and cannot be matched to the short-read-derived gene catalog. Moreover, the HiFi-derived genomes have much higher continuity and completeness, as well as lower contamination; the HiFi-derived gene catalog has a much higher ratio of complete gene structures. The dominant phylum in our HiFi-assembled genomes was Firmicutes (82.5%), and the foregut was highly enriched in 5 genera: Ligilactobacillus, Limosilactobacillus, Lactobacillus, Weissella, and Enterococcus, all of which belong to the order Lactobacillales. Using GTDB-Tk, all 337 species-level genomes were successfully classified at the order level; however, 2, 35, and 189 genomes could not be classified into any known family, genus, and species, respectively. Among these incompletely classified genomes, 9 and 49 may belong to novel genera and species, respectively, because their 16S rRNA genes have identities lower than 95% and 97% to any known 16S rRNA genes. CONCLUSIONS HiFi sequencing not only produced metagenome assemblies and gene structures with markedly improved quality but also recovered a substantial portion of novel genomes and genes that were missed in previous short-read-based metagenome studies. The novel genomes and species obtained in this study will facilitate gut microbiome and host-microbiota interaction studies, thereby contributing to the sustainable development of poultry resources.
Collapse
Affiliation(s)
- Yan Zhang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Fan Jiang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Boyuan Yang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Sen Wang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Hengchao Wang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Anqi Wang
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Dong Xu
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| | - Wei Fan
- Guangdong Laboratory for Lingnan Modern Agriculture (Shenzhen Branch), Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong, 518120, China
| |
Collapse
|
27
|
Sun X, Chen Z, Kong T, Chen Z, Dong Y, Kolton M, Cao Z, Zhang X, Zhang H, Liu G, Gao P, Yang N, Lan L, Xu Y, Sun W. Mycobacteriaceae Mineralizes Micropolyethylene in Riverine Ecosystems. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:15705-15717. [PMID: 36288260 DOI: 10.1021/acs.est.2c05346] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Microplastic (MP) contamination is a serious global environmental problem. Plastic contamination has attracted extensive attention during the past decades. While physiochemical weathering may influence the properties of MPs, biodegradation by microorganisms could ultimately mineralize plastics into CO2. Compared to the well-studied marine ecosystems, the MP biodegradation process in riverine ecosystems, however, is less understood. The current study focuses on the MP biodegradation in one of the world's most plastic contaminated rivers, Pearl River, using micropolyethylene (mPE) as a model substrate. Mineralization of 13C-labeled mPE into 13CO2 provided direct evidence of mPE biodegradation by indigenous microorganisms. Several Actinobacteriota genera were identified as putative mPE degraders. Furthermore, two Mycobacteriaceae isolates related to the putative mPE degraders, Mycobacterium sp. mPE3 and Nocardia sp. mPE12, were retrieved, and their ability to mineralize 13C-mPE into 13CO2 was confirmed. Pangenomic analysis reveals that the genes related to the proposed mPE biodegradation pathway are shared by members of Mycobacteriaceae. While both Mycobacterium and Nocardia are known for their pathogenicity, these populations on the plastisphere in this study were likely nonpathogenic as they lacked virulence factors. The current study provided direct evidence for MP mineralization by indigenous biodegraders and predicted their biodegradation pathway, which may be harnessed to improve bioremediation of MPs in urban rivers.
Collapse
Affiliation(s)
- Xiaoxu Sun
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
| | - Zhenyu Chen
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- School of Environment, Key Laboratory of Yellow River and Huai River Water Environment and Pollution Control, Ministry of Education, Henan Normal University, Xinxiang 453007, China
| | - Tianle Kong
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- College of Environmental Science and Engineering, Donghua University, Shanghai 201620, China
| | - Zheng Chen
- Department of Health and Environmental Sciences, Xi'an Jiaotong-Liverpool University, Suzhou 215123, China
| | - Yiran Dong
- School of Environmental Studies, China University of Geosciences (Wuhan), Wuhan 430074, China
| | - Max Kolton
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- French Associates Institute for Agriculture and Biotechnology of Drylands, Ben-Gurion University of the Negev, Beer Sheva 849900, Israel
| | - Zhiguo Cao
- School of Environment, Key Laboratory of Yellow River and Huai River Water Environment and Pollution Control, Ministry of Education, Henan Normal University, Xinxiang 453007, China
| | - Xin Zhang
- School of Environment, Key Laboratory of Yellow River and Huai River Water Environment and Pollution Control, Ministry of Education, Henan Normal University, Xinxiang 453007, China
| | - Haihan Zhang
- School of Environmental and Municipal Engineering, Xi'an University of Architecture and Technology, Xi'an 710055, China
| | - Guoqiang Liu
- Guangdong Key Laboratory of Environmental Pollution and Health, School of Environment, Jinan University, Guangzhou 511443, China
| | - Pin Gao
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- College of Environmental Science and Engineering, Donghua University, Shanghai 201620, China
| | - Nie Yang
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
- College of Environmental Science and Engineering, Donghua University, Shanghai 201620, China
| | - Ling Lan
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
| | - Yating Xu
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
| | - Weimin Sun
- National-Regional Joint Engineering Research Center for Soil Pollution Control and Remediation in South China, Guangdong Key Laboratory of Integrated Agro-environmental Pollution Control and Management, Institute of Eco-environmental and Soil Sciences, Guangdong Academy of Sciences, Guangzhou 510650, P. R. China
- Guangdong-Hong Kong-Macao Joint Laboratory for Environmental Pollution and Control, Guangzhou Institute of Geochemistry, Chinese Academy of Sciences, Guangzhou 510640, China
| |
Collapse
|
28
|
Mallawaarachchi V, Lin Y. Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs. J Comput Biol 2022; 29:1357-1376. [DOI: 10.1089/cmb.2022.0262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Affiliation(s)
- Vijini Mallawaarachchi
- School of Computing, College of Engineering and Computer Science, Australian National University, Canberra, Australia
| | - Yu Lin
- School of Computing, College of Engineering and Computer Science, Australian National University, Canberra, Australia
| |
Collapse
|
29
|
Hickl O, Queirós P, Wilmes P, May P, Heintz-Buschart A. binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets. Brief Bioinform 2022; 23:6760137. [PMID: 36239393 PMCID: PMC9677464 DOI: 10.1093/bib/bbac431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 09/03/2022] [Accepted: 09/06/2022] [Indexed: 12/14/2022] Open
Abstract
The reconstruction of genomes is a critical step in genome-resolved metagenomics and for multi-omic data integration from microbial communities. Here, we present binny, a binning tool that produces high-quality metagenome-assembled genomes (MAG) from both contiguous and highly fragmented genomes. Based on established metrics, binny outperforms or is highly competitive with commonly used and state-of-the-art binning methods and finds unique genomes that could not be detected by other methods. binny uses k-mer-composition and coverage by metagenomic reads for iterative, nonlinear dimension reduction of genomic signatures as well as subsequent automated contig clustering with cluster assessment using lineage-specific marker gene sets. When compared with seven widely used binning algorithms, binny provides substantial amounts of uniquely identified MAGs and almost always recovers the most near-complete ($\gt 95\%$ pure, $\gt 90\%$ complete) and high-quality ($\gt 90\%$ pure, $\gt 70\%$ complete) genomes from simulated datasets from the Critical Assessment of Metagenome Interpretation initiative, as well as substantially more high-quality draft genomes, as defined by the Minimum Information about a Metagenome-Assembled Genome standard, from a real-world benchmark comprised of metagenomes from various environments than any other tested method.
Collapse
Affiliation(s)
| | | | | | - Patrick May
- Corresponding authors: Patrick May, Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 1 Boulevard du Jazz, L-4370, Esch-sur-Alzette, Luxembourg. Tel: +352 46 6644 6263; E-mail: ; Anna Heintz-Buschart, Biosystems Data Analysis, Swammerdam Institute for Life Sciences, Faculty of Science, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands. Tel: +31 020 525 6547; E-mail:
| | - Anna Heintz-Buschart
- Corresponding authors: Patrick May, Bioinformatics Core, Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 1 Boulevard du Jazz, L-4370, Esch-sur-Alzette, Luxembourg. Tel: +352 46 6644 6263; E-mail: ; Anna Heintz-Buschart, Biosystems Data Analysis, Swammerdam Institute for Life Sciences, Faculty of Science, University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, The Netherlands. Tel: +31 020 525 6547; E-mail:
| |
Collapse
|
30
|
Lamurias A, Sereika M, Albertsen M, Hose K, Nielsen TD. Metagenomic binning with assembly graph embeddings. Bioinformatics 2022; 38:4481-4487. [PMID: 35972375 PMCID: PMC9525014 DOI: 10.1093/bioinformatics/btac557] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 08/02/2022] [Accepted: 08/12/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not a trivial task. Current metagenomic binners do not take full advantage of assembly graphs and are not optimized for long-read assemblies. Deep graph learning algorithms have been proposed in other fields to deal with complex graph data structures. The graph structure generated during the assembly process could be integrated with contig features to obtain better bins with deep learning. RESULTS We propose GraphMB, which uses graph neural networks to incorporate the assembly graph into the binning process. We test GraphMB on long-read datasets of different complexities, and compare the performance with other binners in terms of the number of High Quality (HQ) genome bins obtained. With our approach, we were able to obtain unique bins on all real datasets, and obtain more bins on most datasets. In particular, we obtained on average 17.5% more HQ bins when compared with state-of-the-art binners and 13.7% when aggregating the results of our binner with the others. These results indicate that a deep learning model can integrate contig-specific and graph-structure information to improve metagenomic binning. AVAILABILITY AND IMPLEMENTATION GraphMB is available from https://github.com/MicrobialDarkMatter/GraphMB. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Mantas Sereika
- Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, 9000 Aalborg, Denmark
| | | | | | | |
Collapse
|
31
|
Zhao XL, Qi Z, Huang H, Tu J, Song XJ, Qi KZ, Shao Y. Coexistence of antibiotic resistance genes, fecal bacteria, and potential pathogens in anthropogenically impacted water. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2022; 29:46977-46990. [PMID: 35175529 DOI: 10.1007/s11356-022-19175-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Accepted: 02/07/2022] [Indexed: 06/14/2023]
Abstract
Microbial indicators are often used to monitor microbial safety of aquatic environments. However, information regarding the correlation between microbial indicators and ecotoxicological factors such as potential pathogens and antibiotic resistance genes (ARGs) in anthropogenically impacted waters remains highly limited. Here, we investigated the bacterial community composition, potential pathogens, ARGs diversity, ARG hosts, and horizontal gene transfer (HGT) potential in urban river and wastewater samples from Chaohu Lake Basin using 16S rRNA and metagenomic sequencing. The composition of the microbial community and potential pathogens differed significantly in wastewater and river water samples, and the total relative abundance of fecal indicator bacteria was positively correlated with the total relative abundance of potential pathogens (p < 0.001 and Pearson's r = 0.758). Network analysis indicated that partial ARG subtypes such as dfrE, sul2, and PmrE were significantly correlated with indicator bacteria (p < 0.05 and Pearson's r > 0.6). Notably, Klebsiella was the indicator bacteria significantly correlated with 4 potential pathogens and 14 ARG subtypes. ARGs coexisting with mobile gene elements were mainly found in Thauera, Pseudomonas, Escherichia, and Acinetobacter. Next-generation sequencing (NGS) can be used to conduct preliminary surveys of environmental samples to access potential health risks, thereby facilitating water resources management.
Collapse
Affiliation(s)
- Xiang-Long Zhao
- Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, College of Animal Science and Technology, Anhui Agricultural University, 130 West Changjiang Road, Hefei, 230036, People's Republic of China
| | - Zhao Qi
- School of Information and Computer, Anhui Agricultural University, Hefei, 230036, People's Republic of China
| | - Hao Huang
- Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, College of Animal Science and Technology, Anhui Agricultural University, 130 West Changjiang Road, Hefei, 230036, People's Republic of China
| | - Jian Tu
- Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, College of Animal Science and Technology, Anhui Agricultural University, 130 West Changjiang Road, Hefei, 230036, People's Republic of China
| | - Xiang-Jun Song
- Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, College of Animal Science and Technology, Anhui Agricultural University, 130 West Changjiang Road, Hefei, 230036, People's Republic of China
| | - Ke-Zong Qi
- Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, College of Animal Science and Technology, Anhui Agricultural University, 130 West Changjiang Road, Hefei, 230036, People's Republic of China.
| | - Ying Shao
- Anhui Province Key Laboratory of Veterinary Pathobiology and Disease Control, College of Animal Science and Technology, Anhui Agricultural University, 130 West Changjiang Road, Hefei, 230036, People's Republic of China.
| |
Collapse
|
32
|
Nishimura L, Fujito N, Sugimoto R, Inoue I. Detection of Ancient Viruses and Long-Term Viral Evolution. Viruses 2022; 14:v14061336. [PMID: 35746807 PMCID: PMC9230872 DOI: 10.3390/v14061336] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 06/15/2022] [Accepted: 06/16/2022] [Indexed: 12/22/2022] Open
Abstract
The COVID-19 outbreak has reminded us of the importance of viral evolutionary studies as regards comprehending complex viral evolution and preventing future pandemics. A unique approach to understanding viral evolution is the use of ancient viral genomes. Ancient viruses are detectable in various archaeological remains, including ancient people's skeletons and mummified tissues. Those specimens have preserved ancient viral DNA and RNA, which have been vigorously analyzed in the last few decades thanks to the development of sequencing technologies. Reconstructed ancient pathogenic viral genomes have been utilized to estimate the past pandemics of pathogenic viruses within the ancient human population and long-term evolutionary events. Recent studies revealed the existence of non-pathogenic viral genomes in ancient people's bodies. These ancient non-pathogenic viruses might be informative for inferring their relationships with ancient people's diets and lifestyles. Here, we reviewed the past and ongoing studies on ancient pathogenic and non-pathogenic viruses and the usage of ancient viral genomes to understand their long-term viral evolution.
Collapse
Affiliation(s)
- Luca Nishimura
- Human Genetics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan; (L.N.); (N.F.); (R.S.)
- Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan
| | - Naoko Fujito
- Human Genetics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan; (L.N.); (N.F.); (R.S.)
- Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan
| | - Ryota Sugimoto
- Human Genetics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan; (L.N.); (N.F.); (R.S.)
| | - Ituro Inoue
- Human Genetics Laboratory, National Institute of Genetics, Mishima 411-8540, Japan; (L.N.); (N.F.); (R.S.)
- Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima 411-8540, Japan
- Correspondence: ; Tel.: +81-55-981-6795
| |
Collapse
|
33
|
Nishimura Y, Yoshizawa S. The OceanDNA MAG catalog contains over 50,000 prokaryotic genomes originated from various marine environments. Sci Data 2022; 9:305. [PMID: 35715423 PMCID: PMC9205870 DOI: 10.1038/s41597-022-01392-5] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 05/12/2022] [Indexed: 12/22/2022] Open
Abstract
Marine microorganisms are immensely diverse and play fundamental roles in global geochemical cycling. Recent metagenome-assembled genome studies, with particular attention to large-scale projects such as Tara Oceans, have expanded the genomic repertoire of marine microorganisms. However, published marine metagenome data is still underexplored. We collected 2,057 marine metagenomes covering various marine environments and developed a new genome reconstruction pipeline. We reconstructed 52,325 qualified genomes composed of 8,466 prokaryotic species-level clusters spanning 59 phyla, including genomes from the deep-sea characterized as deeper than 1,000 m (n = 3,337), low-oxygen zones of <90 μmol O2 per kg water (n = 7,884), and polar regions (n = 7,752). Novelty evaluation using a genome taxonomy database shows that 6,256 species (73.9%) are novel and include genomes of high taxonomic novelty, such as new class candidates. These genomes collectively expanded the known phylogenetic diversity of marine prokaryotes by 34.2%, and the species representatives cover 26.5-42.0% of prokaryote-enriched metagenomes. Thoroughly leveraging accumulated metagenomic data, this genome resource, named the OceanDNA MAG catalog, illuminates uncharacterized marine microbial 'dark matter' lineages.
Collapse
Affiliation(s)
- Yosuke Nishimura
- Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba, 277-8564, Japan.
- Research Center for Bioscience and Nanoscience (CeBN), Research Institute for Marine Resources Utilization, Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Yokosuka, Kanagawa, 237-0061, Japan.
| | - Susumu Yoshizawa
- Atmosphere and Ocean Research Institute, The University of Tokyo, Chiba, 277-8564, Japan
- Graduate School of Frontier Sciences, The University of Tokyo, Chiba, 277-8563, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Tokyo, 113-8657, Japan
| |
Collapse
|
34
|
Goussarov G, Mysara M, Vandamme P, Van Houdt R. Introduction to the principles and methods underlying the recovery of metagenome-assembled genomes from metagenomic data. Microbiologyopen 2022; 11:e1298. [PMID: 35765182 PMCID: PMC9179125 DOI: 10.1002/mbo3.1298] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 11/18/2022] Open
Abstract
The rise of metagenomics offers a leap forward for understanding the genetic diversity of microorganisms in many different complex environments by providing a platform that can identify potentially unlimited numbers of known and novel microorganisms. As such, it is impossible to imagine new major initiatives without metagenomics. Nevertheless, it represents a relatively new discipline with various levels of complexity and demands on bioinformatics. The underlying principles and methods used in metagenomics are often seen as common knowledge and often not detailed or fragmented. Therefore, we reviewed these to guide microbiologists in taking the first steps into metagenomics. We specifically focus on a workflow aimed at reconstructing individual genomes, that is, metagenome-assembled genomes, integrating DNA sequencing, assembly, binning, identification and annotation.
Collapse
Affiliation(s)
- Gleb Goussarov
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of SciencesGhent UniversityGhentBelgium
| | - Mohamed Mysara
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
| | - Peter Vandamme
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of SciencesGhent UniversityGhentBelgium
| | - Rob Van Houdt
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN)MolBelgium
| |
Collapse
|
35
|
Wang Y, Dong Q, Hu S, Zou H, Wu T, Shi J, Zhang H, Sheng Y, Sun W, Kong X, Chen L. Decoding microbial genomes to understand their functional roles in human complex diseases. IMETA 2022; 1:e14. [PMID: 38868571 PMCID: PMC10989872 DOI: 10.1002/imt2.14] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 01/20/2022] [Accepted: 02/28/2022] [Indexed: 06/14/2024]
Abstract
Complex diseases such as cardiovascular disease (CVD), obesity, inflammatory bowel disease (IBD), kidney disease, type 2 diabetes (T2D), and cancer have become a major burden to public health and affect more than 20% of the population worldwide. The etiology of complex diseases is not yet clear, but they are traditionally thought to be caused by genetics and environmental factors (e.g., dietary habits), and by their interactions. Besides this, increasing pieces of evidence now highlight that the intestinal microbiota may contribute substantially to the health and disease of the human host via their metabolic molecules. Therefore, decoding the microbial genomes has been an important strategy to shed light on their functional potential. In this review, we summarize the roles of the gut microbiome in complex diseases from its functional perspective. We further introduce artificial tools in decoding microbial genomes to profile their functionalities. Finally, state-of-the-art techniques have been highlighted which may contribute to a mechanistic understanding of the gut microbiome in human complex diseases and promote the development of the gut microbiome-based personalized medicine.
Collapse
Affiliation(s)
- Yifeng Wang
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
| | - Quanbin Dong
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
| | - Shixian Hu
- Institute of Precision Medicine, The First Affiliated Hospital of Sun Yat‐Sen UniversitySun Yat‐Sen UniversityGuangzhouGuangdongChina
| | - Huayiyang Zou
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
| | - Tingting Wu
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
| | - Jing Shi
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
| | - Haifeng Zhang
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
| | - Yanhui Sheng
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
| | - Wei Sun
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
| | - Xiangqing Kong
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
| | - Lianmin Chen
- Department of Cardiology, The First Affiliated Hospital of Nanjing Medical UniversityNanjing Medical UniversityNanjingJiangsuChina
- Cardiovascular Research Center, The Affiliated Suzhou Hospital of Nanjing Medical University, Suzhou Municipal Hospital, Gusu SchoolNanjing Medical UniversitySuzhouJiangsuChina
- Department of Genetics, University Medical Center GroningenUniversity of GroningenGroningenThe Netherlands
| |
Collapse
|
36
|
Goussarov G, Claesen J, Mysara M, Cleenwerck I, Leys N, Vandamme P, Van Houdt R. Accurate prediction of metagenome-assembled genome completeness by MAGISTA, a random forest model built on alignment-free intra-bin statistics. ENVIRONMENTAL MICROBIOME 2022; 17:9. [PMID: 35248155 PMCID: PMC8898458 DOI: 10.1186/s40793-022-00403-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Accepted: 02/17/2022] [Indexed: 05/03/2023]
Abstract
BACKGROUND Although the total number of microbial taxa on Earth is under debate, it is clear that only a small fraction of these has been cultivated and validly named. Evidently, the inability to culture most bacteria outside of very specific conditions severely limits their characterization and further studies. In the last decade, a major part of the solution to this problem has been the use of metagenome sequencing, whereby the DNA of an entire microbial community is sequenced, followed by the in silico reconstruction of genomes of its novel component species. The large discrepancy between the number of sequenced type strain genomes (around 12,000) and total microbial diversity (106-1012 species) directs these efforts to de novo assembly and binning. Unfortunately, these steps are error-prone and as such, the results have to be intensely scrutinized to avoid publishing incomplete and low-quality genomes. RESULTS We developed MAGISTA (metagenome-assembled genome intra-bin statistics assessment), a novel approach to assess metagenome-assembled genome quality that tackles some of the often-neglected drawbacks of current reference gene-based methods. MAGISTA is based on alignment-free distance distributions between contig fragments within metagenomic bins, rather than a set of reference genes. For proper training, a highly complex genomic DNA mock community was needed and constructed by pooling genomic DNA of 227 bacterial strains, specifically selected to obtain a wide variety representing the major phylogenetic lineages of cultivable bacteria. CONCLUSIONS MAGISTA achieved a 20% reduction in root-mean-square error in comparison to the marker gene approach when tested on publicly available mock metagenomes. Furthermore, our highly complex genomic DNA mock community is a very valuable tool for benchmarking (new) metagenome analysis methods.
Collapse
Affiliation(s)
- Gleb Goussarov
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Jürgen Claesen
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
- Department of Epidemiology & Biostatistics, Amsterdam UMC, VU University, Amsterdam, The Netherlands
| | - Mohamed Mysara
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
| | - Ilse Cleenwerck
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Natalie Leys
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium
| | - Peter Vandamme
- Laboratory of Microbiology and BCCM/LMG Bacteria Collection, Faculty of Sciences, Ghent University, Ghent, Belgium
| | - Rob Van Houdt
- Microbiology Unit, Belgian Nuclear Research Centre (SCK CEN), Mol, Belgium.
| |
Collapse
|
37
|
Ventolero MF, Wang S, Hu H, Li X. Computational analyses of bacterial strains from shotgun reads. Brief Bioinform 2022; 23:6524011. [PMID: 35136954 DOI: 10.1093/bib/bbac013] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 01/10/2022] [Accepted: 01/11/2022] [Indexed: 12/21/2022] Open
Abstract
Shotgun sequencing is routinely employed to study bacteria in microbial communities. With the vast amount of shotgun sequencing reads generated in a metagenomic project, it is crucial to determine the microbial composition at the strain level. This study investigated 20 computational tools that attempt to infer bacterial strain genomes from shotgun reads. For the first time, we discussed the methodology behind these tools. We also systematically evaluated six novel-strain-targeting tools on the same datasets and found that BHap, mixtureS and StrainFinder performed better than other tools. Because the performance of the best tools is still suboptimal, we discussed future directions that may address the limitations.
Collapse
Affiliation(s)
| | - Saidi Wang
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, Orlando, FL 32816, USA.,Genomics and Bioinformatics Cluster, University of Central Florida, Orlando, FL 32816, USA
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
38
|
Vuong P, Wise MJ, Whiteley AS, Kaur P. Small investments with big returns: environmental genomic bioprospecting of microbial life. Crit Rev Microbiol 2022; 48:641-655. [PMID: 35100064 DOI: 10.1080/1040841x.2021.2011833] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Microorganisms and their natural products are major drivers of ecological processes and industrial applications. Microbial bioprospecting has been critical for the advancement in various fields such as pharmaceuticals, sustainable industries, food security and bioremediation. Next generation sequencing has been paramount in the exploration of diverse environmental microbiomes. It presents a culture-independent approach to investigating hitherto uncultured taxa, resulting in the creation of massive sequence databases, which are available in the public domain. Genome mining searches available (meta)genomic data for target biosynthetic genes, and combined with the large-scale public data, this in-silico bioprospecting method presents an efficient and extensive way to uncover microbial bioproducts. Bioinformatic tools have progressed to a stage where we can recover genomes from the environment; these metagenome-assembled genomes present a way to understand the metabolic capacity of microorganisms in a physiological and ecological context. Environmental sampling been extensive across various ecological settings, including microbiomes with unique physicochemical properties that could influence the discovery of novel functions and metabolic pathways. Although in-silico methods cannot completely substitute in-vitro studies, the contextual information it provides is invaluable for understanding the ecological and taxonomic distribution of microbial genotypes and to form effective strategies for future microbial bioprospecting efforts.
Collapse
Affiliation(s)
- Paton Vuong
- UWA School of Agriculture & Environment, University of Western Australia, Perth, Australia
| | - Michael J Wise
- School of Physics, Mathematics and Computing, University of Western Australia, Perth, Australia
| | - Andrew S Whiteley
- Centre for Environment & Life Sciences, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Floreat, Australia
| | - Parwinder Kaur
- UWA School of Agriculture & Environment, University of Western Australia, Perth, Australia
| |
Collapse
|
39
|
Fournier P, Pellan L, Barroso-Bergadà D, Bohan DA, Candresse T, Delmotte F, Dufour MC, Lauvergeat V, Le Marrec C, Marais A, Martins G, Masneuf-Pomarède I, Rey P, Sherman D, This P, Frioux C, Labarthe S, Vacher C. The functional microbiome of grapevine throughout plant evolutionary history and lifetime. ADV ECOL RES 2022. [DOI: 10.1016/bs.aecr.2022.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
40
|
Choudhari J, Choubey J, Verma M, Chatterjee T, Sahariah B. Metagenomics: the boon for microbial world knowledge and current challenges. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00022-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
|
41
|
Yang C, Chowdhury D, Zhang Z, Cheung WK, Lu A, Bian Z, Zhang L. A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data. Comput Struct Biotechnol J 2021; 19:6301-6314. [PMID: 34900140 PMCID: PMC8640167 DOI: 10.1016/j.csbj.2021.11.028] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 11/17/2021] [Accepted: 11/17/2021] [Indexed: 12/16/2022] Open
Abstract
Metagenomic sequencing provides a culture-independent avenue to investigate the complex microbial communities by constructing metagenome-assembled genomes (MAGs). A MAG represents a microbial genome by a group of sequences from genome assembly with similar characteristics. It enables us to identify novel species and understand their potential functions in a dynamic ecosystem. Many computational tools have been developed to construct and annotate MAGs from metagenomic sequencing, however, there is a prominent gap to comprehensively introduce their background and practical performance. In this paper, we have thoroughly investigated the computational tools designed for both upstream and downstream analyses, including metagenome assembly, metagenome binning, gene prediction, functional annotation, taxonomic classification, and profiling. We have categorized the commonly used tools into unique groups based on their functional background and introduced the underlying core algorithms and associated information to demonstrate a comparative outlook. Furthermore, we have emphasized the computational requisition and offered guidance to the users to select the most efficient tools. Finally, we have indicated current limitations, potential solutions, and future perspectives for further improving the tools of MAG construction and annotation. We believe that our work provides a consolidated resource for the current stage of MAG studies and shed light on the future development of more effective MAG analysis tools on metagenomic sequencing.
Collapse
Key Words
- CNN, convolutional neural network
- DBG, De Bruijn graph
- GTDB, Genome Taxonomy Database
- Gene functional annotation
- Gene prediction
- Genome assembly
- HMM, Hidden Markov Model
- KEGG, Kyoto Encyclopedia of Genes and Genomes
- LCA, lowest common ancestor
- LPA, label propagation algorithm
- MAGs, metagenome-assembled genomes
- Metagenome binning
- Metagenome-assembled genomes
- Metagenomic sequencing
- Microbial abundance profiling
- OLC, overlap-layout consensus
- ONT, Oxford Nanopore Technologies
- ORFs, open reading frames
- PacBio, Pacific Biosciences
- QC, quality control
- SLR, synthetic long reads
- TNFs, tetranucleotide frequencies
- Taxonomic classification
Collapse
Affiliation(s)
- Chao Yang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Debajyoti Chowdhury
- Computational Medicine Lab, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Institute of Integrated Bioinformedicine and Translational Sciences, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Zhenmiao Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - William K. Cheung
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Aiping Lu
- Computational Medicine Lab, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Institute of Integrated Bioinformedicine and Translational Sciences, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Zhaoxiang Bian
- Institute of Brain and Gut Research, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Chinese Medicine Clinical Study Center, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong Special Administrative Region
| | - Lu Zhang
- Department of Computer Science, Hong Kong Baptist University, Hong Kong Special Administrative Region
- Computational Medicine Lab, Hong Kong Baptist University, Hong Kong Special Administrative Region
| |
Collapse
|
42
|
Ma T, McAllister TA, Guan LL. A review of the resistome within the digestive tract of livestock. J Anim Sci Biotechnol 2021; 12:121. [PMID: 34763729 PMCID: PMC8588621 DOI: 10.1186/s40104-021-00643-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 10/07/2021] [Indexed: 12/25/2022] Open
Abstract
Antimicrobials have been widely used to prevent and treat infectious diseases and promote growth in food-production animals. However, the occurrence of antimicrobial resistance poses a huge threat to public and animal health, especially in less developed countries where food-producing animals often intermingle with humans. To limit the spread of antimicrobial resistance from food-production animals to humans and the environment, it is essential to have a comprehensive knowledge of the role of the resistome in antimicrobial resistance (AMR), The resistome refers to the collection of all antimicrobial resistance genes associated with microbiota in a given environment. The dense microbiota in the digestive tract is known to harbour one of the most diverse resistomes in nature. Studies of the resistome in the digestive tract of humans and animals are increasing exponentially as a result of advancements in next-generation sequencing and the expansion of bioinformatic resources/tools to identify and describe the resistome. In this review, we outline the various tools/bioinformatic pipelines currently available to characterize and understand the nature of the intestinal resistome of swine, poultry, and ruminants. We then propose future research directions including analysis of resistome using long-read sequencing, investigation in the role of mobile genetic elements in the expression, function and transmission of AMR. This review outlines the current knowledge and approaches to studying the resistome in food-producing animals and sheds light on future strategies to reduce antimicrobial usage and control the spread of AMR both within and from livestock production systems.
Collapse
Affiliation(s)
- Tao Ma
- Key laboratory of Feed Biotechnology of the Ministry of Agriculture, Institute of Feed Research, Chinese Academy of Agricultural Sciences, Beijing, 100081, China.,Department of Agricultural, Food and Nutritional Science, University of Alberta, T6G2P5, Edmonton, AB, Canada
| | - Tim A McAllister
- Lethbridge Research and Development Centre, Lethbridge, AB, T1J 4P4, Canada
| | - Le Luo Guan
- Department of Agricultural, Food and Nutritional Science, University of Alberta, T6G2P5, Edmonton, AB, Canada.
| |
Collapse
|
43
|
Metagenome-Assembled Genomes Contribute to Unraveling of the Microbiome of Cocoa Fermentation. Appl Environ Microbiol 2021; 87:e0058421. [PMID: 34105982 DOI: 10.1128/aem.00584-21] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Metagenomic studies about cocoa fermentation have mainly reported on the analysis of short reads for determination of operational taxonomic units. However, it is also important to determine metagenome-assembled genomes (MAGs), which are genomes deriving from the assembly of metagenomics. For this research, all the cocoa metagenomes from public databases were downloaded, resulting in five data sets: one from Ghana and four from Brazil. In addition, in silico approaches were used to describe putative phenotypes and the metabolic potential of MAGs. A total of 17 high-quality MAGs were recovered from these microbiomes, as follows: (i) for fungi, Yamadazyma tenuis (n = 1); (ii) lactic acid bacteria, Limosilactobacillus fermentum (n = 5), Liquorilactobacillus cacaonum (n = 1), Liquorilactobacillus nagelli (n = 1), Leuconostoc pseudomesenteroides (n = 1), and Lactiplantibacillus plantarum subsp. plantarum (n = 1); (iii) acetic acid bacteria, Acetobacter senegalensis (n = 2) and Kozakia baliensis (n = 1); and (iv) Bacillus subtilis (n = 1), Brevundimonas sp. (n = 2), and Pseudomonas sp. (n = 1). Medium-quality MAGs were also recovered from cocoa microbiomes, including some that, to our knowledge, were not previously detected in this environment (Liquorilactobacillus vini, Komagataeibacter saccharivorans, and Komagataeibacter maltaceti) and others previously described (Fructobacillus pseudoficulneus and Acetobacter pasteurianus). Taken together, the MAGs were useful for providing an additional description of the microbiome of cocoa fermentation, revealing previously overlooked microorganisms, with prediction of key phenotypes and biochemical pathways. IMPORTANCE The production of chocolate starts with the harvesting of cocoa fruits and the spontaneous fermentation of the seeds in a microbial succession that depends on yeasts, lactic acid bacteria, and acetic acid bacteria in order to eliminate bitter and astringent compounds present in the raw material, which will be further roasted and grinded to originate the cocoa powder that will enter the food processing industry. The microbiota of cocoa fermentation is not completely known, and yet it advanced from culture-based studies to the advent of next-generation DNA sequencing, with the generation of a myriad of data that need bioinformatic approaches to be properly analyzed. Although the majority of metagenomic studies have been based on short reads (operational taxonomic units), it is also important to analyze entire genomes to determine more precisely possible ecological roles of different species. Metagenome-assembled genomes (MAGs) are very useful for this purpose; here, MAGs from cocoa fermentation microbiomes are described, and the possible implications of their phenotypic and metabolic potentials are discussed.
Collapse
|
44
|
Martínez Arbas S, Busi SB, Queirós P, de Nies L, Herold M, May P, Wilmes P, Muller EEL, Narayanasamy S. Challenges, Strategies, and Perspectives for Reference-Independent Longitudinal Multi-Omic Microbiome Studies. Front Genet 2021; 12:666244. [PMID: 34194470 PMCID: PMC8236828 DOI: 10.3389/fgene.2021.666244] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 04/30/2021] [Indexed: 12/21/2022] Open
Abstract
In recent years, multi-omic studies have enabled resolving community structure and interrogating community function of microbial communities. Simultaneous generation of metagenomic, metatranscriptomic, metaproteomic, and (meta) metabolomic data is more feasible than ever before, thus enabling in-depth assessment of community structure, function, and phenotype, thus resulting in a multitude of multi-omic microbiome datasets and the development of innovative methods to integrate and interrogate those multi-omic datasets. Specifically, the application of reference-independent approaches provides opportunities in identifying novel organisms and functions. At present, most of these large-scale multi-omic datasets stem from spatial sampling (e.g., water/soil microbiomes at several depths, microbiomes in/on different parts of the human anatomy) or case-control studies (e.g., cohorts of human microbiomes). We believe that longitudinal multi-omic microbiome datasets are the logical next step in microbiome studies due to their characteristic advantages in providing a better understanding of community dynamics, including: observation of trends, inference of causality, and ultimately, prediction of community behavior. Furthermore, the acquisition of complementary host-derived omics, environmental measurements, and suitable metadata will further enhance the aforementioned advantages of longitudinal data, which will serve as the basis to resolve drivers of community structure and function to understand the biotic and abiotic factors governing communities and specific populations. Carefully setup future experiments hold great potential to further unveil ecological mechanisms to evolution, microbe-microbe interactions, or microbe-host interactions. In this article, we discuss the challenges, emerging strategies, and best-practices applicable to longitudinal microbiome studies ranging from sampling, biomolecular extraction, systematic multi-omic measurements, reference-independent data integration, modeling, and validation.
Collapse
Affiliation(s)
- Susana Martínez Arbas
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Susheel Bhanu Busi
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Pedro Queirós
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Laura de Nies
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Malte Herold
- Department of Environmental Research and Innovation, Luxembourg Institute of Science and Technology, Belvaux, Luxembourg
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Emilie E. L. Muller
- Université de Strasbourg, UMR 7156 CNRS, Génétique Moléculaire, Génomique, Microbiologie, Strasbourg, France
| | - Shaman Narayanasamy
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
45
|
Nearing JT, Comeau AM, Langille MGI. Identifying biases and their potential solutions in human microbiome studies. MICROBIOME 2021; 9:113. [PMID: 34006335 PMCID: PMC8132403 DOI: 10.1186/s40168-021-01059-0] [Citation(s) in RCA: 108] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Accepted: 03/24/2021] [Indexed: 05/13/2023]
Abstract
Advances in DNA sequencing technology have vastly improved the ability of researchers to explore the microbial inhabitants of the human body. Unfortunately, while these studies have uncovered the importance of these microbial communities to our health, they often do not result in similar findings. One possible reason for the disagreement in these results is due to the multitude of systemic biases that are introduced during sequence-based microbiome studies. These biases begin with sample collection and continue to be introduced throughout the entire experiment leading to an observed community that is significantly altered from the true underlying microbial composition. In this review, we will highlight the various steps in typical sequence-based human microbiome studies where significant bias can be introduced, and we will review the current efforts within the field that aim to reduce the impact of these biases. Video abstract.
Collapse
Affiliation(s)
- Jacob T Nearing
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - André M Comeau
- Integrated Microbiome Resource, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Morgan G I Langille
- Integrated Microbiome Resource, Dalhousie University, Halifax, Nova Scotia, Canada.
- Department of Pharmacology, Dalhousie University, Halifax, Nova Scotia, Canada.
| |
Collapse
|
46
|
Alanin KWS, Jørgensen TS, Browne PD, Petersen B, Riber L, Kot W, Hansen LH. An improved direct metamobilome approach increases the detection of larger-sized circular elements across kingdoms. Plasmid 2021; 115:102576. [PMID: 33872684 DOI: 10.1016/j.plasmid.2021.102576] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 04/12/2021] [Accepted: 04/12/2021] [Indexed: 10/21/2022]
Abstract
Mobile genetic elements (MGEs) are instrumental in natural prokaryotic genome editing, permitting genome plasticity and allowing microbes to accumulate genetic diversity. MGEs serve as a vast communal gene pool and include DNA elements such as plasmids and bacteriophages (phages) among others. These mobile DNA elements represent a human health risk as they can introduce new traits, such as antibiotic resistance or virulence, to a bacterial strain. Sequencing libraries targeting environmental circular MGEs, referred to as metamobilomes, may broaden our current understanding of the mechanisms behind the mobility, prevalence and content of these elements. However, metamobilomics is affected by a severe bias towards small circular elements, introduced by multiple displacement amplification (MDA). MDA is typically used to overcome limiting DNA quantities after the removal of non-circular DNA during library preparations. By examining the relationship between sequencing coverage and the size of circular MGEs in paired metamobilome datasets with and without MDA, we show that larger circular elements are lost when using MDA. This study is the first to systematically demonstrate that MDA is detrimental to detecting larger-sized plasmids if small plasmids are present. It is also the first to show that MDA can be omitted when using enzyme-based DNA fragmentation and PCR in library preparation kits such as Nextera XT® from Illumina.
Collapse
Affiliation(s)
- Katrine Wacenius Skov Alanin
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Tue Sparholt Jørgensen
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark; Department of Science and Environment, Roskilde University, Denmark
| | - Patrick Denis Browne
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Bent Petersen
- Globe Institute, Faculty of Health and Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark; Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
| | - Leise Riber
- Department of Biology, Functional Genomics, University of Copenhagen, Copenhagen, Denmark
| | - Witold Kot
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark.
| | - Lars Hestbjerg Hansen
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
47
|
Jégousse C, Vannier P, Groben R, Glöckner FO, Marteinsson V. A total of 219 metagenome-assembled genomes of microorganisms from Icelandic marine waters. PeerJ 2021; 9:e11112. [PMID: 33859876 PMCID: PMC8020865 DOI: 10.7717/peerj.11112] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 02/23/2021] [Indexed: 12/02/2022] Open
Abstract
Marine microorganisms contribute to the health of the global ocean by supporting the marine food web and regulating biogeochemical cycles. Assessing marine microbial diversity is a crucial step towards understanding the global ocean. The waters surrounding Iceland are a complex environment where relatively warm salty waters from the Atlantic cool down and sink down to the deep. Microbial studies in this area have focused on photosynthetic micro- and nanoplankton mainly using microscopy and chlorophyll measurements. However, the diversity and function of the bacterial and archaeal picoplankton remains unknown. Here, we used a co-assembly approach supported by a marine mock community to reconstruct metagenome-assembled genomes (MAGs) from 31 metagenomes from the sea surface and seafloor of four oceanographic sampling stations sampled between 2015 and 2018. The resulting 219 MAGs include 191 bacterial, 26 archaeal and two eukaryotic MAGs to bridge the gap in our current knowledge of the global marine microbiome.
Collapse
Affiliation(s)
- Clara Jégousse
- School of Health Sciences, University of Iceland, Reykjavik, Iceland.,Microbiology Group, Matís ohf., Reykjavik, Iceland
| | | | - René Groben
- Microbiology Group, Matís ohf., Reykjavik, Iceland
| | - Frank Oliver Glöckner
- Data at the Computing Center, Alfred Wegener Institute, Bremenhaven, Germany.,MARUM - Center for Marine Environmental Sciences,University of Bremen, Bremen, Germany
| | - Viggó Marteinsson
- School of Health Sciences, University of Iceland, Reykjavik, Iceland.,Microbiology Group, Matís ohf., Reykjavik, Iceland
| |
Collapse
|
48
|
Dreher TW, Davis EW, Mueller RS, Otten TG. Comparative genomics of the ADA clade within the Nostocales. HARMFUL ALGAE 2021; 104:102037. [PMID: 34023075 DOI: 10.1016/j.hal.2021.102037] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Revised: 04/14/2021] [Accepted: 04/23/2021] [Indexed: 06/12/2023]
Abstract
The ADA clade of Nostocales cyanobacteria, a group that is prominent in current harmful algal bloom events, now includes over 40 genome sequences with the recent addition of sixteen novel sequenced genomes (Dreher et al., Harmful Algae, 2021). Fourteen genomes are complete (closed), enabling highly detailed assessments of gene content and genome architecture. ADA genomes contain 5 rRNA operons, genes expected to support a photoautotrophic and diazotrophic lifestyle, and a varied array of genes for the synthesis of bioactive secondary metabolites. Genes for the production of the taste-and-odor compound geosmin and the four major classes of cyanotoxins - anatoxin-a, cylindrospermopsin, microcystin and saxitoxin - are represented in members of the ADA clade. Notably, the gene array for the synthesis of cylindrospermopsin by Dolichospermum sp. DET69 was located on a plasmid, raising the possibility of facile horizontal transmission. However, genes supporting independent conjugative transfer of this plasmid are lacking. Further, analysis of genomic loci containing this and other cyanotoxin gene arrays shows evidence that these arrays have long-term stability and do not appear to be genomic islands easily capable of horizontal transmission to other cells. There is considerable diversity in the gene complements of individual ADA genomes, including the variable presence of physiologically important genes: genomes in three species-level subclades lack the gas vesicle genes that facilitate a planktonic lifestyle, and, surprisingly, the genome of Cuspidothrix issatschenkoi CHARLIE-1, a reported diazotroph, lacks the genes for nitrogen fixation. Notably, phylogenetically related genomes possess limited synteny, indicating a prominent role for chromosome rearrangements during ADA strain evolution. The genomes contain abundant insertion sequences and repetitive transposase genes, which could be the main drivers of genome rearrangement through active transposition and homologous recombination. No prophages were found, and no evidence of viral infection was observed in the bloom population samples from which the genomes discussed here were derived. Phages thus seem to have a limited influence on ADA evolution.
Collapse
Affiliation(s)
- Theo W Dreher
- Department of Microbiology, Oregon State University, Corvallis, Oregon 97331 USA; Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon 97331 USA.
| | - Edward W Davis
- Center for Genome Research and Biocomputing, Oregon State University, Corvallis, Oregon 97331 USA
| | - Ryan S Mueller
- Department of Microbiology, Oregon State University, Corvallis, Oregon 97331 USA
| | - Timothy G Otten
- Department of Microbiology, Oregon State University, Corvallis, Oregon 97331 USA.
| |
Collapse
|
49
|
Gayathri KV, Aishwarya S, Kumar PS, Rajendran UR, Gunasekaran K. Metabolic and molecular modelling of zebrafish gut biome to unravel antimicrobial peptides through metagenomics. Microb Pathog 2021; 154:104862. [PMID: 33781870 DOI: 10.1016/j.micpath.2021.104862] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 01/25/2021] [Accepted: 03/12/2021] [Indexed: 01/07/2023]
Abstract
Recently efforts have been taken for unravelling mysteries between host-microbe interactions in gut microbiome studies of model organisms through metagenomics. Co-existence and the co-evolution of the microorganisms is the significant cause of the growing antimicrobial menace. There needs a novel approach to develop potential antimicrobials with capabilities to act directly on the resistant microbes with reduced side effects. One such is to tap them from the natural resources, preferably the gut of the most closely related animal model. In this study, we employed metagenomics approaches to identify the large taxonomic genomes of the zebra fish gut. About 256 antimicrobial peptides were identified using gene ontology predictions from Macrel and Pubseed servers. Upon the property predictions, the top 10 antimicrobial peptides were screened based on their action against many resistant bacterial species, including Klebsiella pneumoniae, Pseudomonas aeruginosa, Staphylococcus aureus, E. coli, and Bacillus cereus. Metabolic modelling and flux balance analysis (FBA) were computed to conclude the antibiotic such as tetracycline, cephalosporins, puromycin, neomycin biosynthesis pathways were adopted by the microbiome as protection strategies. Molecular modelling strategies, including molecular docking and dynamics, were performed to estimate the antimicrobial peptides' binding against the target-putative nucleic acid binding lipoprotein and confirm stable binding. One specific antimicrobial peptide with the sequence "MPPYLHEIQPHTASNCQTELVIKL" showed promising results with 53% hydrophobic residues and a net charge +2.5, significant for the development of antimicrobial peptides. The said peptide also showed promising interactions with the target protein and expressed stable binding with docking energy of -429.34 kcal/mol and the average root mean square deviation of 1 A0. The study is a novel approach focusing on tapping out potential antimicrobial peptides to be developed against most resistant bacterial species.
Collapse
Affiliation(s)
- K Veena Gayathri
- Department of Bioinformatics, Stella Maris College (Autonomous), Chennai, 600086, India.
| | - S Aishwarya
- Department of Biotechnology, Stella Maris College (Autonomous), Chennai, 600086, India; CAS in Crystallography and Biophysics, University of Madras, Chennai, 600025, India
| | - P Senthil Kumar
- Department of Chemical Engineering, Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, Chennai, 603 110, India.
| | - U Rohini Rajendran
- Department of Bioinformatics, Stella Maris College (Autonomous), Chennai, 600086, India
| | - K Gunasekaran
- CAS in Crystallography and Biophysics, University of Madras, Chennai, 600025, India
| |
Collapse
|
50
|
Maguire F, Jia B, Gray KL, Lau WYV, Beiko RG, Brinkman FSL. Metagenome-assembled genome binning methods with short reads disproportionately fail for plasmids and genomic Islands. Microb Genom 2020; 6:mgen000436. [PMID: 33001022 PMCID: PMC7660262 DOI: 10.1099/mgen.0.000436] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Accepted: 09/04/2020] [Indexed: 12/12/2022] Open
Abstract
Metagenomic methods enable the simultaneous characterization of microbial communities without time-consuming and bias-inducing culturing. Metagenome-assembled genome (MAG) binning methods aim to reassemble individual genomes from this data. However, the recovery of mobile genetic elements (MGEs), such as plasmids and genomic islands (GIs), by binning has not been well characterized. Given the association of antimicrobial resistance (AMR) genes and virulence factor (VF) genes with MGEs, studying their transmission is a public-health priority. The variable copy number and sequence composition of MGEs makes them potentially problematic for MAG binning methods. To systematically investigate this issue, we simulated a low-complexity metagenome comprising 30 GI-rich and plasmid-containing bacterial genomes. MAGs were then recovered using 12 current prediction pipelines and evaluated. While 82-94 % of chromosomes could be correctly recovered and binned, only 38-44 % of GIs and 1-29 % of plasmid sequences were found. Strikingly, no plasmid-borne VF nor AMR genes were recovered, and only 0-45 % of AMR or VF genes within GIs. We conclude that short-read MAG approaches, without further optimization, are largely ineffective for the analysis of mobile genes, including those of public-health importance, such as AMR and VF genes. We propose that researchers should explore developing methods that optimize for this issue and consider also using unassembled short reads and/or long-read approaches to more fully characterize metagenomic data.
Collapse
Affiliation(s)
- Finlay Maguire
- Faculty of Computer Science, Dalhousie University, 6050 University Avenue, Halifax, Nova Scotia, B3H 4R2, Canada
| | - Baofeng Jia
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| | - Kristen L. Gray
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| | - Wing Yin Venus Lau
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| | - Robert G. Beiko
- Faculty of Computer Science, Dalhousie University, 6050 University Avenue, Halifax, Nova Scotia, B3H 4R2, Canada
| | - Fiona S. L. Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| |
Collapse
|