1
|
Zhang J, Zhao L, Wang W, Zhang Q, Wang XT, Xing DF, Ren NQ, Lee DJ, Chen C. Large language model for horizontal transfer of resistance gene: From resistance gene prevalence detection to plasmid conjugation rate evaluation. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 931:172466. [PMID: 38626826 DOI: 10.1016/j.scitotenv.2024.172466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/17/2024] [Revised: 04/10/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024]
Abstract
The burgeoning issue of plasmid-mediated resistance genes (ARGs) dissemination poses a significant threat to environmental integrity. However, the prediction of ARGs prevalence is overlooked, especially for emerging ARGs that are potentially evolving gene exchange hotspot. Here, we explored to classify plasmid or chromosome sequences and detect resistance gene prevalence by using DNABERT. Initially, the DNABERT fine-tuned in plasmid and chromosome sequences followed by multilayer perceptron (MLP) classifier could achieve 0.764 AUC (Area under curve) on external datasets across 23 genera, outperforming 0.02 AUC than traditional statistic-based model. Furthermore, Escherichia, Pseudomonas single genera based model were also be trained to explore its predict performance to ARGs prevalence detection. By integrating K-mer frequency attributes, our model could boost the performance to predict the prevalence of ARGs in an external dataset in Escherichia with 0.0281-0.0615 AUC and Pseudomonas with 0.0196-0.0928 AUC. Finally, we established a random forest model aimed at forecasting the relative conjugation transfer rate of plasmids with 0.7956 AUC, drawing on data from existing literature. It identifies the plasmid's repression status, cellular density, and temperature as the most important factors influencing transfer frequency. With these two models combined, they provide useful reference for quick and low-cost integrated evaluation of resistance gene transfer, accelerating the process of computer-assisted quantitative risk assessment of ARGs transfer in environmental field.
Collapse
Affiliation(s)
- Jiabin Zhang
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China
| | - Lei Zhao
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China
| | - Wei Wang
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China.
| | - Quan Zhang
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China
| | - Xue-Ting Wang
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China
| | - De-Feng Xing
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China
| | - Nan-Qi Ren
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China; Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen 518055, China
| | - Duu-Jong Lee
- Department of Mechanical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
| | - Chuan Chen
- State Key Laboratory of Urban Water Resource and Environment, School of Environment, Harbin Institute of Technology, Harbin, Heilongjiang Province 150090, China.
| |
Collapse
|
2
|
Hou S, Tang T, Cheng S, Liu Y, Xia T, Chen T, Fuhrman J, Sun F. DeepMicroClass sorts metagenomic contigs into prokaryotes, eukaryotes and viruses. NAR Genom Bioinform 2024; 6:lqae044. [PMID: 38711860 PMCID: PMC11071121 DOI: 10.1093/nargab/lqae044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 03/18/2024] [Accepted: 04/18/2024] [Indexed: 05/08/2024] Open
Abstract
Sequence classification facilitates a fundamental understanding of the structure of microbial communities. Binary metagenomic sequence classifiers are insufficient because environmental metagenomes are typically derived from multiple sequence sources. Here we introduce a deep-learning based sequence classifier, DeepMicroClass, that classifies metagenomic contigs into five sequence classes, i.e. viruses infecting prokaryotic or eukaryotic hosts, eukaryotic or prokaryotic chromosomes, and prokaryotic plasmids. DeepMicroClass achieved high performance for all sequence classes at various tested sequence lengths ranging from 500 bp to 100 kbps. By benchmarking on a synthetic dataset with variable sequence class composition, we showed that DeepMicroClass obtained better performance for eukaryotic, plasmid and viral contig classification than other state-of-the-art predictors. DeepMicroClass achieved comparable performance on viral sequence classification with geNomad and VirSorter2 when benchmarked on the CAMI II marine dataset. Using a coastal daily time-series metagenomic dataset as a case study, we showed that microbial eukaryotes and prokaryotic viruses are integral to microbial communities. By analyzing monthly metagenomes collected at HOT and BATS, we found relatively higher viral read proportions in the subsurface layer in late summer, consistent with the seasonal viral infection patterns prevalent in these areas. We expect DeepMicroClass will promote metagenomic studies of under-appreciated sequence types.
Collapse
Affiliation(s)
- Shengwei Hou
- Department of Ocean Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
- Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Tianqi Tang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Siliangyu Cheng
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Yuanhao Liu
- Department of Ocean Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Tian Xia
- Department of Ocean Science and Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Ting Chen
- Department of Computer Science and Technology, Institute of Artificial Intelligence & BNRist, Tsinghua University, Beijing 100084, China
| | - Jed A Fuhrman
- Marine and Environmental Biology, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA
| | - Fengzhu Sun
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| |
Collapse
|
3
|
Kan CM, Tsang HF, Pei XM, Ng SSM, Yim AKY, Yu ACS, Wong SCC. Enhancing Clinical Utility: Utilization of International Standards and Guidelines for Metagenomic Sequencing in Infectious Disease Diagnosis. Int J Mol Sci 2024; 25:3333. [PMID: 38542307 PMCID: PMC10970082 DOI: 10.3390/ijms25063333] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/11/2024] [Accepted: 03/12/2024] [Indexed: 11/11/2024] Open
Abstract
Metagenomic sequencing has emerged as a transformative tool in infectious disease diagnosis, offering a comprehensive and unbiased approach to pathogen detection. Leveraging international standards and guidelines is essential for ensuring the quality and reliability of metagenomic sequencing in clinical practice. This review explores the implications of international standards and guidelines for the application of metagenomic sequencing in infectious disease diagnosis. By adhering to established standards, such as those outlined by regulatory bodies and expert consensus, healthcare providers can enhance the accuracy and clinical utility of metagenomic sequencing. The integration of international standards and guidelines into metagenomic sequencing workflows can streamline diagnostic processes, improve pathogen identification, and optimize patient care. Strategies in implementing these standards for infectious disease diagnosis using metagenomic sequencing are discussed, highlighting the importance of standardized approaches in advancing precision infectious disease diagnosis initiatives.
Collapse
Affiliation(s)
- Chau-Ming Kan
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong, China; (C.-M.K.); (H.F.T.)
| | - Hin Fung Tsang
- Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong, China; (C.-M.K.); (H.F.T.)
| | - Xiao Meng Pei
- Department of Applied Biology & Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, China;
| | - Simon Siu Man Ng
- Department of Surgery, Faculty of Medicine, The Chinese University of Hong Kong, Hong Kong, China;
| | | | - Allen Chi-Shing Yu
- Codex Genetics Limited, Shatin, Hong Kong, China; (A.K.-Y.Y.); (A.C.-S.Y.)
| | - Sze Chuen Cesar Wong
- Department of Applied Biology & Chemical Technology, The Hong Kong Polytechnic University, Hong Kong, China;
| |
Collapse
|
4
|
Yu MK, Fogarty EC, Eren AM. Diverse plasmid systems and their ecology across human gut metagenomes revealed by PlasX and MobMess. Nat Microbiol 2024; 9:830-847. [PMID: 38443576 PMCID: PMC10914615 DOI: 10.1038/s41564-024-01610-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 01/17/2024] [Indexed: 03/07/2024]
Abstract
Plasmids alter microbial evolution and lifestyles by mobilizing genes that often confer fitness in changing environments across clades. Yet our ecological and evolutionary understanding of naturally occurring plasmids is far from complete. Here we developed a machine-learning model, PlasX, which identified 68,350 non-redundant plasmids across human gut metagenomes and organized them into 1,169 evolutionarily cohesive 'plasmid systems' using our sequence containment-aware network-partitioning algorithm, MobMess. Individual plasmids were often country specific, yet most plasmid systems spanned across geographically distinct human populations. Cargo genes in plasmid systems included well-known determinants of fitness, such as antibiotic resistance, but also many others including enzymes involved in the biosynthesis of essential nutrients and modification of transfer RNAs, revealing a wide repertoire of likely fitness determinants in complex environments. Our study introduces computational tools to recognize and organize plasmids, and uncovers the ecological and evolutionary patterns of diverse plasmids in naturally occurring habitats through plasmid systems.
Collapse
Affiliation(s)
- Michael K Yu
- Toyota Technological Institute at Chicago, Chicago, IL, USA.
| | - Emily C Fogarty
- Department of Medicine, University of Chicago, Chicago, IL, USA
- Committee On Microbiology, University of Chicago, Chicago, IL, USA
| | - A Murat Eren
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA, USA.
- Alfred Wegener Institute for Polar and Marine Research, Bremerhaven, Germany.
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, Oldenburg, Germany.
- Helmholtz Institute for Functional Marine Biodiversity, Oldenburg, Germany.
- Marine 'Omics Group, Max Planck Institute for Marine Microbiology, Bremen, Germany.
| |
Collapse
|
5
|
Fogarty EC, Schechter MS, Lolans K, Sheahan ML, Veseli I, Moore RM, Kiefl E, Moody T, Rice PA, Yu MK, Mimee M, Chang EB, Ruscheweyh HJ, Sunagawa S, Mclellan SL, Willis AD, Comstock LE, Eren AM. A cryptic plasmid is among the most numerous genetic elements in the human gut. Cell 2024; 187:1206-1222.e16. [PMID: 38428395 PMCID: PMC10973873 DOI: 10.1016/j.cell.2024.01.039] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 10/03/2023] [Accepted: 01/25/2024] [Indexed: 03/03/2024]
Abstract
Plasmids are extrachromosomal genetic elements that often encode fitness-enhancing features. However, many bacteria carry "cryptic" plasmids that do not confer clear beneficial functions. We identified one such cryptic plasmid, pBI143, which is ubiquitous across industrialized gut microbiomes and is 14 times as numerous as crAssphage, currently established as the most abundant extrachromosomal genetic element in the human gut. The majority of mutations in pBI143 accumulate in specific positions across thousands of metagenomes, indicating strong purifying selection. pBI143 is monoclonal in most individuals, likely due to the priority effect of the version first acquired, often from one's mother. pBI143 can transfer between Bacteroidales, and although it does not appear to impact bacterial host fitness in vivo, it can transiently acquire additional genetic content. We identified important practical applications of pBI143, including its use in identifying human fecal contamination and its potential as an alternative approach to track human colonic inflammatory states.
Collapse
Affiliation(s)
- Emily C Fogarty
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA; Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA; Department of Medicine, University of Chicago, Chicago, IL 60637, USA.
| | - Matthew S Schechter
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA; Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA; Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Karen Lolans
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Madeline L Sheahan
- Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA; Department of Microbiology, University of Chicago, Chicago, IL 60637, USA
| | - Iva Veseli
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA; Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Ryan M Moore
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
| | - Evan Kiefl
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA; Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Thomas Moody
- Department of Systems Biology, Columbia University, New York, NY 10032, USA
| | - Phoebe A Rice
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA; Department of Biochemistry, University of Chicago, Chicago, IL 60637, USA
| | - Michael K Yu
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - Mark Mimee
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA; Department of Microbiology, University of Chicago, Chicago, IL 60637, USA; Pritzker School of Molecular Engineering, The University of Chicago, Chicago, IL 60637, USA
| | - Eugene B Chang
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Hans-Joachim Ruscheweyh
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zurich, Zurich 8093, Switzerland
| | - Shinichi Sunagawa
- Department of Biology, Institute of Microbiology and Swiss Institute of Bioinformatics, ETH Zurich, Zurich 8093, Switzerland
| | - Sandra L Mclellan
- School of Freshwater Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI 53204, USA
| | - Amy D Willis
- Department of Biostatistics, University of Washington, Seattle, WA 98195, USA
| | - Laurie E Comstock
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA; Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA; Department of Microbiology, University of Chicago, Chicago, IL 60637, USA.
| | - A Murat Eren
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA; Marine Biological Laboratory, Woods Hole, MA 02543, USA; Alfred Wegener Institute Helmholtz Centre for Polar and Marine Research, 27570 Bremerhaven, Germany; Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, 26129 Oldenburg, Germany; Max Planck Institute for Marine Microbiology, 28359 Bremen, Germany; Helmholtz Institute for Functional Marine Biodiversity, 26129 Oldenburg, Germany.
| |
Collapse
|
6
|
Delogu F, Kunath BJ, Queirós PM, Halder R, Lebrun LA, Pope PB, May P, Widder S, Muller EEL, Wilmes P. Forecasting the dynamics of a complex microbial community using integrated meta-omics. Nat Ecol Evol 2024; 8:32-44. [PMID: 37957315 PMCID: PMC10781640 DOI: 10.1038/s41559-023-02241-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 10/02/2023] [Indexed: 11/15/2023]
Abstract
Predicting the behaviour of complex microbial communities is challenging. However, this is essential for complex biotechnological processes such as those in biological wastewater treatment plants (BWWTPs), which require sustainable operation. Here we summarize 14 months of longitudinal meta-omics data from a BWWTP anaerobic tank into 17 temporal signals, explaining 91.1% of the temporal variance, and link those signals to ecological events within the community. We forecast the signals over the subsequent five years and use 21 extra samples collected at defined time intervals for testing and validation. Our forecasts are correct for six signals and hint on phenomena such as predation cycles. Using all the 17 forecasts and the environmental variables, we predict gene abundance and expression, with a coefficient of determination ≥0.87 for the subsequent three years. Our study demonstrates the ability to forecast the dynamics of open microbial ecosystems using interactions between community cycles and environmental parameters.
Collapse
Affiliation(s)
- Francesco Delogu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| | - Benoit J Kunath
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Pedro M Queirós
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Rashi Halder
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Laura A Lebrun
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Phillip B Pope
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Stefanie Widder
- Department of Medicine 1, Research Division Infection Biology, Medical University of Vienna, Vienna, Austria
| | - Emilie E L Muller
- Génétique Moléculaire, Génomique, Microbiologie, UMR 7156 CNRS, Université de Strasbourg, Strasbourg, France
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| |
Collapse
|
7
|
Sielemann J, Sielemann K, Brejová B, Vinař T, Chauve C. plASgraph2: using graph neural networks to detect plasmid contigs from an assembly graph. Front Microbiol 2023; 14:1267695. [PMID: 37869681 PMCID: PMC10587606 DOI: 10.3389/fmicb.2023.1267695] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 09/08/2023] [Indexed: 10/24/2023] Open
Abstract
Identification of plasmids from sequencing data is an important and challenging problem related to antimicrobial resistance spread and other One-Health issues. We provide a new architecture for identifying plasmid contigs in fragmented genome assemblies built from short-read data. We employ graph neural networks (GNNs) and the assembly graph to propagate the information from nearby nodes, which leads to more accurate classification, especially for short contigs that are difficult to classify based on sequence features or database searches alone. We trained plASgraph2 on a data set of samples from the ESKAPEE group of pathogens. plASgraph2 either outperforms or performs on par with a wide range of state-of-the-art methods on testing sets of independent ESKAPEE samples and samples from related pathogens. On one hand, our study provides a new accurate and easy to use tool for contig classification in bacterial isolates; on the other hand, it serves as a proof-of-concept for the use of GNNs in genomics. Our software is available at https://github.com/cchauve/plasgraph2 and the training and testing data sets are available at https://github.com/fmfi-compbio/plasgraph2-datasets.
Collapse
Affiliation(s)
- Janik Sielemann
- Computational Biology, Faculty of Biology, Center for Biotechnology & Graduate School Digital Infrastructures for the Life Sciences (DILS), Bielefeld Institute for Bioinformatics Infrastructure, Bielefeld University, Bielefeld, Germany
| | - Katharina Sielemann
- Genetics and Genomics of Plants, Faculty of Biology, Center for Biotechnology & Graduate School Digital Infrastructures for the Life Sciences (DILS), Bielefeld Institute for Bioinformatics Infrastructure, Bielefeld University, Bielefeld, Germany
| | - Broňa Brejová
- Department of Computer Science, Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Bratislava, Slovakia
| | - Tomáš Vinař
- Department of Applied Informatics, Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Bratislava, Slovakia
| | - Cedric Chauve
- Department of Mathematics, Simon Fraser University, Burnaby, BC, Canada
| |
Collapse
|
8
|
Tang X, Shang J, Ji Y, Sun Y. PLASMe: a tool to identify PLASMid contigs from short-read assemblies using transformer. Nucleic Acids Res 2023; 51:e83. [PMID: 37427782 PMCID: PMC10450166 DOI: 10.1093/nar/gkad578] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 06/19/2023] [Accepted: 06/26/2023] [Indexed: 07/11/2023] Open
Abstract
Plasmids are mobile genetic elements that carry important accessory genes. Cataloging plasmids is a fundamental step to elucidate their roles in promoting horizontal gene transfer between bacteria. Next generation sequencing (NGS) is the main source for discovering new plasmids today. However, NGS assembly programs tend to return contigs, making plasmid detection difficult. This problem is particularly grave for metagenomic assemblies, which contain short contigs of heterogeneous origins. Available tools for plasmid contig detection still suffer from some limitations. In particular, alignment-based tools tend to miss diverged plasmids while learning-based tools often have lower precision. In this work, we develop a plasmid detection tool PLASMe that capitalizes on the strength of alignment and learning-based methods. Closely related plasmids can be easily identified using the alignment component in PLASMe while diverged plasmids can be predicted using order-specific Transformer models. By encoding plasmid sequences as a language defined on the protein cluster-based token set, Transformer can learn the importance of proteins and their correlation through positionally token embedding and the attention mechanism. We compared PLASMe and other tools on detecting complete plasmids, plasmid contigs, and contigs assembled from CAMI2 simulated data. PLASMe achieved the highest F1-score. After validating PLASMe on data with known labels, we also tested it on real metagenomic and plasmidome data. The examination of some commonly used marker genes shows that PLASMe exhibits more reliable performance than other tools.
Collapse
Affiliation(s)
- Xubo Tang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong SAR, China
| | - Jiayu Shang
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong SAR, China
| | - Yongxin Ji
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong SAR, China
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong SAR, China
| |
Collapse
|
9
|
Zhu Q, Gao S, Xiao B, He Z, Hu S. Plasmer: an Accurate and Sensitive Bacterial Plasmid Prediction Tool Based on Machine Learning of Shared k-mers and Genomic Features. Microbiol Spectr 2023; 11:e0464522. [PMID: 37191574 PMCID: PMC10269668 DOI: 10.1128/spectrum.04645-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 04/26/2023] [Indexed: 05/17/2023] Open
Abstract
Identification of plasmids in bacterial genomes is critical for many factors, including horizontal gene transfer, antibiotic resistance genes, host-microbe interactions, cloning vectors, and industrial production. There are several in silico methods to predict plasmid sequences in assembled genomes. However, existing methods have evident shortcomings, such as unbalance in sensitivity and specificity, dependency on species-specific models, and performance reduction in sequences shorter than 10 kb, which has limited their scope of applicability. In this work, we proposed Plasmer, a novel plasmid predictor based on machine-learning of shared k-mers and genomic features. Unlike existing k-mer or genomic-feature based methods, Plasmer employs the random forest algorithm to make predictions using the percent of shared k-mers with plasmid and chromosome databases combined with other genomic features, including alignment E value and replicon distribution scores (RDS). Plasmer can predict on multiple species and has achieved an average the area under the curve (AUC) of 0.996 with accuracy of 98.4%. Compared to existing methods, tests of both sliding sequences and simulated and de novo assemblies have consistently shown that Plasmer has outperforming accuracy and stable performance across long and short contigs above 500 bp, demonstrating its applicability for fragmented assemblies. Plasmer also has excellent and balanced performance on both sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which has eliminated the bias on sensitivity or specificity that was common in existing methods. Plasmer also provides taxonomy classification to help identify the origin of plasmids. IMPORTANCE In this study, we proposed a novel plasmid prediction tool named Plasmer. Technically, unlike existing k-mer or genomic features-based methods, Plasmer is the first tool to combine the advantages of the percent of shared k-mers and the alignment score of genomic features. This has given Plasmer (i) evident improvement in performance compared to other methods, with the best F1-score and accuracy on sliding sequences, simulated contigs, and de novo assemblies; (ii) applicability for contigs above 500 bp with highest accuracy, enabling plasmid prediction in fragmented short-read assemblies; (iii) excellent and balanced performance between sensitivity and specificity (both >0.95 above 500 bp) with the highest F1-score, which eliminated the bias on sensitivity or specificity that commonly existed in other methods; and (iv) no dependency of species-specific training models. We believe that Plasmer provides a more reliable alternative for plasmid prediction in bacterial genome assemblies.
Collapse
Affiliation(s)
- Qianhui Zhu
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Shenghan Gao
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Binghan Xiao
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- Sino-Danish College, University of Chinese Academy of Sciences, Beijing, China
| | - Zilong He
- School of Engineering Medicine, Beihang University, Beijing, China
- Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, Interdisciplinary Innovation Institute of Medicine and Engineering, Beihang University, Beijing, China
| | - Songnian Hu
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- Sino-Danish College, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
10
|
Cai Z, Li P, Zhu W, Wei J, Lu J, Song X, Li K, Li S, Li M. Metagenomic analysis reveals gut plasmids as diagnosis markers for colorectal cancer. Front Microbiol 2023; 14:1130446. [PMID: 37283932 PMCID: PMC10239823 DOI: 10.3389/fmicb.2023.1130446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Accepted: 05/09/2023] [Indexed: 06/08/2023] Open
Abstract
Background Colorectal cancer (CRC) is linked to distinct gut microbiome patterns. The efficacy of gut bacteria as diagnostic biomarkers for CRC has been confirmed. Despite the potential to influence microbiome physiology and evolution, the set of plasmids in the gut microbiome remains understudied. Methods We investigated the essential features of gut plasmid using metagenomic data of 1,242 samples from eight distinct geographic cohorts. We identified 198 plasmid-related sequences that differed in abundance between CRC patients and controls and screened 21 markers for the CRC diagnosis model. We utilize these plasmid markers combined with bacteria to construct a random forest classifier model to diagnose CRC. Results The plasmid markers were able to distinguish between the CRC patients and controls [mean area under the receiver operating characteristic curve (AUC = 0.70)] and maintained accuracy in two independent cohorts. In comparison to the bacteria-only model, the performance of the composite panel created by combining plasmid and bacteria features was significantly improved in all training cohorts (mean AUCcomposite = 0.804 and mean AUCbacteria = 0.787) and maintained high accuracy in all independent cohorts (mean AUCcomposite = 0.839 and mean AUCbacteria = 0.821). In comparison to controls, we found that the bacteria-plasmid correlation strength was weaker in CRC patients. Additionally, the KEGG orthology (KO) genes in plasmids that are independent of bacteria or plasmids significantly correlated with CRC. Conclusion We identified plasmid features associated with CRC and showed how plasmid and bacterial markers could be combined to further enhance CRC diagnosis accuracy.
Collapse
Affiliation(s)
- Zhiyuan Cai
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Ping Li
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Wen Zhu
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Jingyue Wei
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Jieyu Lu
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Xiaoyi Song
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Kunwei Li
- Radiology Department, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Sikai Li
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| | - Man Li
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital of Sun Yat-sen University, Zhuhai, China
| |
Collapse
|
11
|
Zhang Y, Wang Y, Tang M, Zhou J, Zhang T. The microbial dark matter and "wanted list" in worldwide wastewater treatment plants. MICROBIOME 2023; 11:59. [PMID: 36973807 PMCID: PMC10045942 DOI: 10.1186/s40168-023-01503-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 02/24/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND Wastewater treatment plants (WWTPs) are one of the largest biotechnology applications in the world and are of critical importance to modern urban societies. An accurate evaluation of the microbial dark matter (MDM, microorganisms whose genomes remain uncharacterized) proportions in WWTPs is of great value, while there is no such research yet. This study conducted a global meta-analysis of MDM in WWTPs with 317,542 prokaryotic genomes from the Genome Taxonomy Database and proposed a "wanted list" for priority targets in further investigations of activated sludge. RESULTS Compared with the Earth Microbiome Project data, WWTPs had relatively lower genome-sequenced proportions of prokaryotes than other ecosystems, such as the animal related environments. Analysis showed that the median proportions of the genome-sequenced cells and taxa (100% identity and 100% coverage in 16S rRNA gene region) in WWTPs reached 56.3% and 34.5% for activated sludge, 48.6% and 28.5% for aerobic biofilm, and 48.3% and 28.5% for anaerobic digestion sludge, respectively. This result meant MDM had high proportions in WWTPs. Besides, all of the samples were occupied by a few predominant taxa, and the majority of the sequenced genomes were from pure cultures. The global-scale "wanted list" for activated sludge contained four phyla that have few representatives and 71 operational taxonomic units with the majority of them having no genome or isolate yet. Finally, several genome mining methods were verified to successfully recover genomes from activated sludge such as hybrid assembly of the second- and third-generation sequencing. CONCLUSIONS This work elucidated the proportion of MDM in WWTPs, defined the "wanted list" of activated sludge for future investigations, and certified potential genome recovery methods. The proposed methodology of this study can be applied to other ecosystems and improve understanding of ecosystem structure across diverse habitats. Video Abstract.
Collapse
Affiliation(s)
- Yulin Zhang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Yulin Wang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Mingxi Tang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China
| | - Jizhong Zhou
- Institute for Environmental Genomics, Department of Microbiology and Plant Biology, and School of Civil Engineering and Environmental Sciences, University of Oklahoma, Norman, OK, USA
| | - Tong Zhang
- Environmental Microbiome Engineering and Biotechnology Lab, Department of Civil Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong, China.
- Shenzhen Bay Laboratory, Shenzhen, China.
- Peking University Shenzhen Graduate School, Shenzhen, China.
- Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau, China.
| |
Collapse
|
12
|
Fogarty EC, Schechter MS, Lolans K, Sheahan ML, Veseli I, Moore R, Kiefl E, Moody T, Rice PA, Yu MK, Mimee M, Chang EB, Mclellan SL, Willis AD, Comstock LE, Eren AM. A highly conserved and globally prevalent cryptic plasmid is among the most numerous mobile genetic elements in the human gut. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.25.534219. [PMID: 36993556 PMCID: PMC10055365 DOI: 10.1101/2023.03.25.534219] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
Plasmids are extrachromosomal genetic elements that often encode fitness enhancing features. However, many bacteria carry 'cryptic' plasmids that do not confer clear beneficial functions. We identified one such cryptic plasmid, pBI143, which is ubiquitous across industrialized gut microbiomes, and is 14 times as numerous as crAssphage, currently established as the most abundant genetic element in the human gut. The majority of mutations in pBI143 accumulate in specific positions across thousands of metagenomes, indicating strong purifying selection. pBI143 is monoclonal in most individuals, likely due to the priority effect of the version first acquired, often from one's mother. pBI143 can transfer between Bacteroidales and although it does not appear to impact bacterial host fitness in vivo, can transiently acquire additional genetic content. We identified important practical applications of pBI143, including its use in identifying human fecal contamination and its potential as an inexpensive alternative for detecting human colonic inflammatory states.
Collapse
Affiliation(s)
- Emily C Fogarty
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA
- Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Matthew S Schechter
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA
- Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Karen Lolans
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Madeline L. Sheahan
- Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA
- Department of Microbiology, University of Chicago, Chicago, IL, 60637, USA
| | - Iva Veseli
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Ryan Moore
- Center for Bioinformatics and Computational Biology, University of Delaware, Newark, DE, USA
| | - Evan Kiefl
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, IL 60637, USA
| | - Thomas Moody
- Department of Systems Biology, Columbia University, New York, NY, 10032 USA
| | - Phoebe A Rice
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA
- Department of Biochemistry, University of Chicago, Chicago, IL, 60637, USA
| | | | - Mark Mimee
- Committee on Microbiology, University of Chicago, Chicago, IL 60637, USA
- Department of Microbiology, University of Chicago, Chicago, IL, 60637, USA
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, IL 60637, USA
| | - Eugene B Chang
- Department of Medicine, University of Chicago, Chicago, IL 60637, USA
| | - Sandra L Mclellan
- School of Freshwater Sciences, University of Wisconsin-Milwaukee, Milwaukee, WI, 53204, USA
| | - Amy D Willis
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| | - Laurie E Comstock
- Duchossois Family Institute, University of Chicago, Chicago, IL 60637, USA
- Department of Microbiology, University of Chicago, Chicago, IL, 60637, USA
| | - A Murat Eren
- Marine Biological Laboratory, Woods Hole, MA, 02543, USA
- Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, 27570 Bremerhaven, Germany
- Institute for Chemistry and Biology of the Marine Environment, University of Oldenburg, 26129 Oldenburg, Germany
- Helmholtz Institute for Functional Marine Biodiversity, 26129 Oldenburg, Germany
| |
Collapse
|
13
|
Li Z, Pang B, Lu X, Kan B, Kan B. The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction. China CDC Wkly 2022; 4:1110-1116. [PMID: 36751662 PMCID: PMC9889229 DOI: 10.46234/ccdcw2022.225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022] Open
Abstract
Introduction Salmonella is a key intestinal pathogen of foodborne disease, and the plasmids in Salmonella are related to many biological characteristics, including virulence and drug resistance. A large number of plasmid contigs have been sequenced in bacterial draft genomes, however, these are often difficult to distinguish from chromosomal contigs. Methods In this study, three different customized Kraken databases were used to build three different Kraken classifiers. Complete genome benchmark datasets and simulated draft genome benchmark datasets were constructed. Five-fold cross-validation was used to evaluate the performance of the three different Kraken classifiers by two benchmark datasets. Results The predictive performance of the classifier based on all National Center for Biotechnology Information plasmids and Salmonella complete genomes was optimal. This optimal Kraken classifier was performed with Salmonella isolated in China. The plasmid carrying rate of Salmonella in China is 91.01%, and it was found that the Kraken classifier could find more plasmid contigs and antibiotic resistance genes (ARGs) than results derived from a plasmid replicon-based method (PlasmidFinder). Moreover, it was found that in the strains carrying ARGs, plasmids carried more ARGs [three, 95% confidence interval (CI): 1-14] than chromosomes (one, 95% CI: 1-7). Discussion We found building a high-quality customized database as a Kraken classifier to be ideal for the prediction of Salmonella plasmid sequences from bacterial draft genomes. In the future, the Kraken classifier established in this study will play a significant role in ARG monitoring.
Collapse
Affiliation(s)
- Zhenpeng Li
- State Key Laboratory of Infectious Disease Prevention and Control; National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing Municipality, China
| | - Bo Pang
- State Key Laboratory of Infectious Disease Prevention and Control; National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing Municipality, China
| | - Xin Lu
- State Key Laboratory of Infectious Disease Prevention and Control; National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing Municipality, China,Xin Lu,
| | - Biao Kan
- State Key Laboratory of Infectious Disease Prevention and Control; National Institute for Communicable Disease Control and Prevention, Chinese Center for Disease Control and Prevention, Beijing Municipality, China,School of Public Health, Shandong University, Jinan City, China,Biao Kan,
| | | | | |
Collapse
|
14
|
Johnson G, Bataclan S, So M, Banerjee S, Wolfe AJ, Putonti C. Plasmids of the urinary microbiota. Access Microbiol 2022; 4:acmi000429. [PMID: 36644432 PMCID: PMC9833419 DOI: 10.1099/acmi.0.000429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 09/09/2022] [Indexed: 12/03/2022] Open
Abstract
Studies of the last decade have identified a phylogenetically diverse community of bacteria within the urinary tract of individuals with and without urinary symptoms. Mobile genetic elements (MGEs), including plasmids and phages, within this niche have only recently begun to be explored. These MGEs can expand metabolic capacity and increase virulence, as well as confer antibiotic resistance. As such, they have the potential to contribute to urinary symptoms. While plasmids for some of the bacterial taxa found within the urinary microbiota (urobiome) have been well characterized, many urinary species are under-studied with few genomes sequenced to date. Using a two-pronged bioinformatic approach, we have conducted a comprehensive investigation of the plasmid content of urinary isolates representative of 102 species. The bioinformatic tools plasmidSPAdes and Recycler were used in tandem to identify plasmid sequences from raw short-read sequence data followed by manual curation. In total, we identified 603 high-confidence plasmid sequences in 20 different genera of the urobiome. In total, 70 % of these high-confidence plasmids exhibit sequence similarity to plasmid sequences from the gut. This observation is primarily driven by plasmids from E. coli , which is found in both anatomical niches. To confirm our bioinformatic predictions, long-read sequencing was performed for 23 of the E. coli isolates in addition to two E. coli strains that were sequenced as part of a prior study. Overall, 66.95 % of these predictions were confirmed highlighting the strengths and weaknesses of current bioinformatic tools. Future studies of the urobiome, especially concerning under-studied species in the urobiome, should employ long-read sequencing to expand the catalogue of plasmids for this niche.
Collapse
Affiliation(s)
| | - Seanna Bataclan
- Biology Program, Division of Natural Sciences, University of Guam, Mangilao, GU, USA
| | - Minerva So
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Swarnali Banerjee
- Department of Mathematics and Statistics, Loyola University Chicago, Chicago, IL, USA
| | - Alan J. Wolfe
- Department of Microbiology and Immunology, Loyola University Chicago, Maywood, IL, USA
| | - Catherine Putonti
- Bioinformatics Program, Loyola University Chicago, Chicago, IL, USA,Department of Microbiology and Immunology, Loyola University Chicago, Maywood, IL, USA,Department of Biology, Loyola University Chicago, Chicago, IL, USA,*Correspondence: Catherine Putonti,
| |
Collapse
|
15
|
Behra PRK, Pettersson BMF, Ramesh M, Das S, Dasgupta S, Kirsebom LA. Comparative genome analysis of mycobacteria focusing on tRNA and non-coding RNA. BMC Genomics 2022; 23:704. [PMID: 36243697 PMCID: PMC9569102 DOI: 10.1186/s12864-022-08927-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 10/04/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The Mycobacterium genus encompasses at least 192 named species, many of which cause severe diseases such as tuberculosis. Non-tuberculosis mycobacteria (NTM) can also infect humans and animals. Some are of emerging concern because they show high resistance to commonly used antibiotics while others are used and evaluated in bioremediation or included in anticancer vaccines. RESULTS We provide the genome sequences for 114 mycobacterial type strains and together with 130 available mycobacterial genomes we generated a phylogenetic tree based on 387 core genes and supported by average nucleotide identity (ANI) data. The 244 genome sequences cover most of the species constituting the Mycobacterium genus. The genome sizes ranged from 3.2 to 8.1 Mb with an average of 5.7 Mb, and we identified 14 new plasmids. Moreover, mycobacterial genomes consisted of phage-like sequences ranging between 0 and 4.64% dependent on mycobacteria while the number of IS elements varied between 1 and 290. Our data also revealed that, depending on the mycobacteria, the number of tRNA and non-coding (nc) RNA genes differ and that their positions on the chromosome varied. We identified a conserved core set of 12 ncRNAs, 43 tRNAs and 18 aminoacyl-tRNA synthetases among mycobacteria. CONCLUSIONS Phages, IS elements, tRNA and ncRNAs appear to have contributed to the evolution of the Mycobacterium genus where several tRNA and ncRNA genes have been horizontally transferred. On the basis of our phylogenetic analysis, we identified several isolates of unnamed species as new mycobacterial species or strains of known mycobacteria. The predicted number of coding sequences correlates with genome size while the number of tRNA, rRNA and ncRNA genes does not. Together these findings expand our insight into the evolution of the Mycobacterium genus and as such they establish a platform to understand mycobacterial pathogenicity, their evolution, antibiotic resistance/tolerance as well as the function and evolution of ncRNA among mycobacteria.
Collapse
Affiliation(s)
- Phani Rama Krishna Behra
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
| | - B. M. Fredrik Pettersson
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
| | - Malavika Ramesh
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
| | - Sarbashis Das
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
| | - Santanu Dasgupta
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
| | - Leif A. Kirsebom
- Department of Cell and Molecular Biology, Uppsala University, Biomedical Centre, Box 596, SE-751 24 Uppsala, Sweden
| |
Collapse
|
16
|
Pu L, Shamir R. 3CAC: improving the classification of phages and plasmids in metagenomic assemblies using assembly graphs. Bioinformatics 2022; 38:ii56-ii61. [PMID: 36124804 DOI: 10.1093/bioinformatics/btac468] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Bacteriophages and plasmids usually coexist with their host bacteria in microbial communities and play important roles in microbial evolution. Accurately identifying sequence contigs as phages, plasmids and bacterial chromosomes in mixed metagenomic assemblies is critical for further unraveling their functions. Many classification tools have been developed for identifying either phages or plasmids in metagenomic assemblies. However, only two classifiers, PPR-Meta and viralVerify, were proposed to simultaneously identify phages and plasmids in mixed metagenomic assemblies. Due to the very high fraction of chromosome contigs in the assemblies, both tools achieve high precision in the classification of chromosomes but perform poorly in classifying phages and plasmids. Short contigs in these assemblies are often wrongly classified or classified as uncertain. RESULTS Here we present 3CAC, a new three-class classifier that improves the precision of phage and plasmid classification. 3CAC starts with an initial three-class classification generated by existing classifiers and improves the classification of short contigs and contigs with low confidence classification by using proximity in the assembly graph. Evaluation on simulated metagenomes and on real human gut microbiome samples showed that 3CAC outperformed PPR-Meta and viralVerify in both precision and recall, and increased F1-score by 10-60 percentage points. AVAILABILITY AND IMPLEMENTATION The 3CAC software is available on https://github.com/Shamir-Lab/3CAC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lianrong Pu
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 69978, Israel
| | - Ron Shamir
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 69978, Israel
| |
Collapse
|
17
|
Shalon N, Relman DA, Yaffe E. Precise genotyping of circular mobile elements from metagenomic data uncovers human-associated plasmids with recent common ancestors. Genome Res 2022; 32:986-1003. [PMID: 35414589 PMCID: PMC9104695 DOI: 10.1101/gr.275894.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 04/01/2022] [Indexed: 11/25/2022]
Abstract
Mobile genetic elements with circular genomes play a key role in the evolution of microbial communities. Their circular genomes correspond to circular walks in metagenome graphs, and yet, assemblies derived from natural microbial communities produce graphs riddled with spurious cycles, complicating the accurate reconstruction of circular genomes. We present DomCycle, an algorithm that reconstructs likely circular genomes based on the identification of so-called 'dominant' graph cycles. In the implementation we leverage paired reads to bridge assembly gaps and scrutinize cycles through a nucleotide-level analysis, making the approach robust to misassembly artifacts. We validated the approach using simulated and real sequencing data. Application of DomCycle to 32 publicly available DNA shotgun sequence data sets from diverse natural environments led to the reconstruction of hundreds of circular mobile genomes. Clustering revealed 20 highly prevalent and cryptic plasmids that have clonal population structures with recent common ancestors. This method facilitates the study of microbial communities that evolve through horizontal gene transfer.
Collapse
|
18
|
Wickramarachchi A, Lin Y. GraphPlas: Refined Classification of Plasmid Sequences Using Assembly Graphs. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:57-67. [PMID: 34029192 DOI: 10.1109/tcbb.2021.3082915] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Plasmids are extra-chromosomal genetic materials with important markers that affect the function and behaviour of the microorganisms supporting their environmental adaptations. Hence the identification and recovery of such plasmid sequences from assemblies is a crucial task in metagenomics analysis. In the past, machine learning approaches have been developed to separate chromosomes and plasmids. However, there is always a compromise between precision and recall in the existing classification approaches. The similarity of compositions between chromosomes and their plasmids makes it difficult to separate plasmids and chromosomes with high accuracy. However, high confidence classifications are accurate with a significant compromise of recall, and vice versa. Hence, the requirement exists to have more sophisticated approaches to separate plasmids and chromosomes accurately while retaining an acceptable trade-off between precision and recall. We present GraphPlas, a novel approach for plasmid recovery using coverage, composition and assembly graph topology. We evaluated GraphPlas on simulated and real short read assemblies with varying compositions of plasmids and chromosomes. Our experiments show that GraphPlas is able to significantly improve accuracy in detecting plasmid and chromosomal contigs on top of popular state-of-the-art plasmid detection tools. The source code is freely available at: https://github.com/anuradhawick/GraphPlas.
Collapse
|
19
|
Schmartz GP, Hirsch P, Amand J, Dastbaz J, Fehlmann T, Kern F, Müller R, Keller A. OUP accepted manuscript. Nucleic Acids Res 2022; 50:W132-W137. [PMID: 35489067 PMCID: PMC9252796 DOI: 10.1093/nar/gkac298] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 04/07/2022] [Accepted: 04/14/2022] [Indexed: 11/13/2022] Open
Abstract
Despite recent methodology and reference database improvements for taxonomic profiling tools, metagenomic assembly and genomic binning remain important pillars of metagenomic analysis workflows. In case reference information is lacking, genomic binning is considered to be a state-of-the-art method in mixed culture metagenomic data analysis. In this light, our previously published tool BusyBee Web implements a composition-based binning method efficient enough to function as a rapid online utility. Handling assembled contigs and long nanopore generated reads alike, the webserver provides a wide range of supplementary annotations and visualizations. Half a decade after the initial publication, we revisited existing functionality, added comprehensive visualizations, and increased the number of data analysis customization options for further experimentation. The webserver now allows for visualization-supported differential analysis of samples, which is computationally expensive and typically only performed in coverage-based binning methods. Further, users may now optionally check their uploaded samples for plasmid sequences using PLSDB as a reference database. Lastly, a new application programming interface with a supporting python package was implemented, to allow power users fully automated access to the resource and integration into existing workflows. The webserver is freely available under: https://www.ccb.uni-saarland.de/busybee.
Collapse
Affiliation(s)
- Georges P Schmartz
- Chair for Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
| | - Pascal Hirsch
- Chair for Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
- Clinical Bioinformatics (CLIB), Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, 66123 Saarbrücken, Germany
| | - Jérémy Amand
- Chair for Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
- Clinical Bioinformatics (CLIB), Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, 66123 Saarbrücken, Germany
| | - Jan Dastbaz
- Microbial Natural Products (MINS), Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, 66123 Saarbrücken, Germany
- Deutsches Zentrum für Infektionsforschung (DZIF), Standort Hannover-Braunschweig, 38124 Braunschweig, Germany
| | - Tobias Fehlmann
- Chair for Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
| | - Fabian Kern
- Chair for Clinical Bioinformatics, Saarland University, 66123 Saarbrücken, Germany
- Clinical Bioinformatics (CLIB), Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, 66123 Saarbrücken, Germany
| | - Rolf Müller
- Microbial Natural Products (MINS), Helmholtz Institute for Pharmaceutical Research Saarland (HIPS), Helmholtz Centre for Infection Research, 66123 Saarbrücken, Germany
- Deutsches Zentrum für Infektionsforschung (DZIF), Standort Hannover-Braunschweig, 38124 Braunschweig, Germany
| | - Andreas Keller
- To whom correspondence should be addressed. Tel: +49 681 30268611; Fax: +49 681 30268610;
| |
Collapse
|
20
|
Andreopoulos WB, Geller AM, Lucke M, Balewski J, Clum A, Ivanova NN, Levy A. Deeplasmid: deep learning accurately separates plasmids from bacterial chromosomes. Nucleic Acids Res 2021; 50:e17. [PMID: 34871418 PMCID: PMC8860608 DOI: 10.1093/nar/gkab1115] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 10/11/2021] [Accepted: 10/23/2021] [Indexed: 11/13/2022] Open
Abstract
Plasmids are mobile genetic elements that play a key role in microbial ecology and evolution by mediating horizontal transfer of important genes, such as antimicrobial resistance genes. Many microbial genomes have been sequenced by short read sequencers and have resulted in a mix of contigs that derive from plasmids or chromosomes. New tools that accurately identify plasmids are needed to elucidate new plasmid-borne genes of high biological importance. We have developed Deeplasmid, a deep learning tool for distinguishing plasmids from bacterial chromosomes based on the DNA sequence and its encoded biological data. It requires as input only assembled sequences generated by any sequencing platform and assembly algorithm and its runtime scales linearly with the number of assembled sequences. Deeplasmid achieves an AUC–ROC of over 89%, and it was more accurate than five other plasmid classification methods. Finally, as a proof of concept, we used Deeplasmid to predict new plasmids in the fish pathogen Yersinia ruckeri ATCC 29473 that has no annotated plasmids. Deeplasmid predicted with high reliability that a long assembled contig is part of a plasmid. Using long read sequencing we indeed validated the existence of a 102 kb long plasmid, demonstrating Deeplasmid's ability to detect novel plasmids.
Collapse
Affiliation(s)
- William B Andreopoulos
- Joint Genome Institute, US Department of Energy, LBNL Berkeley, CA, USA.,Department of Computer Science, San Jose State University, CA, USA
| | - Alexander M Geller
- Department of Plant Pathology and Microbiology, The Institute of Environmental Science, The Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Miriam Lucke
- Department of Plant Pathology and Microbiology, The Institute of Environmental Science, The Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Jan Balewski
- National Energy Research Scientific Computing Center (NERSC), Berkeley, CA, USA
| | - Alicia Clum
- Joint Genome Institute, US Department of Energy, LBNL Berkeley, CA, USA
| | - Natalia N Ivanova
- Joint Genome Institute, US Department of Energy, LBNL Berkeley, CA, USA
| | - Asaf Levy
- Department of Plant Pathology and Microbiology, The Institute of Environmental Science, The Robert H. Smith Faculty of Agriculture, Food, and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| |
Collapse
|
21
|
van der Graaf-van Bloois L, Wagenaar JA, Zomer AL. RFPlasmid: predicting plasmid sequences from short-read assembly data using machine learning. Microb Genom 2021; 7. [PMID: 34846288 PMCID: PMC8743549 DOI: 10.1099/mgen.0.000683] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Antimicrobial-resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know whether the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole-genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely on single features, like k-mer composition, circularity of the DNA molecule, copy number or sequence identity to plasmid replication genes, all of which have their drawbacks, especially when faced with large single-copy plasmids, which often carry resistance genes. With our newly developed prediction tool RFPlasmid, we use a combination of multiple features, including k-mer composition and databases with plasmid and chromosomal marker proteins, to predict whether the likely source of a contig is plasmid or chromosomal. The tool RFPlasmid supports models for 17 different bacterial taxa, including Campylobacter, Escherichia coli and Salmonella, and has a taxon agnostic model for metagenomic assemblies or unsupported organisms. RFPlasmid is available both as a standalone tool and via a web interface.
Collapse
Affiliation(s)
- Linda van der Graaf-van Bloois
- Faculty of Veterinary Medicine, Department of Infectious Diseases and Immunology, Utrecht University, Utrecht, The Netherlands.,WHO Collaborating Centre for Reference and Research on Campylobacter and Antimicrobial Resistance from an One Health Perspective/OIE Reference Laboratory for Campylobacteriosis, Utrecht, The Netherlands
| | - Jaap A Wagenaar
- Faculty of Veterinary Medicine, Department of Infectious Diseases and Immunology, Utrecht University, Utrecht, The Netherlands.,WHO Collaborating Centre for Reference and Research on Campylobacter and Antimicrobial Resistance from an One Health Perspective/OIE Reference Laboratory for Campylobacteriosis, Utrecht, The Netherlands.,Wageningen Bioveterinary Research, Lelystad, The Netherlands
| | - Aldert L Zomer
- Faculty of Veterinary Medicine, Department of Infectious Diseases and Immunology, Utrecht University, Utrecht, The Netherlands.,WHO Collaborating Centre for Reference and Research on Campylobacter and Antimicrobial Resistance from an One Health Perspective/OIE Reference Laboratory for Campylobacteriosis, Utrecht, The Netherlands
| |
Collapse
|
22
|
Uddin TM, Chakraborty AJ, Khusro A, Zidan BRM, Mitra S, Emran TB, Dhama K, Ripon MKH, Gajdács M, Sahibzada MUK, Hossain MJ, Koirala N. Antibiotic resistance in microbes: History, mechanisms, therapeutic strategies and future prospects. J Infect Public Health 2021; 14:1750-1766. [PMID: 34756812 DOI: 10.1016/j.jiph.2021.10.020] [Citation(s) in RCA: 425] [Impact Index Per Article: 106.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 10/04/2021] [Accepted: 10/14/2021] [Indexed: 12/22/2022] Open
Abstract
Antibiotics have been used to cure bacterial infections for more than 70 years, and these low-molecular-weight bioactive agents have also been used for a variety of other medicinal applications. In the battle against microbes, antibiotics have certainly been a blessing to human civilization by saving millions of lives. Globally, infections caused by multidrug-resistant (MDR) bacteria are on the rise. Antibiotics are being used to combat diversified bacterial infections. Synthetic biology techniques, in combination with molecular, functional genomic, and metagenomic studies of bacteria, plants, and even marine invertebrates are aimed at unlocking the world's natural products faster than previous methods of antibiotic discovery. There are currently only few viable remedies, potential preventive techniques, and a limited number of antibiotics, thereby necessitating the discovery of innovative medicinal approaches and antimicrobial therapies. MDR is also facilitated by biofilms, which makes infection control more complex. In this review, we have spotlighted comprehensively various aspects of antibiotics viz. overview of antibiotics era, mode of actions of antibiotics, development and mechanisms of antibiotic resistance in bacteria, and future strategies to fight the emerging antimicrobial resistant threat.
Collapse
Affiliation(s)
- Tanvir Mahtab Uddin
- Department of Pharmacy, Faculty of Pharmacy, University of Dhaka, Dhaka 1000, Bangladesh.
| | - Arka Jyoti Chakraborty
- Department of Pharmacy, Faculty of Pharmacy, University of Dhaka, Dhaka 1000, Bangladesh.
| | - Ameer Khusro
- Research Department of Plant Biology and Biotechnology, Loyola College, Nungambakkam, Chennai, Tamil Nadu, India.
| | - Bm Redwan Matin Zidan
- Department of Pharmacy, Faculty of Pharmacy, University of Dhaka, Dhaka 1000, Bangladesh.
| | - Saikat Mitra
- Department of Pharmacy, Faculty of Pharmacy, University of Dhaka, Dhaka 1000, Bangladesh.
| | - Talha Bin Emran
- Department of Pharmacy, BGC Trust University Bangladesh, Chittagong 4381, Bangladesh.
| | - Kuldeep Dhama
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, Uttar Pradesh, India.
| | - Md Kamal Hossain Ripon
- Department of Pharmacy, Mawlana Bhashani Science and Technology University, Santosh, Tangail 1902, Bangladesh.
| | - Márió Gajdács
- Department of Oral Biology and Experimental Dental Research, Faculty of Dentistry, University of Szeged, 6720 Szeged, Hungary.
| | | | - Md Jamal Hossain
- Department of Pharmacy, State University of Bangladesh, 77 Satmasjid Road, Dhanmondi, Dhaka 1205, Bangladesh.
| | - Niranjan Koirala
- Department of Natural Products Research, Dr. Koirala Research Institute for Biotechnology and Biodiversity, Kathmandu 44600, Nepal.
| |
Collapse
|
23
|
Cafiero JH, Martini MC, Lozano MJ, Vacca C, Lagares A, Tomatis PE, Del Papa MF. BioF is a novel B2 metallo-β-lactamase from Pseudomonas sp. isolated from an on-farm biopurification system. Environ Microbiol 2021; 24:1247-1262. [PMID: 34725905 DOI: 10.1111/1462-2920.15822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/13/2021] [Accepted: 10/13/2021] [Indexed: 11/30/2022]
Abstract
Antimicrobial resistance represents a major global health concern and environmental bacteria are considered a source of resistance genes. Carbapenems are often used as the last antibiotic option to treat multidrug-resistant bacteria. Metallo-β-lactamases (MBLs) are able to render resistance to almost all β-lactam antibiotics, including carbapenems. Unfortunately, there are no inhibitors against MBLs for clinical use. Subclass B2 MBLs are the only enzymes working as strict carbapenemases, under-represented, encoded in chromosome genes and only functional as mono-zinc enzymes. Despite current efforts in MBLs inhibitor development, B2 carbapenemase activity is especially difficult to suppress, even in vitro. In this study we characterized BioF, a novel subclass B2 MBL identified in a new environmental Pseudomonas sp. strain isolated from an on-farm biopurification system (BPS). Although blaBioF is most likely a chromosomal gene, it is found in a genomic island and may represent a step previous to the horizontal transmission of B2 genes. The new B2 MBL is active as a mono-zinc enzyme and is a potent carbapenemase with incipient activity against some cephalosporins. BioF activity is not affected by excess zinc and is only inhibited at high metal chelator concentrations. The discovery and characterization of B2 MBL BioF as a potent carbapenemase in a BPS bacterial isolate emphasizes the importance of exploring antibiotic resistances existing in the environmental microbiota under the influence of human activities before they could emerge clinically.
Collapse
Affiliation(s)
- Juan Hilario Cafiero
- Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular (IBBM, CCT-CONICET-La Plata), Universidad Nacional de La Plata, Calle 115 entre 49 y 50, La Plata, 1900, Argentina
| | - María Carla Martini
- Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular (IBBM, CCT-CONICET-La Plata), Universidad Nacional de La Plata, Calle 115 entre 49 y 50, La Plata, 1900, Argentina
| | - Mauricio Javier Lozano
- Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular (IBBM, CCT-CONICET-La Plata), Universidad Nacional de La Plata, Calle 115 entre 49 y 50, La Plata, 1900, Argentina
| | - Carolina Vacca
- Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular (IBBM, CCT-CONICET-La Plata), Universidad Nacional de La Plata, Calle 115 entre 49 y 50, La Plata, 1900, Argentina
| | - Antonio Lagares
- Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular (IBBM, CCT-CONICET-La Plata), Universidad Nacional de La Plata, Calle 115 entre 49 y 50, La Plata, 1900, Argentina
| | - Pablo Emiliano Tomatis
- Instituto de Biología Molecular y Celular de Rosario (IBR, CONICET-UNR), Ocampo y Esmeralda, Rosario, 2000, Argentina.,Área Biofísica, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Suipacha 531, Rosario, 2000, Argentina
| | - María Florencia Del Papa
- Departamento de Ciencias Biológicas, Facultad de Ciencias Exactas, Instituto de Biotecnología y Biología Molecular (IBBM, CCT-CONICET-La Plata), Universidad Nacional de La Plata, Calle 115 entre 49 y 50, La Plata, 1900, Argentina
| |
Collapse
|
24
|
Pradier L, Tissot T, Fiston-Lavier AS, Bedhomme S. PlasForest: a homology-based random forest classifier for plasmid detection in genomic datasets. BMC Bioinformatics 2021; 22:349. [PMID: 34174810 PMCID: PMC8236179 DOI: 10.1186/s12859-021-04270-w] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 06/14/2021] [Indexed: 11/10/2022] Open
Abstract
Background Plasmids are mobile genetic elements that often carry accessory genes, and are vectors for horizontal transfer between bacterial genomes. Plasmid detection in large genomic datasets is crucial to analyze their spread and quantify their role in bacteria adaptation and particularly in antibiotic resistance propagation. Bioinformatics methods have been developed to detect plasmids. However, they suffer from low sensitivity (i.e., most plasmids remain undetected) or low precision (i.e., these methods identify chromosomes as plasmids), and are overall not adapted to identify plasmids in whole genomes that are not fully assembled (contigs and scaffolds). Results We developed PlasForest, a homology-based random forest classifier identifying bacterial plasmid sequences in partially assembled genomes. Without knowing the taxonomical origin of the samples, PlasForest identifies contigs as plasmids or chromosomes with a F1 score of 0.950. Notably, it can detect 77.4% of plasmid contigs below 1 kb with 2.8% of false positives and 99.9% of plasmid contigs over 50 kb with 2.2% of false positives. Conclusions PlasForest outperforms other currently available tools on genomic datasets by being both sensitive and precise. The performance of PlasForest on metagenomic assemblies are currently well below those of other k-mer-based methods, and we discuss how homology-based approaches could improve plasmid detection in such datasets. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04270-w.
Collapse
Affiliation(s)
- Léa Pradier
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche Pour le Développement, 34000, Montpellier, France.
| | - Tazzio Tissot
- Genomics, Bioinformatics and Evolution. Departament de Genètica i Microbiologia, Universitat Autònoma de Barcelona, 08193, Cerdanyola del Vallès, Spain.,Centre de Recerca Matemàtica, 08193, Cerdanyola del Vallès, Spain
| | - Anna-Sophie Fiston-Lavier
- Institut des Sciences de l'Evolution de Montpellier (ISE-M), Equipe Evolution, Vecteurs, Adaptation et Symbiose, UMR 5554, CNRS-Université Montpellier, 34090, Montpellier Cedex 05, France
| | - Stéphanie Bedhomme
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche Pour le Développement, 34000, Montpellier, France.
| |
Collapse
|
25
|
Pellow D, Zorea A, Probst M, Furman O, Segal A, Mizrahi I, Shamir R. SCAPP: an algorithm for improved plasmid assembly in metagenomes. MICROBIOME 2021; 9:144. [PMID: 34172093 PMCID: PMC8228940 DOI: 10.1186/s40168-021-01068-z] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 04/01/2021] [Indexed: 05/28/2023]
Abstract
BACKGROUND Metagenomic sequencing has led to the identification and assembly of many new bacterial genome sequences. These bacteria often contain plasmids: usually small, circular double-stranded DNA molecules that may transfer across bacterial species and confer antibiotic resistance. These plasmids are generally less studied and understood than their bacterial hosts. Part of the reason for this is insufficient computational tools enabling the analysis of plasmids in metagenomic samples. RESULTS We developed SCAPP (Sequence Contents-Aware Plasmid Peeler)-an algorithm and tool to assemble plasmid sequences from metagenomic sequencing. SCAPP builds on some key ideas from the Recycler algorithm while improving plasmid assemblies by integrating biological knowledge about plasmids. We compared the performance of SCAPP to Recycler and metaplasmidSPAdes on simulated metagenomes, real human gut microbiome samples, and a human gut plasmidome dataset that we generated. We also created plasmidome and metagenome data from the same cow rumen sample and used the parallel sequencing data to create a novel assessment procedure. Overall, SCAPP outperformed Recycler and metaplasmidSPAdes across this wide range of datasets. CONCLUSIONS SCAPP is an easy to use Python package that enables the assembly of full plasmid sequences from metagenomic samples. It outperformed existing metagenomic plasmid assemblers in most cases and assembled novel and clinically relevant plasmids in samples we generated such as a human gut plasmidome. SCAPP is open-source software available from: https://github.com/Shamir-Lab/SCAPP . Video abstract.
Collapse
Affiliation(s)
- David Pellow
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 6997801 Israel
| | - Alvah Zorea
- Department of Life Sciences, Ben-Gurion University of the Negev and the National Institute for Biotechnology in the Negev, Beer-Sheva, 8410501 Israel
| | - Maraike Probst
- Institute of Microbiology, University of Innsbruck, Innsbruck, A-6020 Austria
| | - Ori Furman
- Department of Life Sciences, Ben-Gurion University of the Negev and the National Institute for Biotechnology in the Negev, Beer-Sheva, 8410501 Israel
| | - Arik Segal
- Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, 8410501 Israel
- Soroka University Medical Center, Beer-Sheva, 8410501 Israel
| | - Itzhak Mizrahi
- Department of Life Sciences, Ben-Gurion University of the Negev and the National Institute for Biotechnology in the Negev, Beer-Sheva, 8410501 Israel
| | - Ron Shamir
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, 6997801 Israel
| |
Collapse
|
26
|
Geller AM, Pollin I, Zlotkin D, Danov A, Nachmias N, Andreopoulos WB, Shemesh K, Levy A. The extracellular contractile injection system is enriched in environmental microbes and associates with numerous toxins. Nat Commun 2021; 12:3743. [PMID: 34145238 PMCID: PMC8213781 DOI: 10.1038/s41467-021-23777-7] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Accepted: 05/14/2021] [Indexed: 12/26/2022] Open
Abstract
The extracellular Contractile Injection System (eCIS) is a toxin-delivery particle that evolved from a bacteriophage tail. Four eCISs have previously been shown to mediate interactions between bacteria and their invertebrate hosts. Here, we identify eCIS loci in 1,249 bacterial and archaeal genomes and reveal an enrichment of these loci in environmental microbes and their apparent absence from mammalian pathogens. We show that 13 eCIS-associated toxin genes from diverse microbes can inhibit the growth of bacteria and/or yeast. We identify immunity genes that protect bacteria from self-intoxication, further supporting an antibacterial role for some eCISs. We also identify previously undescribed eCIS core genes, including a conserved eCIS transcriptional regulator. Finally, we present our data through an extensive eCIS repository, termed eCIStem. Our findings support eCIS as a toxin-delivery system that is widespread among environmental prokaryotes and likely mediates antagonistic interactions with eukaryotes and other prokaryotes.
Collapse
Affiliation(s)
- Alexander Martin Geller
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel
| | - Inbal Pollin
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel
| | - David Zlotkin
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel
| | - Aleks Danov
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel
| | - Nimrod Nachmias
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel
| | | | - Keren Shemesh
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel
| | - Asaf Levy
- Department of Plant Pathology and Microbiology, the Robert H. Smith Faculty of Food and Environment, the Hebrew University of Jerusalem, Rehovot, Israel.
| |
Collapse
|
27
|
Alanin KWS, Jørgensen TS, Browne PD, Petersen B, Riber L, Kot W, Hansen LH. An improved direct metamobilome approach increases the detection of larger-sized circular elements across kingdoms. Plasmid 2021; 115:102576. [PMID: 33872684 DOI: 10.1016/j.plasmid.2021.102576] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 04/12/2021] [Accepted: 04/12/2021] [Indexed: 10/21/2022]
Abstract
Mobile genetic elements (MGEs) are instrumental in natural prokaryotic genome editing, permitting genome plasticity and allowing microbes to accumulate genetic diversity. MGEs serve as a vast communal gene pool and include DNA elements such as plasmids and bacteriophages (phages) among others. These mobile DNA elements represent a human health risk as they can introduce new traits, such as antibiotic resistance or virulence, to a bacterial strain. Sequencing libraries targeting environmental circular MGEs, referred to as metamobilomes, may broaden our current understanding of the mechanisms behind the mobility, prevalence and content of these elements. However, metamobilomics is affected by a severe bias towards small circular elements, introduced by multiple displacement amplification (MDA). MDA is typically used to overcome limiting DNA quantities after the removal of non-circular DNA during library preparations. By examining the relationship between sequencing coverage and the size of circular MGEs in paired metamobilome datasets with and without MDA, we show that larger circular elements are lost when using MDA. This study is the first to systematically demonstrate that MDA is detrimental to detecting larger-sized plasmids if small plasmids are present. It is also the first to show that MDA can be omitted when using enzyme-based DNA fragmentation and PCR in library preparation kits such as Nextera XT® from Illumina.
Collapse
Affiliation(s)
- Katrine Wacenius Skov Alanin
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Tue Sparholt Jørgensen
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark; Department of Science and Environment, Roskilde University, Denmark
| | - Patrick Denis Browne
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Bent Petersen
- Globe Institute, Faculty of Health and Biomedical Sciences, University of Copenhagen, Copenhagen, Denmark; Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
| | - Leise Riber
- Department of Biology, Functional Genomics, University of Copenhagen, Copenhagen, Denmark
| | - Witold Kot
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark.
| | - Lars Hestbjerg Hansen
- Department of Environmental Science, Aarhus University, Roskilde, Denmark; Department of Plant and Environmental Sciences, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
28
|
Gomi R, Wyres KL, Holt KE. Detection of plasmid contigs in draft genome assemblies using customized Kraken databases. Microb Genom 2021; 7:000550. [PMID: 33826492 PMCID: PMC8208688 DOI: 10.1099/mgen.0.000550] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Accepted: 03/01/2021] [Indexed: 11/22/2022] Open
Abstract
Plasmids play an important role in bacterial evolution and mediate horizontal transfer of genes including virulence and antimicrobial resistance genes. Although short-read sequencing technologies have enabled large-scale bacterial genomics, the resulting draft genome assemblies are often fragmented into hundreds of discrete contigs. Several tools and approaches have been developed to identify plasmid sequences in such assemblies, but require trade-off between sensitivity and specificity. Here we propose using the Kraken classifier, together with a custom Kraken database comprising known chromosomal and plasmid sequences of Klebsiella pneumoniae species complex (KpSC), to identify plasmid-derived contigs in draft assemblies. We assessed performance using Illumina-based draft genome assemblies for 82 KpSC isolates, for which complete genomes were available to supply ground truth. When benchmarked against five other classifiers (Centrifuge, RFPlasmid, mlplasmids, PlaScope and Platon), Kraken showed balanced performance in terms of overall sensitivity and specificity (90.8 and 99.4 %, respectively, for contig count; 96.5 and >99.9 %, respectively, for cumulative contig length), and the highest accuracy (96.8% vs 91.8-96.6% for contig count; 99.8% vs 99.0-99.7 % for cumulative contig length), and F1-score (94.5 % vs 84.5-94.1 %, for contig count; 98.0 % vs 88.9-96.7 % for cumulative contig length). Kraken also achieved consistent performance across our genome collection. Furthermore, we demonstrate that expanding the Kraken database with additional known chromosomal and plasmid sequences can further improve classification performance. Although we have focused here on the KpSC, this methodology could easily be applied to other species with a sufficient number of completed genomes.
Collapse
Affiliation(s)
- Ryota Gomi
- Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, Victoria 3004, Australia
- Department of Environmental Engineering, Graduate School of Engineering, Kyoto University, Kyoto, Japan
| | - Kelly L. Wyres
- Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, Victoria 3004, Australia
| | - Kathryn E. Holt
- Department of Infectious Diseases, Central Clinical School, Monash University, Melbourne, Victoria 3004, Australia
- London School of Hygiene & Tropical Medicine, London WC1E 7HT, UK
| |
Collapse
|
29
|
Lapidus AL, Korobeynikov AI. Metagenomic Data Assembly - The Way of Decoding Unknown Microorganisms. Front Microbiol 2021; 12:613791. [PMID: 33833738 PMCID: PMC8021871 DOI: 10.3389/fmicb.2021.613791] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2020] [Accepted: 03/03/2021] [Indexed: 01/08/2023] Open
Abstract
Metagenomics is a segment of conventional microbial genomics dedicated to the sequencing and analysis of combined genomic DNA of entire environmental samples. The most critical step of the metagenomic data analysis is the reconstruction of individual genes and genomes of the microorganisms in the communities using metagenomic assemblers - computational programs that put together small fragments of sequenced DNA generated by sequencing instruments. Here, we describe the challenges of metagenomic assembly, a wide spectrum of applications in which metagenomic assemblies were used to better understand the ecology and evolution of microbial ecosystems, and present one of the most efficient microbial assemblers, SPAdes that was upgraded to become applicable for metagenomics.
Collapse
Affiliation(s)
- Alla L. Lapidus
- Center for Algorithmic Biotechnology, St. Petersburg State University, Saint Petersburg, Russia
| | | |
Collapse
|
30
|
Chen S, He C, Li Y, Li Z, Melançon CE. A computational toolset for rapid identification of SARS-CoV-2, other viruses and microorganisms from sequencing data. Brief Bioinform 2021; 22:924-935. [PMID: 33003197 PMCID: PMC7543257 DOI: 10.1093/bib/bbaa231] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 08/03/2020] [Accepted: 08/26/2020] [Indexed: 12/17/2022] Open
Abstract
In this paper, we present a toolset and related resources for rapid identification of viruses and microorganisms from short-read or long-read sequencing data. We present fastv as an ultra-fast tool to detect microbial sequences present in sequencing data, identify target microorganisms and visualize coverage of microbial genomes. This tool is based on the k-mer mapping and extension method. K-mer sets are generated by UniqueKMER, another tool provided in this toolset. UniqueKMER can generate complete sets of unique k-mers for each genome within a large set of viral or microbial genomes. For convenience, unique k-mers for microorganisms and common viruses that afflict humans have been generated and are provided with the tools. As a lightweight tool, fastv accepts FASTQ data as input and directly outputs the results in both HTML and JSON formats. Prior to the k-mer analysis, fastv automatically performs adapter trimming, quality pruning, base correction and other preprocessing to ensure the accuracy of k-mer analysis. Specifically, fastv provides built-in support for rapid severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) identification and typing. Experimental results showed that fastv achieved 100% sensitivity and 100% specificity for detecting SARS-CoV-2 from sequencing data; and can distinguish SARS-CoV-2 from SARS, Middle East respiratory syndrome and other coronaviruses. This toolset is available at: https://github.com/OpenGene/fastv.
Collapse
Affiliation(s)
- Shifu Chen
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. He also serves as chief technology officer of HaploX Biotechnology. He is the initiator of OpenGene projects and a contributor to many open source tools
| | - Changshou He
- department of bioinformatics, HaploX Biotechnology
| | - Yingqiang Li
- department of bioinformatics, HaploX Biotechnology
| | - Zhicheng Li
- Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. His research interests lie mainly in imaging genomics
| | - Charles E Melançon
- department of research and development, HaploX Biotechnology. His research interests lie mainly in next-generation sequencing and bioinformatics
| |
Collapse
|
31
|
Blake KS, Choi J, Dantas G. Approaches for characterizing and tracking hospital-associated multidrug-resistant bacteria. Cell Mol Life Sci 2021; 78:2585-2606. [PMID: 33582841 PMCID: PMC8005480 DOI: 10.1007/s00018-020-03717-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 10/26/2020] [Accepted: 11/17/2020] [Indexed: 12/24/2022]
Abstract
Hospital-associated infections are a major concern for global public health. Infections with antibiotic-resistant pathogens can cause empiric treatment failure, and for infections with multidrug-resistant bacteria which can overcome antibiotics of "last resort" there exists no alternative treatments. Despite extensive sanitization protocols, the hospital environment is a potent reservoir and vector of antibiotic-resistant organisms. Pathogens can persist on hospital surfaces and plumbing for months to years, acquire new antibiotic resistance genes by horizontal gene transfer, and initiate outbreaks of hospital-associated infections by spreading to patients via healthcare workers and visitors. Advancements in next-generation sequencing of bacterial genomes and metagenomes have expanded our ability to (1) identify species and track distinct strains, (2) comprehensively profile antibiotic resistance genes, and (3) resolve the mobile elements that facilitate intra- and intercellular gene transfer. This information can, in turn, be used to characterize the population dynamics of hospital-associated microbiota, track outbreaks to their environmental reservoirs, and inform future interventions. This review provides a detailed overview of the approaches and bioinformatic tools available to study isolates and metagenomes of hospital-associated bacteria, and their multi-layered networks of transmission.
Collapse
Affiliation(s)
- Kevin S Blake
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
| | - JooHee Choi
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA
| | - Gautam Dantas
- The Edison Family Center for Genome Sciences & Systems Biology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
- Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
- Department of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, 63110, USA.
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO, 63130, USA.
| |
Collapse
|
32
|
Fang Z, Zhou H. VirionFinder: Identification of Complete and Partial Prokaryote Virus Virion Protein From Virome Data Using the Sequence and Biochemical Properties of Amino Acids. Front Microbiol 2021; 12:615711. [PMID: 33613485 PMCID: PMC7894196 DOI: 10.3389/fmicb.2021.615711] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 01/04/2021] [Indexed: 01/22/2023] Open
Abstract
Viruses are some of the most abundant biological entities on Earth, and prokaryote virus are the dominant members of the viral community. Because of the diversity of prokaryote virus, functional annotation cannot be performed on a large number of genes from newly discovered prokaryote virus by searching the current database; therefore, the development of an alignment-free algorithm for functional annotation of prokaryote virus proteins is important to understand the viral community. The identification of prokaryote virus proteins (PVVPs) is a critical step for many viral analyses, such as species classification, phylogenetic analysis and the exploration of how prokaryote virus interact with their hosts. Although a series of PVVP prediction tools have been developed, the performance of these tools is still not satisfactory. Moreover, viral metagenomic data contains fragmented sequences, leading to the existence of some incomplete genes. Therefore, a tool that can identify partial prokaryote virus proteins is also needed. In this work, we present a novel algorithm, called VirionFinder, to identify the complete and partial PVVPs from non-prokaryote virus virion proteins (non-PVVPs). VirionFinder uses the sequence and biochemical properties of 20 amino acids as the mathematical model to encode the protein sequences and uses a deep learning technique to identify whether a given protein is a PVVP. Compared with the state-of-the-art tools using artificial benchmark datasets, the results show that under the same specificity (Sp), the sensitivity (Sn) of VirionFinder is approximately 10-34% much higher than the Sn of these tools on both complete and partial proteins. When evaluating related tools using real virome data, the recognition rate of PVVP-like sequences of VirionFinder is also much higher than that of the other tools. We expect that VirionFinder will be a powerful tool for identifying novel virion proteins from both complete prokaryote virus genomes and viral metagenomic data. VirionFinder is freely available at https://github.com/zhenchengfang/VirionFinder.
Collapse
Affiliation(s)
- Zhencheng Fang
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
- Center for Quantitative Biology, Peking University, Beijing, China
| | - Hongwei Zhou
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
- State Key Laboratory of Organ Failure Research, Southern Medical University, Guangzhou, China
| |
Collapse
|
33
|
Martínez Arbas S, Narayanasamy S, Herold M, Lebrun LA, Hoopmann MR, Li S, Lam TJ, Kunath BJ, Hicks ND, Liu CM, Price LB, Laczny CC, Gillece JD, Schupp JM, Keim PS, Moritz RL, Faust K, Tang H, Ye Y, Skupin A, May P, Muller EEL, Wilmes P. Roles of bacteriophages, plasmids and CRISPR immunity in microbial community dynamics revealed using time-series integrated meta-omics. Nat Microbiol 2021; 6:123-135. [PMID: 33139880 PMCID: PMC7752763 DOI: 10.1038/s41564-020-00794-8] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Accepted: 09/11/2020] [Indexed: 02/07/2023]
Abstract
Viruses and plasmids (invasive mobile genetic elements (iMGEs)) have important roles in shaping microbial communities, but their dynamic interactions with CRISPR-based immunity remain unresolved. We analysed generation-resolved iMGE-host dynamics spanning one and a half years in a microbial consortium from a biological wastewater treatment plant using integrated meta-omics. We identified 31 bacterial metagenome-assembled genomes encoding complete CRISPR-Cas systems and their corresponding iMGEs. CRISPR-targeted plasmids outnumbered their bacteriophage counterparts by at least fivefold, highlighting the importance of CRISPR-mediated defence against plasmids. Linear modelling of our time-series data revealed that the variation in plasmid abundance over time explained more of the observed community dynamics than phages. Community-scale CRISPR-based plasmid-host and phage-host interaction networks revealed an increase in CRISPR-mediated interactions coinciding with a decrease in the dominant 'Candidatus Microthrix parvicella' population. Protospacers were enriched in sequences targeting genes involved in the transmission of iMGEs. Understanding the factors shaping the fitness of specific populations is necessary to devise control strategies for undesirable species and to predict or explain community-wide phenotypes.
Collapse
Affiliation(s)
- Susana Martínez Arbas
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Shaman Narayanasamy
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Megeno S.A., Esch-sur-Alzette, Luxembourg
| | - Malte Herold
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Laura A Lebrun
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | - Sujun Li
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN, USA
| | - Tony J Lam
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN, USA
| | - Benoît J Kunath
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Nathan D Hicks
- TGen North, Flagstaff, AZ, USA
- Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA
| | - Cindy M Liu
- TGen North, Flagstaff, AZ, USA
- Department of Environmental and Occupational Health, Miken Institute School of Public Health, George Washington University, Washington, DC, USA
| | - Lance B Price
- TGen North, Flagstaff, AZ, USA
- Department of Environmental and Occupational Health, Miken Institute School of Public Health, George Washington University, Washington, DC, USA
| | - Cedric C Laczny
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | | | - Paul S Keim
- TGen North, Flagstaff, AZ, USA
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, AZ, USA
| | | | - Karoline Faust
- Laboratory of Molecular Bacteriology, KU Leuven, Leuven, Belgium
| | - Haixu Tang
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN, USA
| | - Yuzhen Ye
- School of Informatics, Computing and Engineering, Indiana University, Bloomington, IN, USA
| | - Alexander Skupin
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Department of Neuroscience, University of California, La Jolla, CA, USA
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Emilie E L Muller
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Department of Microbiology, Genomics and the Environment, UMR 7156 UNISTRA-CNRS, Université de Strasbourg, Strasbourg, France
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| |
Collapse
|
34
|
Rios Miguel AB, Jetten MS, Welte CU. The role of mobile genetic elements in organic micropollutant degradation during biological wastewater treatment. WATER RESEARCH X 2020; 9:100065. [PMID: 32984801 PMCID: PMC7494797 DOI: 10.1016/j.wroa.2020.100065] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 08/19/2020] [Accepted: 08/28/2020] [Indexed: 05/24/2023]
Abstract
Wastewater treatment plants (WWTPs) are crucial for producing clean effluents from polluting sources such as hospitals, industries, and municipalities. In recent decades, many new organic compounds have ended up in surface waters in concentrations that, while very low, cause (chronic) toxicity to countless organisms. These organic micropollutants (OMPs) are usually quite recalcitrant and not sufficiently removed during wastewater treatment. Microbial degradation plays a pivotal role in OMP conversion. Microorganisms can adapt their metabolism to the use of novel molecules via mutations and rearrangements of existing genes in new clusters. Many catabolic genes have been found adjacent to mobile genetic elements (MGEs), which provide a stable scaffold to host new catabolic pathways and spread these genes in the microbial community. These mobile systems could be engineered to enhance OMP degradation in WWTPs, and this review aims to summarize and better understand the role that MGEs might play in the degradation and wastewater treatment process. Available data about the presence of catabolic MGEs in WWTPs are reviewed, and current methods used to identify and measure MGEs in environmental samples are critically evaluated. Finally, examples of how these MGEs could be used to improve micropollutant degradation in WWTPs are outlined. In the near future, advances in the use of MGEs will hopefully enable us to apply selective augmentation strategies to improve OMP conversion in WWTPs.
Collapse
Affiliation(s)
- Ana B. Rios Miguel
- Department of Microbiology, Institute for Water and Wetland Research, Radboud University, Heyendaalseweg 135, 6525, AJ Nijmegen, the Netherlands
| | - Mike S.M. Jetten
- Department of Microbiology, Institute for Water and Wetland Research, Radboud University, Heyendaalseweg 135, 6525, AJ Nijmegen, the Netherlands
- Soehngen Institute of Anaerobic Microbiology, Radboud University, Heyendaalseweg 135, 6525, AJ Nijmegen, the Netherlands
| | - Cornelia U. Welte
- Department of Microbiology, Institute for Water and Wetland Research, Radboud University, Heyendaalseweg 135, 6525, AJ Nijmegen, the Netherlands
- Soehngen Institute of Anaerobic Microbiology, Radboud University, Heyendaalseweg 135, 6525, AJ Nijmegen, the Netherlands
| |
Collapse
|
35
|
Fang Z, Zhou H. Identification of the conjugative and mobilizable plasmid fragments in the plasmidome using sequence signatures. Microb Genom 2020; 6:mgen000459. [PMID: 33074084 PMCID: PMC7725325 DOI: 10.1099/mgen.0.000459] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Accepted: 10/03/2020] [Indexed: 12/24/2022] Open
Abstract
Plasmids are the key element in horizontal gene transfer in the microbial community. Recently, a large number of experimental and computational methods have been developed to obtain the plasmidomes of microbial communities. Distinguishing transmissible plasmid sequences, which are derived from conjugative or at least mobilizable plasmids, from non-transmissible plasmid sequences in the plasmidome is essential for understanding the diversity of plasmids and how they regulate the microbial community. Unfortunately, due to the highly fragmented characteristics of DNA sequences in the plasmidome, effective identification methods are lacking. In this work, we used information entropy from information theory to assess the randomness of synonymous codon usage over 4424 plasmid genomes. The results showed that for all amino acids, the choice of a synonymous codon in conjugative and mobilizable plasmids is more random than that in non-transmissible plasmids, indicating that transmissible plasmids have different sequence signatures from non-transmissible plasmids. Inspired by this phenomenon, we further developed a novel algorithm named PlasTrans. PlasTrans takes the triplet code sequences and base sequences of plasmid DNA fragments as input and uses the convolutional neural network of the deep learning technique to further extract the more complex signatures of the plasmid sequences and identify the conjugative and mobilizable DNA fragments. Tests showed that PlasTrans could achieve an AUC of as high as 84-91%, even though the fragments only contained hundreds of base pairs. To the best of our knowledge, this is the first quantitative analysis of the difference in sequence signatures between transmissible and non-transmissible plasmids, and we developed the first tool to perform transferability annotation for DNA fragments in the plasmidome. We expect that PlasTrans will be a useful tool for researchers who analyse the properties of novel plasmids in the microbial community and horizontal gene transfer, especially the spread of resistance genes and virulence factors associated with plasmids. PlasTrans is freely available via https://github.com/zhenchengfang/PlasTrans.
Collapse
Affiliation(s)
- Zhencheng Fang
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, 510280, PR China
- Center for Quantitative Biology, Peking University, No. 5 Yiheyuan Road Haidian District, Beijing 100871, PR China
| | - Hongwei Zhou
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, 510280, PR China
- State Key Laboratory of Organ Failure Research, Southern Medical University, Guangzhou, PR China
| |
Collapse
|
36
|
Fang Z, Tan J, Wu S, Li M, Wang C, Liu Y, Zhu H. PlasGUN: gene prediction in plasmid metagenomic short reads using deep learning. Bioinformatics 2020; 36:3239-3241. [PMID: 32091572 PMCID: PMC7214025 DOI: 10.1093/bioinformatics/btaa103] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Revised: 01/16/2020] [Accepted: 02/10/2020] [Indexed: 12/27/2022] Open
Abstract
SUMMARY We present the first tool of gene prediction, PlasGUN, for plasmid metagenomic short-read data. The tool, developed based on deep learning algorithm of multiple input Convolutional Neural Network, demonstrates much better performance when tested on a benchmark dataset of artificial short reads and presents more reliable results for real plasmid metagenomic data than traditional gene prediction tools designed primarily for chromosome-derived short reads. AVAILABILITY AND IMPLEMENTATION The PlasGUN software is available at http://cqb.pku.edu.cn/ZhuLab/PlasGUN/ or https://github.com/zhenchengfang/PlasGUN/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Zhencheng Fang
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Jie Tan
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Shufang Wu
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Mo Li
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Chunhui Wang
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Yongchu Liu
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| | - Huaiqiu Zhu
- State Key Laboratory for Turbulence and Complex Systems, Department of Biomedical Engineering, College of Engineering and Center for Quantitative Biology, Peking University, Beijing 100871, China
| |
Collapse
|
37
|
Schwengers O, Barth P, Falgenhauer L, Hain T, Chakraborty T, Goesmann A. Platon: identification and characterization of bacterial plasmid contigs in short-read draft assemblies exploiting protein sequence-based replicon distribution scores. Microb Genom 2020; 6:mgen000398. [PMID: 32579097 PMCID: PMC7660248 DOI: 10.1099/mgen.0.000398] [Citation(s) in RCA: 94] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2019] [Accepted: 06/02/2020] [Indexed: 12/22/2022] Open
Abstract
Plasmids are extrachromosomal genetic elements that replicate independently of the chromosome and play a vital role in the environmental adaptation of bacteria. Due to potential mobilization or conjugation capabilities, plasmids are important genetic vehicles for antimicrobial resistance genes and virulence factors with huge and increasing clinical implications. They are therefore subject to large genomic studies within the scientific community worldwide. As a result of rapidly improving next-generation sequencing methods, the quantity of sequenced bacterial genomes is constantly increasing, in turn raising the need for specialized tools to (i) extract plasmid sequences from draft assemblies, (ii) derive their origin and distribution, and (iii) further investigate their genetic repertoire. Recently, several bioinformatic methods and tools have emerged to tackle this issue; however, a combination of high sensitivity and specificity in plasmid sequence identification is rarely achieved in a taxon-independent manner. In addition, many software tools are not appropriate for large high-throughput analyses or cannot be included in existing software pipelines due to their technical design or software implementation. In this study, we investigated differences in the replicon distributions of protein-coding genes on a large scale as a new approach to distinguish plasmid-borne from chromosome-borne contigs. We defined and computed statistical discrimination thresholds for a new metric: the replicon distribution score (RDS), which achieved an accuracy of 96.6 %. The final performance was further improved by the combination of the RDS metric with heuristics exploiting several plasmid-specific higher-level contig characterizations. We implemented this workflow in a new high-throughput taxon-independent bioinformatics software tool called Platon for the recruitment and characterization of plasmid-borne contigs from short-read draft assemblies. Compared to PlasFlow, Platon achieved a higher accuracy (97.5 %) and more balanced predictions (F1=82.6 %) tested on a broad range of bacterial taxa and better or equal performance against the targeted tools PlasmidFinder and PlaScope on sequenced Escherichia coli isolates. Platon is available at: http://platon.computational.bio/.
Collapse
Affiliation(s)
- Oliver Schwengers
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen, Germany
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen, Germany
- German Center for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany
| | - Patrick Barth
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen, Germany
| | - Linda Falgenhauer
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen, Germany
- German Center for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany
- Present address: Institute of Hygiene and Environmental Health, Justus Liebig University, Giessen, Germany
| | - Torsten Hain
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen, Germany
- German Center for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany
| | - Trinad Chakraborty
- Institute of Medical Microbiology, Justus Liebig University Giessen, Giessen, Germany
- German Center for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany
| | - Alexander Goesmann
- Bioinformatics and Systems Biology, Justus Liebig University Giessen, Giessen, Germany
- German Center for Infection Research (DZIF), partner site Giessen-Marburg-Langen, Giessen, Germany
| |
Collapse
|
38
|
Maguire F, Jia B, Gray KL, Lau WYV, Beiko RG, Brinkman FSL. Metagenome-assembled genome binning methods with short reads disproportionately fail for plasmids and genomic Islands. Microb Genom 2020; 6:mgen000436. [PMID: 33001022 PMCID: PMC7660262 DOI: 10.1099/mgen.0.000436] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2020] [Accepted: 09/04/2020] [Indexed: 12/12/2022] Open
Abstract
Metagenomic methods enable the simultaneous characterization of microbial communities without time-consuming and bias-inducing culturing. Metagenome-assembled genome (MAG) binning methods aim to reassemble individual genomes from this data. However, the recovery of mobile genetic elements (MGEs), such as plasmids and genomic islands (GIs), by binning has not been well characterized. Given the association of antimicrobial resistance (AMR) genes and virulence factor (VF) genes with MGEs, studying their transmission is a public-health priority. The variable copy number and sequence composition of MGEs makes them potentially problematic for MAG binning methods. To systematically investigate this issue, we simulated a low-complexity metagenome comprising 30 GI-rich and plasmid-containing bacterial genomes. MAGs were then recovered using 12 current prediction pipelines and evaluated. While 82-94 % of chromosomes could be correctly recovered and binned, only 38-44 % of GIs and 1-29 % of plasmid sequences were found. Strikingly, no plasmid-borne VF nor AMR genes were recovered, and only 0-45 % of AMR or VF genes within GIs. We conclude that short-read MAG approaches, without further optimization, are largely ineffective for the analysis of mobile genes, including those of public-health importance, such as AMR and VF genes. We propose that researchers should explore developing methods that optimize for this issue and consider also using unassembled short reads and/or long-read approaches to more fully characterize metagenomic data.
Collapse
Affiliation(s)
- Finlay Maguire
- Faculty of Computer Science, Dalhousie University, 6050 University Avenue, Halifax, Nova Scotia, B3H 4R2, Canada
| | - Baofeng Jia
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| | - Kristen L. Gray
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| | - Wing Yin Venus Lau
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| | - Robert G. Beiko
- Faculty of Computer Science, Dalhousie University, 6050 University Avenue, Halifax, Nova Scotia, B3H 4R2, Canada
| | - Fiona S. L. Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, BC V5A 1S6, Canada
| |
Collapse
|
39
|
The Salmonella enterica Plasmidome as a Reservoir of Antibiotic Resistance. Microorganisms 2020; 8:microorganisms8071016. [PMID: 32650601 PMCID: PMC7409225 DOI: 10.3390/microorganisms8071016] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Revised: 06/19/2020] [Accepted: 06/29/2020] [Indexed: 01/08/2023] Open
Abstract
The emergence of multidrug-resistant bacterial strains worldwide has become a serious problem for public health over recent decades. The increase in antimicrobial resistance has been expanding via plasmids as mobile genetic elements encoding antimicrobial resistance (AMR) genes that are transferred vertically and horizontally. This study focuses on Salmonella enterica, one of the leading foodborne pathogens in industrialized countries. S. enterica is known to carry several plasmids involved not only in virulence but also in AMR. In the current paper, we present an integrated strategy to detect plasmid scaffolds in whole genome sequencing (WGS) assemblies. We developed a two-step procedure to predict plasmids based on i) the presence of essential elements for plasmid replication and mobility, as well as ii) sequence similarity to a reference plasmid. Next, to confirm the accuracy of the prediction in 1750 S. enterica short-read sequencing data, we combined Oxford Nanopore MinION long-read sequencing with Illumina MiSeq short-read sequencing in hybrid assemblies for 84 isolates to evaluate the proportion of plasmid that has been detected. At least one scaffold with an origin of replication (ORI) was predicted in 61.3% of the Salmonella isolates tested. The results indicated that IncFII and IncI1 ORIs were distributed in many S. enterica serotypes and were the most prevalent AMR genes carrier, whereas IncHI2A/IncHI2 and IncA/C2 were more serotype restricted but bore several AMR genes. Comparison between hybrid and short-read assemblies revealed that 81.1% of plasmids were found in the short-read sequencing using our pipeline. Through this process, we established that plasmids are prevalent in S. enterica and we also substantially expand the AMR genes in the resistome of this species.
Collapse
|
40
|
Stohr JJ, Kluytmans-van den Bergh MF, Wedema R, Friedrich AW, Kluytmans JA, Rossen JW. Detection of extended-spectrum beta-lactamase (ESBL) genes and plasmid replicons in Enterobacteriaceae using PlasmidSPAdes assembly of short-read sequence data. Microb Genom 2020; 6:mgen000400. [PMID: 32589571 PMCID: PMC7478632 DOI: 10.1099/mgen.0.000400] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Accepted: 06/04/2020] [Indexed: 01/11/2023] Open
Abstract
Knowledge of the epidemiology of plasmids is essential for understanding the evolution and spread of antimicrobial resistance. PlasmidSPAdes attempts to reconstruct plasmids using short-read sequence data. Accurate detection of extended-spectrum beta-lactamase (ESBL) genes and plasmid replicon genes is a prerequisite for the use of plasmid assembly tools to investigate the role of plasmids in the spread and evolution of ESBL production in Enterobacteriaceae. This study evaluated the performance of PlasmidSPAdes plasmid assembly for Enterobacteriaceae in terms of detection of ESBL-encoding genes, plasmid replicons and chromosomal wgMLST genes, and assessed the effect of k-mer size. Short-read sequence data for 59 ESBL-producing Enterobacteriaceae were assembled with PlasmidSPAdes using different k-mer sizes (21, 33, 55, 77, 99 and 127). For every k-mer size, the presence of ESBL genes, plasmid replicons and a selection of chromosomal wgMLST genes in the plasmid assembly was determined. Out of 241 plasmid replicons and 66 ESBL genes detected by whole-genome assembly, 213 plasmid replicons [88 %; 95 % confidence interval (CI): 83.9-91.9] and 43 ESBL genes (65 %; 95 % CI: 53.1-75.6) were detected in the plasmid assemblies obtained by PlasmidSPAdes. For most ESBL genes (83.3 %) and plasmid replicons (72.0 %), detection results did not differ between the k-mer sizes used in the plasmid assembly. No optimal k-mer size could be defined for the number of ESBL genes and plasmid replicons detected. For most isolates, the number of chromosomal wgMLST genes detected in the plasmid assemblies decreased with increasing k-mer size. Based on our findings, PlasmidSPAdes is not a suitable plasmid assembly tool for short-read sequence data for ESBL-encoding plasmids of Enterobacteriaceae.
Collapse
Affiliation(s)
- Joep J.J.M. Stohr
- Department of Infection Control, Amphia Hospital, Breda, The Netherlands
- Laboratory for Medical Microbiology and Immunology, Elisabeth-TweeSteden Hospital, Tilburg, The Netherlands
| | - Marjolein F.Q. Kluytmans-van den Bergh
- Department of Infection Control, Amphia Hospital, Breda, The Netherlands
- Amphia Academy Infectious Disease Foundation, Amphia Hospital, Breda, The Netherlands
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Ronald Wedema
- Department of Life Science and Technology, Hanze University of Applied Sciences, Groningen, The Netherlands
| | - Alexander W. Friedrich
- Department of Medical Microbiology and Infection Prevention, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Jan A.J.W. Kluytmans
- Department of Infection Control, Amphia Hospital, Breda, The Netherlands
- Amphia Academy Infectious Disease Foundation, Amphia Hospital, Breda, The Netherlands
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
| | - John W.A. Rossen
- Department of Medical Microbiology and Infection Prevention, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| |
Collapse
|
41
|
Hilpert C, Bricheux G, Debroas D. Reconstruction of plasmids by shotgun sequencing from environmental DNA: which bioinformatic workflow? Brief Bioinform 2020; 22:5838452. [PMID: 32427283 DOI: 10.1093/bib/bbaa059] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 03/24/2020] [Accepted: 03/25/2020] [Indexed: 12/19/2022] Open
Abstract
Plasmids play important roles in microbial evolution and also in the spread of antibiotic resistance. Plasmid sequences are extensively studied from clinical isolates but rarely from the environment with a metagenomic approach focused on the plasmid fraction referred to as the plasmidome. A clear challenge in this context is to define a workflow for discriminating plasmids from chromosomal contaminants existing in the plasmidome. For this purpose, we benchmarked existing tools from assembly to detection of the plasmids by reference-free methods (cBar and PlasFlow) and database-guided approaches. Our simulations took into account short-reads alone or combined with moderate long-reads like those actually generated in environmental genomics experiments. This benchmark allowed us to select the best tools for limiting false-positives associated to plasmid prediction tools and a combination of reference-guided methods based on plasmid and bacterial databases.
Collapse
Affiliation(s)
- Cécile Hilpert
- Université Clermont Auvergne, CNRS, Laboratoire Microorganismes: Genome et Environnement, F-63000 Clermont-Ferrand, France
| | - Geneviève Bricheux
- Université Clermont Auvergne, CNRS, Laboratoire Microorganismes: Genome et Environnement, F-63000 Clermont-Ferrand, France
| | - Didier Debroas
- Université Clermont Auvergne, CNRS, Laboratoire Microorganismes: Genome et Environnement, F-63000 Clermont-Ferrand, France
| |
Collapse
|
42
|
Jesus TF, Ribeiro-Gonçalves B, Silva DN, Bortolaia V, Ramirez M, Carriço JA. Plasmid ATLAS: plasmid visual analytics and identification in high-throughput sequencing data. Nucleic Acids Res 2020; 47:D188-D194. [PMID: 30395323 PMCID: PMC6323984 DOI: 10.1093/nar/gky1073] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2018] [Accepted: 10/19/2018] [Indexed: 11/13/2022] Open
Abstract
Plasmid ATLAS (pATLAS, http://www.patlas.site) provides an easy-to-use web accessible database with visual analytics tools to explore the relationships of plasmids available in NCBI's RefSeq database. pATLAS has two main goals: (i) to provide an easy way to search for plasmids deposited in NCBI RefSeq and their associated metadata; (ii) to visualize the relationships of plasmids in a graph, allowing the exploration of plasmid evolution. pATLAS allows searching by plasmid name, bacterial host taxa, antibiotic resistance and virulence genes, plasmid families, and by sequence length and similarity. pATLAS is also able to represent in the plasmid network, plasmid sets identified by external pipelines using mapping, mash screen or assembly from high-throughput sequencing data. By representing the identified hits within the network of relationships between plasmids, allowing the possibility of removing redundant results, and by taking advantage of the browsing capabilities of pATLAS, users can more easily interpret the pipelines' results. All these analyses can be saved to a JSON file for sharing and future re-evaluation. Furthermore, by offering a REST-API, the pATLAS database and network display are easily accessible by other interfaces or pipelines.
Collapse
Affiliation(s)
- Tiago F Jesus
- Instituto de Microbiologia and Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Av. Professor Egaz Moniz, 1649-028 Lisboa, Portugal
| | - Bruno Ribeiro-Gonçalves
- Instituto de Microbiologia and Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Av. Professor Egaz Moniz, 1649-028 Lisboa, Portugal
| | - Diogo N Silva
- Instituto de Microbiologia and Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Av. Professor Egaz Moniz, 1649-028 Lisboa, Portugal
| | - Valeria Bortolaia
- National Food Institute, Technical University of Denmark, Kemitorvet, Building 204, DK-2800 Kgs. Lyngby, Denmark
| | - Mário Ramirez
- Instituto de Microbiologia and Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Av. Professor Egaz Moniz, 1649-028 Lisboa, Portugal
| | - João A Carriço
- Instituto de Microbiologia and Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina, Universidade de Lisboa, Av. Professor Egaz Moniz, 1649-028 Lisboa, Portugal
| |
Collapse
|
43
|
Carr VR, Shkoporov A, Hill C, Mullany P, Moyes DL. Probing the Mobilome: Discoveries in the Dynamic Microbiome. Trends Microbiol 2020; 29:158-170. [PMID: 32448763 DOI: 10.1016/j.tim.2020.05.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/30/2020] [Accepted: 05/05/2020] [Indexed: 02/06/2023]
Abstract
There has been an explosion of metagenomic data representing human, animal, and environmental microbiomes. This provides an unprecedented opportunity for comparative and longitudinal studies of many functional aspects of the microbiome that go beyond taxonomic classification, such as profiling genetic determinants of antimicrobial resistance, interactions with the host, potentially clinically relevant functions, and the role of mobile genetic elements (MGEs). One of the most important but least studied of these aspects are the MGEs, collectively referred to as the 'mobilome'. Here we elaborate on the benefits and limitations of using different metagenomic protocols, discuss the relative merits of various sequencing technologies, and highlight relevant bioinformatics tools and pipelines to predict the presence of MGEs and their microbial hosts.
Collapse
Affiliation(s)
- Victoria R Carr
- Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral and Craniofacial Sciences, King's College London, London, UK; The Alan Turing Institute, British Library, London, UK.
| | - Andrey Shkoporov
- APC Microbiome Ireland, School of Microbiology, University College Cork, Cork, Ireland
| | - Colin Hill
- APC Microbiome Ireland, School of Microbiology, University College Cork, Cork, Ireland
| | - Peter Mullany
- Eastman Dental Institute, University College London, London, UK
| | - David L Moyes
- Centre for Host-Microbiome Interactions, Faculty of Dentistry, Oral and Craniofacial Sciences, King's College London, London, UK.
| |
Collapse
|
44
|
PlasClass improves plasmid sequence classification. PLoS Comput Biol 2020; 16:e1007781. [PMID: 32243433 PMCID: PMC7159247 DOI: 10.1371/journal.pcbi.1007781] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2019] [Revised: 04/15/2020] [Accepted: 03/08/2020] [Indexed: 01/15/2023] Open
Abstract
Many bacteria contain plasmids, but separating between contigs that originate on the plasmid and those that are part of the bacterial genome can be difficult. This is especially true in metagenomic assembly, which yields many contigs of unknown origin. Existing tools for classifying sequences of plasmid origin give less reliable results for shorter sequences, are trained using a fraction of the known plasmids, and can be difficult to use in practice. We present PlasClass, a new plasmid classifier. It uses a set of standard classifiers trained on the most current set of known plasmid sequences for different sequence lengths. We tested PlasClass sequence classification on held-out data and simulations, as well as publicly available bacterial isolates and plasmidome samples and plasmids assembled from metagenomic samples. PlasClass outperforms the state-of-the-art plasmid classification tool on shorter sequences, which constitute the majority of assembly contigs, allowing it to achieve higher F1 scores in classifying sequences from a wide range of datasets. PlasClass also uses significantly less time and memory. PlasClass can be used to easily classify plasmid and bacterial genome sequences in metagenomic or isolate assemblies. It is available under the MIT license from: https://github.com/Shamir-Lab/PlasClass.
Collapse
|
45
|
Laczny CC, Galata V, Plum A, Posch AE, Keller A. Assessing the heterogeneity of in silico plasmid predictions based on whole-genome-sequenced clinical isolates. Brief Bioinform 2020; 20:857-865. [PMID: 29220507 DOI: 10.1093/bib/bbx162] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 10/27/2017] [Indexed: 12/28/2022] Open
Abstract
High-throughput next-generation shotgun sequencing of pathogenic bacteria is growing in clinical relevance, especially for chromosomal DNA-based taxonomic identification and for antibiotic resistance prediction. Genetic exchange is facilitated for extrachromosomal DNA, e.g. plasmid-borne antibiotic resistance genes. Consequently, accurate identification of plasmids from whole-genome sequencing (WGS) data remains one of the major challenges for sequencing-based precision medicine in infectious diseases. Here, we assess the heterogeneity of four state-of-the-art tools (cBar, PlasmidFinder, plasmidSPAdes and Recycler) for the in silico prediction of plasmid-derived sequences from WGS data. Heterogeneity, sensitivity and precision were evaluated by reference-independent and reference-dependent benchmarking using 846 Gram-negative clinical isolates. Interestingly, the majority of predicted sequences were tool-specific, resulting in a pronounced heterogeneity across tools for the reference-independent assessment. In the reference-dependent assessment, sensitivity and precision values were found to substantially vary between tools and across taxa, with cBar exhibiting the highest median sensitivity (87.45%) but a low median precision (27.05%). Furthermore, integrating the individual tools into an ensemble approach showed increased sensitivity (95.55%) while reducing the precision (25.62%). CBar and plasmidSPAdes exhibited the strongest concordance with respect to identified antibiotic resistance factors. Moreover, false-positive plasmid predictions typically contained only few antibiotic resistance factors. In conclusion, while high degrees of heterogeneity and variation in sensitivity and precision were observed across the different tools and taxa, existing tools are valuable for investigating the plasmid-borne resistome. Nevertheless, additional studies on representative clinical data sets will be necessary to translate in silico plasmid prediction approaches from research to clinical application.
Collapse
Affiliation(s)
| | | | | | | | - Andreas Keller
- Chair for Clinical Bioinformatics at Saarland University
| |
Collapse
|
46
|
Douarre PE, Mallet L, Radomski N, Felten A, Mistou MY. Analysis of COMPASS, a New Comprehensive Plasmid Database Revealed Prevalence of Multireplicon and Extensive Diversity of IncF Plasmids. Front Microbiol 2020; 11:483. [PMID: 32265894 PMCID: PMC7105883 DOI: 10.3389/fmicb.2020.00483] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 03/05/2020] [Indexed: 12/20/2022] Open
Abstract
Plasmids are genetic elements that enable rapid adaptation and evolution by transferring genes conferring selective advantages to their hosts. Conjugative plasmids are predominantly responsible for the global dissemination of antimicrobial resistance, representing an important threat to global health. As the number of plasmid sequences grows exponentially, it becomes critical to depict the global diversity and decipher the distribution of circulating plasmids in the bacterial community. To this end, we created COMPASS, a novel and comprehensive database compiling 12,084 complete plasmids with associated metadata from 1571 distinct species isolated worldwide over more than 100 years. The curation of the database allowed us to identify identical plasmids across different bacteria revealing mainly intraspecies dissemination and rare cases of horizontal transmission. We outlined and analyzed all relevant features, plasmid properties, host range and characterized their replication and mobilization systems. After an exhaustive comparison of PlasmidFinder and MOB-typer, the MOB-typer-based analysis revealed that the current knowledge embedded in the current typing schemes fails to classify all the plasmid sequences collected in COMPASS. We were able to categorize 6828 and 5229 plasmids by replicon and MOB typing, respectively, mostly associated with Proteobacteria and Firmicutes. We then searched for the presence of multiple core genes involved in replication and propagation. Our results showed that 2403 plasmids carried multiple replicons that were distributed in 206 bacterial species. The co-integration of replicon types from different incompatibility (Inc) groups is an adaptive mechanism, which plays an important role in plasmid survival and dissemination by extending their host range. Our results highlight the crucial role of IncF alleles (present in 56% of all multireplicons) and revealed that IncH, IncR, and IncU replicons were also frequently carried in multireplicons. Here, we provided a comprehensive picture of the different IncF subtypes by identifying 20 different profiles in 849 IncF multireplicons, which were mostly associated with Enterobacteriaceae. These results could provide the basis for a novel IncF plasmid nomenclature based on different allelic profiles.
Collapse
Affiliation(s)
- Pierre-Emmanuel Douarre
- Agence Nationale de Sécurité Sanitaire de l’Alimentation, de l’Environnement et du Travail, Laboratory for Food Safety, Paris, France
| | - Ludovic Mallet
- Agence Nationale de Sécurité Sanitaire de l’Alimentation, de l’Environnement et du Travail, Laboratory for Food Safety, Paris, France
| | - Nicolas Radomski
- Agence Nationale de Sécurité Sanitaire de l’Alimentation, de l’Environnement et du Travail, Laboratory for Food Safety, Paris, France
| | - Arnaud Felten
- Agence Nationale de Sécurité Sanitaire de l’Alimentation, de l’Environnement et du Travail, Laboratory for Food Safety, Paris, France
| | | |
Collapse
|
47
|
Ndagi U, Falaki AA, Abdullahi M, Lawal MM, Soliman ME. Antibiotic resistance: bioinformatics-based understanding as a functional strategy for drug design. RSC Adv 2020; 10:18451-18468. [PMID: 35685616 PMCID: PMC9122625 DOI: 10.1039/d0ra01484b] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2020] [Accepted: 05/01/2020] [Indexed: 12/19/2022] Open
Abstract
The use of antibiotics to manage infectious diseases dates back to ancient civilization, but the lack of a clear distinction between the therapeutic and toxic dose has been a major challenge. This precipitates the notion that antibiotic resistance was from time immemorial, principally because of a lack of adequate knowledge of therapeutic doses and continuous exposure of these bacteria to suboptimal plasma concentration of antibiotics. With the discovery of penicillin by Alexander Fleming in 1924, a milestone in bacterial infections' treatment was achieved. This forms the foundation for the modern era of antibiotic drugs. Antibiotics such as penicillins, cephalosporins, quinolones, tetracycline, macrolides, sulphonamides, aminoglycosides and glycopeptides are the mainstay in managing severe bacterial infections, but resistant strains of bacteria have emerged and hampered the progress of research in this field. Recently, new approaches to research involving bacteria resistance to antibiotics have appeared; these involve combining the molecular understanding of bacteria systems with the knowledge of bioinformatics. Consequently, many molecules have been developed to curb resistance associated with different bacterial infections. However, because of increased emphasis on the clinical relevance of antibiotics, the synergy between in silico study and in vivo study is well cemented and this facilitates the discovery of potent antibiotics. In this review, we seek to give an overview of earlier reviews and molecular and structural understanding of bacteria resistance to antibiotics, while focusing on the recent bioinformatics approach to antibacterial drug discovery. Understanding the evolution of antibiotic resistance at the molecular level as a functional tool for bioinformatic-based drug design.![]()
Collapse
Affiliation(s)
- Umar Ndagi
- Centre for Trans-Sahara Disease, Vaccine and Drug Research
- Ibrahim Badamasi Babangida University
- Lapai
- Nigeria
| | - Abubakar A. Falaki
- Department of Microbiology
- School of Agriculture and Applied Sciences
- University of KwaZulu-Natal
- Durban 4001
- South Africa
| | - Maryam Abdullahi
- Faculty of Pharmaceutical Sciences
- Ahmadu Bello University Zaria
- Nigeria
| | - Monsurat M. Lawal
- School of Laboratory Medicine and Medical Sciences
- University of KwaZulu-Natal
- Durban 4001
- South Africa
| | - Mahmoud E. Soliman
- Molecular Modeling and Drug Design Research Group
- School of Health Sciences
- University of KwaZulu Natal
- Durban 4001
- South Africa
| |
Collapse
|
48
|
Plasmid Reconstruction from Next-Gen Data: A Detailed Protocol for the Use of PLACNETw for the Reconstruction of Plasmids from WGS Datasets. Methods Mol Biol 2020; 2075:323-339. [PMID: 31584173 DOI: 10.1007/978-1-4939-9877-7_23] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Mobile Genetic Elements (MGE) play essential roles in adaptive bacterial evolution, facilitating genetic exchange for extrachromosomal DNA, especially antibiotic resistance genes and virulence factors. For this reason, high-throughput next-generation sequencing of bacteria is of great relevance, especially for clinical pathogenic bacteria. Accurate identification of MGE from whole-genome sequencing (WGS) datasets is one of the major challenges, still hindered by methodological limitations and high sequencing costs.This chapter encompasses the protocol used for plasmid reconstruction by applying the PLACNETw methodology, from raw reads to assembled plasmids and chromosome. PLACNETw is a graphical user-friendly interface to visualize and reconstruct MGE from short-read WGS datasets. No bioinformatic background or sophisticated computational resources are required and high precision and sensitivity are achieved.
Collapse
|
49
|
Douglas GM, Langille MGI. Current and Promising Approaches to Identify Horizontal Gene Transfer Events in Metagenomes. Genome Biol Evol 2019; 11:2750-2766. [PMID: 31504488 PMCID: PMC6777429 DOI: 10.1093/gbe/evz184] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/19/2019] [Indexed: 12/16/2022] Open
Abstract
High-throughput shotgun metagenomics sequencing has enabled the profiling of myriad natural communities. These data are commonly used to identify gene families and pathways that were potentially gained or lost in an environment and which may be involved in microbial adaptation. Despite the widespread interest in these events, there are no established best practices for identifying gene gain and loss in metagenomics data. Horizontal gene transfer (HGT) represents several mechanisms of gene gain that are especially of interest in clinical microbiology due to the rapid spread of antibiotic resistance genes in natural communities. Several additional mechanisms of gene gain and loss, including gene duplication, gene loss-of-function events, and de novo gene birth are also important to consider in the context of metagenomes but have been less studied. This review is largely focused on detecting HGT in prokaryotic metagenomes, but methods for detecting these other mechanisms are first discussed. For this article to be self-contained, we provide a general background on HGT and the different possible signatures of this process. Lastly, we discuss how improved assembly of genomes from metagenomes would be the most straight-forward approach for improving the inference of gene gain and loss events. Several recent technological advances could help improve metagenome assemblies: long-read sequencing, determining the physical proximity of contigs, optical mapping of short sequences along chromosomes, and single-cell metagenomics. The benefits and limitations of these advances are discussed and open questions in this area are highlighted.
Collapse
Affiliation(s)
- Gavin M Douglas
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Morgan G I Langille
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
50
|
Krawczyk PS, Lipinski L, Dziembowski A. PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res 2019; 46:e35. [PMID: 29346586 PMCID: PMC5887522 DOI: 10.1093/nar/gkx1321] [Citation(s) in RCA: 345] [Impact Index Per Article: 57.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Accepted: 12/28/2017] [Indexed: 12/14/2022] Open
Abstract
Plasmids are mobile genetics elements that play an important role in the environmental adaptation of microorganisms. Although plasmids are usually analyzed in cultured microorganisms, there is a need for methods that allow for the analysis of pools of plasmids (plasmidomes) in environmental samples. To that end, several molecular biology and bioinformatics methods have been developed; however, they are limited to environments with low diversity and cannot recover large plasmids. Here, we present PlasFlow, a novel tool based on genomic signatures that employs a neural network approach for identification of bacterial plasmid sequences in environmental samples. PlasFlow can recover plasmid sequences from assembled metagenomes without any prior knowledge of the taxonomical or functional composition of samples with an accuracy up to 96%. It can also recover sequences of both circular and linear plasmids and can perform initial taxonomical classification of sequences. Compared to other currently available tools, PlasFlow demonstrated significantly better performance on test datasets. Analysis of two samples from heavy metal-contaminated microbial mats revealed that plasmids may constitute an important fraction of their metagenomes and carry genes involved in heavy-metal homeostasis, proving the pivotal role of plasmids in microorganism adaptation to environmental conditions.
Collapse
Affiliation(s)
- Pawel S Krawczyk
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5a, 02-106 Warsaw, Poland.,Department of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, Pawinskiego 5a, 02-106 Warsaw, Poland
| | - Leszek Lipinski
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5a, 02-106 Warsaw, Poland
| | - Andrzej Dziembowski
- Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Pawinskiego 5a, 02-106 Warsaw, Poland.,Department of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, Pawinskiego 5a, 02-106 Warsaw, Poland
| |
Collapse
|