1
|
Asthana V, Bugga P, Smith CE, Wellman C, Dwight Z, Ranjan P, Nieves EM, Dickson RP, VanEpps JS. GenomicGapID: leveraging spatial distribution of conserved genomic sites for broad-spectrum microbial identification. Microbiol Spectr 2025; 13:e0281724. [PMID: 40084960 DOI: 10.1128/spectrum.02817-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Accepted: 01/30/2025] [Indexed: 03/16/2025] Open
Abstract
Bacterial detection and identification methods can be broadly classified as either untargeted with expansive taxonomic coverage or targeted with narrow taxonomic focus. Untargeted approaches, such as culture and sequencing, are often time-consuming and/or costly, whereas targeted methods, such as PCR, can offer faster and more cost-effective results but require a priori knowledge of the likely pathogen to select the appropriate assay. GenomicGapID, a novel approach that leverages the spatial distribution of conserved genetic regions across microbial genomes, represents a significant advancement in the field of microbial identification. This technique has the potential to provide the taxonomic breadth of culture and sequencing while maintaining the speed, simplicity, and cost-effectiveness of PCR. By leveraging the conservation and relative positioning of highly conserved coding regions across different species, GenomicGapID enables the development of universal primer sets that amplify the non-conserved gaps between these regions. This creates a unique electrophoretic signature that facilitates rapid and accurate target-agnostic microbial identification. In this study, we apply the principles of GenomicGapID to the critical task of identifying clinical pathogens. We focus on expanding the coverage of a previously developed universal bacterial identification system, which initially targeted the 16s-23s internal transcribed spacer (ITS) region and was capable of discerning 45 pathogens. To enhance this system, we assembled a comprehensive database of 189 clinically relevant bacterial species. We then identified conserved primer binding sites that produce unique amplicon size signatures for each species. While we found that the use of amplicon size signatures alone would require an impractical number of universal primer sets, we demonstrate that this challenge can be effectively mitigated through concurrent melt analysis. Ultimately, we show that just three universal primer sets, guided by the GenomicGapID framework, are sufficient to cover 189 clinical bacterial pathogens, representing a majority of microbes identified in positive cultures in a clinical microbiology setting, with experimental validation of a subset of these pathogens. This study not only enhances the existing universal bacterial identification system but also establishes GenomicGapID as a versatile and powerful tool in microbial diagnostics and beyond, paving the way for new avenues of research in genomics with the potential to advance molecular biology, clinical practice, and public health.IMPORTANCERapid and accurate microbial identification is critical in both clinical and research settings. Traditional untargeted methods, such as culture and sequencing, are often time-consuming and expensive, while targeted techniques like PCR offer speed and cost-effectiveness but require pre-selection of pathogens. Our work introduces GenomicGapID, a novel bacterial identification system that provides the taxonomic breadth of untargeted methods, coupled with the speed, simplicity, and affordability of targeted PCR-based techniques. By leveraging the gap between conserved genetic regions and analyzing the associated unique electrophoretic and melt analysis signatures, GenomicGapID enables target-agnostic bacterial identification using a parsimonious set of universal primers.Our work has significant implications not only in clinical microbiology but also in genomics, environmental microbiology, and public health. We believe this manuscript aligns well with the mission of Microbiology Spectrum to publish innovative and impactful research that advances the field of microbial sciences.
Collapse
Affiliation(s)
- Vishwaratn Asthana
- Department of Internal Medicine, Division of Hospital Medicine, University of Michigan, Ann Arbor, Michigan, USA
| | - Pallavi Bugga
- Department of Emergency Medicine-Adult, University of Michigan, Ann Arbor, Michigan, USA
| | - Clara Elaine Smith
- Department of Emergency Medicine-Adult, University of Michigan, Ann Arbor, Michigan, USA
| | - Catherine Wellman
- Department of Emergency Medicine-Adult, University of Michigan, Ann Arbor, Michigan, USA
| | - Zachary Dwight
- Precision Biomarker Laboratories, Cedars-Sinai Medical Center, Beverly Hills, California, USA
| | - Piyush Ranjan
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, Michigan, USA
| | - Erika Martínez Nieves
- Department of Emergency Medicine-Adult, University of Michigan, Ann Arbor, Michigan, USA
| | - Robert P Dickson
- Division of Pulmonary and Critical Care Medicine, Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, Michigan, USA
- Department of Microbiology and Immunology, University of Michigan Medical School, Ann Arbor, Michigan, USA
- Weil Institute for Critical Research and Innovation, University of Michigan, Ann Arbor, Michigan, USA
| | - J Scott VanEpps
- Department of Emergency Medicine-Adult, University of Michigan, Ann Arbor, Michigan, USA
- Weil Institute for Critical Research and Innovation, University of Michigan, Ann Arbor, Michigan, USA
- Department of Biomedical Engineering, University of Michigan, Ann Arbor, Michigan, USA
- Program in Macromolecular Science and Engineering, University of Michigan, Ann Arbor, Michigan, USA
- Biointerfaces Institute, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
2
|
Román-Escrivá P, Bernabeu M, Paganin E, Díaz-Villanueva W, Verdú M, Oliver JL, Arnau V, Moya A. Metrics of Genomic Complexity in the Evolution of Bacterial Endosymbiosis. BIOLOGY 2025; 14:338. [PMID: 40282203 PMCID: PMC12024623 DOI: 10.3390/biology14040338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2025] [Revised: 03/07/2025] [Accepted: 03/18/2025] [Indexed: 04/29/2025]
Abstract
Endosymbiosis can be considered a regressive or degenerative evolutionary process characterized at the genomic level by genome erosion and degeneration due to high mutational pressure toward AT (adenine and thymine) bases. The genomic and biological complexity of endosymbionts must be lower than that of the free-living bacteria from which they evolved. In the present work, we contrasted whether two proposed metrics for measuring genomic complexity in both types of bacteria, GS and BB, reflect their complexity, expecting higher values in free-living bacteria than in endosymbionts. On the other hand, we endeavored to delve into the factors that contribute to the reduction in metric values in endosymbionts, as well as their eventual relationship with six genomic parameters associated with functionality. This study aimed to test the robustness of these proposed metrics in a well-known biological scenario, such as the endosymbiosis process.
Collapse
Affiliation(s)
- Pablo Román-Escrivá
- Institute for Integrative Systems Biology (I2SysBio), Spanish National Research Council (CSIC), University of València, 46980 València, Spain; (P.R.-E.); (M.B.); (E.P.); (W.D.-V.); (V.A.)
| | - Moisès Bernabeu
- Institute for Integrative Systems Biology (I2SysBio), Spanish National Research Council (CSIC), University of València, 46980 València, Spain; (P.R.-E.); (M.B.); (E.P.); (W.D.-V.); (V.A.)
- Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Eleonora Paganin
- Institute for Integrative Systems Biology (I2SysBio), Spanish National Research Council (CSIC), University of València, 46980 València, Spain; (P.R.-E.); (M.B.); (E.P.); (W.D.-V.); (V.A.)
| | - Wladimiro Díaz-Villanueva
- Institute for Integrative Systems Biology (I2SysBio), Spanish National Research Council (CSIC), University of València, 46980 València, Spain; (P.R.-E.); (M.B.); (E.P.); (W.D.-V.); (V.A.)
- Biomedical Research Networking Center for Epidemiology and Public Health (CIBERESP), 28029 Madrid, Spain
- Area of Genomics and Health, Foundation for the Promotion of Sanitary and Biomedical Research of València Region (FISABIO), 46020 València, Spain
| | - Miguel Verdú
- Centro de Investigaciones Sobre Desertificación, Consejo Superior de Investigaciones Científicas (CSIC), University of València, Generalitat Valenciana, 46113 València, Spain;
| | - José L. Oliver
- Department of Genetics, Faculty of Sciences, University of Granada, 18071 Granada, Spain;
| | - Vicente Arnau
- Institute for Integrative Systems Biology (I2SysBio), Spanish National Research Council (CSIC), University of València, 46980 València, Spain; (P.R.-E.); (M.B.); (E.P.); (W.D.-V.); (V.A.)
- Biomedical Research Networking Center for Epidemiology and Public Health (CIBERESP), 28029 Madrid, Spain
- Area of Genomics and Health, Foundation for the Promotion of Sanitary and Biomedical Research of València Region (FISABIO), 46020 València, Spain
| | - Andrés Moya
- Institute for Integrative Systems Biology (I2SysBio), Spanish National Research Council (CSIC), University of València, 46980 València, Spain; (P.R.-E.); (M.B.); (E.P.); (W.D.-V.); (V.A.)
- Biomedical Research Networking Center for Epidemiology and Public Health (CIBERESP), 28029 Madrid, Spain
- Area of Genomics and Health, Foundation for the Promotion of Sanitary and Biomedical Research of València Region (FISABIO), 46020 València, Spain
| |
Collapse
|
3
|
Dasgupta P, Vinil K, Kanaujia SP. Evolutionary trends indicate a coherent organization of sap operons. Res Microbiol 2024; 175:104228. [PMID: 38972435 DOI: 10.1016/j.resmic.2024.104228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/30/2024] [Accepted: 07/02/2024] [Indexed: 07/09/2024]
Abstract
Human hosts possess a complex network of immune responses against microbial pathogens. The production of antimicrobial peptides (AMPs), which target the pathogen cell membranes and inhibit them from inhabiting the hosts, is one such mechanism. However, pathogens have evolved systems that encounter these host-produced AMPs. The Sap (sensitivity to antimicrobial peptides) transporter uptakes AMPs inside the microbial cell and proteolytically degrades them. The Sap transporters comprise five subunits encoded by genes in an operon. Despite its ubiquitous nature, its subunits are not found to be in tandem with many organisms. In this study, a total of 421 Sap transporters were analyzed for their operonic arrangement. Out of 421, a total of 352 operons were found to be in consensus arrangement, while the remaining 69 show a varying arrangement of genes. The analysis of the intergenic distance between the subunits of the sap operon suggests a signature pattern with sapAB (-4), sapBC (-14), sapCD (-1), and sapDF (-4 to 1). An evolutionary analysis of these operons favors the consensus arrangement of the Sap transporter systems, substantiating its prevalence in most of the Gram-negative pathogens. Overall, this study provides insight into bacterial evolution, favoring the maintenance of the genetic organization of essential pathogenicity factors.
Collapse
Affiliation(s)
- Pratik Dasgupta
- Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Guwahati - 781039, Assam, India
| | - Kavya Vinil
- Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Guwahati - 781039, Assam, India
| | - Shankar Prasad Kanaujia
- Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Guwahati - 781039, Assam, India.
| |
Collapse
|
4
|
Wiegert J, Höhler D, Haag J, Stamatakis A. Predicting Phylogenetic Bootstrap Values via Machine Learning. Mol Biol Evol 2024; 41:msae215. [PMID: 39418337 PMCID: PMC11523138 DOI: 10.1093/molbev/msae215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 08/28/2024] [Accepted: 09/15/2024] [Indexed: 10/19/2024] Open
Abstract
Estimating the statistical robustness of the inferred tree(s) constitutes an integral part of most phylogenetic analyses. Commonly, one computes and assigns a branch support value to each inner branch of the inferred phylogeny. The still most widely used method for calculating branch support on trees inferred under maximum likelihood (ML) is the Standard, nonparametric Felsenstein bootstrap support (SBS). Due to the high computational cost of the SBS, a plethora of methods has been developed to approximate it, for instance, via the rapid bootstrap (RB) algorithm. There have also been attempts to devise faster, alternative support measures, such as the SH-aLRT (Shimodaira-Hasegawa-like approximate likelihood ratio test) or the UltraFast bootstrap 2 (UFBoot2) method. Those faster alternatives exhibit some limitations, such as the need to assess model violations (UFBoot2) or unstable behavior in the low support interval range (SH-aLRT). Here, we present the educated bootstrap guesser (EBG), a machine learning-based tool that predicts SBS branch support values for a given input phylogeny. EBG is on average 9.4 (σ=5.5) times faster than UFBoot2. EBG-based SBS estimates exhibit a median absolute error of 5 when predicting SBS values between 0 and 100. Furthermore, EBG also provides uncertainty measures for all per-branch SBS predictions and thereby allows for a more rigorous and careful interpretation. EBG can, for instance, predict SBS support values on a phylogeny comprising 1,654 SARS-CoV2 genome sequences within 3 h on a mid-class laptop. EBG is available under GNU GPL3.
Collapse
Affiliation(s)
- Julius Wiegert
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Dimitri Höhler
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Julia Haag
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Biodiversity Computing Group, Institute of Computer Science, Foundation for Research and Technology - Hellas, Heraklion, Crete, Greece
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| |
Collapse
|
5
|
Saksena S, Forbes K, Rajan N, Giles D. Phylogenetic investigation of Gammaproteobacteria proteins involved in exogenous long-chain fatty acid acquisition and assimilation. Biochem Biophys Rep 2023; 35:101504. [PMID: 37601446 PMCID: PMC10439403 DOI: 10.1016/j.bbrep.2023.101504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 06/12/2023] [Accepted: 06/21/2023] [Indexed: 08/22/2023] Open
Abstract
Background The incorporation of exogenous fatty acids into the cell membrane yields structural modifications that directly influence membrane phospholipid composition and indirectly contribute to virulence. FadL and FadD are responsible for importing and activating exogenous fatty acids, while acyltransferases (PlsB, PlsC, PlsX, PlsY) incorporate fatty acids into the cell membrane. Many Gammaproteobacteria species possess multiple homologs of these proteins involved in exogenous fatty acid metabolism, suggesting the evolutionary acquisition and maintenance of this transport pathway. Methods This study developed phylogenetic trees based on amino acid and nucleotide sequences of homologs of FadL, FadD, PlsB, PlsC, PlsX, and PlsY via Mr. Bayes and RAxML algorithms. We also explored the operon arrangement of genes encoding for FadL. Additionally, FadL homologs were modeled via SWISS-MODEL, validated and refined by SAVES, Galaxy Refine, and GROMACS, and docked with fatty acids via AutoDock Vina. Resulting affinities were analyzed by 2-way ANOVA test and Tukey's post-hoc test. Results Our phylogenetic trees revealed grouping based on operon structure, original homolog blasted from, and order of the homolog, suggesting a more ancestral origin of the multiple homolog phenomena. Our molecular docking simulations indicated a similar binding pattern for the fatty acids between the different FadL homologs. General significance Our study is the first to illustrate the phylogeny of these proteins and to investigate the binding of various FadL homologs across orders with fatty acids. This study helps unravel the mystery surrounding these proteins and presents topics for future research.
Collapse
Affiliation(s)
- Saksham Saksena
- College of Arts and Sciences, Vanderbilt University, 2201 West End Ave., Nashville, TN, 37235, USA
| | - Kwame Forbes
- College of Science and Mathematics, The University of the Virgin Islands, 2 John Brewers Bay, St. Thomas, USVI, 00802-9990, USA
| | - Nipun Rajan
- East Hamilton High School, 2015 Ooltewah Ringgold Road, Ootlewah, TN, 37363, USA
| | - David Giles
- Department of Biology, Geology and Environmental Science, The University of Tennessee at Chattanooga, 615 McCallie Ave, Chattanooga, TN, 37403, USA
| |
Collapse
|
6
|
Cao S, Brandis G, Huseby DL, Hughes D. Positive selection during niche adaptation results in large-scale and irreversible rearrangement of chromosomal gene order in bacteria. Mol Biol Evol 2022; 39:6554941. [PMID: 35348727 PMCID: PMC9016547 DOI: 10.1093/molbev/msac069] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Analysis of bacterial genomes shows that, whereas diverse species share many genes in common, their linear order on the chromosome is often not conserved. Whereas rearrangements in gene order could occur by genetic drift, an alternative hypothesis is rearrangement driven by positive selection during niche adaptation (SNAP). Here, we provide the first experimental support for the SNAP hypothesis. We evolved Salmonella to adapt to growth on malate as the sole carbon source and followed the evolutionary trajectories. The initial adaptation to growth in the new environment involved the duplication of 1.66 Mb, corresponding to one-third of the Salmonella chromosome. This duplication is selected to increase the copy number of a single gene, dctA, involved in the uptake of malate. Continuing selection led to the rapid loss or mutation of duplicate genes from either copy of the duplicated region. After 2000 generations, only 31% of the originally duplicated genes remained intact and the gene order within the Salmonella chromosome has been significantly and irreversibly altered. These results experientially validate predictions made by the SNAP hypothesis and show that SNAP can be a strong driving force for rearrangements in chromosomal gene order.
Collapse
Affiliation(s)
- Sha Cao
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.,These authors contributed equally: Sha Cao, Gerrit Brandis
| | - Gerrit Brandis
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.,These authors contributed equally: Sha Cao, Gerrit Brandis
| | - Douglas L Huseby
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Diarmaid Hughes
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
7
|
Kanai Y, Tsuru S, Furusawa C. OUP accepted manuscript. Nucleic Acids Res 2022; 50:1673-1686. [PMID: 35066585 PMCID: PMC8860574 DOI: 10.1093/nar/gkac004] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 12/23/2021] [Accepted: 01/11/2022] [Indexed: 11/12/2022] Open
Abstract
Operons are a hallmark of the genomic and regulatory architecture of prokaryotes. However, the mechanism by which two genes placed far apart gradually come close and form operons remains to be elucidated. Here, we propose a new model of the origin of operons: Mobile genetic elements called insertion sequences can facilitate the formation of operons by consecutive insertion–deletion–excision reactions. This mechanism barely leaves traces of insertion sequences and thus difficult to detect in nature. In this study, as a proof-of-concept, we reproducibly demonstrated operon formation in the laboratory. The insertion sequence IS3 and the insertion sequence excision enhancer are genes found in a broad range of bacterial species. We introduced these genes into insertion sequence-less Escherichia coli and found that, supporting our hypothesis, the activity of the two genes altered the expression of genes surrounding IS3, closed a 2.7 kb gap between a pair of genes, and formed new operons. This study shows how insertion sequences can facilitate the rapid formation of operons through locally increasing the structural mutation rates and highlights how coevolution with mobile elements may shape the organization of prokaryotic genomes and gene regulation.
Collapse
Affiliation(s)
- Yuki Kanai
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| | - Saburo Tsuru
- To whom correspondence should be addressed. Tel: +81 3 5841 4229; Fax: +81 3 5841 4229;
| | | |
Collapse
|
8
|
Abstract
The transcriptomes of Pseudomonas aeruginosa clone C isolates NN2 and SG17M during the mid-exponential and early stationary phase of planktonic growth were evaluated by direct RNA sequencing on the nanopore platform and compared with established short-read cDNA sequencing on the Illumina platform. Fifty to ninety percent of the sense RNAs turned out to be rRNA molecules followed by similar proportions of mRNA transcripts and non-coding RNAs. Both platforms detected similar proportions of uncharged tRNAs and 29 yet undescribed antisense tRNAs. For example, the rarest arginine codon was paired with the most abundant tRNAArg, and the tRNAArg gene is missing for the most frequent arginine codon. More than 90% of the antisense RNA molecules were complementary to a coding sequence. The antisense RNAs were evenly distributed in the genomes. Direct RNA sequencing identified more than 4,000 distinct non-overlapping antisense RNAs during exponential and stationary growth. Besides highly expressed small antisense RNAs less than 200 bases in size, a population of longer antisense RNAs was sequenced that covered a broad range of a few hundred to thousands of bases and could be complementary to a contig of several genes. In summary, direct RNA sequencing identified yet undescribed RNA molecules and an unexpected composition of the pools of tRNAs, sense and antisense RNAs. IMPORTANCE Genome-wide gene expression of bacteria is commonly studied by high-throughput sequencing of size-selected cDNA fragment libraries of reverse-transcribed RNA preparations. However, the depletion of ribosomal RNAs, enzymatic reverse transcription and the fragmentation, size selection and amplification during library preparation lead to inevitable losses of information about the initial composition of the RNA pool. We demonstrate that direct RNA sequencing on the nanopore platform can overcome these limitations. Nanopore sequencing of total RNA yielded novel insights into the Pseudomonas aeruginosa transcriptome that - if replicated in other species - will change our view of the bacterial RNA world. The discovery of sense - antisense pairs of tmRNA, tRNAs and mRNAs indicates a further and unknown level of gene regulation in bacteria.
Collapse
|