1
|
Gopikrishnan M, Haryini S, C GPD. Emerging strategies and therapeutic innovations for combating drug resistance in Staphylococcus aureus strains: A comprehensive review. J Basic Microbiol 2024; 64:e2300579. [PMID: 38308076 DOI: 10.1002/jobm.202300579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 01/03/2024] [Accepted: 01/04/2024] [Indexed: 02/04/2024]
Abstract
In recent years, antibiotic therapy has encountered significant challenges due to the rapid emergence of multidrug resistance among bacteria responsible for life-threatening illnesses, creating uncertainty about the future management of infectious diseases. The escalation of antimicrobial resistance in the post-COVID era compared to the pre-COVID era has raised global concern. The prevalence of nosocomial-related infections, especially outbreaks of drug-resistant strains of Staphylococcus aureus, have been reported worldwide, with India being a notable hotspot for such occurrences. Various virulence factors and mutations characterize nosocomial infections involving S. aureus. The lack of proper alternative treatments leading to increased drug resistance emphasizes the need to investigate and examine recent research to combat future pandemics. In the current genomics era, the application of advanced technologies such as next-generation sequencing (NGS), machine learning (ML), and quantum computing (QC) for genomic analysis and resistance prediction has significantly increased the pace of diagnosing drug-resistant pathogens and insights into genetic intricacies. Despite prompt diagnosis, the elimination of drug-resistant infections remains unattainable in the absence of effective alternative therapies. Researchers are exploring various alternative therapeutic approaches, including phage therapy, antimicrobial peptides, photodynamic therapy, vaccines, host-directed therapies, and more. The proposed review mainly focuses on the resistance journey of S. aureus over the past decade, detailing its resistance mechanisms, prevalence in the subcontinent, innovations in rapid diagnosis of the drug-resistant strains, including the applicants of NGS and ML application along with QC, it helps to design alternative novel therapeutics approaches against S. aureus infection.
Collapse
Affiliation(s)
- Mohanraj Gopikrishnan
- Department of Integrative Biology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| | - Sree Haryini
- Department of Biomedical Sciences, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| | - George Priya Doss C
- Department of Integrative Biology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| |
Collapse
|
2
|
Rusic D, Kumric M, Seselja Perisin A, Leskur D, Bukic J, Modun D, Vilovic M, Vrdoljak J, Martinovic D, Grahovac M, Bozic J. Tackling the Antimicrobial Resistance "Pandemic" with Machine Learning Tools: A Summary of Available Evidence. Microorganisms 2024; 12:842. [PMID: 38792673 DOI: 10.3390/microorganisms12050842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 04/16/2024] [Accepted: 04/19/2024] [Indexed: 05/26/2024] Open
Abstract
Antimicrobial resistance is recognised as one of the top threats healthcare is bound to face in the future. There have been various attempts to preserve the efficacy of existing antimicrobials, develop new and efficient antimicrobials, manage infections with multi-drug resistant strains, and improve patient outcomes, resulting in a growing mass of routinely available data, including electronic health records and microbiological information that can be employed to develop individualised antimicrobial stewardship. Machine learning methods have been developed to predict antimicrobial resistance from whole-genome sequencing data, forecast medication susceptibility, recognise epidemic patterns for surveillance purposes, or propose new antibacterial treatments and accelerate scientific discovery. Unfortunately, there is an evident gap between the number of machine learning applications in science and the effective implementation of these systems. This narrative review highlights some of the outstanding opportunities that machine learning offers when applied in research related to antimicrobial resistance. In the future, machine learning tools may prove to be superbugs' kryptonite. This review aims to provide an overview of available publications to aid researchers that are looking to expand their work with new approaches and to acquaint them with the current application of machine learning techniques in this field.
Collapse
Affiliation(s)
- Doris Rusic
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Marko Kumric
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Ana Seselja Perisin
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Dario Leskur
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Josipa Bukic
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Darko Modun
- Department of Pharmacy, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Marino Vilovic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Josip Vrdoljak
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Dinko Martinovic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
- Department of Maxillofacial Surgery, University Hospital of Split, Spinciceva 1, 21000 Split, Croatia
| | - Marko Grahovac
- Department of Pharmacology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| | - Josko Bozic
- Department of Pathophysiology, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
- Laboratory for Cardiometabolic Research, University of Split School of Medicine, Soltanska 2A, 21000 Split, Croatia
| |
Collapse
|
3
|
Dillon L, Dimonaco NJ, Creevey CJ. Accessory genes define species-specific routes to antibiotic resistance. Life Sci Alliance 2024; 7:e202302420. [PMID: 38228374 DOI: 10.26508/lsa.202302420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 12/29/2023] [Accepted: 01/03/2024] [Indexed: 01/18/2024] Open
Abstract
A deeper understanding of the relationship between the antimicrobial resistance (AMR) gene carriage and phenotype is necessary to develop effective response strategies against this global burden. AMR phenotype is often a result of multi-gene interactions; therefore, we need approaches that go beyond current simple AMR gene identification tools. Machine-learning (ML) methods may meet this challenge and allow the development of rapid computational approaches for AMR phenotype classification. To examine this, we applied multiple ML techniques to 16,950 bacterial genomes across 28 genera, with corresponding MICs for 23 antibiotics with the aim of training models to accurately determine the AMR phenotype from sequenced genomes. This resulted in a >1.5-fold increase in AMR phenotype prediction accuracy over AMR gene identification alone. Furthermore, we revealed 528 unique (often species-specific) genomic routes to antibiotic resistance, including genes not previously linked to the AMR phenotype. Our study demonstrates the utility of ML in predicting AMR phenotypes across diverse clinically relevant organisms and antibiotics. This research proposes a rapid computational method to support laboratory-based identification of the AMR phenotype in pathogens.
Collapse
Affiliation(s)
- Lucy Dillon
- School of Biological Sciences, Queen's University Belfast, Belfast, UK
| | - Nicholas J Dimonaco
- School of Biological Sciences, Queen's University Belfast, Belfast, UK
- https://ror.org/02fa3aq29 Department of Medicine, McMaster University, Hamilton, Ontario, Canada
- https://ror.org/02fa3aq29 Farncombe Family Digestive Health Research Institute, McMaster University, Hamilton, Canada
| | | |
Collapse
|
4
|
Kherabi Y, Thy M, Bouzid D, Antcliffe DB, Rawson TM, Peiffer-Smadja N. Machine learning to predict antimicrobial resistance: future applications in clinical practice? Infect Dis Now 2024; 54:104864. [PMID: 38355048 DOI: 10.1016/j.idnow.2024.104864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 02/05/2024] [Accepted: 02/07/2024] [Indexed: 02/16/2024]
Abstract
INTRODUCTION Machine learning (ML) is increasingly being used to predict antimicrobial resistance (AMR). This review aims to provide physicians with an overview of the literature on ML as a means of AMR prediction. METHODS References for this review were identified through searches of MEDLINE/PubMed, EMBASE, Google Scholar, ACM Digital Library, and IEEE Xplore Digital Library up to December 2023. RESULTS Thirty-six studies were included in this review. Thirty-two studies (32/36, 89 %) were based on hospital data and four (4/36, 11 %) on outpatient data. The vast majority of them were conducted in high-resource settings (33/36, 92 %). Twenty-four (24/36, 67 %) studies developed systems to predict drug resistance in infected patients, eight (8/36, 22 %) tested the performances of ML-assisted antibiotic prescription, two (2/36, 6 %) assessed ML performances in predicting colonization with carbapenem-resistant bacteria and, finally, two assessed national and international AMR trends. The most common inputs were demographic characteristics (25/36, 70 %), previous antibiotic susceptibility testing (19/36, 53 %) and prior antibiotic exposure (15/36, 42 %). Thirty-three (92 %) studies targeted prediction of Gram-negative bacteria (GNB) resistance as an output (92 %). The studies included showed moderate to high performances, with AUROC ranging from 0.56 to 0.93. CONCLUSION ML can potentially provide valuable assistance in AMR prediction. Although the literature on this topic is growing, future studies are needed to design, implement, and evaluate the use and impact of ML decision support systems.
Collapse
Affiliation(s)
- Yousra Kherabi
- Infectious and Tropical Disease Department, Bichat-Claude Bernard Hospital, Assistance Publique-Hôpitaux de Paris, Université Paris Cité, Paris, France; Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, Paris, France.
| | - Michaël Thy
- Medical and Infectious Diseases ICU (MI2) - Bichat-Claude Bernard Hospital, Assistance Publique-Hôpitaux de Paris, Université Paris Cité, Paris, France; EA 7323 - Pharmacology and Therapeutic Evaluation in Children and Pregnant Women, Université Paris Cité, Paris, France
| | - Donia Bouzid
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, Paris, France; Emergency Department, Bichat Claude Bernard Hospital, Assistance Publique-Hôpitaux de Paris, Paris, France
| | - David B Antcliffe
- Division of Anaesthetics, Pain Medicine and Intensive Care, Department of Surgery and Cancer, Imperial College London, London, UK; Department of Intensive Care Unit, Charing Cross Hospital, Imperial College Healthcare NHS Trust, London, UK
| | - Timothy Miles Rawson
- National Institute for Health Research Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, UK; Centre for Antimicrobial Optimisation Imperial College London, London, UK
| | - Nathan Peiffer-Smadja
- Infectious and Tropical Disease Department, Bichat-Claude Bernard Hospital, Assistance Publique-Hôpitaux de Paris, Université Paris Cité, Paris, France; Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, Paris, France; National Institute for Health Research Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance, Imperial College London, London, UK
| |
Collapse
|
5
|
Hu K, Meyer F, Deng ZL, Asgari E, Kuo TH, Münch PC, McHardy AC. Assessing computational predictions of antimicrobial resistance phenotypes from microbial genomes. Brief Bioinform 2024; 25:bbae206. [PMID: 38706320 PMCID: PMC11070729 DOI: 10.1093/bib/bbae206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 04/08/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open
Abstract
The advent of rapid whole-genome sequencing has created new opportunities for computational prediction of antimicrobial resistance (AMR) phenotypes from genomic data. Both rule-based and machine learning (ML) approaches have been explored for this task, but systematic benchmarking is still needed. Here, we evaluated four state-of-the-art ML methods (Kover, PhenotypeSeeker, Seq2Geno2Pheno and Aytan-Aktug), an ML baseline and the rule-based ResFinder by training and testing each of them across 78 species-antibiotic datasets, using a rigorous benchmarking workflow that integrates three evaluation approaches, each paired with three distinct sample splitting methods. Our analysis revealed considerable variation in the performance across techniques and datasets. Whereas ML methods generally excelled for closely related strains, ResFinder excelled for handling divergent genomes. Overall, Kover most frequently ranked top among the ML approaches, followed by PhenotypeSeeker and Seq2Geno2Pheno. AMR phenotypes for antibiotic classes such as macrolides and sulfonamides were predicted with the highest accuracies. The quality of predictions varied substantially across species-antibiotic combinations, particularly for beta-lactams; across species, resistance phenotyping of the beta-lactams compound, aztreonam, amoxicillin/clavulanic acid, cefoxitin, ceftazidime and piperacillin/tazobactam, alongside tetracyclines demonstrated more variable performance than the other benchmarked antibiotics. By organism, Campylobacter jejuni and Enterococcus faecium phenotypes were more robustly predicted than those of Escherichia coli, Staphylococcus aureus, Salmonella enterica, Neisseria gonorrhoeae, Klebsiella pneumoniae, Pseudomonas aeruginosa, Acinetobacter baumannii, Streptococcus pneumoniae and Mycobacterium tuberculosis. In addition, our study provides software recommendations for each species-antibiotic combination. It furthermore highlights the need for optimization for robust clinical applications, particularly for strains that diverge substantially from those used for training.
Collapse
Affiliation(s)
- Kaixin Hu
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Fernando Meyer
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Zhi-Luo Deng
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Ehsaneddin Asgari
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Molecular Cell Biomechanics Laboratory, Department of Bioengineering and Mechanical Engineering, University of California, Berkeley, USA
| | - Tzu-Hao Kuo
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Philipp C Münch
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
- Cluster of Excellence RESIST (EXC 2155), Hannover Medical School, Hannover, Germany
- German Center for Infection Research (DZIF), partner site Hannover Braunschweig, Braunschweig, Germany
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
| | - Alice C McHardy
- Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| |
Collapse
|
6
|
Nsubuga M, Galiwango R, Jjingo D, Mboowa G. Generalizability of machine learning in predicting antimicrobial resistance in E. coli: a multi-country case study in Africa. BMC Genomics 2024; 25:287. [PMID: 38500034 PMCID: PMC10946178 DOI: 10.1186/s12864-024-10214-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 03/11/2024] [Indexed: 03/20/2024] Open
Abstract
BACKGROUND Antimicrobial resistance (AMR) remains a significant global health threat particularly impacting low- and middle-income countries (LMICs). These regions often grapple with limited healthcare resources and access to advanced diagnostic tools. Consequently, there is a pressing need for innovative approaches that can enhance AMR surveillance and management. Machine learning (ML) though underutilized in these settings, presents a promising avenue. This study leverages ML models trained on whole-genome sequencing data from England, where such data is more readily available, to predict AMR in E. coli, targeting key antibiotics such as ciprofloxacin, ampicillin, and cefotaxime. A crucial part of our work involved the validation of these models using an independent dataset from Africa, specifically from Uganda, Nigeria, and Tanzania, to ascertain their applicability and effectiveness in LMICs. RESULTS Model performance varied across antibiotics. The Support Vector Machine excelled in predicting ciprofloxacin resistance (87% accuracy, F1 Score: 0.57), Light Gradient Boosting Machine for cefotaxime (92% accuracy, F1 Score: 0.42), and Gradient Boosting for ampicillin (58% accuracy, F1 Score: 0.66). In validation with data from Africa, Logistic Regression showed high accuracy for ampicillin (94%, F1 Score: 0.97), while Random Forest and Light Gradient Boosting Machine were effective for ciprofloxacin (50% accuracy, F1 Score: 0.56) and cefotaxime (45% accuracy, F1 Score:0.54), respectively. Key mutations associated with AMR were identified for these antibiotics. CONCLUSION As the threat of AMR continues to rise, the successful application of these models, particularly on genomic datasets from LMICs, signals a promising avenue for improving AMR prediction to support large AMR surveillance programs. This work thus not only expands our current understanding of the genetic underpinnings of AMR but also provides a robust methodological framework that can guide future research and applications in the fight against AMR.
Collapse
Affiliation(s)
- Mike Nsubuga
- Department of Immunology and Molecular Biology, School of Biomedical Sciences, College of Health Sciences, Makerere University, P.O Box 7072, Kampala, Uganda
- The African Center of Excellence in Bioinformatics and Data-Intensive Sciences, Infectious Diseases Institute, College of Health Sciences, Makerere University, P.O Box 22418, Kampala, Uganda
- Faculty of Health Sciences, University of Bristol, Bristol, BS40 5DU, UK
- Jean Golding Institute, University of Bristol, Bristol, BS8 1UH, UK
| | - Ronald Galiwango
- Department of Immunology and Molecular Biology, School of Biomedical Sciences, College of Health Sciences, Makerere University, P.O Box 7072, Kampala, Uganda
- The African Center of Excellence in Bioinformatics and Data-Intensive Sciences, Infectious Diseases Institute, College of Health Sciences, Makerere University, P.O Box 22418, Kampala, Uganda
| | - Daudi Jjingo
- Department of Computer Science, College of Computing and Information Sciences, Makerere University, P.O Box 7062, Kampala, Uganda
- The African Center of Excellence in Bioinformatics and Data-Intensive Sciences, Infectious Diseases Institute, College of Health Sciences, Makerere University, P.O Box 22418, Kampala, Uganda
| | - Gerald Mboowa
- Department of Immunology and Molecular Biology, School of Biomedical Sciences, College of Health Sciences, Makerere University, P.O Box 7072, Kampala, Uganda.
- The African Center of Excellence in Bioinformatics and Data-Intensive Sciences, Infectious Diseases Institute, College of Health Sciences, Makerere University, P.O Box 22418, Kampala, Uganda.
- Africa Centres for Disease Control and Prevention, African Union Commission, P.O Box 3243, Roosevelt Street, Addis Ababa, W21 K19, Ethiopia.
| |
Collapse
|
7
|
Batisti Biffignandi G, Chindelevitch L, Corbella M, Feil EJ, Sassera D, Lees JA. Optimising machine learning prediction of minimum inhibitory concentrations in Klebsiella pneumoniae. Microb Genom 2024; 10:001222. [PMID: 38529944 PMCID: PMC10995625 DOI: 10.1099/mgen.0.001222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 03/07/2024] [Indexed: 03/27/2024] Open
Abstract
Minimum Inhibitory Concentrations (MICs) are the gold standard for quantitatively measuring antibiotic resistance. However, lab-based MIC determination can be time-consuming and suffers from low reproducibility, and interpretation as sensitive or resistant relies on guidelines which change over time. Genome sequencing and machine learning promise to allow in silico MIC prediction as an alternative approach which overcomes some of these difficulties, albeit the interpretation of MIC is still needed. Nevertheless, precisely how we should handle MIC data when dealing with predictive models remains unclear, since they are measured semi-quantitatively, with varying resolution, and are typically also left- and right-censored within varying ranges. We therefore investigated genome-based prediction of MICs in the pathogen Klebsiella pneumoniae using 4367 genomes with both simulated semi-quantitative traits and real MICs. As we were focused on clinical interpretation, we used interpretable rather than black-box machine learning models, namely, Elastic Net, Random Forests, and linear mixed models. Simulated traits were generated accounting for oligogenic, polygenic, and homoplastic genetic effects with different levels of heritability. Then we assessed how model prediction accuracy was affected when MICs were framed as regression and classification. Our results showed that treating the MICs differently depending on the number of concentration levels of antibiotic available was the most promising learning strategy. Specifically, to optimise both prediction accuracy and inference of the correct causal variants, we recommend considering the MICs as continuous and framing the learning problem as a regression when the number of observed antibiotic concentration levels is large, whereas with a smaller number of concentration levels they should be treated as a categorical variable and the learning problem should be framed as a classification. Our findings also underline how predictive models can be improved when prior biological knowledge is taken into account, due to the varying genetic architecture of each antibiotic resistance trait. Finally, we emphasise that incrementing the population database is pivotal for the future clinical implementation of these models to support routine machine-learning based diagnostics.
Collapse
Affiliation(s)
- Gherard Batisti Biffignandi
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
- MRC Centre for Global Infectious Disease Analysis, Imperial College, London, England, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Leonid Chindelevitch
- MRC Centre for Global Infectious Disease Analysis, Imperial College, London, England, UK
| | - Marta Corbella
- Microbiology and Virology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - Edward J. Feil
- The Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, UK
| | - Davide Sassera
- Department of Biology and Biotechnology, University of Pavia, Pavia, Italy
- Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
| | - John A. Lees
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| |
Collapse
|
8
|
Wu S, Feng T, Tang W, Qi C, Gao J, He X, Wang J, Zhou H, Fang Z. metaProbiotics: a tool for mining probiotic from metagenomic binning data based on a language model. Brief Bioinform 2024; 25:bbae085. [PMID: 38487846 PMCID: PMC10940841 DOI: 10.1093/bib/bbae085] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 01/26/2024] [Accepted: 02/15/2024] [Indexed: 03/18/2024] Open
Abstract
Beneficial bacteria remain largely unexplored. Lacking systematic methods, understanding probiotic community traits becomes challenging, leading to various conclusions about their probiotic effects among different publications. We developed language model-based metaProbiotics to rapidly detect probiotic bins from metagenomes, demonstrating superior performance in simulated benchmark datasets. Testing on gut metagenomes from probiotic-treated individuals, it revealed the probioticity of intervention strains-derived bins and other probiotic-associated bins beyond the training data, such as a plasmid-like bin. Analyses of these bins revealed various probiotic mechanisms and bai operon as probiotic Ruminococcaceae's potential marker. In different health-disease cohorts, these bins were more common in healthy individuals, signifying their probiotic role, but relevant health predictions based on the abundance profiles of these bins faced cross-disease challenges. To better understand the heterogeneous nature of probiotics, we used metaProbiotics to construct a comprehensive probiotic genome set from global gut metagenomic data. Module analysis of this set shows that diseased individuals often lack certain probiotic gene modules, with significant variation of the missing modules across different diseases. Additionally, different gene modules on the same probiotic have heterogeneous effects on various diseases. We thus believe that gene function integrity of the probiotic community is more crucial in maintaining gut homeostasis than merely increasing specific gene abundance, and adding probiotics indiscriminately might not boost health. We expect that the innovative language model-based metaProbiotics tool will promote novel probiotic discovery using large-scale metagenomic data and facilitate systematic research on bacterial probiotic effects. The metaProbiotics program can be freely downloaded at https://github.com/zhenchengfang/metaProbiotics.
Collapse
Affiliation(s)
- Shufang Wu
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Tao Feng
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Waijiao Tang
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Cancan Qi
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Jie Gao
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
- Department of Gastroenterology, The Second Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Xiaolong He
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Jiaxuan Wang
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Hongwei Zhou
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| | - Zhencheng Fang
- Microbiome Medicine Center, Department of Laboratory Medicine, Zhujiang Hospital, Southern Medical University, Guangzhou, China
| |
Collapse
|
9
|
Gao Y, Li H, Zhao C, Li S, Yin G, Wang H. Machine learning and feature extraction for rapid antimicrobial resistance prediction of Acinetobacter baumannii from whole-genome sequencing data. Front Microbiol 2024; 14:1320312. [PMID: 38274740 PMCID: PMC10808480 DOI: 10.3389/fmicb.2023.1320312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 12/22/2023] [Indexed: 01/27/2024] Open
Abstract
Background Whole-genome sequencing (WGS) has contributed significantly to advancements in machine learning methods for predicting antimicrobial resistance (AMR). However, the comparisons of different methods for AMR prediction without requiring prior knowledge of resistance remains to be conducted. Methods We aimed to predict the minimum inhibitory concentrations (MICs) of 13 antimicrobial agents against Acinetobacter baumannii using three machine learning algorithms (random forest, support vector machine, and XGBoost) combined with k-mer features extracted from WGS data. Results A cohort of 339 isolates was used for model construction. The average essential agreement and category agreement of the best models exceeded 90.90% (95%CI, 89.03-92.77%) and 95.29% (95%CI, 94.91-95.67%), respectively; the exceptions being levofloxacin, minocycline and imipenem. The very major error rates ranged from 0.0 to 5.71%. We applied feature selection pipelines to extract the top-ranked 11-mers to optimise training time and computing resources. This approach slightly improved the prediction performance and enabled us to obtain prediction results within 10 min. Notably, when employing these top-ranked 11-mers in an independent test dataset (120 isolates), we achieved an average accuracy of 0.96. Conclusion Our study is the first to demonstrate that AMR prediction for A. baumannii using machine learning methods based on k-mer features has competitive performance over traditional workflows; hence, sequence-based AMR prediction and its application could be further promoted. The k-mer-based workflow developed in this study demonstrated high recall/sensitivity and specificity, making it a dependable tool for MIC prediction in clinical settings.
Collapse
Affiliation(s)
- Yue Gao
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Henan Li
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Chunjiang Zhao
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Shuguang Li
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Guankun Yin
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| | - Hui Wang
- Institute of Medical Technology, Peking University Health Science Center, Beijing, China
- Department of Clinical Laboratory, Peking University People's Hospital, Beijing, China
| |
Collapse
|
10
|
Hyun JC, Monk JM, Szubin R, Hefner Y, Palsson BO. Global pathogenomic analysis identifies known and candidate genetic antimicrobial resistance determinants in twelve species. Nat Commun 2023; 14:7690. [PMID: 38001096 PMCID: PMC10673929 DOI: 10.1038/s41467-023-43549-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2022] [Accepted: 11/14/2023] [Indexed: 11/26/2023] Open
Abstract
Surveillance programs for managing antimicrobial resistance (AMR) have yielded thousands of genomes suited for data-driven mechanism discovery. We present a workflow integrating pangenomics, gene annotation, and machine learning to identify AMR genes at scale. When applied to 12 species, 27,155 genomes, and 69 drugs, we 1) find AMR gene transfer mostly confined within related species, with 925 genes in multiple species but just eight in multiple phylogenetic classes, 2) demonstrate that discovery-oriented support vector machines outperform contemporary methods at recovering known AMR genes, recovering 263 genes compared to 145 by Pyseer, and 3) identify 142 AMR gene candidates. Validation of two candidates in E. coli BW25113 reveals cases of conditional resistance: ΔcycA confers ciprofloxacin resistance in minimal media with D-serine, and frdD V111D confers ampicillin resistance in the presence of ampC by modifying the overlapping promoter. We expect this approach to be adaptable to other species and phenotypes.
Collapse
Affiliation(s)
- Jason C Hyun
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA
| | - Jonathan M Monk
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Richard Szubin
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Ying Hefner
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Bernhard O Palsson
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA, USA.
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA.
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.
- Center for Microbiome Innovation, University of California, San Diego, La Jolla, CA, USA.
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Building 220, 2800, Kongens, Lyngby, Denmark.
| |
Collapse
|
11
|
Yang J, Eyre DW, Lu L, Clifton DA. Interpretable machine learning-based decision support for prediction of antibiotic resistance for complicated urinary tract infections. NPJ Antimicrob Resist 2023; 1:14. [PMID: 38686216 PMCID: PMC11057209 DOI: 10.1038/s44259-023-00015-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 10/04/2023] [Indexed: 05/02/2024]
Abstract
Urinary tract infections are one of the most common bacterial infections worldwide; however, increasing antimicrobial resistance in bacterial pathogens is making it challenging for clinicians to correctly prescribe patients appropriate antibiotics. In this study, we present four interpretable machine learning-based decision support algorithms for predicting antimicrobial resistance. Using electronic health record data from a large cohort of patients diagnosed with potentially complicated UTIs, we demonstrate high predictability of antibiotic resistance across four antibiotics - nitrofurantoin, co-trimoxazole, ciprofloxacin, and levofloxacin. We additionally demonstrate the generalizability of our methods on a separate cohort of patients with uncomplicated UTIs, demonstrating that machine learning-driven approaches can help alleviate the potential of administering non-susceptible treatments, facilitate rapid effective clinical interventions, and enable personalized treatment suggestions. Additionally, these techniques present the benefit of providing model interpretability, explaining the basis for generated predictions.
Collapse
Affiliation(s)
- Jenny Yang
- Institute of Biomedical Engineering, Department Engineering Science, University of Oxford, Oxford, UK
| | - David W. Eyre
- Big Data Institute, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Lei Lu
- Institute of Biomedical Engineering, Department Engineering Science, University of Oxford, Oxford, UK
| | - David A. Clifton
- Institute of Biomedical Engineering, Department Engineering Science, University of Oxford, Oxford, UK
- Oxford-Suzhou Centre for Advanced Research (OSCAR), Suzhou, China
| |
Collapse
|
12
|
Yamin D, Uskoković V, Wakil AM, Goni MD, Shamsuddin SH, Mustafa FH, Alfouzan WA, Alissa M, Alshengeti A, Almaghrabi RH, Fares MAA, Garout M, Al Kaabi NA, Alshehri AA, Ali HM, Rabaan AA, Aldubisi FA, Yean CY, Yusof NY. Current and Future Technologies for the Detection of Antibiotic-Resistant Bacteria. Diagnostics (Basel) 2023; 13:3246. [PMID: 37892067 PMCID: PMC10606640 DOI: 10.3390/diagnostics13203246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 10/14/2023] [Accepted: 10/15/2023] [Indexed: 10/29/2023] Open
Abstract
Antibiotic resistance is a global public health concern, posing a significant threat to the effectiveness of antibiotics in treating bacterial infections. The accurate and timely detection of antibiotic-resistant bacteria is crucial for implementing appropriate treatment strategies and preventing the spread of resistant strains. This manuscript provides an overview of the current and emerging technologies used for the detection of antibiotic-resistant bacteria. We discuss traditional culture-based methods, molecular techniques, and innovative approaches, highlighting their advantages, limitations, and potential future applications. By understanding the strengths and limitations of these technologies, researchers and healthcare professionals can make informed decisions in combating antibiotic resistance and improving patient outcomes.
Collapse
Affiliation(s)
- Dina Yamin
- Al-Karak Public Hospital, Karak 61210, Jordan;
- Institute for Research in Molecular Medicine, University Sains Malaysia, Health Campus, Kubang Kerian 16150, Kelantan, Malaysia
- Department of Veterinary Clinical Studies, Faculty of Veterinary Medicine, University Malaysia Kelantan, Kota Bharu 16100, Kelantan, Malaysia;
| | - Vuk Uskoković
- TardigradeNano LLC., Irvine, CA 92604, USA;
- Department of Mechanical Engineering, San Diego State University, San Diego, CA 92182, USA
| | - Abubakar Muhammad Wakil
- Department of Veterinary Clinical Studies, Faculty of Veterinary Medicine, University Malaysia Kelantan, Kota Bharu 16100, Kelantan, Malaysia;
- Department of Veterinary Physiology and Biochemistry, Faculty of Veterinary Medicine, University of Maiduguri, Maiduguri 600104, Borno, Nigeria
| | - Mohammed Dauda Goni
- Public Health and Zoonoses Research Group, Faculty of Veterinary Medicine, University Malaysia Kelantan, Pengkalan Chepa 16100, Kelantan, Malaysia;
| | - Shazana Hilda Shamsuddin
- Department of Pathology, School of Medical Sciences, University Sains Malaysia, Health Campus, Kubang Kerian 16150, Kelantan, Malaysia;
| | - Fatin Hamimi Mustafa
- Department of Electronic & Computer Engineering, Faculty of Electrical Engineering, University Teknologi Malaysia, Johor Bharu 81310, Johor, Malaysia;
| | - Wadha A. Alfouzan
- Department of Microbiology, Faculty of Medicine, Kuwait University, Safat 13110, Kuwait;
- Microbiology Unit, Department of Laboratories, Farwania Hospital, Farwania 85000, Kuwait
| | - Mohammed Alissa
- Department of Medical Laboratory Sciences, College of Applied Medical Sciences, Prince Sattam bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia;
| | - Amer Alshengeti
- Department of Pediatrics, College of Medicine, Taibah University, Al-Madinah 41491, Saudi Arabia;
- Department of Infection Prevention and Control, Prince Mohammad Bin Abdulaziz Hospital, National Guard Health Affairs, Al-Madinah 41491, Saudi Arabia
| | - Rana H. Almaghrabi
- Pediatric Department, Prince Sultan Medical Military City, Riyadh 12233, Saudi Arabia;
- College of Medicine, Alfaisal University, Riyadh 11533, Saudi Arabia;
| | - Mona A. Al Fares
- Department of Internal Medicine, King Abdulaziz University Hospital, Jeddah 21589, Saudi Arabia;
| | - Mohammed Garout
- Department of Community Medicine and Health Care for Pilgrims, Faculty of Medicine, Umm Al-Qura University, Makkah 21955, Saudi Arabia;
| | - Nawal A. Al Kaabi
- College of Medicine and Health Science, Khalifa University, Abu Dhabi 127788, United Arab Emirates;
- Sheikh Khalifa Medical City, Abu Dhabi Health Services Company (SEHA), Abu Dhabi 51900, United Arab Emirates
| | - Ahmad A. Alshehri
- Department of Clinical Laboratory Sciences, Faculty of Applied Medical Sciences, Najran University, Najran 61441, Saudi Arabia;
| | - Hamza M. Ali
- Department of Medical Laboratories Technology, College of Applied Medical Sciences, Taibah University, Madinah 41411, Saudi Arabia;
| | - Ali A. Rabaan
- College of Medicine, Alfaisal University, Riyadh 11533, Saudi Arabia;
- Molecular Diagnostic Laboratory, Johns Hopkins Aramco Healthcare, Dhahran 31311, Saudi Arabia
- Department of Public Health and Nutrition, The University of Haripur, Haripur 22610, Pakistan
| | | | - Chan Yean Yean
- Department of Medical Microbiology & Parasitology, School of Medical Sciences, University Sains Malaysia, Kubang Kerian 16150, Kelantan, Malaysia
| | - Nik Yusnoraini Yusof
- Institute for Research in Molecular Medicine, University Sains Malaysia, Health Campus, Kubang Kerian 16150, Kelantan, Malaysia
| |
Collapse
|
13
|
Nguyen M, Elmore Z, Ihle C, Moen FS, Slater AD, Turner BN, Parrello B, Best AA, Davis JJ. Predicting variable gene content in Escherichia coli using conserved genes. mSystems 2023; 8:e0005823. [PMID: 37314210 PMCID: PMC10469788 DOI: 10.1128/msystems.00058-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 04/25/2023] [Indexed: 06/15/2023] Open
Abstract
Having the ability to predict the protein-encoding gene content of an incomplete genome or metagenome-assembled genome is important for a variety of bioinformatic tasks. In this study, as a proof of concept, we built machine learning classifiers for predicting variable gene content in Escherichia coli genomes using only the nucleotide k-mers from a set of 100 conserved genes as features. Protein families were used to define orthologs, and a single classifier was built for predicting the presence or absence of each protein family occurring in 10%-90% of all E. coli genomes. The resulting set of 3,259 extreme gradient boosting classifiers had a per-genome average macro F1 score of 0.944 [0.943-0.945, 95% CI]. We show that the F1 scores are stable across multi-locus sequence types and that the trend can be recapitulated by sampling a smaller number of core genes or diverse input genomes. Surprisingly, the presence or absence of poorly annotated proteins, including "hypothetical proteins" was accurately predicted (F1 = 0.902 [0.898-0.906, 95% CI]). Models for proteins with horizontal gene transfer-related functions had slightly lower F1 scores but were still accurate (F1s = 0.895, 0.872, 0.824, and 0.841 for transposon, phage, plasmid, and antimicrobial resistance-related functions, respectively). Finally, using a holdout set of 419 diverse E. coli genomes that were isolated from freshwater environmental sources, we observed an average per-genome F1 score of 0.880 [0.876-0.883, 95% CI], demonstrating the extensibility of the models. Overall, this study provides a framework for predicting variable gene content using a limited amount of input sequence data. IMPORTANCE Having the ability to predict the protein-encoding gene content of a genome is important for assessing genome quality, binning genomes from shotgun metagenomic assemblies, and assessing risk due to the presence of antimicrobial resistance and other virulence genes. In this study, we built a set of binary classifiers for predicting the presence or absence of variable genes occurring in 10%-90% of all publicly available E. coli genomes. Overall, the results show that a large portion of the E. coli variable gene content can be predicted with high accuracy, including genes with functions relating to horizontal gene transfer. This study offers a strategy for predicting gene content using limited input sequence data.
Collapse
Affiliation(s)
- Marcus Nguyen
- Data Science and Learning Division, Argonne National Laboratory, Lemont, Illinois, USA
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, Illinois, USA
| | - Zachary Elmore
- Biology Department, Hope College, Holland, Michigan, USA
| | - Clay Ihle
- Biology Department, Hope College, Holland, Michigan, USA
| | | | - Adam D. Slater
- Biology Department, Hope College, Holland, Michigan, USA
| | | | - Bruce Parrello
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, Illinois, USA
- Fellowship for Interpretation of Genomes, Burr Ridge, Illinois, USA
| | - Aaron A. Best
- Biology Department, Hope College, Holland, Michigan, USA
| | - James J. Davis
- Data Science and Learning Division, Argonne National Laboratory, Lemont, Illinois, USA
- Consortium for Advanced Science and Engineering, University of Chicago, Chicago, Illinois, USA
| |
Collapse
|
14
|
Karlsen ST, Rau MH, Sánchez BJ, Jensen K, Zeidan AA. From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry. FEMS Microbiol Rev 2023; 47:fuad030. [PMID: 37286882 PMCID: PMC10337747 DOI: 10.1093/femsre/fuad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/09/2023] Open
Abstract
When selecting microbial strains for the production of fermented foods, various microbial phenotypes need to be taken into account to achieve target product characteristics, such as biosafety, flavor, texture, and health-promoting effects. Through continuous advances in sequencing technologies, microbial whole-genome sequences of increasing quality can now be obtained both cheaper and faster, which increases the relevance of genome-based characterization of microbial phenotypes. Prediction of microbial phenotypes from genome sequences makes it possible to quickly screen large strain collections in silico to identify candidates with desirable traits. Several microbial phenotypes relevant to the production of fermented foods can be predicted using knowledge-based approaches, leveraging our existing understanding of the genetic and molecular mechanisms underlying those phenotypes. In the absence of this knowledge, data-driven approaches can be applied to estimate genotype-phenotype relationships based on large experimental datasets. Here, we review computational methods that implement knowledge- and data-driven approaches for phenotype prediction, as well as methods that combine elements from both approaches. Furthermore, we provide examples of how these methods have been applied in industrial biotechnology, with special focus on the fermented food industry.
Collapse
Affiliation(s)
- Signe T Karlsen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Martin H Rau
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Benjamín J Sánchez
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Kristian Jensen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Ahmad A Zeidan
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| |
Collapse
|
15
|
Chong YY, Chan PK, Chan VWK, Cheung A, Luk MH, Cheung MH, Fu H, Chiu KY. Application of machine learning in the prevention of periprosthetic joint infection following total knee arthroplasty: a systematic review. Arthroplasty 2023; 5:38. [PMID: 37316877 DOI: 10.1186/s42836-023-00195-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 05/11/2023] [Indexed: 06/16/2023] Open
Abstract
BACKGROUND Machine learning is a promising and powerful technology with increasing use in orthopedics. Periprosthetic joint infection following total knee arthroplasty results in increased morbidity and mortality. This systematic review investigated the use of machine learning in preventing periprosthetic joint infection. METHODS A systematic review was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines. PubMed was searched in November 2022. All studies that investigated the clinical applications of machine learning in the prevention of periprosthetic joint infection following total knee arthroplasty were included. Non-English studies, studies with no full text available, studies focusing on non-clinical applications of machine learning, reviews and meta-analyses were excluded. For each included study, its characteristics, machine learning applications, algorithms, statistical performances, strengths and limitations were summarized. Limitations of the current machine learning applications and the studies, including their 'black box' nature, overfitting, the requirement of a large dataset, the lack of external validation, and their retrospective nature were identified. RESULTS Eleven studies were included in the final analysis. Machine learning applications in the prevention of periprosthetic joint infection were divided into four categories: prediction, diagnosis, antibiotic application and prognosis. CONCLUSION Machine learning may be a favorable alternative to manual methods in the prevention of periprosthetic joint infection following total knee arthroplasty. It aids in preoperative health optimization, preoperative surgical planning, the early diagnosis of infection, the early application of suitable antibiotics, and the prediction of clinical outcomes. Future research is warranted to resolve the current limitations and bring machine learning into clinical settings.
Collapse
Affiliation(s)
- Yuk Yee Chong
- Department of Orthopaedics and Traumatology, School of Clinical Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Ping Keung Chan
- Department of Orthopaedics and Traumatology, School of Clinical Medicine, The University of Hong Kong, Hong Kong SAR, China.
| | - Vincent Wai Kwan Chan
- Department of Orthopaedics and Traumatology, Queen Mary Hospital, Hong Kong SAR, China
| | - Amy Cheung
- Department of Orthopaedics and Traumatology, Queen Mary Hospital, Hong Kong SAR, China
| | - Michelle Hilda Luk
- Department of Orthopaedics and Traumatology, Queen Mary Hospital, Hong Kong SAR, China
| | - Man Hong Cheung
- Department of Orthopaedics and Traumatology, School of Clinical Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Henry Fu
- Department of Orthopaedics and Traumatology, School of Clinical Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Kwong Yuen Chiu
- Department of Orthopaedics and Traumatology, School of Clinical Medicine, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
16
|
Yang MR, Su SF, Wu YW. Using bacterial pan-genome-based feature selection approach to improve the prediction of minimum inhibitory concentration (MIC). Front Genet 2023; 14:1054032. [PMID: 37323667 PMCID: PMC10267731 DOI: 10.3389/fgene.2023.1054032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 05/16/2023] [Indexed: 06/17/2023] Open
Abstract
Background: Predicting the resistance profiles of antimicrobial resistance (AMR) pathogens is becoming more and more important in treating infectious diseases. Various attempts have been made to build machine learning models to classify resistant or susceptible pathogens based on either known antimicrobial resistance genes or the entire gene set. However, the phenotypic annotations are translated from minimum inhibitory concentration (MIC), which is the lowest concentration of antibiotic drugs in inhibiting certain pathogenic strains. Since the MIC breakpoints that classify a strain to be resistant or susceptible to specific antibiotic drug may be revised by governing institutes, we refrained from translating these MIC values into the categories "susceptible" or "resistant" but instead attempted to predict the MIC values using machine learning approaches. Results: By applying a machine learning feature selection approach on a Salmonella enterica pan-genome, in which the protein sequences were clustered to identify highly similar gene families, we showed that the selected features (genes) performed better than known AMR genes, and that models built on the selected genes achieved very accurate MIC prediction. Functional analysis revealed that about half of the selected genes were annotated as hypothetical proteins (i.e., with unknown functional roles), and that only a small portion of known AMR genes were among the selected genes, indicating that applying feature selection on the entire gene set has the potential of uncovering novel genes that may be associated with and may contribute to pathogenic antimicrobial resistances. Conclusion: The application of the pan-genome-based machine learning approach was indeed capable of predicting MIC values with very high accuracy. The feature selection process may also identify novel AMR genes for inferring bacterial antimicrobial resistance phenotypes.
Collapse
Affiliation(s)
- Ming-Ren Yang
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
- Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
| | - Shun-Feng Su
- Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
| | - Yu-Wei Wu
- Graduate Institute of Biomedical Informatics, College of Medical Science and Technology, Taipei Medical University, Taipei, Taiwan
- Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, Taiwan
- TMU Research Center for Digestive Medicine, Taipei Medical University, Taipei, Taiwan
| |
Collapse
|
17
|
Farhat F, Athar MT, Ahmad S, Madsen DØ, Sohail SS. Antimicrobial resistance and machine learning: past, present, and future. Front Microbiol 2023; 14:1179312. [PMID: 37303800 PMCID: PMC10250749 DOI: 10.3389/fmicb.2023.1179312] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 05/15/2023] [Indexed: 06/13/2023] Open
Abstract
Machine learning has become ubiquitous across all industries, including the relatively new application of predicting antimicrobial resistance. As the first bibliometric review in this field, we expect it to inspire further research in this area. The review employs standard bibliometric indicators such as article count, citation count, and the Hirsch index (H-index) to evaluate the relevance and impact of the leading countries, organizations, journals, and authors in this field. VOSviewer and Biblioshiny programs are utilized to analyze citation and co-citation networks, collaboration networks, keyword co-occurrence, and trend analysis. The United States has the highest contribution with 254 articles, accounting for over 37.57% of the total corpus, followed by China (103) and the United Kingdom (78). Among 58 publishers, the top four publishers account for 45% of the publications, with Elsevier leading with 15% of the publications, followed by Springer Nature (12%), MDPI, and Frontiers Media SA with 9% each. Frontiers in Microbiology is the most frequent publication source (33 articles), followed by Scientific Reports (29 articles), PLoS One (17 articles), and Antibiotics (16 articles). The study reveals a substantial increase in research and publications on the use of machine learning to predict antibiotic resistance. Recent research has focused on developing advanced machine learning algorithms that can accurately forecast antibiotic resistance, and a range of algorithms are now being used to address this issue.
Collapse
Affiliation(s)
- Faiza Farhat
- Department of Zoology, Aligarh Muslim University, Aligarh, India
| | - Md Tanwir Athar
- Department of Pharmacognosy and Pharmaceutical Chemistry, College of Dentistry and Pharmacy, Buraydah Colleges, Buraydah, Al-Qassim, Saudi Arabia
| | - Sultan Ahmad
- Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam bin Abdulaziz University, Al-Kharj, Saudi Arabia
- Department of Computer Science and Engineering, University Center for Research and Development (UCRD), Chandigarh University, Mohali, Punjab, India
| | - Dag Øivind Madsen
- School of Business, University of South-Eastern Norway, Hønefoss, Norway
| | - Shahab Saquib Sohail
- Department of Computer Science and Engineering, Jamia Hamdard University, New Delhi, India
| |
Collapse
|
18
|
Kim J, Rupasinghe R, Halev A, Huang C, Rezaei S, Clavijo MJ, Robbins RC, Martínez-López B, Liu X. Predicting antimicrobial resistance of bacterial pathogens using time series analysis. Front Microbiol 2023; 14:1160224. [PMID: 37250043 PMCID: PMC10213968 DOI: 10.3389/fmicb.2023.1160224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 04/12/2023] [Indexed: 05/31/2023] Open
Abstract
Antimicrobial resistance (AMR) is arguably one of the major health and economic challenges in our society. A key aspect of tackling AMR is rapid and accurate detection of the emergence and spread of AMR in food animal production, which requires routine AMR surveillance. However, AMR detection can be expensive and time-consuming considering the growth rate of the bacteria and the most commonly used analytical procedures, such as Minimum Inhibitory Concentration (MIC) testing. To mitigate this issue, we utilized machine learning to predict the future AMR burden of bacterial pathogens. We collected pathogen and antimicrobial data from >600 farms in the United States from 2010 to 2021 to generate AMR time series data. Our prediction focused on five bacterial pathogens (Escherichia coli, Streptococcus suis, Salmonella sp., Pasteurella multocida, and Bordetella bronchiseptica). We found that Seasonal Auto-Regressive Integrated Moving Average (SARIMA) outperformed five baselines, including Auto-Regressive Moving Average (ARMA) and Auto-Regressive Integrated Moving Average (ARIMA). We hope this study provides valuable tools to predict the AMR burden not only of the pathogens assessed in this study but also of other bacterial pathogens.
Collapse
Affiliation(s)
- Jeonghoon Kim
- Department of Mathematics, University of California, Davis, Davis, CA, United States
| | - Ruwini Rupasinghe
- Department of Medicine and Epidemiology, Center for Animal Disease Modeling and Surveillance (CADMS), School of Veterinary Medicine, University of California, Davis, Davis, CA, United States
| | - Avishai Halev
- Department of Mathematics, University of California, Davis, Davis, CA, United States
| | - Chao Huang
- Department of Computer Science, University of California, Davis, Davis, CA, United States
| | - Shahbaz Rezaei
- Department of Computer Science, University of California, Davis, Davis, CA, United States
| | - Maria J. Clavijo
- Department of Veterinary Diagnostic & Production Animal Medicine (VDPAM), Iowa State University, Ames, IA, United States
| | - Rebecca C. Robbins
- R.C. Robbins Swine Consulting Services, PLLC, Amarillo, TX, United States
| | - Beatriz Martínez-López
- Department of Medicine and Epidemiology, Center for Animal Disease Modeling and Surveillance (CADMS), School of Veterinary Medicine, University of California, Davis, Davis, CA, United States
| | - Xin Liu
- Department of Computer Science, University of California, Davis, Davis, CA, United States
| |
Collapse
|
19
|
Lüftinger L, Májek P, Rattei T, Beisken S. Metagenomic Antimicrobial Susceptibility Testing from Simulated Native Patient Samples. Antibiotics (Basel) 2023; 12. [PMID: 36830277 DOI: 10.3390/antibiotics12020366] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 02/06/2023] [Accepted: 02/08/2023] [Indexed: 02/12/2023] Open
Abstract
Genomic antimicrobial susceptibility testing (AST) has been shown to be accurate for many pathogens and antimicrobials. However, these methods have not been systematically evaluated for clinical metagenomic data. We investigate the performance of in-silico AST from clinical metagenomes (MG-AST). Using isolate sequencing data from a multi-center study on antimicrobial resistance (AMR) as well as shotgun-sequenced septic urine samples, we simulate over 2000 complicated urinary tract infection (cUTI) metagenomes with known resistance phenotype to 5 antimicrobials. Applying rule-based and machine learning-based genomic AST classifiers, we explore the impact of sequencing depth and technology, metagenome complexity, and bioinformatics processing approaches on AST accuracy. By using an optimized metagenomics assembly and binning workflow, MG-AST achieved balanced accuracy within 5.1% of isolate-derived genomic AST. For poly-microbial infections, taxonomic sample complexity and relatedness of taxa in the sample is a key factor influencing metagenomic binning and downstream MG-AST accuracy. We show that the reassignment of putative plasmid contigs by their predicted host range and investigation of whole resistome capabilities improved MG-AST performance on poly-microbial samples. We further demonstrate that machine learning-based methods enable MG-AST with superior accuracy compared to rule-based approaches on simulated native patient samples.
Collapse
|
20
|
Li S, Wu J, Ma N, Liu W, Shao M, Ying N, Zhu L. Prediction of genome-wide imipenem resistance features in Klebsiella pneumoniae using machine learning. J Med Microbiol 2023; 72. [PMID: 36753438 DOI: 10.1099/jmm.0.001657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open
Abstract
Introduction. The resistance rate of Klebsiella pneumoniae (K. pneumoniae) to imipenem is increasing year by year, and the imipenem resistance mechanism of K. pneumoniae is complex. Therefore, it is urgent to develop new strategies to explore the resistance mechanism of imipenem for its effective and accurate use in clinical practice.Hypothesis/Gap sStatement. Machine learning could identify resistance features and biological process that influence microbial resistance from whole-genome sequencing (WGS) data.Aims. This work aimed to predict imipenem resistance genetic features in K. pneumoniae from whole-genome k-mer features, and analyse their function for understanding its resistance mechanism.Methods. This study analysed WGS data of K. pneumoniae combined with resistance phenotype for imipenem, and established K. pneumoniae to imipenem genotype-phenotype model to predict resistance features using chi-squared test and random forest. An external clinical dataset was used to verify prediction power of resistance features. The potential genes were identified through alignment the resistance features with the K. pneumoniae reference genome using blastn, the functions of potential genes were further analysed to explore its resistance-related signalling pathways with GO and KEGG analysis, the resistance sequence patterns were screened using streme software. Finally, the resistance features were combined and modelled through four machine-learning algorithms (logistic regression, SVM, GBDT and XGBoost) to evaluate their phenotype prediction ability.Results. A total of 16 670 imipenem resistance features were predicted from genotype-phenotype model. The 30 potential genes were identified by annotating the resistance features and corresponded to known antibiotic-related genes (mdtM, dedA, rne, etc.). GO and KEGG pathway analyses indicated the possible association of imipenem resistance with metabolism process and cell membrane. CRYCAGCDN and CGRDAAAN were found from the imipenem resistance features, which were widely presented in the reported β-lactam resistance genes (bla SHV, bla CTX-M, bla TEM, etc.), and YCYAGCMCAST with metabolic functions (organic substance metabolic process, nitrogen compound metabolic process and cellular metabolic process) was identified from the top 50 resistance features. The 25 resistance genes in the training dataset included 19 genes in the external dataset, which verified the accuracy of prediction. The area under curve values of logistics regression, SVM, GBDT and XGBoost were 0.965, 0.966, 0.969 and 0.969, respectively, indicating that the imipenem resistance features have a strong prediction power.Conclusion. Machine-learning methods could effectively predict the imipenem resistance feature in K. pneumoniae, and provide resistance sequence profiles for predicting resistance phenotype and exploring potential resistance mechanisms. It provides an important insight into the potential therapeutic strategies of K. pneumoniae resistance to imipenem, and speed up the application of machine learning in routine diagnosis.
Collapse
Affiliation(s)
- Shanshan Li
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Jun Wu
- Lin'an Center for Disease Control and Prevention, Lin'an, 311300, PR China
| | - Nan Ma
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Wenjia Liu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,College of Electronics and Information Engineering, Hangzhou Dianzi University, Hangzhou 310018, PR China
| | - Mengjie Shao
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Nanjiao Ying
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering and Instrument, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Lei Zhu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering and Instrument, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| |
Collapse
|
21
|
Gupta E, Saxena J, Kumar S, Sharma U, Rastogi S, Srivastava VK, Kaushik S, Jyoti A. Fast Track Diagnostic Tools for Clinical Management of Sepsis: Paradigm Shift from Conventional to Advanced Methods. Diagnostics (Basel) 2023; 13. [PMID: 36673087 DOI: 10.3390/diagnostics13020277] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/24/2022] [Accepted: 01/09/2023] [Indexed: 01/15/2023] Open
Abstract
Sepsis is one of the deadliest disorders in the new century due to specific limitations in early and differential diagnosis. Moreover, antimicrobial resistance (AMR) is becoming the dominant threat to human health globally. The only way to encounter the spread and emergence of AMR is through the active detection and identification of the pathogen along with the quantification of resistance. For better management of such disease, there is an essential requirement to approach many suitable diagnostic techniques for the proper administration of antibiotics and elimination of these infectious diseases. The current method employed for the diagnosis of sepsis relies on the conventional culture of blood suspected infection. However, this method is more time consuming and generates results that are false negative in the case of antibiotic pretreated samples as well as slow-growing microbes. In comparison to the conventional method, modern methods are capable of analyzing blood samples, obtaining accurate results from the suspicious patient of sepsis, and giving all the necessary information to identify the pathogens as well as AMR in a short period. The present review is intended to highlight the culture shift from conventional to modern and advanced technologies including their limitations for the proper and prompt diagnosing of bloodstream infections and AMR detection.
Collapse
|
22
|
Vereecke N, Van Hoorde S, Sperling D, Theuns S, Devriendt B, Cox E. Virotyping and genetic antimicrobial susceptibility testing of porcine ETEC/STEC strains and associated plasmid types. Front Microbiol 2023; 14:1139312. [PMID: 37143544 PMCID: PMC10151945 DOI: 10.3389/fmicb.2023.1139312] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 03/13/2023] [Indexed: 05/06/2023] Open
Abstract
Introduction Enterotoxigenic Escherichia coli (ETEC) infections are the most common cause of secretory diarrhea in suckling and post-weaning piglets. For the latter, Shiga toxin-producing Escherichia coli (STEC) also cause edema disease. This pathogen leads to significant economic losses. ETEC/STEC strains can be distinguished from general E. coli by the presence of different host colonization factors (e.g., F4 and F18 fimbriae) and various toxins (e.g., LT, Stx2e, STa, STb, EAST-1). Increased resistance against a wide variety of antimicrobial drugs, such as paromomycin, trimethoprim, and tetracyclines, has been observed. Nowadays, diagnosing an ETEC/STEC infection requires culture-dependent antimicrobial susceptibility testing (AST) and multiplex PCRs, which are costly and time-consuming. Methods Here, nanopore sequencing was used on 94 field isolates to assess the predictive power, using the meta R package to determine sensitivity and specificity and associated credibility intervals of genotypes associated with virulence and AMR. Results Genetic markers associated with resistance for amoxicillin (plasmid-encoded TEM genes), cephalosporins (ampC promoter mutations), colistin (mcr genes), aminoglycosides (aac(3) and aph(3) genes), florfenicol (floR), tetracyclines (tet genes), and trimethoprim-sulfa (dfrA genes) could explain most acquired resistance phenotypes. Most of the genes were plasmid-encoded, of which some collocated on a multi-resistance plasmid (12 genes against 4 antimicrobial classes). For fluoroquinolones, AMR was addressed by point mutations within the ParC and GyrA proteins and the qnrS1 gene. In addition, long-read data allowed to study the genetic landscape of virulence- and AMR-carrying plasmids, highlighting a complex interplay of multi-replicon plasmids with varying host ranges. Conclusion Our results showed promising sensitivity and specificity for the detection of all common virulence factors and most resistance genotypes. The use of the identified genetic hallmarks will contribute to the simultaneous identification, pathotyping, and genetic AST within a single diagnostic test. This will revolutionize future quicker and more cost-efficient (meta)genomics-driven diagnostics in veterinary medicine and contribute to epidemiological studies, monitoring, tailored vaccination, and management.
Collapse
Affiliation(s)
- Nick Vereecke
- Laboratory of Virology, Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
- PathoSense BV, Lier, Belgium
- *Correspondence: Nick Vereecke,
| | - Sander Van Hoorde
- Laboratory of Immunology, Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | | | - Sebastiaan Theuns
- Laboratory of Virology, Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | - Bert Devriendt
- Laboratory of Immunology, Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | - Eric Cox
- Laboratory of Immunology, Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| |
Collapse
|
23
|
Yee R, Simner PJ. Next-Generation Sequencing Approaches to Predicting Antimicrobial Susceptibility Testing Results. Clin Lab Med 2022; 42:557-572. [PMID: 36368782 DOI: 10.1016/j.cll.2022.09.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Rebecca Yee
- Division of Medical Microbiology, Department of Pathology, Johns Hopkins University School of Medicine, Meyer B1-193, 600 North Wolfe Street, Baltimore, MD 21287-7093, USA
| | - Patricia J Simner
- Division of Medical Microbiology, Department of Pathology, Johns Hopkins University School of Medicine, Meyer B1-193, 600 North Wolfe Street, Baltimore, MD 21287-7093, USA.
| |
Collapse
|
24
|
Pei N, Sun W, He J, Li Y, Chen X, Liang T, Kristiansen K, Liu W, Li J. Genome-wide association study of Klebsiella pneumoniae identifies variations linked to carbapenems resistance. Front Microbiol 2022; 13:997769. [PMID: 36386631 PMCID: PMC9664935 DOI: 10.3389/fmicb.2022.997769] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 10/10/2022] [Indexed: 01/25/2023] Open
Abstract
Klebsiella pneumoniae (KP) is one of the microorganisms that can acquire carbapenem-resistance (CR), and few antimicrobial therapy options exist for infections caused by Carbapenem-Resistant KP (CRKP). In recent years, with the increase of carbapenem resistance rates, treating CRKP has become a serious public health threat in clinical practice. We have collected 2,035 clinical KP isolates from a tertiary hospital in China. Whole genome sequencing data coupled with their binary antimicrobial susceptibility testing data were obtained to conduct the genome-wide association study using a bayesian-based method, including single nucleotide polymorphisms (SNPs) and genes. We identified 28 and 37 potential maker genes associated with imipenem and meropenem resistance, respectively. Among which 19 of them were selected in both drugs by genome-wide association study (GWAS), 11 genes among them were simultaneously validated in independent datasets. These genes were likely related to biofilm formation, efflux pump, and DNA repairing. Moreover, we identified 13 significant CR related SNPs in imipenem or meropenem, with one SNP located in the non-coding region and validated in the independent datasets. Our study indicates complex mechanisms of carbapenems resistance and further investigation of CRKP-related factors are warranted to better understand their contributions to carbapenems resistance. These identified biomarkers may provide targets for future drug interventions or treatments.
Collapse
Affiliation(s)
- Na Pei
- BGI-Shenzhen, Shenzhen, China,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | | | - Jingxuan He
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Yanming Li
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Xia Chen
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China
| | - Tianzhu Liang
- BGI-Shenzhen, Shenzhen, China,Shenzhen Key Laboratory of Unknown Pathogen Identification, Shenzhen, China
| | - Karsten Kristiansen
- BGI-Shenzhen, Shenzhen, China,Laboratory of Genomics and Molecular Biomedicine, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Wenen Liu
- Department of Clinical Laboratory, Xiangya Hospital, Central South University, Changsha, China,*Correspondence: Junhua Li, ; Wenen Liu,
| | - Junhua Li
- BGI-Shenzhen, Shenzhen, China,Shenzhen Key Laboratory of Unknown Pathogen Identification, Shenzhen, China,*Correspondence: Junhua Li, ; Wenen Liu,
| |
Collapse
|
25
|
Pilarczyk-Zurek M, Sitkiewicz I, Koziel J. The Clinical View on Streptococcus anginosus Group – Opportunistic Pathogens Coming Out of Hiding. Front Microbiol 2022; 13:956677. [PMID: 35898914 PMCID: PMC9309248 DOI: 10.3389/fmicb.2022.956677] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 06/17/2022] [Indexed: 11/13/2022] Open
Abstract
Three distinct streptococcal species: Streptococcus anginosus, Streptococcus intermedius, and Streptococcus constellatus, belonging to the Streptococcus anginosus group (SAG), also known as Streptococcus milleri group, have been attracting clinicians and microbiologists, not only as oral commensals but also as opportunistic pathogens. For years they have been simply classified as so called viridans streptococci, and distinct species were not associated with particular clinical manifestations. Therefore, description of SAG members are clearly underrepresented in the literature, compared to other medically relevant streptococci. However, the increasing number of reports of life-threatening infections caused by SAG indicates their emerging pathogenicity. The improved clinical data generated with the application of modern molecular diagnostic techniques allow for precise identification of individual species belonging to SAG. This review summarizes clinical reports on SAG infections and systematizes data on the occurrence of individual species at the site of infection. We also discuss the issue of proper microbiological diagnostics, which is crucial for further clinical treatment.
Collapse
Affiliation(s)
- Magdalena Pilarczyk-Zurek
- Department of Microbiology, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Krakow, Poland
| | - Izabela Sitkiewicz
- Center for Translational Medicine, Warsaw University of Life Sciences (SGGW), Warszawa, Poland
| | - Joanna Koziel
- Department of Microbiology, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Krakow, Poland
- *Correspondence: Joanna Koziel,
| |
Collapse
|
26
|
Kim JI, Maguire F, Tsang KK, Gouliouris T, Peacock SJ, McAllister TA, McArthur AG, Beiko RG. Machine Learning for Antimicrobial Resistance Prediction: Current Practice, Limitations, and Clinical Perspective. Clin Microbiol Rev 2022;:e0017921. [PMID: 35612324 DOI: 10.1128/cmr.00179-21] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Antimicrobial resistance (AMR) is a global health crisis that poses a great threat to modern medicine. Effective prevention strategies are urgently required to slow the emergence and further dissemination of AMR. Given the availability of data sets encompassing hundreds or thousands of pathogen genomes, machine learning (ML) is increasingly being used to predict resistance to different antibiotics in pathogens based on gene content and genome composition. A key objective of this work is to advocate for the incorporation of ML into front-line settings but also highlight the further refinements that are necessary to safely and confidently incorporate these methods. The question of what to predict is not trivial given the existence of different quantitative and qualitative laboratory measures of AMR. ML models typically treat genes as independent predictors, with no consideration of structural and functional linkages; they also may not be accurate when new mutational variants of known AMR genes emerge. Finally, to have the technology trusted by end users in public health settings, ML models need to be transparent and explainable to ensure that the basis for prediction is clear. We strongly advocate that the next set of AMR-ML studies should focus on the refinement of these limitations to be able to bridge the gap to diagnostic implementation.
Collapse
|
27
|
Coll F, Gouliouris T, Bruchmann S, Phelan J, Raven KE, Clark TG, Parkhill J, Peacock SJ. PowerBacGWAS: a computational pipeline to perform power calculations for bacterial genome-wide association studies. Commun Biol 2022; 5:266. [PMID: 35338232 PMCID: PMC8956664 DOI: 10.1038/s42003-022-03194-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Accepted: 02/25/2022] [Indexed: 12/14/2022] Open
Abstract
Genome-wide association studies (GWAS) are increasingly being applied to investigate the genetic basis of bacterial traits. However, approaches to perform power calculations for bacterial GWAS are limited. Here we implemented two alternative approaches to conduct power calculations using existing collections of bacterial genomes. First, a sub-sampling approach was undertaken to reduce the allele frequency and effect size of a known and detectable genotype-phenotype relationship by modifying phenotype labels. Second, a phenotype-simulation approach was conducted to simulate phenotypes from existing genetic variants. We implemented both approaches into a computational pipeline (PowerBacGWAS) that supports power calculations for burden testing, pan-genome and variant GWAS; and applied it to collections of Enterococcus faecium, Klebsiella pneumoniae and Mycobacterium tuberculosis. We used this pipeline to determine sample sizes required to detect causal variants of different minor allele frequencies (MAF), effect sizes and phenotype heritability, and studied the effect of homoplasy and population diversity on the power to detect causal variants. Our pipeline and user documentation are made available and can be applied to other bacterial populations. PowerBacGWAS can be used to determine sample sizes required to find statistically significant associations, or the associations detectable with a given sample size. We recommend to perform power calculations using existing genomes of the bacterial species and population of study.
Collapse
Affiliation(s)
- Francesc Coll
- Department of Infection Biology, Faculty of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK.
| | - Theodore Gouliouris
- Department of Medicine, University of Cambridge, Cambridge, UK
- Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | | | - Jody Phelan
- Department of Infection Biology, Faculty of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK
| | - Kathy E Raven
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Taane G Clark
- Department of Infection Biology, Faculty of Infectious & Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK
- Faculty of Epidemiology and Population Health, Department of Infectious Disease Epidemiology, London School of Hygiene & Tropical Medicine, London, UK
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
| | | |
Collapse
|
28
|
Roe C, Vazquez AJ, Phillips PD, Allender CJ, Bowen RA, Nottingham RD, Doyle A, Wongsuwan G, Wuthiekanun V, Limmathurotsakul D, Peacock S, Keim P, Tuanyok A, Wagner DM, Sahl JW. Multiple phylogenetically-diverse, differentially-virulent Burkholderia pseudomallei isolated from a single soil sample collected in Thailand. PLoS Negl Trop Dis 2022; 16:e0010172. [PMID: 35143500 PMCID: PMC8865643 DOI: 10.1371/journal.pntd.0010172] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 02/23/2022] [Accepted: 01/14/2022] [Indexed: 11/24/2022] Open
Abstract
Burkholderia pseudomallei is a soil-dwelling bacterium endemic to Southeast Asia and northern Australia that causes the disease, melioidosis. Although the global genomic diversity of clinical B. pseudomallei isolates has been investigated, there is limited understanding of its genomic diversity across small geographic scales, especially in soil. In this study, we obtained 288 B. pseudomallei isolates from a single soil sample (~100g; intensive site 2, INT2) collected at a depth of 30cm from a site in Ubon Ratchathani Province, Thailand. We sequenced the genomes of 169 of these isolates that represent 7 distinct sequence types (STs), including a new ST (ST1820), based on multi-locus sequence typing (MLST) analysis. A core genome SNP phylogeny demonstrated that all identified STs share a recent common ancestor that diverged an estimated 796-1260 years ago. A pan-genomics analysis demonstrated recombination between clades and intra-MLST phylogenetic and gene differences. To identify potential differential virulence between STs, groups of BALB/c mice (5 mice/isolate) were challenged via subcutaneous injection (500 CFUs) with 30 INT2 isolates representing 5 different STs; over the 21-day experiment, eight isolates killed all mice, 2 isolates killed an intermediate number of mice (1-2), and 20 isolates killed no mice. Although the virulence results were largely stratified by ST, one virulent isolate and six attenuated isolates were from the same ST (ST1005), suggesting that variably conserved genomic regions may contribute to virulence. Genomes from the animal-challenged isolates were subjected to a bacterial genome-wide association study to identify genomic regions associated with differential virulence. One associated region is a unique variant of Hcp1, a component of the type VI secretion system, which may result in attenuation. The results of this study have implications for comprehensive sampling strategies, environmental exposure risk assessment, and understanding recombination and differential virulence in B. pseudomallei.
Collapse
Affiliation(s)
- Chandler Roe
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Adam J. Vazquez
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Paul D. Phillips
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Chris J. Allender
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Richard A. Bowen
- Department of Biological Sciences, Colorado State University, Ft. Collins, Colorado, United States of America
| | - Roxanne D. Nottingham
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Adina Doyle
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Gumphol Wongsuwan
- Mahidol-Oxford Tropical Medicine Research Unit, Mahidol University, Bangkok, Thailand
| | - Vanaporn Wuthiekanun
- Mahidol-Oxford Tropical Medicine Research Unit, Mahidol University, Bangkok, Thailand
| | | | - Sharon Peacock
- Department of Medicine, University of Cambridge, Cambridge, England
| | - Paul Keim
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Apichai Tuanyok
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - David M. Wagner
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| | - Jason W. Sahl
- The Pathogen and Microbiome Institute, Northern Arizona University, Flagstaff, Arizona, United States of America
| |
Collapse
|
29
|
Ben Khedher M, Ghedira K, Rolain JM, Ruimy R, Croce O. Application and Challenge of 3rd Generation Sequencing for Clinical Bacterial Studies. Int J Mol Sci 2022; 23:1395. [PMID: 35163319 PMCID: PMC8835973 DOI: 10.3390/ijms23031395] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 01/20/2022] [Accepted: 01/24/2022] [Indexed: 02/04/2023] Open
Abstract
Over the past 25 years, the powerful combination of genome sequencing and bioinformatics analysis has played a crucial role in interpreting information encoded in bacterial genomes. High-throughput sequencing technologies have paved the way towards understanding an increasingly wide range of biological questions. This revolution has enabled advances in areas ranging from genome composition to how proteins interact with nucleic acids. This has created unprecedented opportunities through the integration of genomic data into clinics for the diagnosis of genetic traits associated with disease. Since then, these technologies have continued to evolve, and recently, long-read sequencing has overcome previous limitations in terms of accuracy, thus expanding its applications in genomics, transcriptomics and metagenomics. In this review, we describe a brief history of the bacterial genome sequencing revolution and its application in public health and molecular epidemiology. We present a chronology that encompasses the various technological developments: whole-genome shotgun sequencing, high-throughput sequencing, long-read sequencing. We mainly discuss the application of next-generation sequencing to decipher bacterial genomes. Secondly, we highlight how long-read sequencing technologies go beyond the limitations of traditional short-read sequencing. We intend to provide a description of the guiding principles of the 3rd generation sequencing applications and ongoing improvements in the field of microbial medical research.
Collapse
Affiliation(s)
- Mariem Ben Khedher
- Bacteriology Laboratory, Archet 2 Hospital, CHU Nice, 06000 Nice, France
- Institute for Research on Cancer and Aging Nice (IRCAN), CNRS, INSERM, Université Côte d’Azur, 06108 Nice, France
| | - Kais Ghedira
- Laboratory of Bioinformatics, Biomathematics and Biostatistics, Institute Pasteur of Tunis, Tunis 1002, Tunisia;
| | - Jean-Marc Rolain
- IRD, APHM, MEPHI, IHU-Méditerranée Infection, Aix Marseille Université, 13005 Marseille, France;
| | - Raymond Ruimy
- Bacteriology Laboratory, Archet 2 Hospital, CHU Nice, 06000 Nice, France
- Centre Méditerranéen de Médecine Moléculaire (C3M), INSERM, Université Côte D’Azur, 06108 Nice, France
| | - Olivier Croce
- Institute for Research on Cancer and Aging Nice (IRCAN), CNRS, INSERM, Université Côte d’Azur, 06108 Nice, France
| |
Collapse
|
30
|
Bharat A, Petkau A, Avery BP, Chen JC, Folster JP, Carson CA, Kearney A, Nadon C, Mabon P, Thiessen J, Alexander DC, Allen V, El Bailey S, Bekal S, German GJ, Haldane D, Hoang L, Chui L, Minion J, Zahariadis G, Domselaar GV, Reid-smith RJ, Mulvey MR. Correlation between Phenotypic and In Silico Detection of Antimicrobial Resistance in Salmonella enterica in Canada Using Staramr. Microorganisms 2022; 10:292. [PMID: 35208747 PMCID: PMC8875511 DOI: 10.3390/microorganisms10020292] [Citation(s) in RCA: 53] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 01/18/2022] [Accepted: 01/21/2022] [Indexed: 01/25/2023] Open
Abstract
Whole genome sequencing (WGS) of Salmonella supports both molecular typing and detection of antimicrobial resistance (AMR). Here, we evaluated the correlation between phenotypic antimicrobial susceptibility testing (AST) and in silico prediction of AMR from WGS in Salmonella enterica (n = 1321) isolated from human infections in Canada. Phenotypic AMR results from broth microdilution testing were used as the gold standard. To facilitate high-throughput prediction of AMR from genome assemblies, we created a tool called Staramr, which incorporates the ResFinder and PointFinder databases and a custom gene-drug key for antibiogram prediction. Overall, there was 99% concordance between phenotypic and genotypic detection of categorical resistance for 14 antimicrobials in 1321 isolates (18,305 of 18,494 results in agreement). We observed an average sensitivity of 91.2% (range 80.5–100%), a specificity of 99.7% (98.6–100%), a positive predictive value of 95.4% (68.2–100%), and a negative predictive value of 99.1% (95.6–100%). The positive predictive value of gentamicin was 68%, due to seven isolates that carried aac(3)-IVa, which conferred MICs just below the breakpoint of resistance. Genetic mechanisms of resistance in these 1321 isolates included 64 unique acquired alleles and mutations in three chromosomal genes. In general, in silico prediction of AMR in Salmonella was reliable compared to the gold standard of broth microdilution. WGS can provide higher-resolution data on the epidemiology of resistance mechanisms and the emergence of new resistance alleles.
Collapse
|
31
|
Florensa AF, Kaas RS, Clausen PTLC, Aytan-Aktug D, Aarestrup FM. ResFinder - an open online resource for identification of antimicrobial resistance genes in next-generation sequencing data and prediction of phenotypes from genotypes. Microb Genom 2022; 8. [PMID: 35072601 PMCID: PMC8914360 DOI: 10.1099/mgen.0.000748] [Citation(s) in RCA: 110] [Impact Index Per Article: 55.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Antimicrobial resistance (AMR) is one of the most important health threats globally. The ability to accurately identify resistant bacterial isolates and the individual antimicrobial resistance genes (ARGs) is essential for understanding the evolution and emergence of AMR and to provide appropriate treatment. The rapid developments in next-generation sequencing technologies have made this technology available to researchers and microbiologists at routine laboratories around the world. However, tools available for those with limited experience with bioinformatics are lacking, especially to enable researchers and microbiologists in low- and middle-income countries (LMICs) to perform their own studies. The CGE-tools (Center for Genomic Epidemiology) including ResFinder (https://cge.cbs.dtu.dk/services/ResFinder/) was developed to provide freely available easy to use online bioinformatic tools allowing inexperienced researchers and microbiologists to perform simple bioinformatic analyses. The main purpose was and is to provide these solutions for people involved in frontline diagnosis especially in LMICs. Since its original publication in 2012, ResFinder has undergone a number of improvements including improvement of the code and databases, inclusion of point mutations for selected bacterial species and predictions of phenotypes also for selected species. As of 28 September 2021, 820 803 analyses have been performed using ResFinder from 61 776 IP-addresses in 171 countries. ResFinder clearly fulfills a need for several people around the globe and we hope to be able to continue to provide this service free of charge in the future. We also hope and expect to provide further improvements including phenotypic predictions for additional bacterial species.
Collapse
Affiliation(s)
| | - Rolf Sommer Kaas
- National Food Institute, Technical University of Denmark, DK-2800 kgs. Lyngby, Denmark
| | | | - Derya Aytan-Aktug
- National Food Institute, Technical University of Denmark, DK-2800 kgs. Lyngby, Denmark
| | | |
Collapse
|
32
|
Abstract
Multidrug resistance (MDR) has been a significant threat to public health and effective treatment of bacterial infections. Current identification of MDR is primarily based upon the large proportions of isolates resistant to multiple antibiotics simultaneously, and therefore is a belated evaluation. For bacteria with MDR, we expect to see strong correlations in both the quantitative minimum inhibitory concentration (MIC) and the binary susceptibility as classified by the pre-determined breakpoints. Being able to detect correlations from these two perspectives allows us to find multidrug resistant bacteria proactively. In this paper, we provide a Bayesian framework that estimates the resistance level jointly for antibiotics belonging to different classes with a Gaussian mixture model, where the correlation in the latent MIC can be inferred from the Gaussian parameters and the correlation in binary susceptibility can be inferred from the mixing weights. By augmenting the laboratory measurement with the latent MIC variable to account for the censored data, and by adopting the latent class variable to represent the MIC components, our model was shown to be accurate and robust compared with the current assessment of correlations. Applying the model to Salmonella heidelberg samples isolated from human participants in National Antimicrobial Resistance Monitoring System (NARMS) provides us with signs of joint resistance to Amoxicillin-clavulanic acid & Cephalothin and joint resistance to Ampicillin & Cephalothin. Large correlations estimated from our model could serve as a timely tool for early detection of MDR, and hence a signal for clinical intervention.
Collapse
Affiliation(s)
- Min Zhang
- Department of Statistics, Iowa State University, Ames, Iowa, United States of America
| | - Chong Wang
- Department of Statistics, Iowa State University, Ames, Iowa, United States of America
- Department of Veterinary Diagnostic and Production Animal Medicine, Iowa State University, Ames, Iowa, United States of America
- * E-mail:
| | - Annette O’Connor
- Department of Veterinary Diagnostic and Production Animal Medicine, Iowa State University, Ames, Iowa, United States of America
- Department of Large Animal Clinical Sciences, Michigan State University, East Lansing, Michigan, United States of America
| |
Collapse
|
33
|
Májek P, Lüftinger L, Beisken S, Rattei T, Materna A. Genome-Wide Mutation Scoring for Machine-Learning-Based Antimicrobial Resistance Prediction. Int J Mol Sci 2021; 22:13049. [PMID: 34884852 DOI: 10.3390/ijms222313049] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 11/25/2021] [Accepted: 11/29/2021] [Indexed: 01/21/2023] Open
Abstract
The prediction of antimicrobial resistance (AMR) based on genomic information can improve patient outcomes. Genetic mechanisms have been shown to explain AMR with accuracies in line with standard microbiology laboratory testing. To translate genetic mechanisms into phenotypic AMR, machine learning has been successfully applied. AMR machine learning models typically use nucleotide k-mer counts to represent genomic sequences. While k-mer representation efficiently captures sequence variation, it also results in high-dimensional and sparse data. With limited training data available, achieving acceptable model performance or model interpretability is challenging. In this study, we explore the utility of feature engineering with several biologically relevant signals. We propose to predict the functional impact of observed mutations with PROVEAN to use the predicted impact as a new feature for each protein in an organism’s proteome. The addition of the new features was tested on a total of 19,521 isolates across nine clinically relevant pathogens and 30 different antibiotics. The new features significantly improved the predictive performance of trained AMR models for Pseudomonas aeruginosa, Citrobacter freundii, and Escherichia coli. The balanced accuracy of the respective models of those three pathogens improved by 6.0% on average.
Collapse
|
34
|
Chaplin AV, Korzhanova M, Korostin DO. Identification of bacterial antibiotic resistance genes in next-generation sequencing data (review of literature). Klin Lab Diagn 2021; 66:684-688. [PMID: 34882354 DOI: 10.51620/0869-2084-2021-66-11-684-688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The spread of antibiotic-resistant human bacterial pathogens is a serious threat to modern medicine. Antibiotic susceptibility testing is essential for treatment regimens optimization and preventing dissemination of antibiotic resistance. Therefore, development of antibiotic susceptibility testing methods is a priority challenge of laboratory medicine. The aim of this review is to analyze the capabilities of the bioinformatics tools for bacterial whole genome sequence data processing. The PubMed database, Russian scientific electronic library eLIBRARY, information networks of World health organization and European Society of Clinical Microbiology and Infectious Diseases (ESCMID) were used during the analysis. In this review, the platforms for whole genome sequencing, which are suitable for detection of bacterial genetic resistance determinants, are described. The classic step of genetic resistance determinants searching is an alignment between the query nucleotide/protein sequence and the subject (database) nucleotide/protein sequence, which is performed using the nucleotide and protein sequence databases. The most commonly used databases are Resfinder, CARD, Bacterial Antimicrobial Resistance Reference Gene Database. The results of the resistance determinants searching in genome assemblies is more correct in comparison to results of the searching in contigs. The new resistance genes searching bioinformatics tools, such as neural networks and machine learning, are discussed in the review. After critical appraisal of the current antibiotic resistance databases we designed a protocol for predicting antibiotic resistance using whole genome sequence data. The designed protocol can be used as a basis of the algorithm for qualitative and quantitative antimicrobial susceptibility testing based on whole genome sequence data.
Collapse
Affiliation(s)
- A V Chaplin
- Pirogov Russian National Research Medical University
| | - M Korzhanova
- Pirogov Russian National Research Medical University
| | - D O Korostin
- Pirogov Russian National Research Medical University
| |
Collapse
|
35
|
Liu W, Ying N, Mo Q, Li S, Shao M, Sun L, Zhu L. Machine learning for identifying resistance features of Klebsiella pneumoniae using whole-genome sequence single nucleotide polymorphisms. J Med Microbiol 2021; 70. [PMID: 34812714 DOI: 10.1099/jmm.0.001474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Introduction. Klebsiella pneumoniae, a gram-negative bacterium, is a common pathogen causing nosocomial infection. The drug-resistance rate of K. pneumoniae is increasing year by year, posing a severe threat to public health worldwide. K. pneumoniae has been listed as one of the pathogens causing the global crisis of antimicrobial resistance in nosocomial infections. We need to explore the drug resistance of K. pneumoniae for clinical diagnosis. Single nucleotide polymorphisms (SNPs) are of high density and have rich genetic information in whole-genome sequencing (WGS), which can affect the structure or expression of proteins. SNPs can be used to explore mutation sites associated with bacterial resistance.Hypothesis/Gap Statement. Machine learning methods can detect genetic features associated with the drug resistance of K. pneumoniae from whole-genome SNP data.Aims. This work used Fast Feature Selection (FFS) and Codon Mutation Detection (CMD) machine learning methods to detect genetic features related to drug resistance of K. pneumoniae from whole-genome SNP data.Methods. WGS data on resistance of K. pneumoniae strains to four antibiotics (tetracycline, gentamicin, imipenem, amikacin) were downloaded from the European Nucleotide Archive (ENA). Sequence alignments were performed with MUMmer 3 to complete SNP calling using K. pneumoniae HS11286 chromosome as the reference genome. The FFS algorithm was applied to feature selection of the SNP dataset. The training set was constructed based on mutation sites with mutation frequency >0.995. Based on the original SNP training set, 70% of SNPs were randomly selected from each dataset as the test set to verify the accuracy of the training results. Finally, the resistance genes were obtained by the CMD algorithm and Venny.Results. The number of strains resistant to tetracycline, gentamicin, imipenem and amikacin was 931, 1048, 789 and 203, respectively. Machine learning algorithms were applied to the SNP training set and test set, and 28 and 23 resistance genes were predicted, respectively. The 28 resistance genes in the training set included 22 genes in the test set, which verified the accuracy of gene prediction. Among them, some genes (KPHS_35310, KPHS_18220, KPHS_35880, etc.) corresponded to known resistance genes (Eef2, lpxK, MdtC, etc). Logistic regression classifiers were established based on the identified SNPs in the training set. The area under the curves (AUCs) of the four antibiotics was 0.939, 0.950, 0.912 and 0.935, showing a strong ability to predict bacterial resistance.Conclusion. Machine learning methods can effectively be used to predict resistance genes and associated SNPs. The FFS and CMD algorithms have wide applicability. They can be used for the drug-resistance analysis of any microorganism with genomic variation and phenotypic data. This work lays a foundation for resistance research in clinical applications.
Collapse
Affiliation(s)
- Wenjia Liu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Nanjiao Ying
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Qiusi Mo
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Shanshan Li
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Mengjie Shao
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| | - Lingli Sun
- Key Laboratory of Microorganism Technology and Bioinformatics Research of Zhejiang Province, Hangzhou, Zhejiang, 310012, PR China.,NMPA Key Laboratory for Testing and Risk Warning of Pharmaceutical Microbiology, Hangzhou, Zhejiang, 310012, PR China
| | - Lei Zhu
- College of Automation, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China.,Institute of Biomedical Engineering, Hangzhou Dianzi University, Hangzhou, Zhejiang, 310018, PR China
| |
Collapse
|
36
|
VanOeffelen M, Nguyen M, Aytan-Aktug D, Brettin T, Dietrich EM, Kenyon RW, Machi D, Mao C, Olson R, Pusch GD, Shukla M, Stevens R, Vonstein V, Warren AS, Wattam AR, Yoo H, Davis JJ. A genomic data resource for predicting antimicrobial resistance from laboratory-derived antimicrobial susceptibility phenotypes. Brief Bioinform 2021; 22:bbab313. [PMID: 34379107 PMCID: PMC8575023 DOI: 10.1093/bib/bbab313] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Revised: 06/18/2021] [Accepted: 07/20/2021] [Indexed: 11/14/2022] Open
Abstract
Antimicrobial resistance (AMR) is a major global health threat that affects millions of people each year. Funding agencies worldwide and the global research community have expended considerable capital and effort tracking the evolution and spread of AMR by isolating and sequencing bacterial strains and performing antimicrobial susceptibility testing (AST). For the last several years, we have been capturing these efforts by curating data from the literature and data resources and building a set of assembled bacterial genome sequences that are paired with laboratory-derived AST data. This collection currently contains AST data for over 67 000 genomes encompassing approximately 40 genera and over 100 species. In this paper, we describe the characteristics of this collection, highlighting areas where sampling is comparatively deep or shallow, and showing areas where attention is needed from the research community to improve sampling and tracking efforts. In addition to using the data to track the evolution and spread of AMR, it also serves as a useful starting point for building machine learning models for predicting AMR phenotypes. We demonstrate this by describing two machine learning models that are built from the entire dataset to show where the predictive power is comparatively high or low. This AMR metadata collection is freely available and maintained on the Bacterial and Viral Bioinformatics Center (BV-BRC) FTP site ftp://ftp.bvbrc.org/RELEASE_NOTES/PATRIC_genomes_AMR.txt.
Collapse
Affiliation(s)
| | - Marcus Nguyen
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Data Science and Learning Division, Argonne National Laboratory, Argonne, IL, USA
| | - Derya Aytan-Aktug
- National Food Institute, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Thomas Brettin
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Computing Environment and Life Sciences, Argonne National Laboratory, Argonne, IL, USA
| | - Emily M Dietrich
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Computing Environment and Life Sciences, Argonne National Laboratory, Argonne, IL, USA
| | - Ronald W Kenyon
- Biocomplexity Institute and Initiative, University of Virginia, Virginia, USA
| | - Dustin Machi
- Biocomplexity Institute and Initiative, University of Virginia, Virginia, USA
| | - Chunhong Mao
- Biocomplexity Institute and Initiative, University of Virginia, Virginia, USA
| | - Robert Olson
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Data Science and Learning Division, Argonne National Laboratory, Argonne, IL, USA
| | - Gordon D Pusch
- Fellowship for Interpretation of Genomes, Burr Ridge, IL, USA
| | - Maulik Shukla
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Data Science and Learning Division, Argonne National Laboratory, Argonne, IL, USA
| | - Rick Stevens
- Computing Environment and Life Sciences, Argonne National Laboratory, Argonne, IL, USA
- Department of Computer Science, University of Chicago, Chicago, IL, USA
| | | | - Andrew S Warren
- Biocomplexity Institute and Initiative, University of Virginia, Virginia, USA
| | - Alice R Wattam
- Data Science and Learning Division, Argonne National Laboratory, Argonne, IL, USA
- Biocomplexity Institute and Initiative, University of Virginia, Virginia, USA
| | - Hyunseung Yoo
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Data Science and Learning Division, Argonne National Laboratory, Argonne, IL, USA
| | - James J Davis
- University of Chicago Consortium for Advanced Science and Engineering, University of Chicago, Chicago, IL, USA
- Data Science and Learning Division, Argonne National Laboratory, Argonne, IL, USA
- Northwestern Argonne Institute for Science and Engineering, Evanston, IL, USA
| |
Collapse
|
37
|
He S, Leanse LG, Feng Y. Artificial intelligence and machine learning assisted drug delivery for effective treatment of infectious diseases. Adv Drug Deliv Rev 2021; 178:113922. [PMID: 34461198 DOI: 10.1016/j.addr.2021.113922] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Revised: 07/14/2021] [Accepted: 08/09/2021] [Indexed: 12/23/2022]
Abstract
In the era of antimicrobial resistance, the prevalence of multidrug-resistant microorganisms that resist conventional antibiotic treatment has steadily increased. Thus, it is now unquestionable that infectious diseases are significant global burdens that urgently require innovative treatment strategies. Emerging studies have demonstrated that artificial intelligence (AI) can transform drug delivery to promote effective treatment of infectious diseases. In this review, we propose to evaluate the significance, essential principles, and popular tools of AI in drug delivery for infectious disease treatment. Specifically, we will focus on the achievements and key findings of current research, as well as the applications of AI on drug delivery throughout the whole antimicrobial treatment process, with an emphasis on drug development, treatment regimen optimization, drug delivery system and administration route design, and drug delivery outcome prediction. To that end, the challenges of AI in drug delivery for infectious disease treatments and their current solutions and future perspective will be presented and discussed.
Collapse
Affiliation(s)
- Sheng He
- Boston Children's Hospital, Harvard Medical School, Harvard University, Boston, MA, USA.
| | - Leon G Leanse
- Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, MA, USA
| | - Yanfang Feng
- Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, MA, USA.
| |
Collapse
|
38
|
Bulman ZP, Krapp F, Pincus NB, Wenzler E, Murphy KR, Qi C, Ozer EA, Hauser AR. Genomic Features Associated with the Degree of Phenotypic Resistance to Carbapenems in Carbapenem-Resistant Klebsiella pneumoniae. mSystems 2021; 6:e0019421. [PMID: 34519526 DOI: 10.1128/mSystems.00194-21] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Carbapenem-resistant Klebsiella pneumoniae strains cause severe infections that are difficult to treat. The production of carbapenemases such as the K. pneumoniae carbapenemase (KPC) is a common mechanism by which these strains resist killing by the carbapenems. However, the degree of phenotypic carbapenem resistance (MIC) may differ markedly between isolates with similar carbapenemase genes, suggesting that our understanding of the underlying mechanisms of carbapenem resistance remains incomplete. To address this problem, we determined the whole-genome sequences of 166 K. pneumoniae clinical isolates resistant to meropenem, imipenem, or ertapenem. Multiple linear regression analysis of this collection of largely blaKPC-3-containing sequence type 258 (ST258) isolates indicated that blaKPC copy number and some outer membrane porin gene mutations were associated with higher MICs to carbapenems. A trend toward higher MICs was also observed with those blaKPC genes carried by the d isoform of Tn4401. In contrast, ompK37 mutations were associated with lower carbapenem MICs, and extended spectrum β-lactamase genes were not associated with higher or lower MICs in carbapenem-resistant K. pneumoniae. A machine learning approach based on the whole-genome sequences of these isolates did not result in a substantial improvement in prediction of isolates with high or low MICs. These results build upon previous findings suggesting that multiple factors influence the overall carbapenem resistance levels in carbapenem-resistant K. pneumoniae isolates. IMPORTANCEKlebsiella pneumoniae can cause severe infections in the blood, urinary tract, and lungs. Resistance to carbapenems in K. pneumoniae is an urgent public health threat, since it can make these isolates difficult to treat. While individual contributors to carbapenem resistance in K. pneumoniae have been studied, few reports explore their combined effects in clinical isolates. We sequenced 166 clinical carbapenem-resistant K. pneumoniae isolates to evaluate the contribution of known genes to carbapenem MICs and to try to identify novel genes associated with higher carbapenem MICs. The blaKPC copy number and some outer membrane porin gene mutations were associated with higher carbapenem MICs. In contrast, mutations in one specific porin, ompK37, were associated with lower carbapenem MICs. Machine learning did not result in a substantial improvement in the prediction of carbapenem resistance nor did it identify novel genes associated with carbapenem resistance. These findings enhance our understanding of the many contributors to carbapenem resistance in K. pneumoniae.
Collapse
|
39
|
Huang Y, Rana AP, Wenzler E, Ozer EA, Krapp F, Bulitta JB, Hauser AR, Bulman ZP. Aminoglycoside-resistance gene signatures are predictive of aminoglycoside MICs for carbapenem-resistant Klebsiella pneumoniae. J Antimicrob Chemother 2021; 77:356-363. [PMID: 34668007 DOI: 10.1093/jac/dkab381] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 09/27/2021] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Aminoglycoside-containing regimens may be an effective treatment option for infections caused by carbapenem-resistant Klebsiella pneumoniae (CR-Kp), but aminoglycoside-resistance genes are common in these strains. The relationship between the aminoglycoside-resistance genes and aminoglycoside MICs remains poorly defined. OBJECTIVES To identify genotypic signatures capable of predicting aminoglycoside MICs for CR-Kp. METHODS Clinical CR-Kp isolates (n = 158) underwent WGS to detect aminoglycoside-resistance genes. MICs of amikacin, gentamicin, plazomicin and tobramycin were determined by broth microdilution (BMD). Principal component analysis was used to initially separate isolates based on genotype. Multiple linear regression was then used to generate models that predict aminoglycoside MICs based on the aminoglycoside-resistance genes. Last, the performance of the predictive models was tested against a validation cohort of 29 CR-Kp isolates. RESULTS Among the original 158 CR-Kp isolates, 91.77% (145/158) had at least one clinically relevant aminoglycoside-resistance gene. As a group, 99.37%, 84.81%, 82.28% and 10.76% of the CR-Kp isolates were susceptible to plazomicin, amikacin, gentamicin and tobramycin, respectively. The first two principal components explained 72.23% of the total variance in aminoglycoside MICs and separated isolates into four groups with aac(6')-Ib, aac(6')-Ib', aac(6')-Ib+aac(6')-Ib' or no clinically relevant aminoglycoside-resistance genes. Regression models predicted aminoglycoside MICs with adjusted R2 values of 56%-99%. Within the validation cohort, the categorical agreement when comparing the observed BMD MICs with the predicated MICs was 96.55%, 89.66%, 86.21% and 82.76% for plazomicin, gentamicin, amikacin and tobramycin, respectively. CONCLUSIONS Susceptibility to each aminoglycoside varies in CR-Kp. Detection of aminoglycoside-resistance genes may be useful to predict aminoglycoside MICs for CR-Kp.
Collapse
Affiliation(s)
- Yanqin Huang
- Department of Pharmacy Practice, University of Illinois at Chicago College of Pharmacy, Chicago, IL, USA
| | - Amisha P Rana
- Department of Pharmacy Practice, University of Illinois at Chicago College of Pharmacy, Chicago, IL, USA
| | - Eric Wenzler
- Department of Pharmacy Practice, University of Illinois at Chicago College of Pharmacy, Chicago, IL, USA
| | - Egon A Ozer
- Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Fiorella Krapp
- Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA.,Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Jürgen B Bulitta
- Center for Pharmacometrics and Systems Pharmacology, College of Pharmacy, University of Florida, Orlando, FL, USA
| | - Alan R Hauser
- Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Zackery P Bulman
- Department of Pharmacy Practice, University of Illinois at Chicago College of Pharmacy, Chicago, IL, USA
| |
Collapse
|
40
|
Zhuang H, Zhu F, Lan P, Ji S, Sun L, Chen Y, Wang Z, Jiang S, Zhang L, Zhu Y, Jiang Y, Chen Y, Yu Y. A random forest model based on core genome allelic profiles of MRSA for penicillin plus potassium clavulanate susceptibility prediction. Microb Genom 2021; 7. [PMID: 34554083 PMCID: PMC8715440 DOI: 10.1099/mgen.0.000610] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Treatment failure of methicillin-resistant Staphylococcus aureus (MRSA) infections remains problematic in clinical practice because therapeutic options are limited. Penicillin plus potassium clavulanate combination (PENC) was shown to have potential for treating some MRSA infections. We investigated the susceptibility of MRSA isolates and constructed a drug susceptibility prediction model for the phenotype of the PENC. We determined the minimum inhibitory concentration of PENC for MRSA (n=284) in a teaching hospital (SRRSH-MRSA). PENC susceptibility genotypes were analysed using a published genotyping scheme based on the mecA sequence. mecA expression in MRSA isolates was analysed by qPCR. We established a random forest model for predicting PENC-susceptible phenotypes using core genome allelic profiles from cgMLST analysis. We identified S2-R isolates with susceptible mecA genotypes but PENC-resistant phenotypes; these isolates expressed mecA at higher levels than did S2 MRSA (2.61 vs 0.98, P<0.05), indicating the limitation of using a single factor for predicting drug susceptibility. Using the data of selected UK-sourced MRSA (n=74) and MRSA collected in a previous national survey (NA-MRSA, n=471) as a training set, we built a model with accuracies of 0.94 and 0.93 for SRRSH-MRSA and UK-sourced MRSA (n=287, NAM-MRSA) validation sets. The AUROC of this model for SRRSH-MRSA and NAM-MRSA was 0.96 and 0.97. Although the source of the training set data affects the scope of application of the prediction model, our data demonstrated the power of the machine learning approach in predicting susceptibility from cgMLST results.
Collapse
Affiliation(s)
- Hemu Zhuang
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Feiteng Zhu
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Peng Lan
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Shujuan Ji
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Lu Sun
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Yiyi Chen
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Zhengan Wang
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Shengnan Jiang
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Linyue Zhang
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Yiwei Zhu
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Yan Jiang
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Yan Chen
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| | - Yunsong Yu
- Department of Infectious Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine; Key Laboratory of Microbial Technology and Bioinformatics of Zhejiang Province; Regional Medical Center for National Institute of Respiratory Diseases, Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, PR China
| |
Collapse
|
41
|
Tan R, Yu A, Liu Z, Liu Z, Jiang R, Wang X, Liu J, Gao J, Wang X. Prediction of Minimal Inhibitory Concentration of Meropenem Against Klebsiella pneumoniae Using Metagenomic Data. Front Microbiol 2021; 12:712886. [PMID: 34497594 PMCID: PMC8421019 DOI: 10.3389/fmicb.2021.712886] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 07/26/2021] [Indexed: 11/29/2022] Open
Abstract
Minimal inhibitory concentration (MIC) is defined as the lowest concentration of an antimicrobial agent that can inhibit the visible growth of a particular microorganism after overnight incubation. Clinically, antibiotic doses for specific infections are determined according to the fraction of MIC. Therefore, credible assessment of MICs will provide a physician valuable information on the choice of therapeutic strategy. Early and precise usage of antibiotics is the key to an infection therapy. Compared with the traditional culture-based method, the approach of whole genome sequencing to identify MICs can shorten the experimental time, thereby improving clinical efficacy. Klebsiella pneumoniae is one of the most significant members of the genus Klebsiella in the Enterobacteriaceae family and also a common non-social pathogen. Meropenem is a broad-spectrum antibacterial agent of the carbapenem family, which can produce antibacterial effects of most Gram-positive and -negative bacteria. In this study, we used single-nucleotide polymorphism (SNP) information and nucleotide k-mers count based on metagenomic data to predict MICs of meropenem against K. pneumoniae. Then, features of 110 sequenced K. pneumoniae genome data were combined and modeled with XGBoost algorithm and deep neural network (DNN) algorithm to predict MICs. We first use the XGBoost classification model and the XGBoost regression model. After five runs, the average accuracy of the test set was calculated. The accuracy of using nucleotide k-mers to predict MICs of the XGBoost classification model and XGBoost regression model was 84.5 and 89.1%. The accuracy of SNP in predicting MIC was 80 and 81.8%, respectively. The results show that XGBoost regression is better than XGBoost classification in both nucleotide k-mers and SNPs to predict MICs. We further selected 40 nucleotide k-mers and 40 SNPs with the highest correlation with MIC values as features to retrain the XGBoost regression model and DNN regression model. After 100 and 1,000 runs, the results show that the accuracy of the two models was improved. The accuracy of the XGBoost regression model for k-mers, SNPs, and k-mers & SNPs was 91.1, 85.2, and 91.3%, respectively. The accuracy of the DNN regression model was 91.9, 87.1, and 91.8%, respectively. Through external verification, some of the selected features were found to be related to drug resistance.
Collapse
Affiliation(s)
- Rundong Tan
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Anqi Yu
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Ziming Liu
- Medical Information Engineering, Department of Medical Information, Harbin Medical University, Harbin, China
| | - Ziqi Liu
- Department of Biostatistics, School of Global Public Health, New York University, New York, NY, United States
| | - Rongfeng Jiang
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Xiaoli Wang
- Department of Critical Care Medicine, Ruijin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Jialin Liu
- Department of Critical Care Medicine, Ruijin Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Junhui Gao
- Shanghai Biotecan Pharmaceuticals Co., Ltd., Shanghai, China.,Shanghai Zhangjiang Institute of Medical Innovation, Shanghai, China
| | - Xinjun Wang
- Translational Medical Center for Stem Cell Therapy, Shanghai East Hospital, School of Medicine, Tongji University, Shanghai, China
| |
Collapse
|
42
|
Abstract
By targeting invasive organisms, antibiotics insert themselves into the ancient struggle of the host-pathogen evolutionary arms race. As pathogens evolve tactics for evading antibiotics, therapies decline in efficacy and must be replaced, distinguishing antibiotics from most other forms of drug development. Together with a slow and expensive antibiotic development pipeline, the proliferation of drug-resistant pathogens drives urgent interest in computational methods that promise to expedite candidate discovery. Strides in artificial intelligence (AI) have encouraged its application to multiple dimensions of computer-aided drug design, with increasing application to antibiotic discovery. This review describes AI-facilitated advances in the discovery of both small molecule antibiotics and antimicrobial peptides. Beyond the essential prediction of antimicrobial activity, emphasis is also given to antimicrobial compound representation, determination of drug-likeness traits, antimicrobial resistance, and de novo molecular design. Given the urgency of the antimicrobial resistance crisis, we analyze uptake of open science best practices in AI-driven antibiotic discovery and argue for openness and reproducibility as a means of accelerating preclinical research. Finally, trends in the literature and areas for future inquiry are discussed, as artificially intelligent enhancements to drug discovery at large offer many opportunities for future applications in antibiotic development.
Collapse
Affiliation(s)
- Marcelo C R Melo
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Jacqueline R M A Maasch
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA
- Department of Computer and Information Science, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA.
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
43
|
Benkwitz-Bedford S, Palm M, Demirtas TY, Mustonen V, Farewell A, Warringer J, Parts L, Moradigaravand D. Machine Learning Prediction of Resistance to Subinhibitory Antimicrobial Concentrations from Escherichia coli Genomes. mSystems 2021; 6:e0034621. [PMID: 34427505 DOI: 10.1128/mSystems.00346-21] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Escherichia coli is an important cause of bacterial infections worldwide, with multidrug-resistant strains incurring substantial costs on human lives. Besides therapeutic concentrations of antimicrobials in health care settings, the presence of subinhibitory antimicrobial residues in the environment and in clinics selects for antimicrobial resistance (AMR), but the underlying genetic repertoire is less well understood. Here, we used machine learning to predict the population doubling time and cell growth yield of 1,407 genetically diverse E. coli strains expanding under exposure to three subinhibitory concentrations of six classes of antimicrobials from single-nucleotide genetic variants, accessory gene variation, and the presence of known AMR genes. We predicted cell growth yields in the held-out test data with an average correlation (Spearman's ρ) of 0.63 (0.36 to 0.81 across concentrations) and cell doubling times with an average correlation of 0.59 (0.32 to 0.92 across concentrations), with moderate increases in sample size unlikely to improve predictions further. This finding points to the remaining missing heritability of growth under antimicrobial exposure being explained by effects that are too rare or weak to be captured unless sample size is dramatically increased, or by effects other than those conferred by the presence of individual single-nucleotide polymorphisms (SNPs) and genes. Predictions based on whole-genome information were generally superior to those based only on known AMR genes and were accurate for AMR resistance at therapeutic concentrations. We pinpointed genes and SNPs determining the predicted growth and thereby recapitulated many known AMR determinants. Finally, we estimated the effect sizes of resistance genes across the entire collection of strains, disclosing the growth effects for known resistance genes in each individual strain. Our results underscore the potential of predictive modeling of growth patterns from genomic data under subinhibitory concentrations of antimicrobials, although the remaining missing heritability poses a challenge for achieving the accuracy and precision required for clinical use. IMPORTANCE Predicting bacterial growth from genome sequences is important for a rapid characterization of strains in clinical diagnostics and to disclose candidate novel targets for anti-infective drugs. Previous studies have dissected the relationship between bacterial growth and genotype in mutant libraries for laboratory strains, yet no study so far has examined the predictive power of genome sequence in natural strains. In this study, we used a high-throughput phenotypic assay to measure the growth of a systematic collection of natural Escherichia coli strains and then employed machine learning models to predict bacterial growth from genomic data under nontherapeutic subinhibitory concentrations of antimicrobials that are common in nonclinical settings. We found a moderate to strong correlation between predicted and actual values for the different collected data sets. Moreover, we observed that the known resistance genes are still effective at sublethal concentrations, pointing to clinical implications of these concentrations.
Collapse
|
44
|
Pearcy N, Hu Y, Baker M, Maciel-Guerra A, Xue N, Wang W, Kaler J, Peng Z, Li F, Dottorini T. Genome-Scale Metabolic Models and Machine Learning Reveal Genetic Determinants of Antibiotic Resistance in Escherichia coli and Unravel the Underlying Metabolic Adaptation Mechanisms. mSystems 2021; 6:e0091320. [PMID: 34342537 DOI: 10.1128/mSystems.00913-20] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Antimicrobial resistance (AMR) is becoming one of the largest threats to public health worldwide, with the opportunistic pathogen Escherichia coli playing a major role in the AMR global health crisis. Unravelling the complex interplay between drug resistance and metabolic rewiring is key to understand the ability of bacteria to adapt to new treatments and to the development of new effective solutions to combat resistant infections. We developed a computational pipeline that combines machine learning with genome-scale metabolic models (GSMs) to elucidate the systemic relationships between genetic determinants of resistance and metabolism beyond annotated drug resistance genes. Our approach was used to identify genetic determinants of 12 AMR profiles for the opportunistic pathogenic bacterium E. coli. Then, to interpret the large number of identified genetic determinants, we applied a constraint-based approach using the GSM to predict the effects of genetic changes on growth, metabolite yields, and reaction fluxes. Our computational platform leads to multiple results. First, our approach corroborates 225 known AMR-conferring genes, 35 of which are known for the specific antibiotic. Second, integration with the GSM predicted 20 top-ranked genetic determinants (including accA, metK, fabD, fabG, murG, lptG, mraY, folP, and glmM) essential for growth, while a further 17 top-ranked genetic determinants linked AMR to auxotrophic behavior. Third, clusters of AMR-conferring genes affecting similar metabolic processes are revealed, which strongly suggested that metabolic adaptations in cell wall, energy, iron and nucleotide metabolism are associated with AMR. The computational solution can be used to study other human and animal pathogens. IMPORTANCEEscherichia coli is a major public health concern given its increasing level of antibiotic resistance worldwide and extraordinary capacity to acquire and spread resistance via horizontal gene transfer with surrounding species and via mutations in its existing genome. E. coli also exhibits a large amount of metabolic pathway redundancy, which promotes resistance via metabolic adaptability. In this study, we developed a computational approach that integrates machine learning with metabolic modeling to understand the correlation between AMR and metabolic adaptation mechanisms in this model bacterium. Using our approach, we identified AMR genetic determinants associated with cell wall modifications for increased permeability, virulence factor manipulation of host immunity, reduction of oxidative stress toxicity, and changes to energy metabolism. Unravelling the complex interplay between antibiotic resistance and metabolic rewiring may open new opportunities to understand the ability of E. coli, and potentially of other human and animal pathogens, to adapt to new treatments.
Collapse
|
45
|
Lüftinger L, Ferreira I, Frank BJH, Beisken S, Weinberger J, von Haeseler A, Rattei T, Hofstaetter JG, Posch AE, Materna A. Predictive Antibiotic Susceptibility Testing by Next-Generation Sequencing for Periprosthetic Joint Infections: Potential and Limitations. Biomedicines 2021; 9:910. [PMID: 34440114 DOI: 10.3390/biomedicines9080910] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Revised: 07/21/2021] [Accepted: 07/22/2021] [Indexed: 01/18/2023] Open
Abstract
Joint replacement surgeries are one of the most frequent medical interventions globally. Infections of prosthetic joints are a major health challenge and typically require prolonged or even indefinite antibiotic treatment. As multidrug-resistant pathogens continue to rise globally, novel diagnostics are critical to ensure appropriate treatment and help with prosthetic joint infections (PJI) management. To this end, recent studies have shown the potential of molecular methods such as next-generation sequencing to complement established phenotypic, culture-based methods. Together with advanced bioinformatics approaches, next-generation sequencing can provide comprehensive information on pathogen identity as well as antimicrobial susceptibility, potentially enabling rapid diagnosis and targeted therapy of PJIs. In this review, we summarize current developments in next generation sequencing based predictive antibiotic susceptibility testing and discuss potential and limitations for common PJI pathogens.
Collapse
|
46
|
Avershina E, Shapovalova V, Shipulin G. Fighting Antibiotic Resistance in Hospital-Acquired Infections: Current State and Emerging Technologies in Disease Prevention, Diagnostics and Therapy. Front Microbiol 2021; 12:707330. [PMID: 34367112 PMCID: PMC8334188 DOI: 10.3389/fmicb.2021.707330] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 06/29/2021] [Indexed: 12/20/2022] Open
Abstract
Rising antibiotic resistance is a global threat that is projected to cause more deaths than all cancers combined by 2050. In this review, we set to summarize the current state of antibiotic resistance, and to give an overview of the emerging technologies aimed to escape the pre-antibiotic era recurrence. We conducted a comprehensive literature survey of >150 original research and review articles indexed in the Web of Science using "antimicrobial resistance," "diagnostics," "therapeutics," "disinfection," "nosocomial infections," "ESKAPE pathogens" as key words. We discuss the impact of nosocomial infections on the spread of multi-drug resistant bacteria, give an overview over existing and developing strategies for faster diagnostics of infectious diseases, review current and novel approaches in therapy of infectious diseases, and finally discuss strategies for hospital disinfection to prevent MDR bacteria spread.
Collapse
Affiliation(s)
- Ekaterina Avershina
- Department of Biotechnology, Inland Norway University of Applied Sciences, Hamar, Norway
- Laboratory or Postgenomic Technologies, Izmerov Research Institute of Occupational Health, Moscow, Russia
| | - Valeria Shapovalova
- Federal State Budgetary Institution “Centre for Strategic Planning and Management of Biomedical Health Risks” of the Federal Medical Biological Agency, Centre for Strategic Planning of FMBA of Russia, Moscow, Russia
| | - German Shipulin
- Federal State Budgetary Institution “Centre for Strategic Planning and Management of Biomedical Health Risks” of the Federal Medical Biological Agency, Centre for Strategic Planning of FMBA of Russia, Moscow, Russia
| |
Collapse
|
47
|
Anahtar MN, Yang JH, Kanjilal S. Applications of Machine Learning to the Problem of Antimicrobial Resistance: an Emerging Model for Translational Research. J Clin Microbiol 2021; 59:e0126020. [PMID: 33536291 DOI: 10.1128/JCM.01260-20] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Antimicrobial resistance (AMR) remains one of the most challenging phenomena of modern medicine. Machine learning (ML) is a subfield of artificial intelligence that focuses on the development of algorithms that learn how to accurately predict outcome variables using large sets of predictor variables that are typically not hand selected and are minimally curated. Models are parameterized using a training data set and then applied to a test data set on which predictive performance is evaluated. The application of ML algorithms to the problem of AMR has garnered increasing interest in the past 5 years due to the exponential growth of experimental and clinical data, heavy investment in computational capacity, improvements in algorithm performance, and increasing urgency for innovative approaches to reducing the burden of disease. Here, we review the current state of research at the intersection of ML and AMR with an emphasis on three domains of work. The first is the prediction of AMR using genomic data. The second is the use of ML to gain insight into the cellular functions disrupted by antibiotics, which forms the basis for understanding mechanisms of action and developing novel anti-infectives. The third focuses on the application of ML for antimicrobial stewardship using data extracted from the electronic health record. Although the use of ML for understanding, diagnosing, treating, and preventing AMR is still in its infancy, the continued growth of data and interest ensures it will become an important tool for future translational research programs.
Collapse
|
48
|
Aytan-Aktug D, Nguyen M, Clausen PTLC, Stevens RL, Aarestrup FM, Lund O, Davis JJ. Predicting Antimicrobial Resistance Using Partial Genome Alignments. mSystems 2021; 6:e0018521. [PMID: 34128695 DOI: 10.1128/mSystems.00185-21] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Antimicrobial resistance (AMR) is an important global health threat that impacts millions of people worldwide each year. Developing methods that can detect and predict AMR phenotypes can help to mitigate the spread of AMR by informing clinical decision making and appropriate mitigation strategies. Many bioinformatic methods have been developed for predicting AMR phenotypes from whole-genome sequences and AMR genes, but recent studies have indicated that predictions can be made from incomplete genome sequence data. In order to more systematically understand this, we built random forest-based machine learning classifiers for predicting susceptible and resistant phenotypes for Klebsiella pneumoniae (1,640 strains), Mycobacterium tuberculosis (2,497 strains), and Salmonella enterica (1,981 strains). We started by building models from alignments that were based on a reference chromosome for each species. We then subsampled each chromosomal alignment and built models for the resulting subalignments, finding that very small regions, representing approximately 0.1 to 0.2% of the chromosome, are predictive. In K. pneumoniae, M. tuberculosis, and S. enterica, the subalignments are able to predict multiple AMR phenotypes with at least 70% accuracy, even though most do not encode an AMR-related function. We used these models to identify regions of the chromosome with high and low predictive signals. Finally, subalignments that retain high accuracy across larger phylogenetic distances were examined in greater detail, revealing genes and intergenic regions with potential links to AMR, virulence, transport, and survival under stress conditions. IMPORTANCE Antimicrobial resistance causes thousands of deaths annually worldwide. Understanding the regions of the genome that are involved in antimicrobial resistance is important for developing mitigation strategies and preventing transmission. Machine learning models are capable of predicting antimicrobial resistance phenotypes from bacterial genome sequence data by identifying resistance genes, mutations, and other correlated features. They are also capable of implicating regions of the genome that have not been previously characterized as being involved in resistance. In this study, we generated global chromosomal alignments for Klebsiella pneumoniae, Mycobacterium tuberculosis, and Salmonella enterica and systematically searched them for small conserved regions of the genome that enable the prediction of antimicrobial resistance phenotypes. In addition to known antimicrobial resistance genes, this analysis identified genes involved in virulence and transport functions, as well as many genes with no previous implication in antimicrobial resistance.
Collapse
|
49
|
Medina-Cordoba LK, Chande AT, Rishishwar L, Mayer LW, Valderrama-Aguirre LC, Valderrama-Aguirre A, Gaby JC, Kostka JE, Jordan IK. Genomic characterization and computational phenotyping of nitrogen-fixing bacteria isolated from Colombian sugarcane fields. Sci Rep 2021; 11:9187. [PMID: 33911103 PMCID: PMC8080613 DOI: 10.1038/s41598-021-88380-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Accepted: 04/07/2021] [Indexed: 01/26/2023] Open
Abstract
Previous studies have shown the sugarcane microbiome harbors diverse plant growth promoting microorganisms, including nitrogen-fixing bacteria (diazotrophs), which can serve as biofertilizers. The genomes of 22 diazotrophs from Colombian sugarcane fields were sequenced to investigate potential biofertilizers. A genome-enabled computational phenotyping approach was developed to prioritize sugarcane associated diazotrophs according to their potential as biofertilizers. This method selects isolates that have potential for nitrogen fixation and other plant growth promoting (PGP) phenotypes while showing low risk for virulence and antibiotic resistance. Intact nitrogenase (nif) genes and operons were found in 18 of the isolates. Isolates also encode phosphate solubilization and siderophore production operons, and other PGP genes. The majority of sugarcane isolates showed uniformly low predicted virulence and antibiotic resistance compared to clinical isolates. Six strains with the highest overall genotype scores were experimentally evaluated for nitrogen fixation, phosphate solubilization, and the production of siderophores, gibberellic acid, and indole acetic acid. Results from the biochemical assays were consistent and validated computational phenotype predictions. A genotypic and phenotypic threshold was observed that separated strains by their potential for PGP versus predicted pathogenicity. Our results indicate that computational phenotyping is a promising tool for the assessment of bacteria detected in agricultural ecosystems.
Collapse
Affiliation(s)
- Luz K Medina-Cordoba
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia
| | - Aroon T Chande
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Lavanya Rishishwar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Leonard W Mayer
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Lina C Valderrama-Aguirre
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Laboratory of Microorganismal Production (Bioinoculums), Department of Field Research in Sugarcane, INCAUCA S.A.S., Cali, Valle del Cauca, Colombia
| | - Augusto Valderrama-Aguirre
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia.,Universidad Santiago de Cali, Cali, Colombia
| | - John Christian Gaby
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Joel E Kostka
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA. .,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia. .,School of Biological Sciences, Georgia Institute of Technology, 310 Ferst Dr NW, Atlanta, GA, 30332, USA.
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA. .,PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, Colombia. .,Applied Bioinformatics Laboratory, Atlanta, GA, USA. .,School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Dr NW, Atlanta, GA, 30332, USA.
| |
Collapse
|
50
|
Maryam L, Usmani SS, Raghava GPS. Computational resources in the management of antibiotic resistance: Speeding up drug discovery. Drug Discov Today 2021; 26:2138-2151. [PMID: 33892146 DOI: 10.1016/j.drudis.2021.04.016] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 12/24/2020] [Accepted: 04/12/2021] [Indexed: 01/19/2023]
Abstract
This article reviews more than 50 computational resources developed in past two decades for forecasting of antibiotic resistance (AR)-associated mutations, genes and genomes. More than 30 databases have been developed for AR-associated information, but only a fraction of them are updated regularly. A large number of methods have been developed to find AR genes, mutations and genomes, with most of them based on similarity-search tools such as BLAST and HMMER. In addition, methods have been developed to predict the inhibition potential of antibiotics against a bacterial strain from the whole-genome data of bacteria. This review also discuss computational resources that can be used to manage the treatment of AR-associated diseases.
Collapse
Affiliation(s)
- Lubna Maryam
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi 110020, India
| | - Salman Sadullah Usmani
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi 110020, India
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, New Delhi 110020, India.
| |
Collapse
|